[RESOLVED] SaaS - EU (Paris) outage on 2019/01/25 - #20190125A

[RESOLVED] SaaS - EU (Paris) outage on 2019/01/25 - #20190125A

On 2019/01/25T16:37 an incident on SQL Cluster stack impacted the ThingPark Wireless GUI, OSS-API, DX-API and Provisioning. 

Current State


Resolved

Incident information and Service impact



  • Incident Start Time: 2019-01-25T16:37Z.
  • Service Restoration Time: 2019-01-25T16:45Z.
  • Service(s) Impact Duration: 8 minutes on the impacted services (GUI, OSS API, DX API and Provisioning)
  • Severe service impact: GUI, OSS API, DX API and Provisioning.

Timeline (January 25th)



The status is updated on real-time on the status page.
[04:37 pm] : When enabling a new service for ThingPark Exchange, the SQL Cluster went down
[04:40 pm] : Problem identified (SQL Cluster)
[04:44 pm] : SQL service restarted
[04:45 pm] : Service resolved and sanity checks started
[06:46 pm] : Incident Marked as Resolved.


Root Cause Analysis


After enabling a new service for ThingPark Exchange, the SQL Cluster went down, impacting: DX-API, OSS-API, GUI and Provisioning, the service has been restored by restarting the SQL Cluster stack and it took the time of re-synchronizing databases.


Actility Support