[resolved][2021-03-10] ECO platform is down: ThingPark Community Portal, POC, Dev1 total outage
2021-03-27T22:00:00Z
- ๐ community.thingpark.org:
- โ subscriber creation is working again, but still not available for customers as Interop Engine is not working yet,
- โ Single Sign-On (SSO) to interop engine is also working
- ๐ Interop engine validation needed.
- โ TPXLE + ADA working for
dev-ope
instance
- ๐ TPXLE not working for community instance without portal
- โ All needed operator instances are now created.
- ๐ฅ Next in line:
- Subscribers, basestations and devices migration.
- TPX decoders and drivers
- Restore Partners sites
---
Incident history:
Thingpark Community, PoCs and Dev1
โNetwork Server for PoCs
Major incident
โNetwork Server for Dev1
Major incident
โOSS API and GUI
Major incident
โThingpark X Location Engine
Major incident
โThingPark X IoT-Flow
Major incident
โDX API
Major incident
โdocumentation platform Major incident
- 2021-03-10T09:35:00Z: the hosting company (OVH) just communicated they won't be able to restore their datacenter today.
2021-03-10T06:51:00Z (morning, Paris time) : ๐ฅ A datacenter hosting some of our services has burnt down ๐ฅ
2021-03-11T13:50:00Z
after the fire ๐ฅ that damaged multiple datacentres in Strasbourg, France, yesterday, the entire ECO environment is still down.
OVH is in disaster-recovery mode. In the next few days, they will let us know what services / data are gone and what we can restore. We will then assess the situation.
Our teams are working on restoring our own manual backups to a different location.
2021-03-18, morning, Paris time. Disaster recovery plan is ongoing.
- โ The new
community platform is up, including
DX-API and
IPSec.
- ๐
IoTFlow,
ThingPark X Location Engine,
Interoperability Engine are not back online, yet.
- ๐ฅ next in line for re-establishment:
- TPX IoTFlow
-
ThingPark X Location Engine (TPXLE) +
Abeeway Device Analyzer (ADA)
- migration of operator instances, subscribers, basestations (gateways), devices
- provisioning of
Community offers (including
Interop)
2021-03-19, morning, Paris time. Disaster recovery plan is ongoing.
- โ
IoTFlow is installed
- โ
DX-API was updated to facilitate reprovisioning of subscribers and end-user accounts
- ๐
ThingPark X Location Engine,
Interoperability Engine are not back online, yet.
-๐ฅnext in line:
- validate
TPXLE and
ADA
- achieve creation of all operator instances
- migrate subscribers, basestations (gateways), devices
- provision Community and Partner portals offers (including Interop)
Related Articles
[RESOLVED] - OVH (our datacenter) outage on 2017/11/09 - #201701109A
Since the 2017/11/09 07:43 AM (CET) our datacenter at OVH is unreachable and this impact big part of Actility services. Actility services are accessible from certain locations/networks. Issues are located on some global routers by OVH (out of our ...
[RESOLVED] SaaS - EU outage on 2019-05-22 - #20190522[A]
Incident Description On 2019-05-22 - 03:08 AM, our SaaS-EU datacenter became unreachable. From the outside world, it looks like a complete outage. Incident information and Service impact Incident Start Time: 2019-05-22 - 03:08 AM. ...
[RESOLVED] - LRC outage for SaaS-EU on 2020-04-25
On 2020-04-25, our LRC end-to-end probe detects a traffic issue. The alarm is automatically cleared within five minutes: the LRC process was restarted after catastrophic memory failure. Incident information and Timeline (UTC) [summary] The problem ...
[RESOLVED] SaaS - EU (Paris) outage on 2019/01/15 - #20190115B
On 2019-01-15T01:25Z, following planned hardware maintenance and incident #20190115A, site B went offline. Actility technical teams are working on this issue. Current state Resolved. Incident information and Service impact Incident Start Time: ...
[RESOLVED] - LRC outage for SaaS- EU from 2018/04/12 - 2018/05
Since April 12, 2018, our LRC probe detects from time to time a lack of traffic; packets coming from gateways are no longer forwarded to AS. Initial incidents were detected by the on-call engineer who gathers enough information to find the root ...