mDash Outage Aug-16

Looks like mDash is down today. Our devices are unable to store data to the mDash database. Also see a Network Error when attempting to load the mDash web console.

Anyone else see similar symptoms?
-AD

Same here, started seeing some errors for a few hours but they went away.
It’s down again now, I’ve contacted support by mail.

Web console is working for me, I can query and see last written message was around 10 hours ago

@cesanta

Web console is partially down for me now, can’t view devices or customers but data, webapp, keys, account are all working.

Service is back operating as normal for me

Yes, appears to be working now.

It appears the outage was between 2024-08-16 12:00 and 2024-08-16 23:00 UTC and another shorter outage from about 2024-08-17 06:30 to 2024-08-17 09:00 UTC. During this time our devices were unable to write data to the mDash database. Presumably this data is lost.

Had another little glitch yesterday, wasn’t able to download my database.

Will keep and eye on this, as last time this happened the day before the whole service went down.

@Autodog out of interest do you have any mechanism for detecting this? Or just missing data? I turned on some CI jobs to check on the mDash service which have been working well for me checking hourly for a few key functions I need.

@klimbot no, I don’t have a system to check for mDash health, but that would be a good idea given the recent stability challenges.

It’s interesting that this most recent outage appeared to affect the console and device database writes but not the shadow registers. We may start looking for a means of detecting discrepancies between the two, but this is a unique fault that we haven’t seen before.

-AD