More

mystcb · 2025-10-29T16:23:20 1761755000

Update 16:57 UTC:

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing Azure Front Door issues resulting in a loss of availability of some services. In addition. customers may experience issues accessing the Azure Portal. Customers can attempt to use programmatic methods (PowerShell, CLI, etc.) to access/utilize resources if they are unable to access the portal directly. We have failed the portal away from Azure Front Door (AFD) to attempt to mitigate the portal access issues and are continuing to assess the situation.

We are actively assessing failover options of internal services from our AFD infrastructure. Our investigation into the contributing factors and additional recovery workstreams continues. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:57 UTC on 29 October 2025

---

Update: 16:35 UTC:

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025

---

Azure Portal Access Issues

We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly.

This message was last updated at 16:18 UTC on 29 October 2025

---

Message from the Azure Status Page: https://azure.status.microsoft/en-gb/status

planewave · 2025-10-29T17:18:47 1761758327

Azure Network Availability Issues

Starting at approximately 16:00 UTC, we began experiencing Azure Front Door issues resulting in a loss of availability of some services. We suspect that an inadvertent configuration change as the trigger event for this issue. We are taking two concurrent actions where we are blocking all changes to the AFD services and at the same time rolling back to our last known good state.

We have failed the portal away from Azure Front Door (AFD) to mitigate the portal access issues. Customers should be able to access the Azure management portal directly.

We do not have an ETA for when the rollback will be completed, but we will update this communication within 30 minutes or when we have an update.

This message was last updated at 17:17 UTC on 29 October 2025

croemer · 2025-10-29T18:24:00 1761762240

"We have initiated the deployment of our 'last known good' configuration. This is expected to be fully deployed in about 30 minutes from which point customers will start to see initial signs of recovery. Once this is completed, the next stage is to start to recover nodes while we route traffic through these healthy nodes."

"This message was last updated at 18:11 UTC on 29 October 2025"

croemer · 2025-10-29T20:32:31 1761769951

At this stage, we anticipate full mitigation within the next four hours as we continue to recover nodes. This means we expect recovery to happen by 23:20 UTC on 29 October 2025. We will provide another update on our progress within two hours, or sooner if warranted.

This message was last updated at 19:57 UTC on 29 October 2025

cyptus · 2025-10-29T17:20:01 1761758401

AFD is down quite often regionally in Europe for our services. In 50%+ the cases they just don‘t report it anywhere, even if its for 2h+.

RajT88 · 2025-10-29T17:23:01 1761758581

Spam those Azure tickets. If you have a CSAM, build them a nice powerpoint telling the story of all your AFD issues (that's what they are there for).

> In 50%+ the cases they just don‘t report it anywhere, even if its for 2h+.

I assume you mean publicly. Are you getting the service health alerts?

tomashubelbauer · 2025-10-29T17:54:50 1761760490

CSAM apparently also means Customer Success Account Manager for those who might have gotten startled by this message like me.

ifwinterco · 2025-10-29T20:16:36 1761768996

Alternative für Deutschland was strange enough, when I saw CSAM I was really wondering what thread I had stumbled into

cyptus · 2025-10-30T06:19:11 1761805151

haha :D

linohh · 2025-10-29T17:57:59 1761760679

Thank you, not going to google that shit.

andrewinardeer · 2025-10-29T19:07:21 1761764841

"Apply to become a CSAM mentor"

psunavy03 · 2025-10-29T17:50:38 1761760238

Some really unfortunate acronyms flying around the Microsoft ecosystem . . .

RajT88 · 2025-10-29T17:52:33 1761760353

Quite so. The acronym collision rate is high.

riffic · 2025-10-29T18:32:27 1761762747

In general, plain language works so much better than throwing bowls of alphabet soup around.

RajT88 · 2025-10-29T18:51:37 1761763897

That's a funny criticism to make on a tech forum.

But, for future reference:

site:microsoft.com csam

Noumenon72 · 2025-10-29T23:14:34 1761779674

That's an even 5:5 split between both meanings.

nijave · 2025-10-29T22:56:34 1761778594

Back when we used Azure the only outcome was them trying to upsell us on Premium Support

RajT88 · 2025-10-30T14:05:00 1761833100

Do you recall the kind of premium support? Azure Rapid Response?

nijave · 2025-11-07T13:43:49 1762523029

I think we weren't paying for support and it was standard Business Support they were pitching. At the time we were having pretty fundamental problems with Azure Single Server Postgres which was really just a terribly engineered solution which they admitted had some nasty issues (there was some bug that would cause the storage IO threads to deadlock causing Postgres to crash)

cyptus · 2025-10-29T17:41:26 1761759686

in many cases: no service health alerts, no status page updates and no confirmations from the support team in tickets. still we can confirm these issues from different customers accross europe. Mostly the issues are regional dependent.

cyberax · 2025-10-29T17:58:31 1761760711

> CSAM

Child Sex-Abuse Material?!? Well, a nice case of acronym collision.

mirekrusin · 2025-10-29T18:26:50 1761762410

They should rename to Success Customer Account Manager.

tanseydavid · 2025-10-29T21:41:48 1761774108

>> They should rename to Success Customer Account Manager.

No -- the one referencing crime should NEVER have be turned into an acronym.

Crimes should not be described in euphemistic terms (which is exactly what the acronym is)

rightbyte · 2025-10-30T09:32:00 1761816720

You could argue "pornography" is the euphemism?

xp84 · 2025-10-29T19:59:10 1761767950

Most companies just call 'em CSMs

red-iron-pine · 2025-10-30T13:02:46 1761829366

but that makes them sound like Managers, which is not what they are -- glorified sales people, really.

actual Managers hate that

pndy · 2025-10-29T20:16:43 1761769003

Supervisor Customer Account Manager: a remote kind of job, paid occasionally with gift cards

mirekrusin · 2025-10-29T21:03:16 1761771796

...performed by cheap, open weight LLM.

RajT88 · 2025-10-29T18:52:54 1761763974

Definitely the most baffling acronym collision I have seen with Microsoft. I did one time count 4 different products abbreviated VSTS at one point.

dotancohen · 2025-10-29T23:14:05 1761779645

Didn't MS have three things called "link" at one time? They were all spelled differently, of course.

SAI_Peregrinus · 2025-10-29T18:12:02 1761761522

They must really depend on their government contracts with this administration…

codeduck · 2025-10-29T20:05:21 1761768321

Oh dear. Will make for an awkward thing to have on your resume.

senderista · 2025-10-29T21:17:00 1761772620

"CSAM Ninja"

zemariagp · 2025-10-30T05:21:51 1761801711

Wait till you hear about the Keen Kubernetes Knowledge iniciative

red-iron-pine · 2025-10-30T13:04:34 1761829474

North American Zigbee Initiative

alias_neo · 2025-10-30T09:49:21 1761817761

Where do these alerts supposedly come from? I started having issues around 4PM (GMT), couldn't access portal, and couldn't make AKV requests from the CLI, and initially asked our Ops guys but with no info and a vague "There may be issues with Portal" on their status page, that was me done for the day.

llama052 · 2025-10-29T17:25:17 1761758717

I got a service health alert an hour after it started, saying the portal was having issues. Pretty useless and misleading.

RajT88 · 2025-10-29T17:29:39 1761758979

That should go into the presentation you provide your CSAM with as well.

Storytelling is how issues get addressed. Help the CSAM tell the story to the higher ups.

nevf1 · 2025-10-29T18:58:38 1761764318

This is the single most frustrating thing about these incidents. As you're harmstrung on what you can do or how you can react until Microsoft officially acknowledges a problem. Took nearly 90mins both today and when it happened on 9th October.

cyptus · 2025-10-29T19:05:32 1761764732

so true. instead of getting a fast feedback we are wasting time searching for our own issues first.

hallh · 2025-10-29T17:28:42 1761758922

Same experience. We've recently migrated fully away from AFD due to how unreliable it is.

8cvor6j844qw_d6 · 2025-10-29T16:40:35 1761756035

I'll be interested in the incident writeup since DNS is mentioned. It will be interesting in a way if it is similar to what happened at AWS.

Insanity · 2025-10-29T16:56:07 1761756967

It's pretty unlikely. AWS published a public 'RCA' https://aws.amazon.com/message/101925/. A race condition in a DNS 'record allocator' causing all DNS records for DDB to be wiped out.

I'm simplifying a bit, but I don't think it's likely that Azure has a similar race condition wiping out DNS records on _one_ system than then propagates to all others. The similarity might just end at "it was DNS".

parliament32 · 2025-10-29T17:04:46 1761757486

That RCA was fun. A distributed system with members that don't know about each other, don't bother with leader elections, and basically all stomp all over each other updating the records. It "worked fine" until one of the members had slightly increased latency and everything cascade-failed down from there. I'm sure there was missing (internal) context but it did not sound like a well-architected system at all.

nijave · 2025-10-29T23:02:15 1761778935

>slightly increased latency

They didn't provide any details on latency. It could have been delayed an hour or a day and no one noticed

RajT88 · 2025-10-29T17:25:07 1761758707

Needs STONITH

kyrra · 2025-10-29T16:57:28 1761757048

https://isitdns.com/

cdr420 · 2025-10-29T17:05:17 1761757517

It's always DNS

tempest_ · 2025-10-29T18:15:30 1761761730

It is a coin flip, heads DNS, tails BGP

r_lee · 2025-10-29T19:09:35 1761764975

THIS is the real deal. Some say it's always DNS but many times it's some routing fuckup with BGP. two most cursed 3 letter acronym technologies out there

chasd00 · 2025-10-29T19:51:19 1761767479

when a service goes down it's DNS when an entire nation or group of nations vanish it's BGP.

layer8 · 2025-10-29T17:41:56 1761759716

DNS has both naming and cache invalidation, so no surprise it’s among the hardest things to get right. ;)

dotancohen · 2025-10-29T23:16:03 1761779763

That's three of the hardest problems in CS ))

jjp · 2025-10-29T17:14:43 1761758083

Whilst the status message acknowledge's the issue with Front Door (AFD), it seems as though the rest of the actions are about how to get Portal/internal services working without relying on AFD. For those of us using Front Door does that mean we're in for a long haul?

llama052 · 2025-10-29T17:28:53 1761758933

Please migrate off of front door. It's been a failure mode since it came out historically. Anything else is better at this point

everfrustrated · 2025-10-29T17:45:33 1761759933

Didn't the underlying vendor they used for Azure Front Door go bankrupt? It's probably on life support.

guptadagger · 2025-10-30T01:56:57 1761789417

i understood that to be a different third party that provided a CDN and was different than afd. https://learn.microsoft.com/en-us/azure/frontdoor/migrate-cd...

progmetaldev · 2025-10-29T22:00:38 1761775238

Currently even the Front Door landing page is only partially loading.

https://azure.microsoft.com/en-us/products/frontdoor

NDizzle · 2025-10-29T18:14:11 1761761651

They briefly had a statement about using Traffic Manager to work with your AFD to work around this issue, with a link to learn.microsoft.com/...traffic-manager, and the link didn't work. Due to the same issue affecting everyone right now.

They quickly updated the message to REMOVE the link. Comical at this point.

Aperocky · 2025-10-29T18:38:02 1761763082

The statement is still there though on the status page though

NDizzle · 2025-10-29T19:24:46 1761765886

They re-added it once the site was accessible.

jdc0589 · 2025-10-29T16:23:49 1761755029

yea its not just the portal. microsoft.com is down too

mystcb · 2025-10-29T16:28:07 1761755287

Yeah, I am guessing it's just a placeholder till they get more info. I thought I saw somewhere that internally within Microsoft it's seen as a "Sev 1" with "all hands on deck" - Annoyingly I can't remember where I saw it, so if someone spots it before I do, please credit that person :D

Edit: Typo!

verst · 2025-10-29T19:31:23 1761766283

It's a Sev 0 actually (as one would expect - this isn't a big secret). I was on the engineering bridge call earlier for a bit. The Azure service I work on was minimally impacted (our customer facing dashboard could not load, but APIs and data layer were not impacted) but we found a workaround.

chad_c · 2025-10-29T16:36:57 1761755817

It was here https://news.ycombinator.com/item?id=45749054 but that comment has been deleted.

PeterCorless · 2025-10-29T17:22:42 1761758562

Seems all Microsoft-related domains are impacted in some way.

• https://www.xbox.com/en-US also doesn't fully paint. Header comes up, but not the rest of the page.

• https://www.minecraft.net/en-us is extremely slow, but eventually came up.

bossyTeacher · 2025-10-29T17:54:59 1761760499

It sure must be embarrassing for the website of the second richest company in the world to be down.

daxfohl · 2025-10-29T16:33:45 1761755625

Downdetector says aws and gcp are down too. Might be in for a fun day.

rozenmd · 2025-10-29T17:01:11 1761757271

From what I can tell, Downdetector just tracks traffic to their pages without actually checking if the site is down.

The other day during the AWS outage they "reported" OVH down too.

jdc0589 · 2025-10-29T16:36:05 1761755765

yea I saw that, but im not sure on how accurate that is. a few large apps/companies I know to be 100% on AWS in us-east-1 are cranking along just fine.

linhns · 2025-10-29T18:12:17 1761761537

Not sure if this is true. I just login to the console with no glitch.

NetMageSCW · 2025-10-29T16:59:22 1761757162

AWS was performance issues and I believe is resolved.

planewave · 2025-10-29T17:15:50 1761758150

yes, and it seems that at least for some login.microsoftonline.com is down too, which is part of the Entra login / SSO flow.

jonathanlydall · 2025-10-29T17:20:05 1761758405

Yet another reason to move away from Front Door.

We already had to do it for large files served from Blob Storage since they would cap out at 2MB/s when not in cache of the nearest PoP. If you’ve ever experienced slow Windows Store or Xbox downloads it’s probably the same problem.

I had a support ticket open for months about this and in the end the agent said “this is to be expected and we don’t plan on doing anything about it”.

We’ve moved to Cloudflare and not only is the performance great, but it costs less.

Only thing I need to move off Front Door is a static website for our docs served from Blob Storage, this incident will make us do it sooner rather than later.

out_sider · 2025-10-29T17:26:37 1761758797

we are considering the same but because our website uses APEX domain we would need to move all DNS resolver to cloudfront right ? Does it have as a nice "rule set builder" as azure ?

jonathanlydall · 2025-10-29T17:52:44 1761760364

Unless you pay for CloudFlare’s Enterpise plan, you’re required to have them host your DNS zone, you can use a different registrar as long as you just point your NS records to Cloudflare.

Be aware that if you’re using Azure as your registrar, it’s (probably still) impossible to change your NS records to point to CloudFlare’s DNS server, at least it was for me about 6 months ago.

This also makes it impossible to transfer your domain to them either, as CloudFlare’s domain transfer flow requires you set your NS records to point to them before their interface shows a transfer option.

In our case we had to transfer to a different registrar, we used Namecheap.

However, transferring a domain from Azure was also a nightmare. Their UI doesn’t have any kind of transfer option, I eventually found an obscure document (not on their Learn website) which had an az command which would let you get a transfer code which I could give to Namecheap.

Then I had to wait over a week for the transfer timeout to occur because there is no way on Azure side that I could find to accept the transfer immediately.

I found CloudFlare’s way of building rules quite easy to use, different from Front Door but I’m not doing anything more complex than some redirects and reverse proxying.

I will say that Cloudflare’s UI is super fast, with Front Door I always found it painfully slow when trying to do any kind of configuration.

Cloudflare also doesn’t have the problem that Front Door has where it requires a manual process every 6 months or so to renew the APEX certificate.

out_sider · 2025-10-29T18:06:39 1761761199

Thanks :). We don't use Azure as our registrar. It seems I'll have to plan for this then, we also had another issue, AFD has a hard 500ms tls handshake timeout (doesn't matter how much you put on the origin timeout settings) which means if our server was slow for some reason we would get 504 origin timeout.

Figs · 2025-10-29T18:09:39 1761761379

CloudFlare != CloudFront

out_sider · 2025-10-29T18:24:16 1761762256

I meant cloudfare

nosefrog · 2025-10-29T20:11:37 1761768697

Front Door is not good.

eddie_catflap · 2025-10-29T17:27:09 1761758829

We saw issues before 16:00 UTC - approx 15:38

ThatManulTheCat · 2025-10-29T16:38:11 1761755891

DNS. Ofc.

rconti · 2025-10-29T17:55:51 1761760551

Sounds like they need to move their portal to a region with more capacity for the desired instance type. /s

mystcb · 2025-10-29T16:22:07 1761754927

Updated 16:35 UTC

Azure Portal Access Issues

Starting at approximately 16:00 UTC, we began experiencing DNS issues resulting in availability degradation of some services. Customers may experience issues accessing the Azure Portal. We have taken action that is expected to address the portal access issues here shortly. We are actively investigating the underlying issue and additional mitigation actions. More information will be provided within 60 minutes or sooner.

This message was last updated at 16:35 UTC on 29 October 2025

----

Azure Portal Access Issues

We are investigating an issue with the Azure Portal where customers may be experiencing issues accessing the portal. More information will be provided shortly.

This message was last updated at 16:18 UTC on 29 October 2025

-- From the Azure status page

mystcb · on Dec 3, 2024

I'd say, people that need it. Which could be the same for all the other models out there.

To create one model that is great at everything is probably a pipedream. Much like creating a multi-tool that can do everything- but can it? I wouldn't trust a multi-tool to take a wheel nut off a wheel, but I would find it useful if I suddenly needed a cross-head screw taken out of something.

But then I also have a specific crosshead screwdriver that is good at just taking out cross-head screws.

Use the right tool for the right reason. In this case, there maybe a legal reason why someone might need to use it. It might be that this version of a model can create something better that another model can't. It might be that for cost reasons you are within AWS, that it makes sense to use a model at the cheaper cost than say something else.

So yeah, I am sure it will be great for some people, and terrible for others... just the way things go!

dgfitz · on Dec 3, 2024

> I'd say, people that need it.

Nobody needs Reddit hallucinations about programming.

mystcb · on Sept 17, 2024

> Do you still have to quit and restart the entire application after you give the permission?

The funny thing about this, even on Sonoma - I could click the button to allow it, when it said "restart app" I closed the box (or clicked cancel), and it worked anyway. Specifically, I noticed it more on things like Teams/Zoom where I was doing a screen share, it just "worked" - no need to restart the entire application.

mystcb · on May 31, 2024

Refreshed before typing this because I realised someone might have beaten me to it! - But that's a big difference here - even though the service is gone, you got the refund and still a usable device as a controller out of it...

Spotify has taken something that could be used generically too, and just decided to brick it.

Insert something about product and consumers and how its all just some big money game or something somewhere :D

ActionHank · on May 31, 2024

Spotify should have the ewaste generated by this decision accounted for when considering any ESG benefits or grants.

mystcb · on April 3, 2024

Uniqlo has something similar to this - they use RFID's on their stock so you can dump it into a bucket at the self-checkout, and it scans everything immediately [1]. You still have someone pop over to check if you are OK, but it is a lot quicker than self-scan or usually waiting for someone.

[1] https://www.rfidcard.com/uniqlo-has-rolled-out-rfid-technolo...

baby · on April 3, 2024

Was going to mention Uniqlo as well! Their self-checkout is the future.

mystcb · on Dec 13, 2023

Oddly enough, I am in the UK - and I do have it, but it was already turned off when I went there. I wonder if things have changed, or there are some canary releases of the box... or am I just completely unaware my account isn't considered a UK-based account?

mystcb · on Dec 4, 2023

I did a few things in my younger days - I used to like playing MUDs, and one day a few of my college friends wanted to create our own. So we created a fairly unknown MUD called "Faereal" which still happens to be used as my domain name for my personal stuff!

I was lucky enough to have a good friend and neighbour down the road who ran ExNet [1], who provided me with space to host my first server, and oh boy looking back, I am surprised I didn't blow everything up! [2] - Windows 98 connected directly into the internet, with a fairly terrible firewall and some random remote control software I found!.

Eventually, though another MUD, we were donated a more up-to-date box, which ran Linux, and we hosted that MUD and the Faereal MUD for a while, eventually adding in my own DNS server, website hosting (PHP), and that is how I ended up hosting friends websites.

That turned into a hobby where I started to write my own PHP, started helping firstly helping out on a game called "PhaseOne" which was essentially a copy of a game we were all playing at the time called "Planetarion" [3] -- (OMG As I looked for this, its still running!). Part of this code I created a "Team based chat area", which eventually became the primary base for something that has taken over nearly 20 years of my life.

The code became the custom-written forum code behind DDR:UK, a Dance Dance Revolution fan website for the UK, which through the founders we created the "official" Sim Packs for DDR simulators such as DWI [4] and Stepmania [5]. This eventually moved into us working at events such as the London MCM ComicCon [6], where we bought in actual DDR arcade machines, including a Stepmania run DDR Machine that used to sit in the Namco Station in Central London on the South Bank. (I would love to say it was a world first, but there was one group in the US that had a temporary setup... I would like to hope we are the world first permanent money-making one :D)

That got me into running a Japanese Culture Festival called Tokonatsu [7] which got me into learning AWS. This festival has now been running for 20 years!

So all in all, how did this help:

* Interviews, it's a great story to tell, and I always get a lot of fun looks!

* Experience, from hardware, to networking, to early days of internet, software, hosting etc etc. I went thought a LOT of sleepless nights when I was younger sorting this out, gave me a whole bunch of experience that I would never would have had.

* Networking, still talk to a lot of people today, and these people are key for where I am.

Honestly, the owner of ExNet, I couldn't have done any of this, if he hadn't of started me on the right path.

EDIT: Totally forgot to explain where I am now! So with all this, through support tech, manager of of datacentres, through lead engineers, etc etc... I am now the AWS Practice Lead for my company, a Principle Consultant, and I am writing this in the airport on the way back from AWS Re:Invent 2023 :D

So yeah, that is my story! Hope someone does eventually read it :D

[1] https://www.exnet.com/

[2] https://static.colinbarker.me.uk/img/blog/2020/07/faereal-se...

[3] http://www.planetarion.com/

[4] http://dwi.ddruk.com

[5] https://www.stepmania.com/

[6] https://www.mcmcomiccon.com/global/en-us.html

[7] https://www.tokonatsu.org.uk

mystcb · on Nov 29, 2023

Is it this[1] that you were looking for at all?

[1] https://news.ycombinator.com/item?id=33945628

mystcb · on Nov 29, 2023

It's currently Re:Invent 2023 [1], where AWS usually "store up" announcements for the week. Meaning a load of product announcements are being released in quick succession. That would explain a higher than usual number of articles and links being put up.

[1] https://reinvent.awsevents.com/