Hacker News

by stellioskon 9/29/2022, 1:15 PMwith 26 comments

by sebslomskion 9/29/2022, 1:31 PM

Interesting.

* Atlassian: We estimate the rebuilding effort to last for up to 2 more weeks: https://news.ycombinator.com/item?id=30990697

* Inside the longest Atlassian outage: https://news.ycombinator.com/item?id=31015813

* Atlassian products have been down for 4 days https://news.ycombinator.com/item?id=30973808

* Post-incident review on the Atlassian April 2022 outage https://news.ycombinator.com/item?id=31210469

by xorciston 9/29/2022, 1:38 PM

An availability of 99.9999% means a maximum of 31 seconds unavailable per year. The usual "five nines" is 5 minutes, and that's a tough target for anyone.

Given that their outage was from April 4 to April 19 this year, they should reach their target availability on average at the earliest in the year 45222. If they keep perfect uptime in the meantime, that is.

by warenton 9/29/2022, 1:18 PM

lol. They just had a multiple week outage this year. No, they cannot claim this level of availability until around May 2023. This is marketing nonsense trying to cover their massive April mistake.

by mkl95on 9/29/2022, 2:53 PM

> Atlassian Engineering recently published how it exceeded 99.9999% of availability with its Tenant Context Service (TCS).

What a misleading and cynical headline. Literally all Atlassian products I work with have some unexpected downtime every now and then.

by posneton 9/29/2022, 1:30 PM

The title is very misleading, it is just one of their micro-services that has that uptime.

by CyanLite2on 9/29/2022, 1:36 PM

Misleading Title.

Should be: "Besides that Mrs. Lincoln, how was the play?"

by kayodelycaonon 9/29/2022, 1:30 PM

Exactly which part of their system has 6 9s? It certainly hasn’t been Jira.

by grnmambaon 9/29/2022, 1:31 PM

This is the worst attempt at corporate propaganda I've seen in a while.

https://www.atlassian.com/engineering/post-incident-review-a...

by dangon 9/29/2022, 3:02 PM

Url changed from https://www.infoq.com/news/2022/09/atlassian-high-availabili..., which points to this.

by 0xbadcafebeeon 9/29/2022, 2:49 PM

Atlassian's status pages have had "active incidents" for the last two days straight: https://status.atlassian.com/

Six nines of availability means no more than 30 seconds downtime per year.

Maybe the fault tolerance of one system isn't such a big deal if you depend on 30 other systems?

by hericiumon 9/29/2022, 1:31 PM

Didn't Atlassian irreversibly lost Confluence data of some of their clients this year after weeks-long outage?

by fiparon 9/29/2022, 3:12 PM

I think this is relevant regarding the very misleading availability percentage in the title: https://rachelbythebay.com/w/2019/07/15/giant/

by atulvion 9/29/2022, 1:31 PM

Is JIRA not included in this calculation? They were down many times last year.

by jayanmnon 9/29/2022, 3:06 PM

>achieved this high availability by implementing highly-autonomous client sidecars, able to proactively shield themselves from complete AWS region failures.

complete region fail? How often does that happen?

by rwbhnon 9/29/2022, 5:45 PM

Actual title: Here’s how one of Atlassian’s critical services consistently gets above 99.9999% of availability

by jtthe13on 9/29/2022, 6:50 PM

Escaping confluence and transitioning to a competing service was the highlight of my summer.

Atlassian Exceeds 99.9999% of Availability Using Sidecars, Fault-Tolerant Design