Onepane ’s Post

View organization page for Onepane , graphic

846 followers

Earlier this month, on March 11, Atlassian customers using Bitbucket Cloud faced degradation or downtime of their website and APIs. This outage lasted for about more than an hour. To make it clear, even a very few minutes of downtime can cost millions of dollars for larger companies like Atlassian. Identifying the exact root cause is important for every incident or outage in software. This incident was caused by a bug in the version of Amazon Aurora. To address this, they have implemented a temporary fix that triggers a vacuum process when discrepancies in the visibility map are detected. They are also working on fine-tuning the auto vacuum settings for a long-term solution. This incident increased latency while accessing the bitbucket.org website and APIs during the duration of the incident. Git requests over SSH and HTTPS are affected. We learn when we fail; this quote not only applies in life but also from a technical standpoint. For more information of this kind, check out the incidents and outages concerning Atlassian products on the website: You can check this out and get an idea of how to resolve incidents promptly: https://lnkd.in/giayJXbE #sre #incidentmanagement

  • No alternative text description for this image

To view or add a comment, sign in

Explore topics