Originally posted on Gigaom:
Whatever investments Twitter is making to improve the reliability of its system aren’t working, or at least not as well as they should be. The world’s favorite micro-blogging site blamed Thursday morning’s approximately two-hour outage on problems within its data centers — specifically the parallel failure of its running system and its backup system — and it’s the second time in less than two month’s Twitter’s infrastructure has brought the site down. Maybe it’s time for Twitter to talk openly about what its doing in there.
Don’t get me wrong, Twitter has been nothing if not generous in talking about the software it builds. The company has open sourced numerous data-management tools and other pieces of code. It has occasionally (at least in 2009) been willing to share how it handles, stores, searches, and analyzes billions of data points relating to users and their tweets. But when it comes to the actual infrastructure on which this software runs?
Let’s just say Twitter is less than forthcoming. The explanation of today’s outage: