What really happened when Facebook shut down?

6

October

2021

No ratings yet.

Probably everyone noticed what happened to Facebook, Instagram, and Whatsapp this week. Because of a human error, the platforms were no longer reachable and caused millions of people to open their favourite app every half an hour to check if they could connect with their friends again. 

Now, what causes something like this? What happened when Facebook ‘was down’. I’ll try to shed a little bit of light on this topic and share some implications.

The internet is basically a network of networks. Those networks can be an internet service provider (ISP), a university system of computers, a government agency’s computer system, or something that contains information within a ‘network’, called an Autonomous System (AS). Now all of those networks need to be connected to access a file stored somewhere else in the network. You can get there using what you call a Borderline Gate Protocol (BGP); it figures out which route through the networks you should take to access the file you want. It does this by listening to the ‘Prefix announcement’. These announcements by an AS tell what IP Addresses (i.e. Facebook) they can deliver traffic to. The AS either hosts this IP Address itself or is connected to an AS that does.

Schematic view of the internet

All AS’es are shouting out the IP addresses they can deliver traffic to. The BGP picks up this signal and finds the best way to get there, hopping from AS to AS until it arrives at the IP address containing the correct data. What happened to Facebook was that someone updated the company’s BGP records, which took away ‘the map’ that made Facebook visible in the network. Whenever someone tried to connect to Facebook, no AS could tell where to go. You can compare this with your friends sharing their live location so you can go to their current location, but suddenly, you can’t see them on the map. It doesn’t mean they have evaporated or disappeared from the earth, but you don’t know where they are.

The outage has huge implications

That you can’t share a meme on Facebook for a few hours probably won’t do much harm, not liking a cat video neither. But communication channels like Whatsapp are commonly used for communication on the work floor (or as student for a project), and is increasingly used for connecting with businesses. Imagine an outage of Google’s DNS server, which millions of people rely on. That will not only affect consumers, but would disrupt commerce and critical infrastructure, factory production, fleet transport, basically any industry you can think of.

The Facebook outage has drawn attention to the vulnerability of the world to failures of this nature. The internet is a complex system of systems in which much can go wrong and following Murphy’s law “anything that can go wrong, will go wrong”. It also leaves room for criminals to highjack a BGP and redirect traffic to a malicious website; with 80,000 autonomous systems, it is not surprising that some would be untrustworthy.

Hopefully we can continue to share our social lives online without any inferences anytime soon, but when it happens, you know why and know the consequences.

Sources:

https://www.researchgate.net/figure/Internet-is-a-network-of-networks-interconnected-by-various-business-models_fig3_304797449

https://www.wired.com/story/why-facebook-instagram-whatsapp-went-down-outage/

https://krebsonsecurity.com/2021/10/what-happened-to-facebook-instagram-whatsapp/

Please rate this

1 thought on “What really happened when Facebook shut down?”

  1. Hi Julius,

    Thank you for this interesting article. Especially with the outage being so recent and since this topic is heavily discussed. I already saw on the news that this outage was caused by a human error, but did not see an explanation of how this error was actually caused. It is nice to read about the actual underlying system and problem in your article, it certainly gives a better understanding of how the outage actually occurred.

    What is very shocking to me, is that one single person has the ability to have such a huge impact on so many people by a single error. This really stimulated me to think about how big of an influence some singular platforms have on our daily life and how just a bunch of people are responsible for some platforms that are embedded in our lives. As you shared in the implications part of your blog, not only individuals, but also businesses and industries that heavily rely on these platforms are more vulnerable than we are aware of, since we know a single human can either create an error in the system or use the system maliciously.

    Though I thought the outage was quite nice, it did shed light upon how much I use the platforms owned by a single company maintained by a small number of people. And even though it did shed light upon this, if I am being honest, I don’t think it will change my behavior in using these apps in any shape or form….

Leave a Reply

Your email address will not be published. Required fields are marked *