CrowdStrike reveals root cause of global outage, and it's a mistake first-year programming students learn about (2024)

CrowdStrike would be feeling "very embarrassed" after issuing its Root Cause Analysis (RCA) of the faulty software update that led to potentially the largest global IT outage in history, experts say.

It came down to a mistake first-year programming students are taught how to avoid.

On July 19, the fateful Blue Screen of Death (BSOD) Friday, about 8.5 million Windows systems around the world went into meltdown when an update for CrowdStrike's Falcon sensor product went very wrong.

The US cybersecurity company released a preliminary report days after the incident.

Now a more in-depth, 12-page analysis has confirmed the root of the cause — one single undetected sensor.

Falcon's privileged access

CrowdStrike offers ransomware, malware and internet security products almost exclusively to businesses and large organisations.

The widespread outage has been linked to its Falcon sensor software, which is installed to look for threats and help lock them down.

Sigi Goode, a professor of information systems at the Australian National University, said Falcon had very privileged access.

It sits at what is called the kernel level of Windows.

"It's sitting as close to the engine that powers the operating system as possible," Professor Goode said.

"Kernel mode is constantly watching what you're doing and listening to requests from the applications you're using, and servicing them in a way that appears seamless to you."

He described kernel mode as the traffic police that Falcon sits alongside, saying, "I don't like to look of that vehicle, we should take a look at it".

CrowdStrike reveals root cause of global outage, and it's a mistake first-year programming students learn about (1)

The sensor 21 culprit

CrowdStrike is constantly updating Falcon.

On July 19, the company sent out a Rapid Response Content update to certain Windows hosts.

In the RCA, CrowdStrike called it the "Channel 291 Incident", in which a new capability was introduced into Falcon's sensors.

Sensors are like "a pathway for evidence," that tell it what sort of suspicious activity to look for, Professor Goode said.

"Falcon is looking at a range of sensors — a range of indicators — to see if something is wrong," he said.

CrowdStrike reveals root cause of global outage, and it's a mistake first-year programming students learn about (2)

When updates are sent, it changes the location or the number of sensors to check for a potential attack.

In this instance, Falcon expected the update to have 20 input fields, but it had 21 input fields.

This "count mismatch" is what caused the global crash, CrowdStrike said.

"The Content Interpreter expected only 20 values," the RCA report states.

"Therefore, the attempt to access the 21st value produced an out-of-bounds memory read beyond the end of the input data array and resulted in a system crash."

Because Falcon is so tightly integrated into the core of Windows, when it crashed it bought down the entire system causing the BSOD.

CrowdStrike reveals root cause of global outage, and it's a mistake first-year programming students learn about (3)

Professor Goode said some of the most common ways to compromise a system were to flood memory.

Essentially, you tell the computer to look for something "out of bounds".

"It was looking for something that wasn't there," he said.

"But Falcon had to look in that 21st location, because that's what it was told to do by the new template it was given."

How can this happen?

CrowdStrike has apologised for the failure which has led to its CEO, George Kurtz, being called to testify before the US Congress to explain what happened.

"We are using the lessons learned from this incident to better serve our customers," Mr Kurtz said in a statement this week.

"To this end, we have already taken decisive steps to help prevent this situation from repeating, and to help ensure that we — and you — become even more resilient."

CrowdStrike's quality assurance (QA) processes have come into question.

The company has said that its updates "go through an extensive QA process, which includes automated testing, manual testing, validation and rollout steps".

But Rapid Response Content, which was used in this instance, goes through a different process.

CrowdStrike reveals root cause of global outage, and it's a mistake first-year programming students learn about (4)

In the report, CrowdStrike admits that "lack of a specific test for non-wildcard matching criteria in the 21st field" contributed to "the confluence of these issues that resulted in a system crash".

Toby Murray, associate professor at the University of Melbourne's School of Computing and Information Systems, said the "dodgy data file update" was "embarrassing".

He said even basic checks by a human developer would have found the problem.

"That is an incredibly basic and fundamental mismatch that was always going to lead to catastrophic problems, sooner or later," he told the ABC.

"The fact that the CrowdStrike developers were able to have this obvious inconsistency between the data file format and the software code means that the most basic forms of quality review and assurance were not being correctly carried out."

CrowdStrike reveals root cause of global outage, and it's a mistake first-year programming students learn about (5)

Professor Goode said this kind of mistake shouldn't be happening.

He said the update should have been released through a staged deployment.

"When they wrote this report, they must have been feeling very embarrassed," he said.

"First-year programming students are taught about the 'stack', the series of instructions that need to be executed in a CPU (central processing unit)."

CrowdStrike announced it had engaged with two independent software security vendors to conduct further review of the Falcon sensor code for both security and quality assurance.

Calls for accountability

In the wake of the outage, regulators and businesses have been considering legal implications.

The incident sent airports into chaos, supermarket check-outs stopped working, and media outlets struggled to bring you the news.

In Australia alone, the impact on businesses has been estimated at more than $1 billion.

Australian Industry Group CEO Innes Willox told ABC's The Business he expected the damage bill from the glitch to run into the billions of dollars.

But he said it was still unclear whether affected businesses would be able to seek compensation from CrowdStrike for any losses incurred from the outages.

America's Delta Airlines last week said the outage had cost the company $US500 million ($760 million) and that it planned to take legal action to get compensation from the cybersecurity firm.

CrowdStrike has rejected the claim, saying in a letter from an external lawyer that it is "highly disappointed by Delta's suggestion that CrowdStrike acted inappropriately and strongly rejects any allegation that it was grossly negligent or committed misconduct".

Delta cancelled more than 6,000 flights over a six-day period, impacting more than 500,000 passengers.

It faces a US Transportation Department investigation into why it took so much longer for it to recover from the outage than other airlines.

CrowdStrike reveals root cause of global outage, and it's a mistake first-year programming students learn about (2024)

References

Top Articles
Mrs. Poindexter's Net Worth - Zophra
The Power of Pause – how leaders can transform productivity through rest - InfluencerWorldDaily.com
2016 Hyundai Sonata Refrigerant Capacity
Blackstone Launchpad Ucf
Helicopter Over Massapequa Now
El Paso Craigs
Tiffany's Breakfast Portage
The Clapping Song Lyrics by Belle Stars
Restored Republic June 6 2023
Void Client Vrchat
Creepshot. Org
Unveiling The Voice Behind Maui: The Actor Behind The Demigod
Pooch Parlor Covington Tn
Orange Craigslist Free Stuff
Peanut Oil Can Be Part Of A Healthy Diet — But Only If It's Used This Way
Varsity Competition Results 2022
Premier Auto Works-- The House Of Cash Car Deals
Masdar | Masdar’s Youth 4 Sustainability Announces COP28 Program to Empower Next Generation of Climate Leaders
Ttw Cut Content
How to track your Amazon order on your phone or desktop
Saltburn | Rotten Tomatoes
Long-awaited Ringu sequel Sadako doesn’t click with the 21st century
A Flame Extinguished Wow Bugged
Juego Friv Poki
Orlando Magic Account Manager
Contenidos del nivel A2
Mta Bus Forums
80 Maiden Lane Ny Ny 10038 Directions
Emerge Ortho Kronos
Axolotls for Sale - 10 Online Stores You Can Buy an Axolotl - Axolotl Nerd
Numerous people shot in Kentucky near Interstate 75, officials say | CNN
Winvic First UK Contractor to Use Innovative Technology that Operates Tower Cranes from the Ground
Kahoot Spamming Bots
Point After Salon
How Old Am I 1981
Age Gabriela Moura's Evolution from Childhood Dreams to TikTok Fame - Essential Tribune
Provo Craigslist
Gold Bowl Vidalia La Menu
How to get tink dissipator coil? - Dish De
Kayak Parts Amazon
Hatcher Funeral Home Aiken Sc
Windows 10 Defender Dateien und Ordner per Rechtsklick prüfen
Puppies For Sale in Netherlands (98) | Petzlover
Fineassarri
What Is Opm1 Treas 310 Deposit
02488 - Uitvaartcentrum Texel
Shaws Myaci
Roman Numerals Chart, Translation Tips & History
Sparkle Nails Phillipsburg
Saratoga Otb Results
LP Vinyl Samling pop rock thrash metal trance
Pollen Count Butler Pa
Latest Posts
Article information

Author: Chrissy Homenick

Last Updated:

Views: 5769

Rating: 4.3 / 5 (54 voted)

Reviews: 93% of readers found this page helpful

Author information

Name: Chrissy Homenick

Birthday: 2001-10-22

Address: 611 Kuhn Oval, Feltonbury, NY 02783-3818

Phone: +96619177651654

Job: Mining Representative

Hobby: amateur radio, Sculling, Knife making, Gardening, Watching movies, Gunsmithing, Video gaming

Introduction: My name is Chrissy Homenick, I am a tender, funny, determined, tender, glorious, fancy, enthusiastic person who loves writing and wants to share my knowledge and understanding with you.