Google says its AI-based bug hunter found 20 security vulnerabilities


Google’s AI-powered bug hunter has just reported its first batch of security vulnerabilities. 

Heather Adkins, Google’s vice president of security, announced Monday that its LLM-based vulnerability researcher Big Sleep found and reported 20 flaws in various popular open source software.

Adkins said that Big Sleep, which is developed by the company’s AI department DeepMind as well as its elite team of hackers Project Zero, reported its first-ever vulnerabilities, mostly in open source software such as audio and video library FFmpeg and image-editing suite ImageMagick. 

Given that the vulnerabilities are not fixed yet, we don’t have details of their impact or severity, as Google does not yet want to provide details, which is a standard policy when waiting for bugs to be fixed. But the simple fact that Big Sleep found these vulnerabilities is significant, as it shows these tools are starting to get real results, even if there was a human involved in this case. 

“To ensure high quality and actionable reports, we have a human expert in the loop before reporting, but each vulnerability was found and reproduced by the AI agent without human intervention,” Google’s spokesperson Kimberly Samra told TechCrunch. 

Royal Hansen, Google’s vice president of engineering, wrote on X that the findings demonstrate “a new frontier in automated vulnerability discovery.” 

LLM-powered tools that can look for and find vulnerabilities are already a reality. Other than Big Sleep, there’s RunSybil and XBOW, among others. 

Techcrunch event

San Francisco
|
October 27-29, 2025

XBOW has garnered headlines after it reached the top of one of the U.S. leaderboards at bug bounty platform HackerOne. It’s important to note that in most cases, these reports have a human at some point of the process to verify that the AI-powered bug hunter found a legitimate vulnerability, as is the case with Big Sleep.

Vlad Ionescu, co-founder and chief technology officer at RunSybil, a startup that develops AI-powered bug hunters, told TechCrunch that Big Sleep is a “legit” project, given that it has “good design, people behind it know what they’re doing, Project Zero has the bug finding experience and DeepMind has the firepower and tokens to throw at it.”

There is obviously a lot of promise with these tools, but also significant downsides. Several people who maintain different software projects have complained of bug reports that are actually hallucinations, with some calling them the bug bounty equivalent of AI slop. 

“That’s the problem people are running into, is we’re getting a lot of stuff that looks like gold, but it’s actually just crap,” Ionescu previously told TechCrunch.



Source link

Latest

The Witcher Season 4 will hit Netflix in October with its new Geralt

It's been quite the wait, but the fourth...

Samsung has to redesign Galaxy phones for the sake of Qi2

Samsung appears to be adopting Qi2 magnets in...

These are the richest people in Massachusetts on the Forbes list

Nine people from Massachusetts made the Forbes 400,...

Pilot union urges FAA to reject Rainmaker’s drone cloud-seeding plan

Rainmaker Technology’s bid to deploy cloud-seeding flares on...

Newsletter

Don't miss

The Witcher Season 4 will hit Netflix in October with its new Geralt

It's been quite the wait, but the fourth...

Samsung has to redesign Galaxy phones for the sake of Qi2

Samsung appears to be adopting Qi2 magnets in...

These are the richest people in Massachusetts on the Forbes list

Nine people from Massachusetts made the Forbes 400,...

Pilot union urges FAA to reject Rainmaker’s drone cloud-seeding plan

Rainmaker Technology’s bid to deploy cloud-seeding flares on...

Roblox hit with wrongful death lawsuit following a teen player’s suicide

Following her son's suicide, Becca Dallas filed a...

The Witcher Season 4 will hit Netflix in October with its new Geralt

It's been quite the wait, but the fourth season of The Witcher is almost here. In a teaser shared this weekend, Netflix finally...

Samsung has to redesign Galaxy phones for the sake of Qi2

Samsung appears to be adopting Qi2 magnets in the Galaxy S26 series, but there’s a key problem with that, and it’s going to...

These are the richest people in Massachusetts on the Forbes list

Nine people from Massachusetts made the Forbes 400, a ranking of the richest people in the United States for 2025. The richest 400...

LEAVE A REPLY

Please enter your comment!
Please enter your name here