infosec.place

Conversation

Frederik Braun �

What Mythos access got us. Now public. https://blog.mozilla.org/en/privacy-security/ai-security-zero-day-vulnerabilities/

Frederik Braun �

freddy@security.plumbing

3 months ago

Reply to @freddy@security.plumbing

Holy patch notes, Batman! Firefox has obliterated over 200 sinister security bugs in this release alone - the most villanous vulnerabilities ever squashed in the Firefox history. https://www.mozilla.org/en-US/security/advisories/mfsa2026-30/

Mae

Mae@is.badat.dev

3 months ago

Reply to @freddy@security.plumbing

@freddy I'm confused at how despite the blog post mentioning over 200 security bugs but there are only 3 bugs attributed to anthropic

Brad Macpherson

brad@1040ste.net

3 months ago

Reply to @Mae@is.badat.dev

@Mae @freddy Almost like it's yet more bullshit from the bubble boosters.

Mae

Mae@is.badat.dev

3 months ago

Reply to @brad@1040ste.net

Edited 3 months ago

@brad @freddy they're non public but if you look at references there were multiple memory safety bugs fixed in this release that are all grouped under the last 3 CVEs

Brad Macpherson

brad@1040ste.net

3 months ago

Reply to @Mae@is.badat.dev

@Mae @freddy Right, but not the ones attributed to "AI", am I reading that right?

buherator

3 months ago

Reply to @freddy@security.plumbing

@freddy Can you tell us about token costs?

Frederik Braun �

freddy@security.plumbing

3 months ago

Reply to @buherator

@buherator no :)

Frederik Braun �

freddy@security.plumbing

3 months ago

Reply to @Mae@is.badat.dev

Edited 3 months ago

@Mae @brad What Mae says is correct :). The ones found by Anthropic listed at the top were their work and reported to us. The ones at the bottom are the internal findings (some with use of their model, some manual, some with fuzzing)

Brad Macpherson

brad@1040ste.net

3 months ago

Reply to @freddy@security.plumbing

@freddy @Mae So, not 271 vulnerabilities found using the LLM, then? Is it - 3? More?

One could be excused for reading this paragraph as "271 vulnerabilities were identified and fixed simply by running the LLM-based tool over the code".

'As part of our continued collaboration with Anthropic, we had the opportunity to apply an early version of Claude Mythos Preview to Firefox. This week’s release of Firefox 150 includes fixes for 271 vulnerabilities identified during this initial evaluation.'

Frederik Braun �

freddy@security.plumbing

3 months ago

Reply to @brad@1040ste.net

@brad @Mae Nah, you got it wrong, still.

There is no point in issuing 271 CVE identifiers when people have to update Firefox (or not). It's not like users have a choice which fixes to apply.

The LLM found way more than 271 vulns. We fixed the first 271 in Firefox 150. We lumped them into a somewhat arbitrary number of CVEs because we do not think our time is best spent writing advisory texts (or mastodon posts for that matter ;)). More bug fixes will come. And then some more.

Jeffrey Yasskin

jyasskin@hachyderm.io

3 months ago

Reply to @freddy@security.plumbing

@freddy Did you learn any lessons that would apply at the specification level? Totally reasonable if these were all implementation bugs, but I'd love to learn about mistakes we're making at other levels too.

Frederik Braun �

freddy@security.plumbing

3 months ago

Reply to @jyasskin@hachyderm.io

@jyasskin https://github.com/tc39/proposal-thenable-curtailment comes to mind :D

Brad Macpherson

brad@1040ste.net

3 months ago

Reply to @freddy@security.plumbing

@freddy @Mae The post implies that all the vulnerabilities fixed - regardless whether you're counting CVEs, bug reports, or "trust me bro"s - were identified using the LLM. Which does not appear to be the case.

Curious that you'd leave "way more" vulnerabilities unpatched, too. Not really vulnerabilities? Or the patching has to be done by people, and the LLM is being used as a static analyser? That then raises the question of what code scanning was being done previously.

floyd aka floyd_ch

floyd@chaos.social

2 months ago

Reply to @freddy@security.plumbing

@freddy do you have any infos regarding numbers of CVEs in how this compares to the coverage guided fuzzing time (AFL and following)?

Frederik Braun �

freddy@security.plumbing

2 months ago

Reply to @floyd@chaos.social

@floyd you mean amount over time? It’s hard to compare because (I think) we got better and were faster at putting scalable automation behind this than when fuzzing, afl & asan happened. Probably easier to do statics when you look at year/multi year.

If you were to do bugs per week or month, this would beat all historic records

floyd aka floyd_ch

floyd@chaos.social

2 months ago

Reply to @freddy@security.plumbing

@freddy ok good to know, yeah per year would probably work. Ok so it got easier to automate, good to know. I don't feel that way without Mythic and using other AI