🤖 GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models
"Recent advancements in Large Language Models (LLMs) have sparked interest in their formal reasoning capabilities, particularly in mathematics. The GSM8K benchmark is widely used to assess the mathematical reasoning of models on grade-school-level questions. While the performance of LLMs on GSM8K has significantly improved in recent years, i…"
Latest update on the DDOS attack from @brewsterkahle (Oct 11 @ 10:22am PT):
"The data is safe.
Services are offline as we examine and strengthen them. Sorry, but needed. @internetarchive staff is working hard.
Estimated Timeline: days, not weeks.
Thank you for the offers of pizza (we are set)."
Very kind for 0-day to hit right at the start of a workday TBH
https://blog.mozilla.org/security/2024/10/11/behind-the-scenes-fixing-an-in-the-wild-firefox-exploit/
Light on details, but there's some.
HyperDbg v0.10.2 is released!
This release comes with lots of bugfixes and improved stability, check it out here:
https://github.com/HyperDbg/HyperDbg/releases/tag/v0.10.2
@futurebird if you want to read Vinge's "A Fire Upon The Deep" along with the author's notes, I've converted the 1993 Hugo and Nebula anthology CD-ROM into a website: https://deepness.trmm.net/
(not "A Deepness in the Sky" as I originally wrote. those responsible have been sacked, etc)