infosec.place

buherator

@buherator

Posts

2620

Following

670

Followers

1497

"I'm interested in all kinds of astronomy."

Personal

https://scrapco.de

GitHub

https://github.com/v-p-b

Nudes

https://infosex.exchange

buherator

repeated

David Chisnall (Now with 50% more sarcasm!)

david_chisnall@infosec.exchange

10 months ago

I finally turned off GitHub Copilot yesterday. I’ve been using it for about a year on the ‘free for open-source maintainers’ tier. I was skeptical but didn’t want to dismiss it without a fair trial.

It has cost me more time than it has saved. It lets me type faster, which has been useful when writing tests where I’m testing a variety of permutations of an API to check error handling for all of the conditions.

I can recall three places where it has introduced bugs that took me more time to to debug than the total time saving:

The first was something that initially impressed me. I pasted the prose description of how to communicate with an Ethernet MAC into a comment and then wrote some method prototypes. It autocompleted the bodies. All very plausible looking. Only it managed to flip a bit in the MDIO read and write register commands. MDIO is basically a multiplexing system. You have two device registers exposed, one sets the command (read or write a specific internal register) and the other is the value. It got the read and write the wrong way around, so when I thought I was writing a value, I was actually reading. When I thought I was reading, I was actually seeing the value in the last register I thought I had written. It took two of us over a day to debug this. The fix was simple, but the bug was in the middle of correct-looking code. If I’d manually transcribed the command from the data sheet, I would not have got this wrong because I’d have triple checked it.

Another case it had inverted the condition in an if statement inside an error-handling path. The error handling was a rare case and was asymmetric. Hitting the if case when you wanted the else case was okay but the converse was not. Lots of debugging. I learned from this to read the generated code more carefully, but that increased cognitive load and eliminated most of the benefit. Typing code is not the bottleneck and if I have to think about what I want and then read carefully to check it really is what I want, I am slower.

Most recently, I was writing a simple binary search and insertion-deletion operations for a sorted array. I assumed that this was something that had hundreds of examples in the training data and so would be fine. It had all sorts of corner-case bugs. I eventually gave up fixing them and rewrote the code from scratch.

Last week I did some work on a remote machine where I hadn’t set up Copilot and I felt much more productive. Autocomplete was either correct or not present, so I was spending more time thinking about what to write. I don’t entirely trust this kind of subjective judgement, but it was a data point. Around the same time I wrote some code without clangd set up and that really hurt. It turns out I really rely on AST-aware completion to explore APIs. I had to look up more things in the documentation. Copilot was never good for this because it would just bullshit APIs, so something showing up in autocomplete didn’t mean it was real. This would be improved by using a feedback system to require autocomplete outputs to type check, but then they would take much longer to create (probably at least a 10x increase in LLM compute time) and wouldn’t complete fragments, so I don’t see a good path to being able to do this without tight coupling to the LSP server and possibly not even then.

Yesterday I was writing bits of the CHERIoT Programmers’ Guide and it kept autocompleting text in a different writing style, some of which was obviously plagiarised (when I’m describing precisely how to implement a specific, and not very common, lock type with a futex and the autocomplete is a paragraph of text with a lot of detail, I’m confident you don’t have more than one or two examples of that in the training set). It was distracting and annoying. I wrote much faster after turning it off.

So, after giving it a fair try, I have concluded that it is both a net decrease in productivity and probably an increase in legal liability.

Discussions I am not interested in having:

You are holding it wrong. Using Copilot with this magic config setting / prompt tweak makes it better. At its absolute best, it was a small productivity increase, if it needs more effort to use, that will be offset.
This other LLM is much better. I don’t care. The costs of the bullshitting far outweighed the benefits when it worked, to be better it would have to not bullshit, and that’s not something LLMs can do.
It’s great for boilerplate! No. APIs that require every user to write the same code are broken. Fix them, don’t fill the world with more code using them that will need fixing when the APIs change.
Don’t use LLMs for autocomplete, use them for dialogues about the code. Tried that. It’s worse than a rubber duck, which at least knows to stay silent when it doesn’t know what it’s talking about.

The one place Copilot was vaguely useful was hinting at missing abstractions (if it can autocomplete big chunks then my APIs required too much boilerplate and needed better abstractions). The place I thought it might be useful was spotting inconsistent API names and parameter orders but it was actually very bad at this (presumably because of the way it tokenises identifiers?). With a load of examples with consistent names, it would suggest things that didn't match the convention. After using three APIs that all passed the same parameters in the same order, it would suggest flipping the order for the fourth.

#GitHubCopilot #CHERIoT

buherator

repeated

clearbluejar

clearbluejar@infosec.exchange

10 months ago

Exciting! My talk recording just dropped from #OBTS v7! 🗣️✨ Learn how to patch diff on Apple with #Ghidra, #ghidriff, and #ipsw: "Patch Different on *OS": https://www.youtube.com/watch?v=Ellb76t7nrc

buherator

repeated

Alexander Popov

a13xp0p0v@infosec.exchange

10 months ago

Slides for my talk at H2HC 2024:

🤿 Diving into Linux kernel security 🤿

I described how to learn this complex area and knowingly configure the security parameters of your Linux-based system.

And I showed my open-source tools for that purpose!

https://a13xp0p0v.github.io/img/Alexander_Popov-H2HC-2024.pdf

buherator

10 months ago

Back in the day I reverse engineered Oracle Forms network protocol and published a bunch of writeups and tools about it:

https://github.com/silentsignal/oracle_forms/

I've always thought Forms is a niche in enterprise IT that's slowly dying out (for good), until I saw this video about our local nuclear power plant o.O

https://youtu.be/xsOAjgFLImg?si=_FJsd7EoEC1J3gim&t=4660

buherator

repeated

Charlie Stross

cstross@wandering.shop

10 months ago

WIRED article forecasting the generative AI bubble will burst in 2025. This is more optimistic than my own expectations, but if WIRED are printing it, it's the direction sentiment in Silicon Valley is running in.

(Hint: there's gold in AI, but it's in *analytical* AI, aka big data, not stochastic parrot bullshit.)

https://www.wired.com/story/generative-ai-will-need-to-prove-its-usefulness/

buherator

10 months ago

Reply to @eniko@peoplemaking.games

@eniko idk the history but IMO it'd make sanse to support the primary arch first (providing a memory safe option for devs too!) then use the abstraction of the bytecode as needed. E.g. a quick search shows that .NET 4 was available for Itanium too.

buherator

repeated

stacksmashing

stacksmashing@infosec.exchange

10 months ago

Call for SPI flashes at #38C3

I'm developing some SPI-flash tools and want to try a variety of devices and flash chips for testing.

Got devices where it's tricky to dump in-system or rare flash chips? I'd love to test them at #38c3 if you can bring them!

buherator

10 months ago

Reply to @eniko@peoplemaking.games

@eniko wasn't it for Windows Phone dev too?

buherator

10 months ago

Reply to @LukaszOlejnik@mastodon.social

@LukaszOlejnik good that our crusade against cookies is going great so far /s

buherator

repeated

Lukasz Olejnik

LukaszOlejnik@mastodon.social

10 months ago

My strategic privacy analysis. Is Google undoing a decade of progress on privacy? Their new policy allows invasive device fingerprinting for tracking user activity. Here’s my deep dive into what this means for privacy—and the future of AI. https://blog.lukaszolejnik.com/biggest-privacy-erosion-in-10-years-on-googles-policy-change-towards-fingerprinting/

buherator

repeated

luna aria ielenia

ielenia@ck.catwithaclari.net

10 months ago

buherator

repeated

GEBIRGE

GEBIRGE@infosec.exchange

10 months ago

My first article for @mogwailabs_gmbh just released. Thanks to @h0ng10 for making it happen. 🥳

https://mogwailabs.de/en/blog/2024/12/jndi-mind-tricks/

#jndi #java #deserialization

buherator

repeated

Taggart

mttaggart@infosec.exchange

10 months ago

Seems like a mitigation for a Tomcat TOCTOU vuln was incomplete.

(H/t) @AAKL

https://seclists.org/oss-sec/2024/q4/164

buherator

repeated

Taggart

mttaggart@infosec.exchange

10 months ago

Does Tidal compensate artists fairly? I'm ready to ditch Spotify, but I'd like to do it the right way.

buherator

repeated

screaminggoat

screaminggoat@infosec.exchange

10 months ago

Sophos security advisory 19 December 2024: Resolved Multiple Vulnerabilities in Sophos Firewall (CVE-2024-12727, CVE-2024-12728, CVE-2024-12729)

CVE-2024-12727 (9.8 critical) pre-auth SQL injection vulnerability in the email protection feature of Sophos Firewall
CVE-2024-12728 (9.8 critical) weak credentials vulnerability potentially allows privileged system access via SSH to Sophos Firewall
CVE-2024-12729 (8.8 high) post-auth code injection vulnerability in the User Portal allows authenticated users to execute code remotely in Sophos Firewall

Sophos has not observed these vulnerabilities to be exploited at this time.

#sophos #firewall #vulnerability #cve #infosec #cybersecurity

buherator

10 months ago

Reply to

@cR0w I'm not subscribed to that and it's too end-of-Q4 Friday to look them up...

buherator

10 months ago

Security Bulletin: #IBMi is vulnerable to bypassing Navigator for i interface restrictions and a server-side request forgery [CVE-2024-51463, CVE-2024-51464]

https://www.ibm.com/support/pages/node/7179509

buherator

repeated

HalvarFlake

HalvarFlake@mastodon.social

10 months ago

Somebody tell Elon: "Never go full retard."

buherator

repeated

Ars Technica

arstechnica@mastodon.social

10 months ago

Why AI language models choke on too much text
Compute costs scale with the square of the input size. That's not great.
https://arstechnica.com/ai/2024/12/why-ai-language-models-choke-on-too-much-text/?utm_brand=arstechnica&utm_social-type=owned&utm_source=mastodon&utm_medium=social

buherator

repeated

Aral Balkan

aral@mastodon.ar.al

10 months ago

Heads up: Folks on #Codeberg

You might get an email belittling your project, seemingly from Michael Bell (mikedesu) via noreply@codeberg.org (an issue is created on your repo and then deleted, leading to the notification).

This appears to be part of a smear campaign someone is running that started on GitHub. e.g., see:

https://www.techradar.com/pro/security/github-projects-are-being-targeted-with-malicious-action-in-apparent-attempt-to-frame-researcher

CC: @Codeberg – hope you can identify the account(s) responsible and block them. Example (deleted) issue: https://codeberg.org/kitten/app/issues/216

Show older

Posts

Following

Followers

David Chisnall (*Now with 50% more sarcasm!*)

clearbluejar

Alexander Popov

buherator

Charlie Stross

buherator

stacksmashing

buherator

buherator

Lukasz Olejnik

luna aria ielenia

GEBIRGE

Taggart

Taggart

screaminggoat

buherator

buherator

HalvarFlake

Ars Technica

Aral Balkan

Terms of Service

David Chisnall (Now with 50% more sarcasm!)