Posts
2528
Following
649
Followers
1466
"I'm interested in all kinds of astronomy."
Edited 7 months ago
#scraping
Show content
@mrose.ink.bsky.social said it perfectly:

https://bsky.app/profile/mrose.ink/post/3lbwpud2mes2n

"One enduring complication with all this is that scraping happens all the time for reasons that people *don’t* find inherently objectionable, and in fact support—the Wayback Machine, all kinds of public health and extremism research, etc. The mistake was assuming that goodwill transfers.

A key problem in the Disc Horse (and policy to a lesser extent) is reminding people that scraping as a technological process is Important, Actually, for all the things You Think Are Good, and any proposed solutions to curtail GAI training uses need to be VERY narrowly tailored to not impact those.

All the proposed solutions so far have had some critical flaw that makes them unworkable.

Manual consent? Ok, how do we implement that at scale? robots.txt style flags are fine, but they’re also not legally binding—and that’s good! If they were, Wayback wouldn’t be able to index!

So exclusion protocols can be ignored, For Good Reason. “What if we give an exclusion protocol the force of law for this specific use?” Closer, but there’s active debate in the courts about whether this is all a fair use, and if the answer is “yes,” then it doesn’t matter

…then best case scenario the tags are rendered null (because you can’t legally override fair use), and worst case you’ve just recreated a DMCA 1201 style lockout trick, and we have spent the last 25 years seeing just how incredibly those fuck up everything around them."
0
2
1
repeated
no mom its not a "bot net" it is a highly versatile cross platform networked RPC implementation
0
3
0
repeated

bert hubert 🇺🇦🇪🇺🇺🇦

23 years old, and if you replace a few hyped things with today's equivalents, the article is 100% fresh. Things have gotten even worse since then it appears. https://www.joelonsoftware.com/2001/04/21/dont-let-architecture-astronauts-scare-you/

0
3
0
@joepie91 OK, please let me know when the scraping stops because of our collective will!
0
0
0
repeated

“CrowdStrike Earnings: Cybersecurity Firm Posts Higher Revenue Amid Swing to Loss - WSJ”

https://www.wsj.com/business/earnings/crowdstrike-raises-outlook-post-higher-revenue-amid-swing-to-loss-dde5cf9f

So, I've long argued that all of software dev's dysfunctions can be traced to the fact that business outcomes do not depend on software quality, design, or reliability. As long as this dynamic continues the software we use will only get worse

2
3
0
@joepie91 - You're still assuming you can know about the scraping in the first place
- Money doesn't stink
1
0
0
@joepie91 Do you really think people who want to e.g. earn money with this give a flying fart if they are excluded from a community (which they weren't part of in the first place)?
1
0
0
@joepie91 Based on arguments I had over here people definitely believe that technical measures at the publishing platform (such as limiting search) can affect this. Also, what is the point of being outraged about the single person who is open about his scraping while I guarantee you a dozen other orgs do the same rn just don't talk about it?
1
0
0
Here we go again explaining supposedly technologically literate people that what they *publish* on the Internet can and will be scraped... Bluesky's explanation ("we can't enforce this") is on point btw.

RE: https://infosec.exchange/@josephcox/113551853623942786
1
1
3
#twitter #uspol
Show content
What I don't get about the post-election Twitter exodus is that for the masses (ofc not you, dear reader!) somehow it was OK to create content (and thus attract ad money) there, while *after* the owners friend got elected it's suddenly not?
1
1
6
@tmr232 Are the slides available somewhere?
1
0
0
repeated
repeated

bert hubert 🇺🇦🇪🇺🇺🇦

Earlier post, but in recent talks I'm encountering more and more organizations that are losing their last technical people. You can outsource a lot, but most places have a core thing that they should really own. And once your own technical department is no longer viable, you are hosed. The longer story: https://berthub.eu/articles/posts/your-tech-my-tech/

1
2
0
repeated

thesis: numbers stations are a form of microblogging

3
4
0
Why do BloodHound CE passwords expire?! 🤦
0
0
0
repeated

New post: Vulnerability Disclosure: Command Injection in Kemp LoadMaster Load Balancer (CVE-2024-7591) https://insinuator.net/2024/11/vulnerability-disclosure-command-injection-in-kemp-loadmaster-load-balancer-cve-2024-7591/

0
2
0
repeated

@penguin86 @vampiress it's ok, the floppy is write protected.

0
1
0
This effect lasted about 24h, now I get the same braindead content again :P

So much for "personalized experience"...

RE: https://infosec.place/objects/0fe974a7-6345-4ccc-a9a4-5dce0da786a9
0
0
2
repeated

What, it's already this time of the year again?! Yes, 'tis the season of reviewing and selecting our top picks from around 3.000 productions - and we would love to have you on the team as a juror! Sign up now:
https://2025.meteoriks.org/taking_part/juror/

0
2
0
[RSS] Hacking Barcodes for Fun & Profit...

https://blog.mantrainfosec.com/blog/16/hacking-barcodes-for-fun-profit

Old friend hacking Hungarian bottle recycling machines :) #DRS
0
1
0
Show older