Today I shared an article with my Dad, one that is no longer accessible online, by sharing it from my #Readeck archive.
Bookmarking & archiving everything is paying off, and Readeck did an amazing job.
@algernon ohh that's an interesting project! do you also use it on mobile? I currently use ArchiveBox for safekeeping, but copy-pasting is pain in the ass 😞
@pcdevil the browser extension works in mobile firefox too, and I'm using that.
There is no "save into readeck" for other apps, though, so sometimes I have to copy links around.
@buherator no, it does not. It is a bookmark (& bookmark archiver) tool, not a full-on archiving one.
Imho, both kinds have their place, they solve different problems.
@buherator nope, sorry :(
That is not a use-case I have. I'll boost the question, perhaps some along in my circles can suggest something.
Go go #fedisearch!
@buherator @algernon Does https://www.httrack.com/ do what you need?
@buherator @algernon browsertrix crawler or zimit (uses browsertrix, but saves to the Zim format)
Unfortunately they use brave as the chromium backend :/
@TheDragon @pcdevil ooooooooooohhhhh.
That looks very interesting. I'll give it a try!
@buherator @algernon
I had some notes about web archival in my memex, I published them quickly:
https://brain.trainpats.eu/web%20archival.html
Don't rely on this URL, it might change soon.
I mostly recommend this page that I link to from there:
https://github.com/ArchiveBox/ArchiveBox/wiki/Web-Archiving-Community#other-archivebox-alternatives
@TheDragon @pcdevil I'll see if it can help me do two things:
It looks like it can do both. Will look cliser later. This might solve a problem I've been banging my head against for quite a while!
Thank you!
@buherator @algernon Depends what kind of brokenness you need to solve for. It may be that writing a simple crawler with Playwright is your best bet if you want 100% support for every oddity.