Archiving Web Site - Failing Business

bOt@zerobytes.monsterM to It's A Digital Disease!@zerobytes.monster · 6 hours ago

The original post: /r/datahoarder by /u/EDACerton on 2024-12-22 20:15:38.

One of my friends is a technical writer/editor, but the company that they work for will go bankrupt very soon. All of their work is on a web site, and they expect that the web site will disappear when the business collapses (taking their portfolio of work with it).

They asked me to scrape/archive the site so that they would have a copy of their work. I’m trying httrack, but getting poor results due to JavaScript, etc.

Does anyone know of any tools that could scrape all of their pages to something like PDFs?

You must log in or # to comment.

Chat

It's A Digital Disease!@zerobytes.monster

datahoarder@zerobytes.monster

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !datahoarder@zerobytes.monster

Community locked: only moderators can create posts. You can still comment on posts.

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
2 users / week
2 users / month
3 users / 6 months
1 local subscriber
11 subscribers
748 Posts
0 Comments
Modlog

mods:
bOt@zerobytes.monster