Archive it or you will miss it June 19, 2017 on Drew DeVault's blog

Let’s open with some quotes from the Wikipedia article on link rot:

In 2014, bookmarking site Pinboard’s owner Maciej Cegłowski reported a “pretty steady rate” of 5% link rot per year… approximately 50% of the URLs in U.S. Supreme Court opinions no longer link to the original information… (analysis of) more than 180,000 links from references in… three major open access publishers… found that overall 24.5% of links cited were no longer available.

I hate link rot. It’s been common when servers disappeared or domains expired, in the past and still today. Today, link rot is on the rise under the influence of more sinister factors. Abuse of DMCA. Region locking. Paywalls. Maybe it just no longer serves the interests of a walled garden to host the content. Maybe the walled garden went out of business. Users rely on platforms to host content and links rot by the millions when the platforms die. Movies disappear from Netflix. Music vanishes from Spotify. Accounts are banned from SoundCloud. YouTube channels are banned over false DMCA requests issued by robots.

At this point, link rot is an axiom of the internet. In the face of this, I store a personal offline archive of anything I want to see twice. When I see a cool YouTube video I like, I archive the entire channel right away. Rather than subscribe to it, I update my archive on a cronjob. I scrape content out of RSS feeds and into offline storage and I have dozens of websites archived with wget. I mirror most git repositories I’m interested in. I have DRM free offline copies of all of my music, TV shows, and movies, ill-begotten or not.

I suggest you do the same. It’s sad that it’s come to this. Let’s all do ourselves a favor. Don’t build unsustainable platforms and ask users to trust you with their data. Pay for your domain. Give people DRM free downloads. Don’t cripple your software when it can’t call home. If you run a website, let archive.org scrape it.

And archive anything you want to see again.

0 0 * * 0 cd ~/archives && wget -m https://drewdevault.com

Have a comment on one of my posts? Start a discussion in my public inbox by sending an email to ~sircmpwn/public-inbox@lists.sr.ht [mailing list etiquette]

Articles from blogs I read Generated by openring

Command PATH security in Go

Today’s Go security release fixes an issue involving PATH lookups in untrusted directories that can lead to remote execution during the go get command. We expect people to have questions about what exactly this means and whether they might h…

via The Go Programming Language Blog January 19, 2021

Status update, January 2021

Hi all! This month again, my main focus has been wlroots. I’ve focused on the internal renderer refactoring (the so-called “renderer v6"). A lot of the work has now been completed, and all backends now use the new interfaces under-the-hood. With the help …

via emersion January 18, 2021

What's cooking on Sourcehut? January 2021

Another year begins, and hopefully with better prospects for us all. SourceHut has emerged from 2020 relatively unscathed, thankfully, and I hope the same is true of most of our users. A body which, by the way, today numbers 19,647 strong, up 623 from Decemb…

via Blogs on Sourcehut January 15, 2021