<?xml version="1.0" encoding="utf-8"?>
<feed xml:lang="en-us" xmlns="http://www.w3.org/2005/Atom"><title>Simon Willison's Weblog: archives</title><link href="http://simonwillison.net/" rel="alternate"/><link href="http://simonwillison.net/tags/archives.atom" rel="self"/><id>http://simonwillison.net/</id><updated>2025-12-30T23:51:33+00:00</updated><author><name>Simon Willison</name></author><entry><title>TIL: Downloading archived Git repositories from archive.softwareheritage.org</title><link href="https://simonwillison.net/2025/Dec/30/software-heritage/#atom-tag" rel="alternate"/><published>2025-12-30T23:51:33+00:00</published><updated>2025-12-30T23:51:33+00:00</updated><id>https://simonwillison.net/2025/Dec/30/software-heritage/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://til.simonwillison.net/github/software-archive-recovery"&gt;TIL: Downloading archived Git repositories from archive.softwareheritage.org&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Back in February I &lt;a href="https://simonwillison.net/2025/Feb/7/sqlite-s3vfs/"&gt;blogged about&lt;/a&gt; a neat Python library called &lt;code&gt;sqlite-s3vfs&lt;/code&gt; for accessing SQLite databases hosted in an S3 bucket, released as MIT licensed open source by the UK government's Department for Business and Trade.&lt;/p&gt;
&lt;p&gt;I went looking for it today and found that the &lt;a href="https://github.com/uktrade/sqlite-s3vfs"&gt;github.com/uktrade/sqlite-s3vfs&lt;/a&gt; repository is now a 404.&lt;/p&gt;
&lt;p&gt;Since this is taxpayer-funded open source software I saw it as my moral duty to try and restore access! It turns out &lt;a href="https://archive.softwareheritage.org/browse/origin/directory/?origin_url=https://github.com/uktrade/sqlite-s3vfs"&gt;a full copy&lt;/a&gt; had been captured by &lt;a href="https://archive.softwareheritage.org/"&gt;the Software Heritage archive&lt;/a&gt;, so I was able to restore  the repository from there. My copy is now archived at &lt;a href="https://github.com/simonw/sqlite-s3vfs"&gt;simonw/sqlite-s3vfs&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;The process for retrieving an archive was non-obvious, so I've written up a TIL and also published a new &lt;a href="https://tools.simonwillison.net/software-heritage-repo#https%3A%2F%2Fgithub.com%2Fuktrade%2Fsqlite-s3vfs"&gt;Software Heritage Repository Retriever&lt;/a&gt; tool which takes advantage of the CORS-enabled APIs provided by Software Heritage. Here's &lt;a href="https://gistpreview.github.io/?3a76a868095c989d159c226b7622b092/index.html"&gt;the Claude Code transcript&lt;/a&gt; from building that.

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="https://news.ycombinator.com/item?id=46435308#46438857"&gt;Hacker News comment&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/archives"&gt;archives&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/git"&gt;git&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/github"&gt;github&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/open-source"&gt;open-source&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/tools"&gt;tools&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai"&gt;ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/til"&gt;til&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/generative-ai"&gt;generative-ai&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/llms"&gt;llms&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/ai-assisted-programming"&gt;ai-assisted-programming&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/claude-code"&gt;claude-code&lt;/a&gt;&lt;/p&gt;



</summary><category term="archives"/><category term="git"/><category term="github"/><category term="open-source"/><category term="tools"/><category term="ai"/><category term="til"/><category term="generative-ai"/><category term="llms"/><category term="ai-assisted-programming"/><category term="claude-code"/></entry><entry><title>Using static websites for tiny archives</title><link href="https://simonwillison.net/2024/Oct/17/using-static-websites-for-tiny-archives/#atom-tag" rel="alternate"/><published>2024-10-17T23:02:18+00:00</published><updated>2024-10-17T23:02:18+00:00</updated><id>https://simonwillison.net/2024/Oct/17/using-static-websites-for-tiny-archives/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://alexwlchan.net/2024/static-websites/"&gt;Using static websites for tiny archives&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Alex Chan:&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Over the last year or so, I’ve been creating static websites to browse my local archives. I’ve done this for a variety of collections, including:&lt;/p&gt;
&lt;ul&gt;
&lt;li&gt;paperwork I’ve scanned&lt;/li&gt;
&lt;li&gt;documents I’ve created&lt;/li&gt;
&lt;li&gt;screenshots I’ve taken&lt;/li&gt;
&lt;li&gt;web pages I’ve bookmarked&lt;/li&gt;
&lt;li&gt;video and audio files I’ve saved&lt;/li&gt;
&lt;/ul&gt;
&lt;/blockquote&gt;
&lt;p&gt;This is &lt;em&gt;such&lt;/em&gt; a neat idea. These tiny little personal archive websites aren't even served through a localhost web server - they exist as folders on disk, and Alex browses them by opening up the &lt;code&gt;index.html&lt;/code&gt; file directly in a browser.

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="https://social.alexwlchan.net/@alex/113318585934019063"&gt;@alex&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/archives"&gt;archives&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/html"&gt;html&lt;/a&gt;&lt;/p&gt;



</summary><category term="archives"/><category term="html"/></entry><entry><title>NYT Flash-based visualizations work again</title><link href="https://simonwillison.net/2024/Jan/21/nyt-flash-based-visualizations-work-again/#atom-tag" rel="alternate"/><published>2024-01-21T05:58:18+00:00</published><updated>2024-01-21T05:58:18+00:00</updated><id>https://simonwillison.net/2024/Jan/21/nyt-flash-based-visualizations-work-again/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://flowingdata.com/2024/01/10/nyt-flash-based-visualizations-work-again/"&gt;NYT Flash-based visualizations work again&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
The New York Times are using the open source &lt;a href="https://ruffle.rs/"&gt;Ruffle&lt;/a&gt; Flash emulator - built using Rust, compiled to WebAssembly - to get their old archived data visualization interactives working again.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/archives"&gt;archives&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/flash"&gt;flash&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/new-york-times"&gt;new-york-times&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/rust"&gt;rust&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/webassembly"&gt;webassembly&lt;/a&gt;&lt;/p&gt;



</summary><category term="archives"/><category term="flash"/><category term="new-york-times"/><category term="rust"/><category term="webassembly"/></entry><entry><title>SIARD: Software Independent Archiving of Relational Databases</title><link href="https://simonwillison.net/2022/May/4/siard/#atom-tag" rel="alternate"/><published>2022-05-04T22:40:27+00:00</published><updated>2022-05-04T22:40:27+00:00</updated><id>https://simonwillison.net/2022/May/4/siard/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="https://www.bar.admin.ch/bar/en/home/archiving/tools/siard-suite.html"&gt;SIARD: Software Independent Archiving of Relational Databases&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
I hadn’t heard of this before but it looks really interesting: the Federal Archives of Switzerland developed a standard for archiving any relational database as a zip file full of XML which is “is used in over 50 countries around the globe”.

    &lt;p&gt;&lt;small&gt;&lt;/small&gt;Via &lt;a href="https://twitter.com/MAndrewWaugh/status/1521981238890082304"&gt;@MAndrewWaugh&lt;/a&gt;&lt;/small&gt;&lt;/p&gt;


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/archives"&gt;archives&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/databases"&gt;databases&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/xml"&gt;xml&lt;/a&gt;&lt;/p&gt;



</summary><category term="archives"/><category term="databases"/><category term="xml"/></entry><entry><title>If You Don't Date Your Work, It Sucks.</title><link href="https://simonwillison.net/2010/Jan/18/if/#atom-tag" rel="alternate"/><published>2010-01-18T17:46:24+00:00</published><updated>2010-01-18T17:46:24+00:00</updated><id>https://simonwillison.net/2010/Jan/18/if/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://www.protocolostomy.com/2010/01/18/if-you-dont-date-your-work-it-sucks/"&gt;If You Don&amp;#x27;t Date Your Work, It Sucks.&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
I learnt this lesson the hard way, when I realised that I had no idea exactly what year I created my earliest web-facing projects.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/archives"&gt;archives&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/dates"&gt;dates&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/year"&gt;year&lt;/a&gt;&lt;/p&gt;



</summary><category term="archives"/><category term="dates"/><category term="year"/></entry></feed>