Minimal, selfhosted website archiver for Are.na Blocks. Built with HTMX, FastAPI and WGET.
| app | ||
| frontend | ||
| .gitignore | ||
| Dockerfile | ||
| LICENSE | ||
| Makefile | ||
| pyproject.toml | ||
| README.md | ||
Are.na Web Archiver
A minimal, self-hosted utility for Are.na built with HTMX, FastAPI, and Wget.
Details
- Websites are archived as HTML and WARC files.
- If no WARC filename is provided, the Are.na block ID will be used.
- Archives are saved in:
/tmp/arena_archiveswhen running with Docker./tmp(inside the project repository) when running locally.
- Mirror → creates a 1:1 copy of the entire website.
- Snapshot → archives only a single page.
Recommendations
- Use this tool to preserve old web content that might disappear one day.
- It’s intentionally minimal—expect the bare essentials.
- Some sites may cause infinite loops when mirroring (this error is not currently handled).
- For very large websites and archives, consider using a more robust archiving tool.
Get Started
- Clone repo
- Have Python and and UV package manager and WGET installed on your machine
- Start the server using
make server - Access on http://127.0.0.1:8000
- See docs at http://127.0.0.1:8000/docs
Get started with docker
- run
make docker-create-volume - run
make docker-build - run
make docker-run - Access on http://127.0.0.1:8000
- See docs at http://127.0.0.1:8000/docs
contribute
See https://github.com/pivilartisant/arena-web-archiver/issues
License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
