Minimal, selfhosted website archiver for Are.na Blocks. Built with HTMX, FastAPI and WGET.
Find a file
2025-08-30 15:31:11 -04:00
app Add Docker part 1 2025-08-27 13:50:32 -04:00
frontend Add Docker part 1 2025-08-27 13:50:32 -04:00
.gitignore Add:frontend 2025-08-26 11:39:18 -04:00
Dockerfile Add Docker part 1 2025-08-27 13:50:32 -04:00
LICENSE Create LICENSE 2025-08-27 15:00:44 -04:00
Makefile Add volume 2025-08-27 14:58:20 -04:00
pyproject.toml Add basic features 2025-08-24 14:20:29 -04:00
README.md Update README.md 2025-08-30 15:31:11 -04:00

Are.na Web Archiver

A minimal, self-hosted utility for Are.na built with HTMX, FastAPI, and Wget.


Details

  • Websites are archived as HTML and WARC files.
  • If no WARC filename is provided, the Are.na block ID will be used.
  • Archives are saved in:
    • /tmp/arena_archives when running with Docker.
    • /tmp (inside the project repository) when running locally.
  • Mirror → creates a 1:1 copy of the entire website.
  • Snapshot → archives only a single page.

Recommendations

  • Use this tool to preserve old web content that might disappear one day.
  • Its intentionally minimal—expect the bare essentials.
  • Some sites may cause infinite loops when mirroring (this error is not currently handled).
  • For very large websites and archives, consider using a more robust archiving tool.

Get Started

  1. Clone repo
  2. Have Python and and UV package manager and WGET installed on your machine
  3. Start the server using make server
  4. Access on http://127.0.0.1:8000
  5. See docs at http://127.0.0.1:8000/docs

Get started with docker

  1. run make docker-create-volume
  2. run make docker-build
  3. run make docker-run
  4. Access on http://127.0.0.1:8000
  5. See docs at http://127.0.0.1:8000/docs

contribute

See https://github.com/pivilartisant/arena-web-archiver/issues

License

Shield: CC BY-NC-SA 4.0

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

CC BY-NC-SA 4.0