Skip to content
@ArchiveBox

ArchiveBox

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

Pinned Loading

  1. ArchiveBox ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    Python 20.3k 1.1k

  2. archivebox-browser-extension archivebox-browser-extension Public

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    TypeScript 176 14

  3. archivebox-proxy archivebox-proxy Public

    Official ArchiveBox MITM proxy: saves URLs of all requests passing through to an ArchiveBox server for archival.

    Python 9

  4. internet-archiving-talk internet-archiving-talk Public

    Forked from pirate/internet-archiving-talk

    🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.

    JavaScript 13 1

  5. good-karma-kit good-karma-kit Public

    😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

    300 8

  6. pydantic-pkgr pydantic-pkgr Public

    A modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    Python 7

Repositories

Showing 10 of 16 repositories
  • ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    ArchiveBox/ArchiveBox’s past year of commit activity
  • readability-extractor Public

    Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

    ArchiveBox/readability-extractor’s past year of commit activity
    JavaScript 35 13 0 1 Updated Jun 19, 2024
  • pip-archivebox Public

    Official Python package for ArchiveBox, the self-hosted internet archiving solution.

    ArchiveBox/pip-archivebox’s past year of commit activity
    14 GPL-3.0 2 0 3 Updated Jun 18, 2024
  • pydantic-pkgr Public

    A modern Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    ArchiveBox/pydantic-pkgr’s past year of commit activity
    Python 7 MIT 0 0 0 Updated Jun 12, 2024
  • archivebox-spreadsheet-bot Public

    This is a bot that provides ArchiveBox integration with Google Sheets for new URL ingestion, archived URL management, and automated QA (optionally AI-powered).

    ArchiveBox/archivebox-spreadsheet-bot’s past year of commit activity
    2 GPL-3.0 1 0 0 Updated May 22, 2024
  • debian-archivebox Public

    Home of the official apt/deb package for Ubuntu/Debian-based systems.

    ArchiveBox/debian-archivebox’s past year of commit activity
    Python 18 GPL-3.0 5 0 1 Updated May 21, 2024
  • good-karma-kit Public

    😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

    ArchiveBox/good-karma-kit’s past year of commit activity
    300 MIT 8 0 0 Updated May 11, 2024
  • docs Public

    Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

    ArchiveBox/docs’s past year of commit activity
    CSS 12 3 0 4 Updated May 7, 2024
  • archivebox-browser-extension Public

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    ArchiveBox/archivebox-browser-extension’s past year of commit activity
    TypeScript 176 MIT 14 16 0 Updated Apr 11, 2024
  • community Public

    A wiki of the broader Web Archiving Community: important organizations, alternative projects, blog posts, and more.

    ArchiveBox/community’s past year of commit activity
    4 0 0 0 Updated Feb 21, 2024