Explore web archive collections

Introduction


  • This page provides resources on searching, browsing, and analyzing web archive collections.

Resources


  • Explore Archive-It: https://archive-it.org/explore

    • Search and browse web archive collections created with the Internet Archive's Archive-It service.

  • Internet Archive Wayback Machine: https://web.archive.org/

    • Explore more than 555 billion web pages saved over time.

  • Archives Unleashed Project: https://archivesunleashed.org/

    • The Archives Unleashed project aims to make petabytes of historical internet content accessible to scholars and others interested in researching the recent past. The project team develops web archive search and data analysis tools to enable scholars, librarians and archivists to access, share, and investigate recent history since the early days of the World Wide Web.

  • Archives Unleashed Cloud: https://cloud.archivesunleashed.org/

    • The Archives Unleashed Cloud is an open-source cloud-based analysis tool that helps researchers and scholars conduct web archive analysis. It supports the priorities of accessibility and usability of web archives by providing users with a web-based front end to access the Archives Unleashed Toolkit. It has been primarily developed by Archives Unleashed project co-investigator and developer, Nick Ruest.

  • Warclight: Archives Unleashed Demo: https://warclight.archivesunleashed.org/

    • Warclight is a Project Blacklight based Rails engine that supports the discovery of web archives held in the WARC and ARC formats. It allows faceted full-text search, record view, and other advanced discovery options. Warclight is designed to work with web archive data that is indexed via the UK Web Archive's webarchive-discovery project.