Introduction to web archiving

Introduction


  • This page provides selected resources that introduce basic concepts in web archiving.

  • Many resources are derived from the Archive-It User Guide.

Learning outcomes


After reviewing this material, learners will be able to:

  1. Describe basic terms used in web archiving

  2. Locate additional resources to support further study and training

Introduction to web archiving


These resources will introduce basic concepts in web archiving:

Questions


  1. How do libraries and archives create web archives?

  2. What is a robots.txt file? How can it affect web archiving technology?

  3. What is a crawler trap?

  4. What is the difference between the Archive-It standard crawler (Heritrix) and Brozzler?

Further reading


See the web archiving reading list for additional resources.