Jump to content

International Internet Preservation Consortium

From Wikipedia, the free encyclopedia
(Redirected from Web Curator Tool)
International Internet Preservation Consortium
AbbreviationIIPC
FormationJuly 2003; 21 years ago (2003-07)
PurposeAcquire, preserve and make accessible knowledge and information from the Internet for future generations everywhere, promoting global exchange and international relations.[1]
Websitehttp://netpreserve.org/

The International Internet Preservation Consortium is an international organization of libraries and other organizations established to coordinate efforts to preserve internet content for the future.[2] It was founded in July 2003 by 12 participating institutions,[1] and had grown to 35 members by January 2010.[3] As of January 2022, there are 52 members.

Membership is open to archives, museums, libraries (including national libraries), and cultural heritage institutions.[1][4]


Members

[edit]

National libraries

[edit]

Participating national libraries and archives include:[5]

Participating organisations

[edit]

Other participating organizations include:[5]

Past members

[edit]

WebCite used to be, but is no longer, a member of the IIPC.[6] In a 2012 message, its founder Gunther Eysenbach commented that "WebCite has no funding, and IIPC charges 4000 Euro/yr in membership fees."[7]

Projects

[edit]

The IIPC sponsors and collaborates on a number of different projects with its member organizations.

Current projects

[edit]
  • Support for transitioning to pywb (Python Wayback).[8]
  • Collaborative Collections: IIPC members are collaborating to build public web archive collections based on transnational themes or events of mutual interest. Topics of existing collections include: European Refugee Crisis, Intergovernmental Organizations, Olympics, World War I Commemoration, Climate Change, Artificial Intelligence, and Novel Coronavirus (COVID-19).[9]
  • Memento: aggregate metadata of the IIPC archives and provide access to Memento.[10]

IIPC also maintains an electronic mailing list open to anyone interested in issues associated with web harvesting, archiving, and quality maintenance issues.[11]

Past projects

[edit]
  • Developing Bloom Filters for Web Archives’ Holdings.[12]
  • Improving the Dark and Stormy Archives Framework by Summarizing the Collections of the National Library of Australia[13]
  • LinkGate: Core Functionality and Future Use Cases.[14]
  • Asking questions with web archives – introductory notebooks for historians: The project output is a set of 16 Jupyter notebooks that demonstrate how specific historical research questions can be explored by analysing data from web archives.[15][16][17]
  • IIPC sponsored a project on "cross-archival search strategies" which included the creation of an archive focused on the 2010 Winter Olympics.[18]
  • Starting in 2006, the National Library of New Zealand and the British Library developed the Web Curator Tool, an open-source workflow management system for selective web archiving. Since 2017 the Royal Library of the Netherlands has collaboratively developed the tool with the National Library of New Zealand. Version 3.2.1[19] was released in August 2024, and is available at GitHub.[20] The Web Curator Tool is built upon Java and utilizes Internet Archive’s technology, the Heritrix web archiving crawler, and replay tools such as OpenWayback[21] and Pywb.[22]
  • IIPC Web Archiving Doctoral Support Award: grant to provide three years of funding for a student to earn a PhD in Interdisciplinary Information Science at The University of North Texas College of Information.[23]
  • IIPC Member Staff Exchange: onsite training by experts for participating IIPC members to use Heritrix 3 web crawler.[24]
  • Working group on Statistics and Quality Indicators for Web Archiving: development of guidelines on the management and evaluation of Web archiving activities and products.[25]

References

[edit]
  1. ^ a b c "Mission & Goals | IIPC". www.netpreserve.org. International Internet Preservation Consortium. Archived from the original on 2017-06-06. Retrieved 2015-09-12.
  2. ^ "International Internet Preservation Consortium" (Press release). International Internet Preservation Consortium. May 5, 2004. Archived from the original on May 1, 2012.
  3. ^ "Web Archives Registry Launched". News & Events. Library of Congress. January 29, 2010. Archived from the original on April 8, 2011. Retrieved 2011-04-17.
  4. ^ Hiiragi, Wasuke; Shigeo Sugimoto; Tetsuo Sakaguchi. "Web archiving in the world - International Internet Preservation Consortium (IIPC) and their activities". The Journal of Information Science and Technology Association. 58 (8). Japan.
  5. ^ a b "Members". International Internet Preservation Consortium. 2020.
  6. ^ "WebCite Consortium FAQ". webcitation.org. WebCite. Archived from the original on 2008-08-28.
  7. ^ "Twitter post". 2012-06-11. Archived from the original on 2014-01-07. Retrieved 2013-03-10.
  8. ^ "Support for transitioning to pywb". International Internet Preservation Consortium. Retrieved 24 January 2021.
  9. ^ "Collaborative Collections". International Internet Preservation Consortium. Retrieved 24 January 2021.
  10. ^ "Memento". International Internet Preservation Consortium. Retrieved 17 March 2014.
  11. ^ "Web Curators Mailing List". International Internet Preservation Consortium. Archived from the original on 2014-01-25. Retrieved 2017-10-17.
  12. ^ "Developing Bloom Filters for Web Archives' Holdings". International Internet Preservation Consortium. Retrieved 24 January 2021.
  13. ^ "Improving the Dark and Stormy Archives Framework by Summarizing the Collections of the National Library of Australia". International Internet Preservation Consortium. Retrieved 24 January 2021.
  14. ^ "LinkGate: Core Functionality and Future Use Cases". International Internet Preservation Consortium. Retrieved 24 January 2021.
  15. ^ "Asking questions with web archives – introductory notebooks for historians". International Internet Preservation Consortium. Retrieved 24 January 2021.
  16. ^ "Web Archives". GLAM Workbench. Retrieved 24 January 2021.
  17. ^ "IIPC RSS webinar: Tim Sherratt: Jupyter notebooks for web archives". International Internet Preservation Consortium. Archived from the original on 2021-12-20. Retrieved 24 January 2021.
  18. ^ "2010 Winter Olympics". California Digital Library. 2010. Archived from the original on 2011-09-02.
  19. ^ "The Web Curator Tool Release History". ReadTheDocs. Retrieved 2024-10-30.
  20. ^ "Web Curator Tool". GitHub. Retrieved 2024-10-30.
  21. ^ "OpenWayback". International Internet Preservation Consortium. Retrieved 29 October 2024.
  22. ^ "pywb". GitHub. Retrieved 29 October 2024.
  23. ^ "PhD Sponsorship". International Internet Preservation Consortium. Archived from the original on 17 October 2014. Retrieved 17 March 2014.
  24. ^ "Staff Exchange". International Internet Preservation Consortium. Archived from the original on 7 November 2014. Retrieved 17 March 2014.
  25. ^ "Statistics and Quality Indicators for Web Archiving". International Internet Preservation Consortium. Archived from the original on 7 November 2014. Retrieved 17 March 2014.
[edit]