Downloads & Free Reading Options - Results

Web Archiving by Julien Masanès

Read "Web Archiving" by Julien Masanès through these free online access and download options.

Search for Downloads

Search by Title or Author

Books Results

Source: The Internet Archive

The internet Archive Search Results

Available books for downloads and borrow from The internet Archive

1Web Archiving And The IIPC - German Subtitles

By

Source: https://www.youtube.com/watch?v=07luBORrFr4 Uploader: IIPC

“Web Archiving And The IIPC - German Subtitles” Metadata:

  • Title: ➤  Web Archiving And The IIPC - German Subtitles
  • Author:

“Web Archiving And The IIPC - German Subtitles” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 47.71 Mbs, the file-s for this book were downloaded 16 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find Web Archiving And The IIPC - German Subtitles at online marketplaces:


289J7-UFA3: Web Archiving And The Global Response To COVID-19…

Perma.cc archive of https://cgs.illinois.edu/news/2020-06-02/web-archiving-and-global-response-covid-19 created on 2022-09-06 20:34:56.738885+00:00.

“89J7-UFA3: Web Archiving And The Global Response To COVID-19…” Metadata:

  • Title: ➤  89J7-UFA3: Web Archiving And The Global Response To COVID-19…

Edition Identifiers:

Downloads Information:

The book is available for download in "web" format, the size of the file-s is: 1.64 Mbs, the file-s for this book were downloaded 214 times, the file-s went public at Wed Sep 07 2022.

Available formats:
Archive BitTorrent - Item CDX Index - Item CDX Meta-Index - Metadata - WARC CDX Index - Web ARChive GZ -

Related Links:

Online Marketplaces

Find 89J7-UFA3: Web Archiving And The Global Response To COVID-19… at online marketplaces:


3Web Archiving In The United States 2017 NDSA Survey Instrument

The survey was available from October 2 - November 20, 2017 via SurveyMonkey. Survey respondents were solicited through mailing lists, blogs, social media and other channels. After the survey closed, the working group reviewed a total of 156 responses and removed 39 responses that were tests, substantially incomplete, or from outside the United States, leaving a total of 119 responses for analysis. Respondents were not required to answer all questions. Percentages reported for individual questions reflect the number of responses to that question rather than the total number of respondents participating in the survey.

“Web Archiving In The United States 2017 NDSA Survey Instrument” Metadata:

  • Title: ➤  Web Archiving In The United States 2017 NDSA Survey Instrument
  • Language: English

“Web Archiving In The United States 2017 NDSA Survey Instrument” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 7.70 Mbs, the file-s for this book were downloaded 62 times, the file-s went public at Thu Sep 20 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Web Archiving In The United States 2017 NDSA Survey Instrument at online marketplaces:


4Archive-It Advanced Training: Web Archiving For Lone Arrangers

By

Archive-It partners discuss and answer questions about how they each manage their collecting as the only person responsible for web archiving at their institution.

“Archive-It Advanced Training: Web Archiving For Lone Arrangers” Metadata:

  • Title: ➤  Archive-It Advanced Training: Web Archiving For Lone Arrangers
  • Author:
  • Language: English

“Archive-It Advanced Training: Web Archiving For Lone Arrangers” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 262.81 Mbs, the file-s for this book were downloaded 156 times, the file-s went public at Fri Jul 24 2020.

Available formats:
Archive BitTorrent - Item Tile - MPEG4 - Metadata - Thumbnail -

Related Links:

Online Marketplaces

Find Archive-It Advanced Training: Web Archiving For Lone Arrangers at online marketplaces:


5Requirements For Archiving IETF Email Lists And For Providing Web-Based Browsing And Searching

By

The IETF makes heavy use of email lists to conduct its work. Participants frequently need to search and browse the archives of these lists and have asked for improved search capabilities. The current archive mechanism could also be made more efficient. This memo captures the requirements for improved email list archiving and searching systems. This document is not an Internet Standards Track specification; it is published for informational purposes.

“Requirements For Archiving IETF Email Lists And For Providing Web-Based Browsing And Searching” Metadata:

  • Title: ➤  Requirements For Archiving IETF Email Lists And For Providing Web-Based Browsing And Searching
  • Author:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 3.50 Mbs, the file-s for this book were downloaded 24 times, the file-s went public at Tue Jan 24 2023.

Available formats:
Archive BitTorrent - DjVuTXT - Djvu XML - HTML - Item Tile - JSON - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Requirements For Archiving IETF Email Lists And For Providing Web-Based Browsing And Searching at online marketplaces:


6Web Archiving At Henderson Libraries

Presentation from Community Webs program cohort member Henderson District Public Libraries.

“Web Archiving At Henderson Libraries” Metadata:

  • Title: ➤  Web Archiving At Henderson Libraries

“Web Archiving At Henderson Libraries” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 14.69 Mbs, the file-s for this book were downloaded 783 times, the file-s went public at Mon Jun 18 2018.

Available formats:
Archive BitTorrent - JPEG Thumb - MPEG4 - Metadata - Ogg Video - Thumbnail -

Related Links:

Online Marketplaces

Find Web Archiving At Henderson Libraries at online marketplaces:


7Why Archiving The Internet Is Vital And How A Decentralized Web Can Help

By

Why archiving the internet is vital for the future, how a decentralized web can help, and why looking at history is a good way to predict the future Source: https://the-future-rules-forkast-news-x-filecoin-foundation.simplecast.com/episodes/archiving-the-internet-x2EIV1xM Uploader: tubeup.py

“Why Archiving The Internet Is Vital And How A Decentralized Web Can Help” Metadata:

  • Title: ➤  Why Archiving The Internet Is Vital And How A Decentralized Web Can Help
  • Author:

“Why Archiving The Internet Is Vital And How A Decentralized Web Can Help” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "audio" format, the size of the file-s is: 22.93 Mbs, the file-s for this book were downloaded 17 times, the file-s went public at Wed Dec 14 2022.

Available formats:
Archive BitTorrent - Columbia Peaks - Item Tile - JPEG - JPEG Thumb - JSON - Metadata - PNG - Spectrogram - Unknown - VBR MP3 -

Related Links:

Online Marketplaces

Find Why Archiving The Internet Is Vital And How A Decentralized Web Can Help at online marketplaces:


8IIPC WAC2021: WASAPIfying Private Web Archiving Tools For Persistence And Collaboration

By

IIPC WAC 2021: SESSION 10: ARCHIVING FRAMEWORKS & TOOLS Mat Kelly: WASAPIfying private web archiving tools for persistence and collaboration https://netpreserve.org/ga2021/ Source: https://www.youtube.com/watch?v=BsXPBfrEC_8 Uploader: IIPC

“IIPC WAC2021: WASAPIfying Private Web Archiving Tools For Persistence And Collaboration” Metadata:

  • Title: ➤  IIPC WAC2021: WASAPIfying Private Web Archiving Tools For Persistence And Collaboration
  • Author:

“IIPC WAC2021: WASAPIfying Private Web Archiving Tools For Persistence And Collaboration” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 72.27 Mbs, the file-s for this book were downloaded 18 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC WAC2021: WASAPIfying Private Web Archiving Tools For Persistence And Collaboration at online marketplaces:


9DIY Web Archiving Zine V4

By

Do It Yourself guide to archiving web content. Version 4. 23 pages. Covers Webrecorder, Archivweb.page and Browsertrix. "You can be part of making sure the stuff you care about on the web doesn't disappear! Anything you access via the web, including datasets, fanfic, encyclopedia articles, videos, websites, online museum exhibits, online community forums, news articles, and more.  We're hoping to give you concrete tools and steps that you can take like right now today to start preserving things that you are concerned might no longer be there in a month, 3 months, a year. This zine introduces you to accessible tools and best practices to get started, and why you should act rather than assuming others will take care of things."

“DIY Web Archiving Zine V4” Metadata:

  • Title: DIY Web Archiving Zine V4
  • Author: ➤  
  • Language: English

“DIY Web Archiving Zine V4” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 11.07 Mbs, the file-s for this book were downloaded 7 times, the file-s went public at Thu Apr 10 2025.

Available formats:
Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find DIY Web Archiving Zine V4 at online marketplaces:


10IIPC Training Video Case Study, Topic 1: Building Web Archiving Skills

By

Produced by the International Internet Preservation Consortium Training Working Group, this video features IIPC members discussing how they became web archiving professionals, what technical skills are useful to have when starting out, what all librarians and archivists should know about web archiving, and more! This video is 1 of 8 in a series of topical training videos. Full length videos of the interviews are also available. Filmed in June 2019 in Zagreb, Croatia. Source: https://www.youtube.com/watch?v=74vnixC_jf8 Uploader: IIPC

“IIPC Training Video Case Study, Topic 1: Building Web Archiving Skills” Metadata:

  • Title: ➤  IIPC Training Video Case Study, Topic 1: Building Web Archiving Skills
  • Author:

“IIPC Training Video Case Study, Topic 1: Building Web Archiving Skills” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 318.01 Mbs, the file-s for this book were downloaded 9 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC Training Video Case Study, Topic 1: Building Web Archiving Skills at online marketplaces:


11206.12 LT18 Web Archiving At The National Library Of Colombia

By

Biblioteca Nacional de Colombia has begun the Colombian web archive as a part of digital deposit managed by Biblioteca Nacional de Colombia. Initially, for this task we used HTTrack harvesting tool however this tool didn’t preserve contents in warc digital format. So, Biblioteca Nacional de Colombia decided to change for webrecorder.io online harvesting tool. The purpose of the lighting talk is to make people aware of the importance of our project. Throughout the course of our project we have overcome obstacles that have created difficulties, such as no support from legislative law, lack of funds and lack of an infrastructure and needed to preserve digital memory of Colombia. However, we have managed to work together with Ministry of Commerce and National Directorate of Copyright to modify the legislation that included changes in the legal e-deposit that allows the Library to preserve Colombian internet. Finally, we have to implement the appropriate harvesting system and a needed infrastructure to preserve the digital memory of Colombia.

“206.12 LT18 Web Archiving At The National Library Of Colombia” Metadata:

  • Title: ➤  206.12 LT18 Web Archiving At The National Library Of Colombia
  • Author:

Edition Identifiers:

Downloads Information:

The book is available for download in "data" format, the size of the file-s is: 6.04 Mbs, the file-s for this book were downloaded 1 times, the file-s went public at Mon Aug 23 2021.

Available formats:
Archive BitTorrent - Metadata - ZIP -

Related Links:

Online Marketplaces

Find 206.12 LT18 Web Archiving At The National Library Of Colombia at online marketplaces:


12Decentralization And Archiving: Threats, Challenges And Opportunities For Decentralized Web Archiving In A Changing World.

By

This talk occurred at DWeb Camp 2023 at Camp Navarro, CA. It was given by  Lorena Ramirez-Lopez, Sawood Alam, Jack Cushman, Benedict Lau, Tessa Walsh, and Ilya Kreymer. Link to SCHED This panel will focus on challenging but important questions about the role of decentralization and how it affects/is affected by digital preservation, specifically web and social media archiving.  Speakers Dr. Sawood Alam: Dr. Sawood is a Web and Data Scientist of the Wayback Machine at the Internet Archive (IA). He pursued his Masters and PhD degrees from the Old Dominion University while working with the Web Science and Digital Libraries (WS-DL) Research Group. Jack Cushman: Jack Cushman is the director of the Harvard Library Innovation Lab, a software and design lab at the Harvard Law School Library building tools and communities for open knowledge.  Tessa Walsh: Senior Applications and Tools Engineer, Webrecorder Ilya Kreymer: Webrecorder

“Decentralization And Archiving: Threats, Challenges And Opportunities For Decentralized Web Archiving In A Changing World.” Metadata:

  • Title: ➤  Decentralization And Archiving: Threats, Challenges And Opportunities For Decentralized Web Archiving In A Changing World.
  • Author: ➤  

“Decentralization And Archiving: Threats, Challenges And Opportunities For Decentralized Web Archiving In A Changing World.” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 2577.22 Mbs, the file-s for this book were downloaded 76 times, the file-s went public at Thu Aug 10 2023.

Available formats:
Archive BitTorrent - Intermediate ASR JSON - Item Tile - MP3 - MPEG4 - Metadata - PNG - SubRip - Thumbnail - Web Video Text Tracks - Whisper ASR JSON - h.264 720P -

Related Links:

Online Marketplaces

Find Decentralization And Archiving: Threats, Challenges And Opportunities For Decentralized Web Archiving In A Changing World. at online marketplaces:


13Ukrainian-web-archiving-KPIX-4-13-2022.mov

By

Ukrainian Web Archiving KPIX April 13, 2022 includes show clip of Brewster Kahle Source: https://archive.org/details/ukrainian-web-archiving-kpix-4-13-2022 Uploader: [email protected]

“Ukrainian-web-archiving-KPIX-4-13-2022.mov” Metadata:

“Ukrainian-web-archiving-KPIX-4-13-2022.mov” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 203.32 Mbs, the file-s for this book were downloaded 10 times, the file-s went public at Tue Dec 13 2022.

Available formats:
Archive BitTorrent - Item Tile - JPEG - JPEG Thumb - JSON - Metadata - QuickTime - Thumbnail - Unknown - h.264 -

Related Links:

Online Marketplaces

Find Ukrainian-web-archiving-KPIX-4-13-2022.mov at online marketplaces:


14IIPC WAC202: Contemporary Art Knowledge A Community Based Approach To Web Archiving

By

IIPC WAC 2021: SESSION 6: ARCHIVING COMMUNITIES Hélène Brousseau: Contemporary art knowledge: a community-based approach to web archiving https://netpreserve.org/ga2021/ Source: https://www.youtube.com/watch?v=_z44i56Syug Uploader: IIPC

“IIPC WAC202: Contemporary Art Knowledge A Community Based Approach To Web Archiving” Metadata:

  • Title: ➤  IIPC WAC202: Contemporary Art Knowledge A Community Based Approach To Web Archiving
  • Author:

“IIPC WAC202: Contemporary Art Knowledge A Community Based Approach To Web Archiving” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 130.99 Mbs, the file-s for this book were downloaded 2 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC WAC202: Contemporary Art Knowledge A Community Based Approach To Web Archiving at online marketplaces:


15ODU Web Archiving Activities 2017

Presentation by Michael Nelson at the National Symposium on Web Archiving Interoperability

“ODU Web Archiving Activities 2017” Metadata:

  • Title: ➤  ODU Web Archiving Activities 2017
  • Language: English

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 9.21 Mbs, the file-s for this book were downloaded 188 times, the file-s went public at Tue Jun 13 2017.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find ODU Web Archiving Activities 2017 at online marketplaces:


16IIPC WAC2021: A Beginner's Journey To Capture A Local Event Through Web Archiving Activities

By

IIPC WAC 2021 SESSION 17: LIGHTINING TALK Yoo Young Lee, Marina Bokovay: Starting Small but Dreaming Big - A beginner's journey to capture a local event through web archiving activities Source: https://www.youtube.com/watch?v=7PAMkAsESYU Uploader: IIPC

“IIPC WAC2021: A Beginner's Journey To Capture A Local Event Through Web Archiving Activities” Metadata:

  • Title: ➤  IIPC WAC2021: A Beginner's Journey To Capture A Local Event Through Web Archiving Activities
  • Author:

“IIPC WAC2021: A Beginner's Journey To Capture A Local Event Through Web Archiving Activities” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 32.93 Mbs, the file-s for this book were downloaded 7 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC WAC2021: A Beginner's Journey To Capture A Local Event Through Web Archiving Activities at online marketplaces:


17Web Archiving And The IIPC

By

Source: https://www.youtube.com/watch?v=wG7dRjtGWDk Uploader: IIPC

“Web Archiving And The IIPC” Metadata:

  • Title: Web Archiving And The IIPC
  • Author:

“Web Archiving And The IIPC” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 47.17 Mbs, the file-s for this book were downloaded 7 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find Web Archiving And The IIPC at online marketplaces:


18Requirements For Improvements To The IETF Email List Archiving, Web-Based Browsing, And Search Tool

By

The web-based IETF email archive search tool based on the requirements captured in RFC 6778 was deployed in January 2014. This memo captures the requirements for a set of improvements that have been identified during its initial years of community use.

“Requirements For Improvements To The IETF Email List Archiving, Web-Based Browsing, And Search Tool” Metadata:

  • Title: ➤  Requirements For Improvements To The IETF Email List Archiving, Web-Based Browsing, And Search Tool
  • Author:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 3.00 Mbs, the file-s for this book were downloaded 45 times, the file-s went public at Thu Jan 26 2023.

Available formats:
Archive BitTorrent - DjVuTXT - Djvu XML - HTML - Item Tile - JSON - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find Requirements For Improvements To The IETF Email List Archiving, Web-Based Browsing, And Search Tool at online marketplaces:


19Web Archiving And Retrieval Appliance

The web-based IETF email archive search tool based on the requirements captured in RFC 6778 was deployed in January 2014. This memo captures the requirements for a set of improvements that have been identified during its initial years of community use.

“Web Archiving And Retrieval Appliance” Metadata:

  • Title: ➤  Web Archiving And Retrieval Appliance

Edition Identifiers:

Downloads Information:

The book is available for download in "software" format, the size of the file-s is: 733.72 Mbs, the file-s for this book were downloaded 1028 times, the file-s went public at Thu Oct 25 2007.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVu - DjVuTXT - Djvu XML - GZIP - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find Web Archiving And Retrieval Appliance at online marketplaces:


20Advanced Training - The Web Archiving Systems API (WASAPI)

By

Archive-It advanced training webinar on the Web Archiving Systems API (WASAPI) -- a tool to access, download, and transfer WARC file data and metadata.

“Advanced Training - The Web Archiving Systems API (WASAPI)” Metadata:

  • Title: ➤  Advanced Training - The Web Archiving Systems API (WASAPI)
  • Author:

“Advanced Training - The Web Archiving Systems API (WASAPI)” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 371.57 Mbs, the file-s for this book were downloaded 1596 times, the file-s went public at Thu May 30 2019.

Available formats:
Archive BitTorrent - Item Tile - MPEG4 - Metadata - Ogg Video - Thumbnail -

Related Links:

Online Marketplaces

Find Advanced Training - The Web Archiving Systems API (WASAPI) at online marketplaces:


21Ukrainian Web Archiving KPIX April 13, 2022

By

Ukrainian Web Archiving KPIX April 13, 2022 includes show clip of Brewster Kahle

“Ukrainian Web Archiving KPIX April 13, 2022” Metadata:

  • Title: ➤  Ukrainian Web Archiving KPIX April 13, 2022
  • Author:
  • Language: English

“Ukrainian Web Archiving KPIX April 13, 2022” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 202.92 Mbs, the file-s for this book were downloaded 4749 times, the file-s went public at Thu Apr 14 2022.

Available formats:
Archive BitTorrent - Item Tile - Metadata - QuickTime - Thumbnail - h.264 -

Related Links:

Online Marketplaces

Find Ukrainian Web Archiving KPIX April 13, 2022 at online marketplaces:


22Github.com-iipc-awesome-web-archiving_-_2017-06-17_12-08-45

By

An Awesome List for getting started with web archiving Awesome Web Archiving Introduction An Awesome List for getting started with web archiving. Inspired by the awesome list. Table of Contents Training/Documentation Tools & Software Community Resources Deprecated Contribute Please ensure your pull request adheres to the following guidelines: Use the following format: [Name](link) (Status: Stable or In Development ) - Brief Description of what the module does Make an individual pull request for each new item. Link additions should be inserted alphabetically to the relavant category. New categories or improvements to the existing categorization are welcome. Check your spelling and grammar. The pull request and commit should have a useful title. License To the extent possible under law, the owner has waived all copyright and related or neighboring rights to this work. The List Training/Documentation Introductions to web archiving concepts: What is a web archive? video from the UK Web Archive YouTube Channel Glossary of Archive-It and Web Archiving Terms More advanced material: Awesome Memento docs.warcbase.org The WARC Ecosystem warcbase workshop Tools & Software Acquisition ArchiveFacebook (Stable) - A Mozilla Firefox add-on for individuals to archive their Facebook accounts. Brozzler (Stable) - A distributed web crawler (爬虫) that uses a real browser (chrome or chromium) to fetch pages and embedded urls and to extract links. F(b)arc (Stable) - A commandline tool and Python library for archiving data from Facebook using the Graph API . Heritrix (Stable) - An open source, extensible, web-scale, archival quality web crawler. grab-site (Stable) - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns. HTTrack (Stable) - An open source website copying utility. Lentil (Stable) - A Ruby on Rails Engine that supports the harvesting of images from Instagram and provides several browsing views, mechanisms for sharing, tools for users to select their favorite images, an administrative interface for moderating images, and a system for harvesting images and submitting donor agreements in preparation of ingest into external repositories. SiteStory (Stable) - A transactional archive that selectively captures and stores transactions that take place between a web client (browser) and a web server. twarc (Stable) - A command line tool and Python library for archiving Twitter JSON data. WARCreate (Stable) - A Google Chrome extension for archiving an individual webpage or website to a WARC file. WAIL (Stable) - A graphical user interface (GUI) atop multiple web archiving tools intended to be used as an easy way for anyone to preserve and replay web pages; Python , Electron . Web2Warc (Stable) - An easy-to-use and highly customizable crawler that enables anyone to create their own little Web archives (WARC/CDX). Webrecorder (Stable) - Create high-fidelity, interactive recordings of any web site you browse. Wget (Stable) - An open source file retrieval utility that of version 1.14 supports writing warcs . Wget-lua (Stable) - Wget with Lua extension. Wpull (Stable) - A Wget-compatible (or remake/clone/replacement/alternative) web downloader and crawler. Warcat - Tool and library for handling Web ARChive (WARC) files. Replay PyWb (Stable) - A Python (2 and 3) implementation of web archival replay tools, sometimes also known as 'Wayback Machine'. OpenWayback (Stable) - The open source project aimed to develop Wayback Machine, the key software used by web archives worldwide to play back archived websites in the user's browser. Webrecorder Player Webrecorder Player for Desktop OSX/Windows/Linux). (Built with Electron + Webrecorder) Search & Discovery Shine (Stable) - A prototype web archives exploration UI, based on a Solr back-end that has been populated using the warc-discovery indexer. warc-discovery (Stable) - WARC and ARC indexing and discovery tools. WARClight (In Development) - Blacklight instance operating on WARCs indexed using warc-discovery Utilities HadoopConcatGz (Stable) - A Splitable Hadoop InputFormat for Concatenated GZIP Files (and *.warc.gz) Jwat (Stable) - Libraries and tools for reading/writting/validating WARC/ARC/GZIP files. Warcat (Stable) - Tool and library for handling Web ARChive (WARC) files. wasapi-downloader (Stable) - Java command line application to download crawls from WASAPI. WarcPartitioner (Stable) - Partition (W)ARC Files by MIME Type and Year Analysis ArchiveSpark (Stable) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation. warcbase (Stable) - Warcbase is an open-source platform for managing analyzing web archives. Community Resources Blogs and Scholarship IIPC Blog Web Archiving Roundtable - Currently dormant, but is a great archive of web archiving resources and links. The Web as History - An open-source book that provides a conceptual overview to web archiving research, as well as several case studies. Mailing Lists IIPC OpenWayback WASAPI Slack Ask @netpreserve for access to the IIPC Slack Ask @ianmilligan1 for access to the Archives Unleashed Slack , a researcher group of people working with web archives. Twitter IIPC #webarchives Deprecated pywb Wayback Web Recorder (Archiver) (Sunsetted) - A bare-bones example of how to create a simple web recording and replay system. Warrick (Unknown) - An open source downloadable tool or web service for reconstructing websites from web archives, using Memento . To restore the repository, download the bundle iipc-awesome-web-archiving_-_2017-06-17_12-08-45.bundle and run: git clone iipc-awesome-web-archiving_-_2017-06-17_12-08-45.bundle -b master Source: https://github.com/iipc/awesome-web-archiving Uploader: iipc Upload date: 2017-06-17

“Github.com-iipc-awesome-web-archiving_-_2017-06-17_12-08-45” Metadata:

  • Title: ➤  Github.com-iipc-awesome-web-archiving_-_2017-06-17_12-08-45
  • Author:

“Github.com-iipc-awesome-web-archiving_-_2017-06-17_12-08-45” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "software" format, the size of the file-s is: 0.16 Mbs, the file-s for this book were downloaded 65 times, the file-s went public at Sat Jun 17 2017.

Available formats:
Archive BitTorrent - Item Tile - JPEG - JPEG Thumb - Metadata - Unknown -

Related Links:

Online Marketplaces

Find Github.com-iipc-awesome-web-archiving_-_2017-06-17_12-08-45 at online marketplaces:


23The Web Archiving Life Cycle Model

By

A white paper describing the steps and phases that an institution or individual experiences as they implement and develop a web archiving program.

“The Web Archiving Life Cycle Model” Metadata:

  • Title: ➤  The Web Archiving Life Cycle Model
  • Authors:
  • Language: English

“The Web Archiving Life Cycle Model” Subjects and Themes:

Edition Identifiers:

  • Internet Archive ID: WALCM

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 24.15 Mbs, the file-s for this book were downloaded 627 times, the file-s went public at Thu Jun 01 2017.

Available formats:
Abbyy GZ - Archive BitTorrent - Cloth Cover Detection Log - Daisy - DjVuTXT - Djvu XML - EPUB - Item Tile - Metadata - OCR Page Index - OCR Search Text - Page Numbers JSON - Scandata - Single Page Processed JP2 ZIP - Text PDF - chOCR - hOCR -

Related Links:

Online Marketplaces

Find The Web Archiving Life Cycle Model at online marketplaces:


24K12 Web Archiving Program Students

By

This video features 8th grade participants in the Archive-It K12 Web Archiving Program. The program, now in it's fifth year, asks students to collaboratively think about what topics they want to collect web content on and to save for future generations. The students in this video are from Moran Middle School in Wallingford, CT. They have been asked to reflect on their experiences in the program. The K-12 Web Archiving program has been a partnership between the Internet Archive and the Library of Congress. For more information please visit the website at http://archive-it.org/k12/

“K12 Web Archiving Program Students” Metadata:

  • Title: ➤  K12 Web Archiving Program Students
  • Author:

“K12 Web Archiving Program Students” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 168.23 Mbs, the file-s for this book were downloaded 5049 times, the file-s went public at Fri Feb 25 2011.

Available formats:
512Kb MPEG4 - Animated GIF - Archive BitTorrent - Item Tile - MPEG4 - Metadata - Ogg Video - Thumbnail -

Related Links:

Online Marketplaces

Find K12 Web Archiving Program Students at online marketplaces:


25Web Archiving In The United States A 2017 Survey

By

National Digital Stewardship Alliance Releases 2017 Web Archiving Survey Report From the National Digital Stewardship Alliance (via Digital Library Federation): The National Digital Stewardship Alliance is pleased to announce the release of the 2017 Web Archiving Survey Report.

“Web Archiving In The United States A 2017 Survey” Metadata:

  • Title: ➤  Web Archiving In The United States A 2017 Survey
  • Author: ➤  
  • Language: English

“Web Archiving In The United States A 2017 Survey” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 13.39 Mbs, the file-s for this book were downloaded 67 times, the file-s went public at Wed Dec 12 2018.

Available formats:
Abbyy GZ - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find Web Archiving In The United States A 2017 Survey at online marketplaces:


26IIPC Web Archiving Conference 2019 In Zagreb: Dr Anat Ben-David (keynote)

By

IIPC WAC 2019 Keynote Dr Anat Ben-David: Web archives as Memoryware: Critical reflections on sources and methods for web history http://netpreserve.org/ga2019/programme/keynote-speakers/ Source: https://www.youtube.com/watch?v=2kRC2X88kF4 Uploader: IIPC

“IIPC Web Archiving Conference 2019 In Zagreb: Dr Anat Ben-David (keynote)” Metadata:

  • Title: ➤  IIPC Web Archiving Conference 2019 In Zagreb: Dr Anat Ben-David (keynote)
  • Author:

“IIPC Web Archiving Conference 2019 In Zagreb: Dr Anat Ben-David (keynote)” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 465.55 Mbs, the file-s for this book were downloaded 6 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC Web Archiving Conference 2019 In Zagreb: Dr Anat Ben-David (keynote) at online marketplaces:


27Archiving Web Pages With Hadoop And Pig

By

Amateur video of Aaron Binns' presentation at the Web Archiving Cooperative (WAC) Workshop at Stanford University on June 30, 2012.

“Archiving Web Pages With Hadoop And Pig” Metadata:

  • Title: ➤  Archiving Web Pages With Hadoop And Pig
  • Author:

“Archiving Web Pages With Hadoop And Pig” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 1736.16 Mbs, the file-s for this book were downloaded 187 times, the file-s went public at Thu Jul 05 2012.

Available formats:
Animated GIF - Archive BitTorrent - Item Tile - MPEG4 - Metadata - Ogg Video - Thumbnail -

Related Links:

Online Marketplaces

Find Archiving Web Pages With Hadoop And Pig at online marketplaces:


28Code4Lib 2008 Lightning Talk: Web Archiving Service

By

Mike Wooldridge describes the Web Archiving Service at the California Digital Library.

“Code4Lib 2008 Lightning Talk: Web Archiving Service” Metadata:

  • Title: ➤  Code4Lib 2008 Lightning Talk: Web Archiving Service
  • Author:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 253.27 Mbs, the file-s for this book were downloaded 78 times, the file-s went public at Tue Jun 03 2008.

Available formats:
512Kb MPEG4 - Animated GIF - Archive BitTorrent - Item Tile - MPEG2 - Metadata - Ogg Video - Thumbnail -

Related Links:

Online Marketplaces

Find Code4Lib 2008 Lightning Talk: Web Archiving Service at online marketplaces:


29Jason Scott Presentation At American Archivists Meeting ( Web Archiving Group) August 2014

By

Presentation by Jason Scott at the American Archivists Meeting, Web Archiving Group, in August 2014.

“Jason Scott Presentation At American Archivists Meeting ( Web Archiving Group) August 2014” Metadata:

  • Title: ➤  Jason Scott Presentation At American Archivists Meeting ( Web Archiving Group) August 2014
  • Author:
  • Language: English

“Jason Scott Presentation At American Archivists Meeting ( Web Archiving Group) August 2014” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "audio" format, the size of the file-s is: 603.65 Mbs, the file-s for this book were downloaded 92 times, the file-s went public at Sat Aug 23 2014.

Available formats:
Archive BitTorrent - Checksums - Flac - Flac FingerPrint - Item Tile - Metadata - Ogg Vorbis - PNG - VBR MP3 - WAVE -

Related Links:

Online Marketplaces

Find Jason Scott Presentation At American Archivists Meeting ( Web Archiving Group) August 2014 at online marketplaces:


30SOS: Save Our Site! Archiving Web Content

 A recorded webinar with TechSoup about the Community Webs program offered by the Internet Archive in partnership with WebJunction, and with funding from the Institute of Museum and Library Services. The Community Webs program offers public librarians a chance to participate in a program of continuing education, training, and services to enable public libraries to build collections of historically-valuable, web published materials documenting their local communities.  

“SOS: Save Our Site! Archiving Web Content” Metadata:

  • Title: ➤  SOS: Save Our Site! Archiving Web Content

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 223.07 Mbs, the file-s for this book were downloaded 444 times, the file-s went public at Fri Aug 11 2017.

Available formats:
Archive BitTorrent - Item Tile - MPEG4 - Metadata - Ogg Video - Thumbnail -

Related Links:

Online Marketplaces

Find SOS: Save Our Site! Archiving Web Content at online marketplaces:


31IIPC WAC2021: Capturing Social Movements Web Archiving Needs Of Activist Collections

By

IIPC WAC 2021: SESSION 13: LIGHTNING TALK Bethany Aylward: Capturing social movements: Web archiving needs of activist collections in Yorkshire Source: https://www.youtube.com/watch?v=nY_1ECjMrnM Uploader: IIPC

“IIPC WAC2021: Capturing Social Movements Web Archiving Needs Of Activist Collections” Metadata:

  • Title: ➤  IIPC WAC2021: Capturing Social Movements Web Archiving Needs Of Activist Collections
  • Author:

“IIPC WAC2021: Capturing Social Movements Web Archiving Needs Of Activist Collections” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 17.10 Mbs, the file-s for this book were downloaded 5 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC WAC2021: Capturing Social Movements Web Archiving Needs Of Activist Collections at online marketplaces:


32Why I Started Archiving The Internet And How The Decentralized Web Could Make It Last

By

Brewster Kahle started the Internet Archive, a non-profit to save digital content in 1996. Learn more about how and why he's archived much of the internet and how decentralized storage solutions like Filecoin can help. Source: https://www.youtube.com/watch?v=Ne_NyqYkQgo Uploader: Filecoin

“Why I Started Archiving The Internet And How The Decentralized Web Could Make It Last” Metadata:

  • Title: ➤  Why I Started Archiving The Internet And How The Decentralized Web Could Make It Last
  • Author:

“Why I Started Archiving The Internet And How The Decentralized Web Could Make It Last” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 89.70 Mbs, the file-s for this book were downloaded 5 times, the file-s went public at Wed Dec 14 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - MPEG4 - Metadata - Thumbnail - Unknown -

Related Links:

Online Marketplaces

Find Why I Started Archiving The Internet And How The Decentralized Web Could Make It Last at online marketplaces:


33IIPC Training Video Case Study, Topic 4: The Evolution And Challenges Of Web Archiving

By

Produced by the International Internet Preservation Consortium Training Working Group, this video features IIPC members discussing significant changes they've seen since starting web archiving, their biggest challenges, and the future goals of their institution's web archiving program. This video is 4 of 8 in a series of topical training videos. Full length videos of the interviews are also available. Filmed in June 2019 in Zagreb, Croatia. Source: https://www.youtube.com/watch?v=5kkQ5kLiuDk Uploader: IIPC

“IIPC Training Video Case Study, Topic 4: The Evolution And Challenges Of Web Archiving” Metadata:

  • Title: ➤  IIPC Training Video Case Study, Topic 4: The Evolution And Challenges Of Web Archiving
  • Author:

“IIPC Training Video Case Study, Topic 4: The Evolution And Challenges Of Web Archiving” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 356.60 Mbs, the file-s for this book were downloaded 5 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC Training Video Case Study, Topic 4: The Evolution And Challenges Of Web Archiving at online marketplaces:


34The Web Ahead 97: Archiving The Internet With Jason Scott

By

The Internet Archive is a treasure trove of digitized culture — films, software, audio, websites and more. How it it being collected, and how might the Internet Archive be our best hope for preserving the history of this era, as we invent the web? Jason Scott joins Jen Simmons to talk about the challenges of archiving in the digital age.

“The Web Ahead 97: Archiving The Internet With Jason Scott” Metadata:

  • Title: ➤  The Web Ahead 97: Archiving The Internet With Jason Scott
  • Author:
  • Language: English

“The Web Ahead 97: Archiving The Internet With Jason Scott” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "audio" format, the size of the file-s is: 105.82 Mbs, the file-s for this book were downloaded 71 times, the file-s went public at Sun Mar 01 2015.

Available formats:
Archive BitTorrent - Columbia Peaks - Item Tile - Metadata - Ogg Vorbis - PNG - Spectrogram - VBR MP3 -

Related Links:

Online Marketplaces

Find The Web Ahead 97: Archiving The Internet With Jason Scott at online marketplaces:


35Web Archiving And The IIPC - Japanese Subtitles

By

Source: https://www.youtube.com/watch?v=jGrNrkr28ZI Uploader: IIPC

“Web Archiving And The IIPC - Japanese Subtitles” Metadata:

  • Title: ➤  Web Archiving And The IIPC - Japanese Subtitles
  • Author:

“Web Archiving And The IIPC - Japanese Subtitles” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 48.40 Mbs, the file-s for this book were downloaded 7 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find Web Archiving And The IIPC - Japanese Subtitles at online marketplaces:


36Github.com-iipc-awesome-web-archiving_-_2017-06-19_15-54-40

By

An Awesome List for getting started with web archiving Awesome Web Archiving Introduction An Awesome List for getting started with web archiving. Inspired by the awesome list. Table of Contents Training/Documentation Tools & Software Community Resources Deprecated Contribute Please ensure your pull request adheres to the following guidelines: Use the following format: [Name](link) (Status: Stable or In Development ) - Brief Description of what the module does Make an individual pull request for each new item. Link additions should be inserted alphabetically to the relavant category. New categories or improvements to the existing categorization are welcome. Check your spelling and grammar. The pull request and commit should have a useful title. License To the extent possible under law, the owner has waived all copyright and related or neighboring rights to this work. The List Training/Documentation Introductions to web archiving concepts: What is a web archive? video from the UK Web Archive YouTube Channel Glossary of Archive-It and Web Archiving Terms More advanced material: Awesome Memento docs.warcbase.org The WARC Ecosystem warcbase workshop Tools & Software Acquisition ArchiveFacebook (Stable) - A Mozilla Firefox add-on for individuals to archive their Facebook accounts. Brozzler (Stable) - A distributed web crawler (爬虫) that uses a real browser (chrome or chromium) to fetch pages and embedded urls and to extract links. F(b)arc (Stable) - A commandline tool and Python library for archiving data from Facebook using the Graph API . grab-site (Stable) - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns. Heritrix (Stable) - An open source, extensible, web-scale, archival quality web crawler. html2warc (Stable) - A simple script to covert offline data into a single warc file HTTrack (Stable) - An open source website copying utility. Lentil (Stable) - A Ruby on Rails Engine that supports the harvesting of images from Instagram and provides several browsing views, mechanisms for sharing, tools for users to select their favorite images, an administrative interface for moderating images, and a system for harvesting images and submitting donor agreements in preparation of ingest into external repositories. SiteStory (Stable) - A transactional archive that selectively captures and stores transactions that take place between a web client (browser) and a web server. twarc (Stable) - A command line tool and Python library for archiving Twitter JSON data. WARCreate (Stable) - A Google Chrome extension for archiving an individual webpage or website to a WARC file. WAIL (Stable) - A graphical user interface (GUI) atop multiple web archiving tools intended to be used as an easy way for anyone to preserve and replay web pages; Python , Electron . Web2Warc (Stable) - An easy-to-use and highly customizable crawler that enables anyone to create their own little Web archives (WARC/CDX). Webrecorder (Stable) - Create high-fidelity, interactive recordings of any web site you browse. Wget (Stable) - An open source file retrieval utility that of version 1.14 supports writing warcs . Wget-lua (Stable) - Wget with Lua extension. Wpull (Stable) - A Wget-compatible (or remake/clone/replacement/alternative) web downloader and crawler. Replay PyWb (Stable) - A Python (2 and 3) implementation of web archival replay tools, sometimes also known as 'Wayback Machine'. OpenWayback (Stable) - The open source project aimed to develop Wayback Machine, the key software used by web archives worldwide to play back archived websites in the user's browser. Webrecorder Player Webrecorder Player for Desktop OSX/Windows/Linux). (Built with Electron + Webrecorder) Search & Discovery Shine (Stable) - A prototype web archives exploration UI, based on a Solr back-end that has been populated using the warc-discovery indexer. warc-discovery (Stable) - WARC and ARC indexing and discovery tools. WARClight (In Development) - Blacklight instance operating on WARCs indexed using warc-discovery Utilities HadoopConcatGz (Stable) - A Splitable Hadoop InputFormat for Concatenated GZIP Files (and *.warc.gz) Jwat (Stable) - Libraries and tools for reading/writting/validating WARC/ARC/GZIP files. Warcat (Stable) - Tool and library for handling Web ARChive (WARC) files. wasapi-downloader (Stable) - Java command line application to download crawls from WASAPI. WarcPartitioner (Stable) - Partition (W)ARC Files by MIME Type and Year webarchive-indexing - Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system. The Archive Browser - The Archive Browser is a program that lets you browse the contents of archives, as well as extract them. It will let you open files from inside archives, and lets you preview them using Quick Look. WARC is supported. (OSX only, Proprietary app) The Unarchiver - Program to extract the contents of many archive formats, inclusive of WARC, to a file system. Free variant of The Archive Browser. (OSX only, Proprietary app) Analysis ArchiveSpark (Stable) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation. warcbase (Stable) - Warcbase is an open-source platform for managing analyzing web archives. Community Resources Blogs and Scholarship IIPC Blog Web Archiving Roundtable - Currently dormant, but is a great archive of web archiving resources and links. The Web as History - An open-source book that provides a conceptual overview to web archiving research, as well as several case studies. Mailing Lists IIPC OpenWayback WASAPI Slack Ask @netpreserve for access to the IIPC Slack Ask @ianmilligan1 for access to the Archives Unleashed Slack , a researcher group of people working with web archives. Twitter IIPC #webarchives Deprecated pywb Wayback Web Recorder (Archiver) (Sunsetted) - A bare-bones example of how to create a simple web recording and replay system. Warrick (Unknown) - An open source downloadable tool or web service for reconstructing websites from web archives, using Memento . To restore the repository download the bundle iipc-awesome-web-archiving_-_2017-06-19_15-54-40.bundle and run: git clone iipc-awesome-web-archiving_-_2017-06-19_15-54-40.bundle -b master Source: https://github.com/iipc/awesome-web-archiving Uploader: iipc Upload date: 2017-06-19

“Github.com-iipc-awesome-web-archiving_-_2017-06-19_15-54-40” Metadata:

  • Title: ➤  Github.com-iipc-awesome-web-archiving_-_2017-06-19_15-54-40
  • Author:

“Github.com-iipc-awesome-web-archiving_-_2017-06-19_15-54-40” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "software" format, the size of the file-s is: 0.17 Mbs, the file-s for this book were downloaded 54 times, the file-s went public at Mon Jun 19 2017.

Available formats:
Archive BitTorrent - Item Tile - JPEG - JPEG Thumb - Metadata - Unknown -

Related Links:

Online Marketplaces

Find Github.com-iipc-awesome-web-archiving_-_2017-06-19_15-54-40 at online marketplaces:


37WARC: Jobs.code4lib.org-jobs-web-archiving 20160802

An Awesome List for getting started with web archiving Awesome Web Archiving Introduction An Awesome List for getting started with web archiving. Inspired by the awesome list. Table of Contents Training/Documentation Tools & Software Community Resources Deprecated Contribute Please ensure your pull request adheres to the following guidelines: Use the following format: [Name](link) (Status: Stable or In Development ) - Brief Description of what the module does Make an individual pull request for each new item. Link additions should be inserted alphabetically to the relavant category. New categories or improvements to the existing categorization are welcome. Check your spelling and grammar. The pull request and commit should have a useful title. License To the extent possible under law, the owner has waived all copyright and related or neighboring rights to this work. The List Training/Documentation Introductions to web archiving concepts: What is a web archive? video from the UK Web Archive YouTube Channel Glossary of Archive-It and Web Archiving Terms More advanced material: Awesome Memento docs.warcbase.org The WARC Ecosystem warcbase workshop Tools & Software Acquisition ArchiveFacebook (Stable) - A Mozilla Firefox add-on for individuals to archive their Facebook accounts. Brozzler (Stable) - A distributed web crawler (爬虫) that uses a real browser (chrome or chromium) to fetch pages and embedded urls and to extract links. F(b)arc (Stable) - A commandline tool and Python library for archiving data from Facebook using the Graph API . grab-site (Stable) - The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns. Heritrix (Stable) - An open source, extensible, web-scale, archival quality web crawler. html2warc (Stable) - A simple script to covert offline data into a single warc file HTTrack (Stable) - An open source website copying utility. Lentil (Stable) - A Ruby on Rails Engine that supports the harvesting of images from Instagram and provides several browsing views, mechanisms for sharing, tools for users to select their favorite images, an administrative interface for moderating images, and a system for harvesting images and submitting donor agreements in preparation of ingest into external repositories. SiteStory (Stable) - A transactional archive that selectively captures and stores transactions that take place between a web client (browser) and a web server. twarc (Stable) - A command line tool and Python library for archiving Twitter JSON data. WARCreate (Stable) - A Google Chrome extension for archiving an individual webpage or website to a WARC file. WAIL (Stable) - A graphical user interface (GUI) atop multiple web archiving tools intended to be used as an easy way for anyone to preserve and replay web pages; Python , Electron . Web2Warc (Stable) - An easy-to-use and highly customizable crawler that enables anyone to create their own little Web archives (WARC/CDX). Webrecorder (Stable) - Create high-fidelity, interactive recordings of any web site you browse. Wget (Stable) - An open source file retrieval utility that of version 1.14 supports writing warcs . Wget-lua (Stable) - Wget with Lua extension. Wpull (Stable) - A Wget-compatible (or remake/clone/replacement/alternative) web downloader and crawler. Replay PyWb (Stable) - A Python (2 and 3) implementation of web archival replay tools, sometimes also known as 'Wayback Machine'. OpenWayback (Stable) - The open source project aimed to develop Wayback Machine, the key software used by web archives worldwide to play back archived websites in the user's browser. Webrecorder Player Webrecorder Player for Desktop OSX/Windows/Linux). (Built with Electron + Webrecorder) Search & Discovery Shine (Stable) - A prototype web archives exploration UI, based on a Solr back-end that has been populated using the warc-discovery indexer. warc-discovery (Stable) - WARC and ARC indexing and discovery tools. WARClight (In Development) - Blacklight instance operating on WARCs indexed using warc-discovery Utilities HadoopConcatGz (Stable) - A Splitable Hadoop InputFormat for Concatenated GZIP Files (and *.warc.gz) Jwat (Stable) - Libraries and tools for reading/writting/validating WARC/ARC/GZIP files. Warcat (Stable) - Tool and library for handling Web ARChive (WARC) files. wasapi-downloader (Stable) - Java command line application to download crawls from WASAPI. WarcPartitioner (Stable) - Partition (W)ARC Files by MIME Type and Year webarchive-indexing - Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system. The Archive Browser - The Archive Browser is a program that lets you browse the contents of archives, as well as extract them. It will let you open files from inside archives, and lets you preview them using Quick Look. WARC is supported. (OSX only, Proprietary app) The Unarchiver - Program to extract the contents of many archive formats, inclusive of WARC, to a file system. Free variant of The Archive Browser. (OSX only, Proprietary app) Analysis ArchiveSpark (Stable) - An Apache Spark framework (not only) for Web Archives that enables easy data processing, extraction as well as derivation. warcbase (Stable) - Warcbase is an open-source platform for managing analyzing web archives. Community Resources Blogs and Scholarship IIPC Blog Web Archiving Roundtable - Currently dormant, but is a great archive of web archiving resources and links. The Web as History - An open-source book that provides a conceptual overview to web archiving research, as well as several case studies. Mailing Lists IIPC OpenWayback WASAPI Slack Ask @netpreserve for access to the IIPC Slack Ask @ianmilligan1 for access to the Archives Unleashed Slack , a researcher group of people working with web archives. Twitter IIPC #webarchives Deprecated pywb Wayback Web Recorder (Archiver) (Sunsetted) - A bare-bones example of how to create a simple web recording and replay system. Warrick (Unknown) - An open source downloadable tool or web service for reconstructing websites from web archives, using Memento . To restore the repository download the bundle iipc-awesome-web-archiving_-_2017-06-19_15-54-40.bundle and run: git clone iipc-awesome-web-archiving_-_2017-06-19_15-54-40.bundle -b master Source: https://github.com/iipc/awesome-web-archiving Uploader: iipc Upload date: 2017-06-19

“WARC: Jobs.code4lib.org-jobs-web-archiving 20160802” Metadata:

  • Title: ➤  WARC: Jobs.code4lib.org-jobs-web-archiving 20160802

Edition Identifiers:

Downloads Information:

The book is available for download in "web" format, the size of the file-s is: 3906.85 Mbs, the file-s for this book were downloaded 9088 times, the file-s went public at Wed May 24 2017.

Available formats:
Archive BitTorrent - Item CDX Index - Item CDX Meta-Index - Metadata - WARC CDX Index - Web ARChive GZ -

Related Links:

Online Marketplaces

Find WARC: Jobs.code4lib.org-jobs-web-archiving 20160802 at online marketplaces:


38IIPC Training Video Case Study, Topic 5: Web Archiving Collecting Policies

By

Produced by the International Internet Preservation Consortium Training Working Group, this video features IIPC members discussing web archiving collection policies and scope. This video is 5 of 8 in a series of topical training videos. Full length videos of the interviews are also available. Filmed in June 2019 in Zagreb, Croatia. Source: https://www.youtube.com/watch?v=-NxJXrUTJ8A Uploader: IIPC

“IIPC Training Video Case Study, Topic 5: Web Archiving Collecting Policies” Metadata:

  • Title: ➤  IIPC Training Video Case Study, Topic 5: Web Archiving Collecting Policies
  • Author:

“IIPC Training Video Case Study, Topic 5: Web Archiving Collecting Policies” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 220.34 Mbs, the file-s for this book were downloaded 4 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC Training Video Case Study, Topic 5: Web Archiving Collecting Policies at online marketplaces:


39Web Archiving And The IIPC - Arabic Subtitles

By

Source: https://www.youtube.com/watch?v=9bI-ODibhdU Uploader: IIPC

“Web Archiving And The IIPC - Arabic Subtitles” Metadata:

  • Title: ➤  Web Archiving And The IIPC - Arabic Subtitles
  • Author:

“Web Archiving And The IIPC - Arabic Subtitles” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 47.29 Mbs, the file-s for this book were downloaded 11 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find Web Archiving And The IIPC - Arabic Subtitles at online marketplaces:


40Web Archiving And The IIPC - Spanish Subtitles

By

Source: https://www.youtube.com/watch?v=ZK-ODeTeXxU Uploader: IIPC

“Web Archiving And The IIPC - Spanish Subtitles” Metadata:

  • Title: ➤  Web Archiving And The IIPC - Spanish Subtitles
  • Author:

“Web Archiving And The IIPC - Spanish Subtitles” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 46.34 Mbs, the file-s for this book were downloaded 6 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find Web Archiving And The IIPC - Spanish Subtitles at online marketplaces:


41Archiving Software Surrogates On The Web For Future Reference

By

Software has long been established as an essential aspect of the scientific process in mathematics and other disciplines. However, reliably referencing software in scientific publications is still challenging for various reasons. A crucial factor is that software dynamics with temporal versions or states are difficult to capture over time. We propose to archive and reference surrogates instead, which can be found on the Web and reflect the actual software to a remarkable extent. Our study shows that about a half of the webpages of software are already archived with almost all of them including some kind of documentation.

“Archiving Software Surrogates On The Web For Future Reference” Metadata:

  • Title: ➤  Archiving Software Surrogates On The Web For Future Reference
  • Authors:

“Archiving Software Surrogates On The Web For Future Reference” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 0.59 Mbs, the file-s for this book were downloaded 20 times, the file-s went public at Sat Jun 30 2018.

Available formats:
Archive BitTorrent - Metadata - Text PDF -

Related Links:

Online Marketplaces

Find Archiving Software Surrogates On The Web For Future Reference at online marketplaces:


42IIPC WAC2021: Web Archiving In A Multilingual Environment: An EU Experience

By

IIPC WAC 2021: SESSION 5: MULTI-LINGUAL & COLLABORATIVE ARCHIVING Silvia Sevilla: Web archiving in a multilingual environment: an EU experience https://netpreserve.org/ga2021/ Source: https://www.youtube.com/watch?v=CLDuVVUdlNU Uploader: IIPC

“IIPC WAC2021: Web Archiving In A Multilingual Environment: An EU Experience” Metadata:

  • Title: ➤  IIPC WAC2021: Web Archiving In A Multilingual Environment: An EU Experience
  • Author:

“IIPC WAC2021: Web Archiving In A Multilingual Environment: An EU Experience” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 88.64 Mbs, the file-s for this book were downloaded 3 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC WAC2021: Web Archiving In A Multilingual Environment: An EU Experience at online marketplaces:


43IIPC Web Archiving Conference 2019 In Zagreb: Access Policies, Challenges, And Approaches (panel)

By

ABBIE GROTKE, ALEXANDRE CHAUTEMPS, NICOLA BINGHAM, MARIA RYAN, ALEX THURMAN & DANIEL GOMES: Access policies, challenges, and approaches http://netpreserve.org/ga2019/programme/abstracts/#40 Source: https://www.youtube.com/watch?v=s9tNJHT7rZo Uploader: IIPC

“IIPC Web Archiving Conference 2019 In Zagreb: Access Policies, Challenges, And Approaches (panel)” Metadata:

  • Title: ➤  IIPC Web Archiving Conference 2019 In Zagreb: Access Policies, Challenges, And Approaches (panel)
  • Author:

“IIPC Web Archiving Conference 2019 In Zagreb: Access Policies, Challenges, And Approaches (panel)” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 564.45 Mbs, the file-s for this book were downloaded 2 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC Web Archiving Conference 2019 In Zagreb: Access Policies, Challenges, And Approaches (panel) at online marketplaces:


4429H9-T5LQ: Web Archiving | IIPC

Perma.cc archive of http://www.netpreserve.org/web-archiving/overview created on 2017-05-12 16:03:40+00:00.

“29H9-T5LQ: Web Archiving | IIPC” Metadata:

  • Title: ➤  29H9-T5LQ: Web Archiving | IIPC

Edition Identifiers:

Downloads Information:

The book is available for download in "web" format, the size of the file-s is: 0.57 Mbs, the file-s for this book were downloaded 199 times, the file-s went public at Sat May 13 2017.

Available formats:
Archive BitTorrent - Item CDX Index - Item CDX Meta-Index - Metadata - WARC CDX Index - Web ARChive GZ -

Related Links:

Online Marketplaces

Find 29H9-T5LQ: Web Archiving | IIPC at online marketplaces:


45IIPC Web Archiving Conference 2019 In Zagreb: Studying National Webs (panel)

By

NIELS BRÜGGER, DITTE LAURSEN, FRIEDEL GEERAERT, KEES TESZELSZKY, VALÉRIE SCHAFER & DANIEL GOMES: Opportunities and challenges in collecting and studying national webs http://netpreserve.org/ga2019/programme/abstracts/#36 Source: https://www.youtube.com/watch?v=xH6oN1p6NCU Uploader: IIPC

“IIPC Web Archiving Conference 2019 In Zagreb: Studying National Webs (panel)” Metadata:

  • Title: ➤  IIPC Web Archiving Conference 2019 In Zagreb: Studying National Webs (panel)
  • Author:

“IIPC Web Archiving Conference 2019 In Zagreb: Studying National Webs (panel)” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 544.69 Mbs, the file-s for this book were downloaded 14 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find IIPC Web Archiving Conference 2019 In Zagreb: Studying National Webs (panel) at online marketplaces:


46Web Archiving Crawl Download Automation Demo 3

By

Video demonstration of work on utilities built by Stanford University Libraries using the WASAPI data transfer API.

“Web Archiving Crawl Download Automation Demo 3” Metadata:

  • Title: ➤  Web Archiving Crawl Download Automation Demo 3
  • Author:
  • Language: English

“Web Archiving Crawl Download Automation Demo 3” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 25.74 Mbs, the file-s for this book were downloaded 116 times, the file-s went public at Thu Jun 08 2017.

Available formats:
Archive BitTorrent - Item Tile - MPEG4 - Metadata - Ogg Video - Thumbnail -

Related Links:

Online Marketplaces

Find Web Archiving Crawl Download Automation Demo 3 at online marketplaces:


47Internet Archive - How Would You Capture Your Community’s Web Presence? @buffalolibrary Is Archiving Local Theatre, Festivals, And Community Groups With @archiveitorg: Join Them In #CommunityWebs. Apply Via Call For Applications At

By

How would you capture your community’s web presence? @buffalolibrary is archiving local theatre, festivals, and community groups with @archiveitorg: https://t.co/gtps5ax1jL. Join them in #CommunityWebs. Apply via Call for Applications at https://t.co/jFbrojO63N. https://t.co/iji83Iiomc Source: https://twitter.com/internetarchive/status/1469336103257735169 Uploader: Internet Archive

“Internet Archive - How Would You Capture Your Community’s Web Presence? @buffalolibrary Is Archiving Local Theatre, Festivals, And Community Groups With @archiveitorg: Join Them In #CommunityWebs. Apply Via Call For Applications At” Metadata:

  • Title: ➤  Internet Archive - How Would You Capture Your Community’s Web Presence? @buffalolibrary Is Archiving Local Theatre, Festivals, And Community Groups With @archiveitorg: Join Them In #CommunityWebs. Apply Via Call For Applications At
  • Author:

“Internet Archive - How Would You Capture Your Community’s Web Presence? @buffalolibrary Is Archiving Local Theatre, Festivals, And Community Groups With @archiveitorg: Join Them In #CommunityWebs. Apply Via Call For Applications At” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 0.18 Mbs, the file-s for this book were downloaded 9 times, the file-s went public at Wed Dec 14 2022.

Available formats:
Archive BitTorrent - Item Tile - JPEG - JPEG Thumb - JSON - MPEG4 - Metadata - Thumbnail - Unknown - h.264 IA -

Related Links:

Online Marketplaces

Find Internet Archive - How Would You Capture Your Community’s Web Presence? @buffalolibrary Is Archiving Local Theatre, Festivals, And Community Groups With @archiveitorg: Join Them In #CommunityWebs. Apply Via Call For Applications At at online marketplaces:


48Web Archiving At National Libraries - Findings Of Stakeholders’ Consultation By The Internet Archive

By

Report on Internet Archive's Stakeholder Consultation between November 2015 and March 2016, with the aim to understand current practices, and then review Internet Archive’s current services in this light and explore new aspects for national libraries. This document reports on the consultation and summarises the findings.

“Web Archiving At National Libraries - Findings Of Stakeholders’ Consultation By The Internet Archive” Metadata:

  • Title: ➤  Web Archiving At National Libraries - Findings Of Stakeholders’ Consultation By The Internet Archive
  • Author:
  • Language: English

“Web Archiving At National Libraries - Findings Of Stakeholders’ Consultation By The Internet Archive” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "texts" format, the size of the file-s is: 8.75 Mbs, the file-s for this book were downloaded 908 times, the file-s went public at Thu May 26 2016.

Available formats:
Abbyy GZ - Animated GIF - Archive BitTorrent - DjVuTXT - Djvu XML - Item Tile - Metadata - Scandata - Single Page Processed JP2 ZIP - Text PDF -

Related Links:

Online Marketplaces

Find Web Archiving At National Libraries - Findings Of Stakeholders’ Consultation By The Internet Archive at online marketplaces:


49WAC2021: Memento Tracer: An Innovative Approach Towards Balancing Web Archiving At Scale & Quality

By

IIPC WAC 2021: SESSION 10: ARCHIVING FRAMEWORKS & TOOLS Martin Klein & Herbert Van de Sompel: Memento Tracer - an innovative approach towards balancing web archiving at scale and quality https://netpreserve.org/ga2021/ Source: https://www.youtube.com/watch?v=hEe3B7rc7vE Uploader: IIPC

“WAC2021: Memento Tracer: An Innovative Approach Towards Balancing Web Archiving At Scale & Quality” Metadata:

  • Title: ➤  WAC2021: Memento Tracer: An Innovative Approach Towards Balancing Web Archiving At Scale & Quality
  • Author:

“WAC2021: Memento Tracer: An Innovative Approach Towards Balancing Web Archiving At Scale & Quality” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 70.05 Mbs, the file-s for this book were downloaded 12 times, the file-s went public at Tue May 24 2022.

Available formats:
Archive BitTorrent - Item Tile - JSON - Metadata - Thumbnail - Unknown - WebM - h.264 -

Related Links:

Online Marketplaces

Find WAC2021: Memento Tracer: An Innovative Approach Towards Balancing Web Archiving At Scale & Quality at online marketplaces:


50Jefferson Bailey And Helge Holzman - Web Archiving And Data Services

Presentation to Internet Archive staff

“Jefferson Bailey And Helge Holzman - Web Archiving And Data Services” Metadata:

  • Title: ➤  Jefferson Bailey And Helge Holzman - Web Archiving And Data Services
  • Language: English

“Jefferson Bailey And Helge Holzman - Web Archiving And Data Services” Subjects and Themes:

Edition Identifiers:

Downloads Information:

The book is available for download in "movies" format, the size of the file-s is: 348.10 Mbs, the file-s for this book were downloaded 120 times, the file-s went public at Fri Jan 15 2021.

Available formats:
Archive BitTorrent - Intermediate ASR JSON - Item Tile - MP3 - MPEG4 - Metadata - PNG - SubRip - Thumbnail - Web Video Text Tracks - Whisper ASR JSON - h.264 720P - h.264 IA -

Related Links:

Online Marketplaces

Find Jefferson Bailey And Helge Holzman - Web Archiving And Data Services at online marketplaces:


Buy “Web Archiving” online:

Shop for “Web Archiving” on popular online marketplaces.