The WARC File Format (Version 0.16)
draft-kunze-warc-00

Document Type Expired Internet-Draft (individual)
Authors John Kunze  , Gordon Mohr  , Michael Stack 
Last updated 2008-07-05
Stream (None)
Intended RFC status (None)
Formats
Expired & archived
pdf htmlized bibtex
Stream Stream state (No stream defined)
Consensus Boilerplate Unknown
RFC Editor Note (None)
IESG IESG state Expired
Telechat date
Responsible AD (None)
Send notices to (None)

This Internet-Draft is no longer active. A copy of the expired Internet-Draft can be found at
https://www.ietf.org/archive/id/draft-kunze-warc-00.txt

Abstract

The WARC (Web ARChive) format specifies a method for combining multiple digital resources into an aggregate archival file together with related information. Resources are dated, identified by URIs, and preceded by simple text headers. By convention, files of this format are named with the extension ".warc" and have the MIME type application/warc. The WARC file format is a revision and generalization of the ARC format used by the Internet Archive to store information blocks harvested by web crawlers. This document specifies version 0.16 of the WARC format.

Authors

John Kunze (jak@ucop.edu)
Gordon Mohr (gojomo@archive.org)
Michael Stack (stack@archive.org)

(Note: The e-mail addresses provided for the authors of this Internet-Draft may no longer be valid.)