( The Canterbury Corpus )
home | purpose | summary | details | corpora | methods | related | credits | faq

Welcome to the Canterbury Corpus

The Canterbury Corpus is a benchmark to enable researchers to evaluate lossless compression methods. This site includes test files and compression test results for many research compression methods.

Site Contents

p u r p o s e
What the Corpus is, and why
s u m m a r y
A summary of the compression test results
d e t a i l s
More detailed results, including some statistical analysis
c o r p o r a
Descriptions of the various corpora
m e t h o d s
Descriptions of the compression methods
r e s e a r c h
Research on the corpus and compression in general (includes papers and reports in PDF format)
r e l a t e d
Links to related web sites dealing with lossless compression and compression in general
c r e d i t s
Who did what

This page last updated Tuesday, November 20, 2001 by Matt Powell Department of Computer Science University of Canterbury