3DCompression.com 
the homepage for information on compressing 3D graphics and other complex datasets for fast transmission and cheap storage
   
Home Page
Demos/Downloads
Short Overview
Info Theory Backgrnd
Links
   New Links
   People
   Companies
   Software
   3D models 
   Standards Info
   Related Books
   Conferences
   Papers
   Patents
   Other related sites
About the author and this site
Ask a question

 
 
 

A static mirror of this site is available at Georgia Tech: mirror.
 

Send us your questions, comments, and new links. our email
 

This site and all its pages are Copyright (C) 1999 - 2001, Davis King, all rights reserved.

Net Resources on Data Compression and Information Theory

Data Compression is the art of squeezing a large amount of information into a small space.  Compression algorithms are widely used in applications that depend on storing and transmitting data efficiently, including the Internet, fax machines, cell phones, DVD players, database applications, and satellite communications.

Information Theory provides the scientific basis for data compression and for other technologies such as encryption, error correction, pattern recognition, signal processing, and similarity comparisons.  Information theory has also been widely influential in research on theoretical computer science, the physics of computation, fiber optics, psychology, and even music and molecular biology.

Overviews and Tutorials

  • Information theory and first principles
    • Encyclopedia Britannica article on Information Theory
    • Encyclopedia Britannica article on data compression
  • Text compression
    • Arithmetic Coding -- Mark Nelson, "Arithmetic Coding + Statistical Modeling=Data Compression," Dr. Dobb's Journal, February, 1991
    • Huffman coding and LZ coding, with java applet demos
    • huffman coding (a/k/a minimum-redundancy coding)
    • dictionary coding (lempel-ziv, LZW, gzip)
  • Mathematical advances in compression
    • wavelet compression
    • fractal compression -- siggraph '92 course notes from a good webpage on fractal image encoding
    • vector quantization
    • markov modeling
  • JPEG, MPEG and other standards for multimedia compression

Seminal Papers and literature surveys

  • C. E. Shannon, "A Mathematical Theory of Communication," Bell System Tech Journal,vol. 27, pp. 379-423 and 623-656, July and October, 1948.  (the original paper on information theory)
  • Kolmogorov Complexity/ algorithmic information theory
  • Huffman, paper on huffman coding
  • Witten, Neal, and Cleary, "Arithmetic Coding for Data Compression," Comm. ACM, 30(6):520--541, June 1987 (long considered the standard for arithmetic coding)
  • Moffat, Neal, and Witten, "Arithmetic Coding Revisited", ACM Transactions on Information Systems, 16(3):256-294, July 1998 (modern improvements to the speed of arithmetic coding)
  • wavelets for compression
  • R. M. Gray and D. L. Neuhoff, "Quantization", invited to IEEE Transactions on Information Theory, October 1998. (a comprehensive overview of scalar and vector quantization)
  • fractal compression

Source Code/Implementation Notes

  • Michael Schinler's notes on Practical Huffman coding
  • Implementation of arithmetic coding  from Moffat, Neal, and Witten
  • Another directory with links to sources for LZ and dynamic markov coding

Other Web Directories

  • Dr. Dobb's Journal data compression page -- a snazzy, well-organized page that links some of the best sites for programmers seeking to understand and implement compression algorithms
  • The Data Compression Library -- a good directory organized by topic.  allows users to search, rate sites, and submit new sites
  • Links2go compression page -- many excellent links, little organization
  • Dr. Ross's Compression Crypt -- stylish, high quality overview, from a former compression researcher
  • Arimura's compression bookmarks -- compression links clearly organized by subject areas
  • Compression Pointers -- alphabetical list of people in the field, plus companies and research projects
  • A nice overview of data compression, emphasizing huffman coding, markov modeling, and images (in a really long file)

Companies and Organizations

Books

  • D. Slepian, editor, Key Papers in the Development of Information Theory, New York: IEEE Press, 1974.
  • Managing Gigabytes: Compressing and Indexing Documents and Images by Ian H. Witten, Alistair Moffat, and Timothy C. Bell, published by Morgan Kaufmann, San Francisco, 1999.
  • W. Weaver and C. E. Shannon, The Mathematical Theory of Communication, Urbana, Illinois: University of Illinois Press, 1949, republished in paperback 1963.

Researchers

Some Frequently Asked Questions on Data Compression

  •  Comp.compression FAQ
  1. Can all data sets be compressed? How far can a data set be compressed?
  2. Can any compression algorithm achieve the maximum possible compression rate for a text string?
  3. How can a string of symbols be compressed with less than one bit per symbol?
  4. Why isn't arithmetic coding used more often?
  5. What is the best compression method to use for (text? images? music? speech? video? 3D graphics?)
  6. Are these algorithms patented?
  7. What are open research topics in compression?

 
 

Copyright (C) 1999-2001, Davis King

This page, the database, and all the pages on this site are copyright (C) 1999-2001, Davis King, all rights reserved, and it may not be reproduced or mirrored in any form or medium without permission, except for automatic caching or, with proper citation and attribution, for "fair use." Anyone is welcome to link to any of the pages on this site; we have included all the links here in good faith and will remove any links at the request of the author or owner of the linked page.  This site is intended for informational and research purposes only, and nothing in this site should be construed as an offer to sell or an endorsement of any product, or as a warranty of the suitability or effectiveness of any technology for any particular application.