skip to main content
10.5555/3171004.3171038guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
research-article
Free access

Towards the ultimate text compression method for help systems

Published: 02 April 1991 Publication History

Abstract

State-of-the-art in the area of text compression for practical applications seems to be rapidly approaching its theoretical limit. This limit was estimated at 1.2 bits per character [6] (on the basis of the ability of a human to predict textual data input), and it establishes the ultimate goal of every investigation. This paper reports a research which led to an algorithm displaying a good-faith agreement between the objective of compression and the performance of contemporary computational resources, bringing the efficiency as high as 1.6 bits per character. It presents a step-by-step reasoning which directed the study toward the discovery of an intuitively perfect technique and discusses various heuristic rules that have been utilized to amplify the rudimentary engine.

References

[1]
Abrahamson, D.M. {1989}. An Adaptive Dependency Source Model for Data Compression, Communications of the ACM, January 1989, Vol. 32. No. 1. pp. 77--83.
[2]
Bentley, J.L., Sleator, D.D., Tarjan, R.E. and Wei, V.K. {1986}. A Locally Adaptive Data Compression Scheme, Communications of the ACM, April 1986, Vol. 29. No. 4., pp. 320--330.
[3]
Faller, N. {1973}. An Adaptive System for Data Compression, Conference Record of the Seventh IEEE Asilomar Conference on Circuits and Systems, pp. 593--597.
[4]
Gallager, R.G. {1978}. Variations on a Theme by Huffman, IEEE Transactions on Information Theory 24:6, pp. 668--674.
[5]
Huffman, D.A. {1952}. A Method for Construction of Minimum-Redundancy Codes, Proceedings of the IRE 40, pp. 1098--1101.
[6]
Shannon, C.E. {1951}. Prediction of Entropy of Printed English Text, Bell System Technical Journal 30, pp. 50--64.
[7]
Skaba, W. {1990}. Adaptive Huffman Squeezing with Refresh-Decay Heuristic, Submitted to: Communications of the ACM, July 1990.
[8]
Storer, J.A. {1988}. Data Compression: Methods and Theory, Computer Science Press, Rockville, Maryland, pp. 89--95.
[9]
Tazelaar, J.M., et al. {1988}. In Depth: Hypertext, BYTE October 1988, Vol. 13, No. 10, pp. 234--268.
[10]
Welch, T.A. {1984}. A Technique for High-Performance Data Compression, IEEE Computer 17:6, pp. 8--19.
[11]
Ziv, J. and Lempel, A. {1978}. Compression of Individual Sequences via Variable-Rate Coding, IEEE Transactions on Information Theory 24:5, pp. 530--536.

Recommendations

Comments

Information & Contributors

Information

Published In

RIAO '91: Intelligent Text and Image Handling - Volume 2
April 1991
450 pages

Publisher

LE CENTRE DE HAUTES ETUDES INTERNATIONALES D'INFORMATIQUE DOCUMENTAIRE

Paris, France

Publication History

Published: 02 April 1991

Author Tags

  1. data compression
  2. huffman coding
  3. off-line methods

Qualifiers

  • Research-article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 44
    Total Downloads
  • Downloads (Last 12 months)29
  • Downloads (Last 6 weeks)3
Reflects downloads up to 28 Dec 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media