My watch list

Entropy encoding

In information theory an entropy encoding is a lossless data compression scheme that is independent of the media’s specific characteristics.

Additional recommended knowledge

Safe Weighing Range Ensures Accurate Results

How to quickly check pipettes?

Guide to balance cleaning: 8 simple steps

One of the main types of entropy coding assigns codes to symbols so as to match code lengths with the probabilities of the symbols. Typically, these entropy encoders are used to compress data by replacing symbols represented by equal-length codes with symbols represented by codes where the length of each codeword is proportional to the negative logarithm of the probability. Therefore, the most common symbols use the shortest codes.

According to Shannon's source coding theorem, the optimal code length for a symbol is −log_bP, where b is the number of symbols used to make output codes and P is the probability of the input symbol.

Two of the most common entropy encoding techniques are Huffman coding and arithmetic coding. If the approximate entropy characteristics of a data stream are known in advance (especially for signal compression), a simpler static code may be useful. These static codes include universal codes (such as Elias gamma coding or Fibonacci coding) and Golomb codes (such as unary coding or Rice coding).

Entropy as a measure of similarity

Besides using entropy encoding as a way to compress (and losslessly recover) digital data, an entropy encoder can also be used to measure the amount of similarity between streams of data. This is done by generating an entropy coder/compressor for each class of data; unknown data is then classified by feeding the uncompressed data to each compressor and seeing which compressor yields the highest compression. The coder with the best compression is probably the coder trained on the data that was most similar to the unknown data.

An earlier (open content) version of the above article was posted on PlanetMath.

v • d • e

Data compression methods

Lossless compression methods

Theory	Entropy · Complexity · Redundancy
Entropy encoding	Huffman · Adaptive Huffman · Arithmetic (Shannon-Fano · Range) · Golomb · Exp-Golomb · Universal (Elias · Fibonacci) · Asymmetric binary
Dictionary	RLE · LZ77/78 · LZW · LZWL · LZO · DEFLATE · LZMA · LZX
Others	CTW · BWT · PPM · DMC

Audio compression methods

Theory	Convolution · Sampling · Nyquist–Shannon theorem
Audio codec parts	LPC (LAR · LSP) · WLPC · CELP · ACELP · A-law · μ-law · MDCT · Fourier transform · Psychoacoustic model
Others	Dynamic range compression · Speech compression · Sub-band coding

Image compression methods

Terms	Color space · Pixel · Chroma subsampling · Compression artifact
Methods	RLE · Fractal · Wavelet · SPIHT · DCT · KLT
Others	Bit rate · Test images · PSNR quality measure · Quantization

Video compression

Terms	Video Characteristics · Frame · Frame types · Video quality
Video codec parts	Motion compensation · DCT · Quantization
Others	Video codecs · Rate distortion theory (CBR · ABR · VBR)

Timeline of information theory, data compression, and error-correcting codes

See Compression Formats and Standards for formats and Compression Software Implementations for codecs

Category: Entropy and information

This article is licensed under the GNU Free Documentation License. It uses material from the Wikipedia article "Entropy_encoding". A list of authors is available in Wikipedia.

Last viewed

Entropy encoding

Additional recommended knowledge

Safe Weighing Range Ensures Accurate Results

How to quickly check pipettes?

Guide to balance cleaning: 8 simple steps

Entropy as a measure of similarity

About chemeurope.com

About LUMITOS

Advertise with LUMITOS