DATA COMPRESSION STUDY GUIDE


Terminology and Basics

Run-length coding Huffman coding LZW

Recommended Problems

C level

  1. Decode each of the following LZW-encoded messages or explain briefly why it is not a valid LZW-encoded message. (Recall that codeword 80 is reserved to signify end of file.)
    41 42 43 44 80 
    42 41 4E 82 41 80
    42 41 83 80
    41 42 81 82 80
    41 42 81 83 80
    42 41 4E 44 41 4E 41 80
    
    Answers
  2. Fall 2012 Final, #12 (Huffman)
  3. Spring 2012 Final, #10 (BW)
  4. Textbook 5.5.3

B level

  1. Consider the following Huffman trie of a message over the 5-character alphabet {A, B, C, D, E}:

    Identify each statement with the best matching description (below tagged with letters A to D).
    ---- The frequency of A is strictly less than the frequency of B.
    ---- The frequency of C is greater than or equal to the frequency of A.
    ---- The frequency of D is strictly greater than the frequency of A.
    ---- The frequency of D is greater than or equal to that of A, B, and C combined.
    ---- The frequency of E is strictly less than that of
        A, B, and C combined.
        A. True for all messages.
        B. False for all messages.
        C. Depends on the message.
    
    Answers
  2. Textbook 5.5.13
  3. Textbook 5.5.17

A level

  1. Fall 2012 Final, #13