EECS 1520 Chapter Notes - Chapter 3: Bundesautobahn 66, Huffman Coding, Extended Ascii
mahika.c06 and 40040 others unlocked
54
EECS 1520 Full Course Notes
Verified Note
54 documents
Document Summary
English language character set: 26 letters (upper and lower case), punctuation, numeric digits, etc. Determine the minimum number of bits needed and pair them off. Each character is coded as byte (8 bits) 8th bit is unused (or used for parity check or to indicate extended character set) 128 codes (= 72 ) two general types of codes: Convert hexadecimal to binary in the ascii table. Convert to decimal: 01001010 = 64 + 8+ 2 = 74. 65,536 possible codes (= 162 ) from the standard: contains 38,887 distinct coded characters covering the principal written languages of the world. see: www. unicode. org for more information. Google: unicode and look at wikipedia entry. Meant to be a superset of ascii: the first 256 characters in unicode correspond to the extended ascii character set. Text compression: keyword encoding idea: substitute a frequently used word with a single character. Example: as (^), the (~), and (+), that ($), etc.