EECS 1520 Lecture Notes - Lecture 11: Huffman Coding, Extended Ascii, Compression Ratio

135 views2 pages

Verified Note

harlequinmanatee745

14 Oct 2018

School

York University

Department

Electrical Engineering and Computer Science

Course

EECS 1520

Professor

Andriy Pavlovych

For unlimited access to Class Notes, a Class+ subscription is required.

EECS 1520 verified notes

11/14View all

EECS 1520 Lecture Notes - Lecture 9: Radix Point, Decimal Mark, Scientific Notation

EECS 1520 Lecture Notes - Lecture 11: Huffman Coding, Extended Ascii, Compression Ratio

EECS 1520 Lecture Notes - Lecture 12: Binary Tree, Huffman Coding

REPRESENTING TEXT

• Text is a combo of characters

• To represent every character, a list is made, and each is assigned a binary

string

• Character set is a list of characters and the codes used to represent each one

• Experts in the computer industry agree on the specifics of a character set,

thereby creating a standard for sharing data

American Standard Code for Information Interchange (ASCII) character set

• First 32 are control characters (or hidden characters); they control how the text

appears but do not appear as text

• Decimal Arabic numbers (i.e., 0, 1, 2, etc) start at code 48

o Ex. the code for 6 is 48 + 6 = 54

• Uppercase roman letters (i.e., A, B, C etc) start at 65

o Ex. the code for J (10th letter) is 65 + (10-1) =74

• Lowercase roman letters (i.e., a, b, c etc) start at 97

o Ex. the code for j (10th letter) is 97 + (10-1) = 106

• Notice the corresponding upper and lowercase letters are separated by 32 (106-

74 = 32)

Unicode character set

• Even the extended ASCII is not enough for international use

• Unicode uses 16 bits per character and can represent 216 or over 65 thousand

characters

• Unicode is a superset of ASCII

o The first 256 characters in the Unicode character set correspond exactly

to the extended ASCII character set

• For organization, Unicode is divided into books of characters, with each block

having a common theme

Data Compression

• A reduction in the amount of space needed to store a piece of data

• Compression ratio: the size of the compressed data divided by the size of the

original data

• A data compression technique can be

o Lossless-the data can be retrieved without loss of the original info

o Lossy- some info may be lost in the process of compaction

• Ex.

o Keyword encoding

Unlock document

This preview shows half of the first page of the document.
Unlock all 2 pages and 3 million more documents.

Already have an account? Log in

EECS 1520 Lecture Notes - Lecture 11: Huffman Coding, Extended Ascii, Compression Ratio

EECS 1520 Lecture Notes - Lecture 9: Radix Point, Decimal Mark, Scientific Notation

EECS 1520 Lecture Notes - Lecture 11: Huffman Coding, Extended Ascii, Compression Ratio

EECS 1520 Lecture Notes - Lecture 12: Binary Tree, Huffman Coding

Get access

Related Documents

EECS 1520 Lecture Notes - Lecture 5: Extended Ascii, Aboriginal Peoples In Canada, Data Compression

EECS 1520 Lecture Notes - Extended Ascii, Moodle

EECS 1520 Lecture Notes - Extended Ascii, Aboriginal Peoples In Canada, Binary Number