.pf{position:relative;background-color:#fff;overflow:hidden;margin:0;border:0}.pc{position:absolute;border:0;padding:0;margin:0;top:0;left:0;width:100%;height:100%;overflow:hidden;display:block;transform-origin:0 0;-ms-transform-origin:0 0;-webkit-transform-origin:0 0}.bi{position:absolute;border:0;margin:0}.c{position:absolute;border:0;padding:0;margin:0;overflow:hidden;display:block}.t{position:absolute;white-space:pre;font-size:1px;transform-origin:0 100%;-ms-transform-origin:0 100%;-webkit-transform-origin:0 100%;unicode-bidi:bidi-override;-moz-font-feature-settings:"liga" 0}.t:after{content:''}.t:before{content:'';display:inline-block}.t span{position:relative;unicode-bidi:bidi-override}._{display:inline-block;color:transparent;z-index:-1}.pi{display:none}@media screen{.pf{margin:13px auto;box-shadow:1px 1px 3px 1px #333;border-collapse:separate}}.ff1{font-family:ff1;line-height:.94;font-style:normal;font-weight:400;visibility:visible}@font-face{font-family:ff2;src:url(data:application/font-woff;base64,d09GRgABAAAAAAfoAA8AAAAAD3QAAQAAAAAAAAAAAAAAAAAAAAAAAAAAAABGRlRNAAAHzAAAABwAAAAcXf8aCkdERUYAAAewAAAAHAAAAB4AJwANT1MvMgAAAdAAAAA/AAAAVlTQmaZjbWFwAAACLAAAAEYAAAFK9fQL/2N2dCAAAAQoAAAAHAAAABwYdQMgZnBnbQAAAnQAAAFtAAAEKP36yaxnbHlmAAAEVAAAAiQAAATosOqghmhlYWQAAAFYAAAANgAAADbrqilQaGhlYQAAAZAAAAAeAAAAJAWvAyZobXR4AAACEAAAABwAAAAcCg4AbGxvY2EAAAREAAAAEAAAABABogPObWF4cAAAAbAAAAAdAAAAIAIpAe9uYW1lAAAGeAAAAQMAAAIlYh80MXBvc3QAAAd8AAAAMwAAAEqHOwM0cHJlcAAAA+QAAABEAAAATZYPIDMAAQAAAAEAAP/t8G1fDzz1AB8D6AAAAAC2Q2AAAAAAANNNhZkADAAAAtsC0gAAAAgAAgAAAAAAAHicY2BkYGC6xAAEzPIMDP//M91mYGRABewAWKAD2wAAeJxjYGRgYGBnMGFgZgABJjApB2Iw7gQxAQt0ARwAAAB4nGNgZFRknMDAysDA1MW0h4GBoQdCMz5gMGRkAooysDIzwACjAwMCBKS5pgAphd8MTJdAfAjJwAgiALSKCV0AAWwAIQAAAAABTQAAAAEAAAJBAAwB9AAzAx8ADHicY2BgYGaAYBkGRgYQcAHyGMF8FgYNIM0GpBkZmBgUfjP8/w/kg+n/j1lZoOqBgJGNAc5hZAISTAyogBFixXAGAM/OCPAAAHicrdC7TsMwFAZguymFUi7lUig1lWxF7VAFsTNlcCKhLIEy2AsXqZVo3wEpSxcPPIvZzNYXQ3DiRlGAqMrQJbZPrF+/P4OQFxnUjMUHxu/S4O+F4aj/iZrIeXq8Mgh7lAYzrvEzHGoeDEYMdo5HQ+0MwnvhSqqoup0oGtLXl4muD+wKP6ZKXlONxmIG3wfBtC9Jvp1KeQM59TSnbnOUhIR5ljC3CRDwBZe2vIhqZxiLO6ETTrTPJWGMBnoZC73khEkJtxp5U1jfZt2s8zZ0boxgs7NKGUMGREilslNtyPRSKaLgJXbiMoNRNoCXpnecQWCwH9tfvstIOnCZy6CH5JDd9KKxCKAJS5vs/iNFvEDaKpLuQb2WJd3fEOlBFdLDSqTtctIj6NxOSY/LSd01oPnALxFOVsJJifDJL+HT9cKdovAZtO1Y4fMNCXerCF9UEu6VCxPo3EuFL3Nhn2hUFE7+gKKNk/cL5D+gKTaWAAAAeJzbwSCizbiLgQkIxbQZ9zMwM9gxmDNoM8gzCDMwMMhoM+wHynihCu1iYAFCae0dDIwKrrWZEi47GLiAHAZtAIURCvIUAAAUAEsAWgAAABH/OgALAgUADALKABEAIQJ5AAAAKgAqACoAKgEGAU4CdHicrZLPbhJRFMbPvfOPOtgyVSiRWDIQsaUwUEaMTYiKiaTauCTFRdOkqdpHcOEL+AAu6FvAaqDbGlxoMnErD+DGHTuF+B3AcRJTSaozOfnO/d2bk3PP/UiSTSTy8pQUMsjpCCrVuoYa/Vbp6Nqw1lUkUuoojDXGXUO/+qPWFcxdy7Zuu1bWFktfBwN5On5pyxbKUZRIPpKfaUs89Oj61hlJlFcoiWwTSsg8UmIeGf5MV4YeSWgca4KuzTUNvgm95c+0OCSPxOOWR2Yp1SOTrtx/PgcpBqkQ0BnoFAlAgkEiBHIMciEgGUhSA5BnkJ+BXKyHzuUIW+hGRWh+n1Zw3rRWd9AR9lexX9foLZI2Lw7qFkWQJBEbiHuIXUQLcYJ4jYgc9CkdVMlwXdx0g2/sl7dF1sJfdat37rqVxFrcEdmMflP8sY7jlHhz3tzff9KIpqPmDTOdOf+dHh83RX/SEP29Zw/qqvpUVZKZ3VC6N2mgBZeE+Cg/TL3wgl/Oo0gJL4GIoCfd56nyhJQRv0+P1Gm2FLyKyvNSQ4+gMdBmA1SDAWoxLiJHuJ5t2ddgI9GefBKuO3kFF72T7fGOHKAL9tEX9hGVyBW9i7x0RgVkxUu5qggtQwvQ7blWg/tcwkNlBuUQWOzUZQbLIbDOYD0EHAZOCPzVqVNQYVC52LooCloAZRM7VPgPJv5V5Z9MjHOGu9DMh03+FnlaHp28H38n+gmAuRkHeJylj7tqAkEUhr/xFgKSyiLllCay4m6hwSZIQATRrCmMZRYZFmHZhfVS5EHyLnmlPEX+1SlTBBw4M9+c858b0OYLQ3UMHR4917jh1XOdAd+eG3RMx3OTtnnx3JL/U0rTuJWne86quMYdz57rfPDuuSHNj+cm9+bBc4uuiVkQs2LOhgk9ZjgyTroP7NiSsBQfZSzi1Xwz6c1cdnKH3TZZuqO8bwqlEmSSlvq69JglgikFuYpUbymFwxLR13qWsey/bS/qiCEBI1mkCiFPalDkh2lRps5G/YEd27/GkzsaBqMgGoTKuHLTtUIle4mrzaymuGzD2pX7XZHbUHNc2+UXvN9aoQB4nGNgYgCD/xYMRgzYADsQMzIwMTAzMjEyM7Kwl+ZlujkZGHD4Jeam+qbqwRlGAMQVCPQAeJxjYGRgYOABYjEgZmJgBEI2IGYB8xgAA+AANQAAAAEAAAAA1G40cAAAAAC2Q2AAAAAAANNNhZk=) format("woff")}.ff2{font-family:ff2;line-height:.722;font-style:normal;font-weight:400;visibility:visible}.m0{transform:matrix(.320260,0,0,.320260,0,0);-ms-transform:matrix(.320260,0,0,.320260,0,0);-webkit-transform:matrix(.320260,0,0,.320260,0,0)}.ls0{letter-spacing:0}.sc0{text-shadow:-.015em 0 transparent,0 .015em transparent,.015em 0 transparent,0 -.015em transparent}@media screen and (-webkit-min-device-pixel-ratio:0){.sc0{-webkit-text-stroke:.015em transparent;text-shadow:none}}.ws0{word-spacing:0}._0{margin-left:-4.1216px}._1{margin-left:-1.6128px}.fc0{color:#000}.fs0{font-size:44.8px}.y27{bottom:14.347678px}.y26{bottom:45.092676px}.y25{bottom:92.235006px}.y24{bottom:108.632338px}.y1{bottom:115.293728px}.y23{bottom:125.029670px}.y0{bottom:130.66624px}.y22{bottom:141.427002px}.y21{bottom:157.824334px}.y20{bottom:174.221666px}.y1f{bottom:190.618998px}.y1e{bottom:207.016330px}.y1d{bottom:223.413662px}.y1c{bottom:239.810994px}.y1b{bottom:256.208326px}.y1a{bottom:272.605658px}.y19{bottom:289.002990px}.y18{bottom:319.747987px}.y17{bottom:366.890317px}.y16{bottom:383.287649px}.y15{bottom:399.684981px}.y14{bottom:416.082313px}.y13{bottom:432.479645px}.y12{bottom:448.876977px}.y11{bottom:465.274309px}.y10{bottom:481.671641px}.yf{bottom:498.068973px}.ye{bottom:514.466305px}.yd{bottom:530.863637px}.yc{bottom:547.260969px}.yb{bottom:563.658302px}.ya{bottom:580.055634px}.y9{bottom:596.452966px}.y8{bottom:612.850298px}.y7{bottom:643.595295px}.y6{bottom:690.737625px}.y5{bottom:707.134957px}.y4{bottom:723.532289px}.y3{bottom:739.929621px}.y2{bottom:770.674618px}.h3{height:32.7488px}.h1{height:731.474734px}.h2{height:783.997438px}.h0{height:1014.58492px}.w1{width:87.110826px}.w2{width:599.527453px}.w0{width:783.997438px}.x2{left:0}.x3{left:40.99333px}.x4{left:81.98666px}.x1{left:92.234993px}.x0{left:117.215303px}.x5{left:122.97999px}

COMPSCI 98 Lecture Notes - Lecture 4: Universal Coded Character Set, English Alphabet, Gzip

New homework assignment on hu man trees, which will be put up this weekend (09/31 - 10/01) Code for trees and hu man encoding trees provided. Way of encoding text that"s used across many programming languages and systems. Utf-8: correspondence between those integers and bytes (0 to 255) A byte is 8 bits and can encode any integer 0-255. Variable-length encoding: integers vary in the number of bytes required to encode them. In python: string length is measured in characters, bytes length in bytes. Fewer bytes are used for more common characters, while more bytes are used for less common characters. Demo in class demonstrating various utf-8, ascii, and encoding functionalities in. One of the types in python is a bytes value, which is a range. We require an encoding without a deterministic decoding (with no collisions) 5-bit representation accounts for lower-case letters of the english alphabet, but no upper-case letters.

United States

Directed Group Study

Computer Science

John De Nero

University of California - Berkeley

The Structure and Interpretation of Computer Programs

Data Structures

Great Ideas of Computer Architecture (Machine Structures)

Physics for Scientists and Engineers

Designing Information Devices and Systems I

CIS 2500 Lecture Notes - Lecture 2: Binary File, Text Editor, Scanf Format String

CSC 1100 Lecture Notes - Lecture 4: Conio.H, Assembly Language, Machine Code

INFO 200 Study Guide - Midterm Guide: Determinism, Likert Scale, Royal Institute Of Technology

COMPSCI 98 Lecture Notes - Lecture 4: Universal Coded Character Set, English Alphabet, Gzip

Document Summary

Get access

Related Documents

CIS 2500 Lecture Notes - Lecture 2: Binary File, Text Editor, Scanf Format String

CSC 1100 Lecture Notes - Lecture 4: Conio.H, Assembly Language, Machine Code

INFO 200 Study Guide - Midterm Guide: Determinism, Likert Scale, Royal Institute Of Technology