IST 195 Lecture Notes - Lecture 6: Parallel Computing, Newstalk Zb, Clickstream
Document Summary
We live in a world ruled by big data. A huge amount of data is being collected and stored. Commerce, e-commerce, bank transactions, public economic data. The world created 1 zb of data (1 billion terabytes) in 2011. 1826 pb of data is pushed over the internet daily. Variety = different forms of data; where it"s coming from. Velocity = how fast it"s coming in. There are more than these, but these are the most common. Unstructured = structure is not formally defined or anticipated. People are now attempting to structure these. Geofences take a physical area and build a digital fence around it. Companies would use database vendors to process large amounts of data. Google was one of the first companies to tackle big data. They chopped data into chunks and mapped each to a different computer to be processed, then compiled into one report. Parallel computing = multiple computers working on the same thing.