MIS 303 Lecture Notes - Lecture 6: Betweenness Centrality, Cluster Analysis, Text Mining
Document Summary
Data warehouse: a logical collection of information, gathered from many different operational databases, that supports business analysis activities and decision-making tasks, the primary purpose of a data warehouse is, to aggregate information throughout an organization. Multidimensional analysis: databases contain information in a series of two-dimensional tables. In a data warehouse and data mart, information is multidimensional, it contains layers of columns and rows: dimension: a particular attribute of information, cube: common term for the representation of multidimensional information. Information and cleansing: an organization must maintain high-quality data in the data warehouse. Information cleansing or scrubbing: a process that weeds out and fixes or discards inconsistent, incorrect, or incomplete information. Information cleansing activities: missing records or attributes, redundant records, missing keys or other required data, erroneous relationships or references. Inaccurate data: making them one customer, standardizing customer name from operational systems.