18C5T13 Study Guide - Final Guide: Fraud, Medical Diagnosis, The Algorithm

36 views17 pages

Document Summary

Classification: basic concepts, general approach to solving a classification problem. Decision tree induction: working of decision tree, building a decision tree, methods for expressing an attribute test conditions, measures for selecting the best split, algorithm for decision tree induction. Classification is a form of data analysis that extracts models describing important data classes. Such models, called classifiers, predict categorical (discrete, unordered) class labels. We can build a classification model to categorize bank loan applications as either safe or risky. Such analysis can help provide us with a better understanding of the data at large. Many classification methods have been proposed by researchers in machine learning, pattern recognition, and statistics. And developing scalable classification and prediction techniques capable of handling large amounts of disk- resident data. Classification has numerous applications, including: fraud detection, target marketing, performance prediction, manufacturing, medical diagnosis, basic concepts: