Chapter 10 Data Mining
Instructions: Please submit your work in one single Excel file with one tab/worksheet for each problem.

Cluster Analysis
1. (25 points) Apply single linkage cluster analysis to Berkeley, Cal Tech, UCLA, and UNC in the Excel file Colleges and Universities Cluster Analysis Worksheet and draw a dendrogram illustrating the clustering process.

Classification
2. In the Excel file Credit Risk Data, classify the following record:

a. (25 points) Using k-NN algorithm for k=1 to 5.
b. (25 points) Using discriminant analysis.

Association
3. (25 points) The Excel file Automobile Options provides data on options ordered together for a particular model of automobile. Consider the following rules:
a. Rule 1: If Fastest Engine, then Traction Control.
b. Rule 2: If Faster Engine and 16-inch Wheels, then 3 Year Warranty.
Compute the support, confidence, and lift for each of these rules.

Essay Mill

Share
Published by
Essay Mill

Recent Posts

Childbirth

For this short paper activity, you will learn about the three delays model, which explains…

1 month ago

Literature

 This is a short essay that compares a common theme or motif in two works…

1 month ago

Hospital Adult Medical Surgical Collaboration Area

Topic : Hospital adult medical surgical collaboration area a. Current Menu Analysis (5 points/5%) Analyze…

1 month ago

Predictive and Qualitative Analysis Report

As a sales manager, you will use statistical methods to support actionable business decisions for Pastas R Us,…

1 month ago

Business Intelligence

Read the business intelligence articles: Getting to Know the World of Business Intelligence Business intelligence…

1 month ago

Alcohol Abuse

The behaviors of a population can put it at risk for specific health conditions. Studies…

1 month ago