关闭

cluster analysis needed to be done

该项目收到1 来自天才威客的竞标,平均竞标价格为₹277 INR / hour

为像这样的项目获取免费报价
雇主工作
项目预算
₹100 - ₹400 INR / hour
全部竞标
1
项目描述

1. Open the HEART dataset and create some visualizations to get familiar with the data. (Note: You do not need to submit these visualizations to Moodle for this problem.)

2. Create clusters over patients who have died. To do so, filter the data over the entire dataset over Status = Dead. Remove missing values.

3. Click the New Cluster icon on the toolbar and assign all the measure variables except Metropolitan Relative Weight and Age at Start to it.

4. Click the Properties tab. Notice that the number of clusters is set to 5, which is the default. Five clusters were crated with cluster IDs 0-4. Change the number of clusters to 4.

5. Increase the Visible Roles to 7. Maximize the cluster matrix. Right-click on one of the cells that have Age of Death on the X axis. Select Plot Age of Death by Cluster ID. Which cluster has the patients who died the youngest?

6. Create a box plot of Smoking by Cluster ID. Which cluster represents those that were heavy smokers in this dataset?

7. Minimize the cluster matrix and maximize the parallel coordinates plot. The plot shows the cluster IDs on the left side of the plot and the effects along the top. The clusters are colored differently. The bar sizes on the left represent the number of observations in each cluster. The minimum and maximum values for each effect are shown at the top and bottom of the effect. By looking at the plot with all the clusters shown, what can you assess? For example, which cluster appears to have the patients with the highest cholesterol?

8. Which cluster can be classified as follows: Non-smokers who were older in age at death, had lower cholesterol, and had lower blood pressure?

9. Characterize each of the other clusters.

10. Which cluster is the most different; that is, it has the largest Within-Cluster SS?

11. Right-click any of the cells in the cluster matrix. Select Derive a Cluster ID Variable. A new variable is created and appears in the Data pane. This may be used now as an input to other models

需要技能

在寻找赚取金钱的机会?

  • 设定您的预算和时间框架
  • 大致描述您的建议方案
  • 为您的工作领取工资

雇用同样在该项目上竞标的威客

    • Forbes
    • The New York Times
    • Time
    • Wall Street Journal
    • Times Online