# Data Mining with Orange Heart Disease Data set.Discuss

Data Mining with Orange Heart Disease Dataset
Problem Description
The dataset used in this exercise is the heart disease dataset available in heart_disease.tab obtained from the Orange datasets repository. This dataset describes risk factors for heart disease. The attribute diameter narowing represents the (binary) class attribute: class 1 means there is diameter narrowing; class 0 indicates no diameter narrowing.
The main aim of this exercise is to predict heart disease in terms of diameter narrowing from the other attributes in the dataset. Obviously, this is a classification problem. The software to be used is Orange. However, feel free to try any ideas you may have to tackle the problem with any other software.
The description of this exercise is stepwise. Therefore, I hope you can get a better understanding of the various aspects and questions involved in the KDD (Knowledge Discovery in Databases) process.

Data Understanding
The first step in approaching the problem is to get acquainted with the data. Answering the following questions will help you to better understand the data. The data file heart_disease.tab contains some information about the data stored in it.
Load the data file in Orange.

For each attribute find the following information.
The attribute type, e.g. nominal, ordinal, numeric.
Percentage of missing values in the data.
Max, min, mean, standard deviation.
Are there any records that have a value for the attribute that no other record has?
Study the histogram at the lower right and informally describe how the attribute seems to influence the risk for heart disease. What does it mean the pop-up messages that appear when dragging the mouse over the graphic?
Are there any outliers for the attribute under consideration?
Investigate the possibility of using the Orange widgets to detect outliers.
Use Visualize widgets to visualize 2D-scatter plots for each pair of attributes.
Which attributes seem to be the most/least linked to heart disease? Summarize in a table your findings concerning the predictive value of each attribute.
Does any pair of attributes seem to be correlated?

Investigate also possible multivariate associations of attributes with the class attribute, i.e. study scatter plots of two attributes X and Y and try to identify possible ”dense” heart disease areas (if any).
If you find ”dense” heart disease areas in any scatter plot then quantify the heart disease rate in these areas with respect to the entire data set.

Data Preprocessing
The second step is to preprocess the data such that the transformed data is in a more suitable form for the mining algorithms.

Attribute selection.
Investigate the possibility of using the widget AttributeSelection for selecting a subset of attributes with good predicting capability. Then, describe briefly the widget you used and compare the results you obtained with the conclusions you obtained in the previous section.

Handling missing values.
Consider the following methods for handling missing values and investigate each possibility within Orange. Note that, as rule of thumb, if an attribute has more than 5% missing values then the records should not be deleted and it is advisable to impute values where data is missing, using a suitable method.
Replace the missing values by the attribute mean, if the attribute is numeric. Otherwise, replace missing values by attribute mode (if the attribute is categorical). Save the dataset you obtained without missing values in the file heart-disease2.tab.
Investigate the possibility of using (linear) regression to estimate the missing values for each attribute. Save the dataset you obtained without missing values in the file heart_disease3.tab

Eliminating outliers.
Eliminate the outlier records and save the dataset you obtained without outliers in the file heart_disease4.tab
Mining the Data
The third step is to use some classifier algorithms available in Orange to discover hidden patterns in the data. You should repeat the steps described below for each of the datasets you created during preprocessing, besides using also the original dataset (if possible).
1. Use more than one classifier (Decision Tree, SVM, K Nearest Neighbor)
(a) What can you conclude? Compare your conclusions with your previous conclusions obtained in section 1.1.
(b) Compare the accuracy of the classifier on the training set with the accuracy estimation obtained through 10 fold-cross validation. How do you explain the difference (if any)?

(b) Describe the patterns you obtained and compare with your previous conclusions.

Clustering Tendency
Investigate whether there is a clustering tendency in the dataset. You may start by clustering the data with K Means Clustering algorithm.
1. Do not use the class attribute, diameter narrowing for clustering.
2. Find a suitable value for k, i.e. the number of clusters you are going to build. Justify your choice of k.

Predicting Performance
In the previous step you have built several models. Finally, you need to compare the different models and describe your final conclusions.
1. Orange outputs several performance measures. Choose some of the performance measures and motivate your choice.
2. Summarize in a table the performance measures for each classifier and each dataset.
3. What can you conclude?

1.6 Conclusions
Describe your final conclusions and indicate which risk factors for heart disease

Are you looking for a similar paper or any other quality academic essay? Then look no further. Our research paper writing service is what you require. Our team of experienced writers is on standby to deliver to you an original paper as per your specified instructions with zero plagiarism guaranteed. This is the perfect way you can prepare your own unique academic paper and score the grades you deserve.

Use the order calculator below and get started! Contact our live support team for any assistance or inquiry.

 Type of paper Academic level Subject area Essay Term Paper Research Paper Coursework Book Report Book Review Movie Review Dissertation Thesis Thesis Proposal Research Proposal Dissertation Chapter - Abstract Dissertation Chapter - Introduction Chapter Dissertation Chapter - Literature Review Dissertation Chapter - Methodology Dissertation Chapter - Results Dissertation Chapter - Discussion Dissertation Services - Editing Dissertation Services - Proofreading Formatting Admission Services - Admission Essay Admission Services - Scholarship Essay Admission Services - Personal Statement Admission Services - Editing Editing Proofreading Case Study Lab Report Speech Presentation Math Problem Article Article Critique Annotated Bibliography Reaction Paper PowerPoint Presentation Statistics Project Multiple Choice Questions (None-Time-Framed) Other (Not listed) High School Undergraduate Master Ph. D. Art   Architecture   Dance   Design Analysis   Drama   Movies   Music   Paintings   Theatre Biology Business Chemistry Communications and Media   Advertising   Communication Strategies   Journalism   Public Relations Creative writing Economics   Accounting   Case Study   Company Analysis   E-Commerce   Finance   Investment   Logistics   Trade Education   Application Essay   Education Theories   Pedagogy   Teacher's Career Engineering English Ethics History   African-American Studies   American History   Asian Studies   Canadian Studies   East European Studies   Holocaust   Latin-American Studies   Native-American Studies   West European Studies Law   Criminology   Legal Issues Linguistics Literature   American Literature   Antique Literature   Asian Literature   English Literature   Shakespeare Studies Management Marketing Mathematics Medicine and Health   Alternative Medicine   Healthcare   Nursing   Nutrition   Pharmacology   Sport Nature   Agricultural Studies   Anthropology   Astronomy   Environmental Issues   Geography   Geology Philosophy Physics Political Science Psychology Religion and Theology Sociology Technology   Aeronautics   Aviation   Computer Science   Internet   IT Management   Web Design Tourism Number of pages Paper urgency Cost per page: 1 pages/275 words 2 pages/550 words 3 pages/825 words 4 pages/1100 words 5 pages/1375 words 6 pages/1650 words 7 pages/1925 words 8 pages/2200 words 9 pages/2475 words 10 pages/2750 words 11 pages/3025 words 12 pages/3300 words 13 pages/3575 words 14 pages/3850 words 15 pages/4125 words 16 pages/4400 words 17 pages/4675 words 18 pages/4950 words 19 pages/5225 words 20 pages/5500 words 21 pages/5775 words 22 pages/6050 words 23 pages/6325 words 24 pages/6600 words 25 pages/6875 words 26 pages/7150 words 27 pages/7425 words 28 pages/7700 words 29 pages/7975 words 30 pages/8250 words 31 pages/8525 words 32 pages/8800 words 33 pages/9075 words 34 pages/9350 words 35 pages/9625 words 36 pages/9900 words 37 pages/10175 words 38 pages/10450 words 39 pages/10725 words 40 pages/11000 words 41 pages/11275 words 42 pages/11550 words 43 pages/11825 words 44 pages/12100 words 45 pages/12375 words 46 pages/12650 words 47 pages/12925 words 48 pages/13200 words 49 pages/13475 words 50 pages/13750 words 51 pages/14025 words 52 pages/14300 words 53 pages/14575 words 54 pages/14850 words 55 pages/15125 words 56 pages/15400 words 57 pages/15675 words 58 pages/15950 words 59 pages/16225 words 60 pages/16500 words 61 pages/16775 words 62 pages/17050 words 63 pages/17325 words 64 pages/17600 words 65 pages/17875 words 66 pages/18150 words 67 pages/18425 words 68 pages/18700 words 69 pages/18975 words 70 pages/19250 words 71 pages/19525 words 72 pages/19800 words 73 pages/20075 words 74 pages/20350 words 75 pages/20625 words 76 pages/20900 words 77 pages/21175 words 78 pages/21450 words 79 pages/21725 words 80 pages/22000 words 81 pages/22275 words 82 pages/22550 words 83 pages/22825 words 84 pages/23100 words 85 pages/23375 words 86 pages/23650 words 87 pages/23925 words 88 pages/24200 words 89 pages/24475 words 90 pages/24750 words 91 pages/25025 words 92 pages/25300 words 93 pages/25575 words 94 pages/25850 words 95 pages/26125 words 96 pages/26400 words 97 pages/26675 words 98 pages/26950 words 99 pages/27225 words 100 pages/27500 words 101 pages/27775 words 102 pages/28050 words 103 pages/28325 words 104 pages/28600 words 105 pages/28875 words 106 pages/29150 words 107 pages/29425 words 108 pages/29700 words 109 pages/29975 words 110 pages/30250 words 111 pages/30525 words 112 pages/30800 words 113 pages/31075 words 114 pages/31350 words 115 pages/31625 words 116 pages/31900 words 117 pages/32175 words 118 pages/32450 words 119 pages/32725 words 120 pages/33000 words 121 pages/33275 words 122 pages/33550 words 123 pages/33825 words 124 pages/34100 words 125 pages/34375 words 126 pages/34650 words 127 pages/34925 words 128 pages/35200 words 129 pages/35475 words 130 pages/35750 words 131 pages/36025 words 132 pages/36300 words 133 pages/36575 words 134 pages/36850 words 135 pages/37125 words 136 pages/37400 words 137 pages/37675 words 138 pages/37950 words 139 pages/38225 words 140 pages/38500 words 141 pages/38775 words 142 pages/39050 words 143 pages/39325 words 144 pages/39600 words 145 pages/39875 words 146 pages/40150 words 147 pages/40425 words 148 pages/40700 words 149 pages/40975 words 150 pages/41250 words 151 pages/41525 words 152 pages/41800 words 153 pages/42075 words 154 pages/42350 words 155 pages/42625 words 156 pages/42900 words 157 pages/43175 words 158 pages/43450 words 159 pages/43725 words 160 pages/44000 words 161 pages/44275 words 162 pages/44550 words 163 pages/44825 words 164 pages/45100 words 165 pages/45375 words 166 pages/45650 words 167 pages/45925 words 168 pages/46200 words 169 pages/46475 words 170 pages/46750 words 171 pages/47025 words 172 pages/47300 words 173 pages/47575 words 174 pages/47850 words 175 pages/48125 words 176 pages/48400 words 177 pages/48675 words 178 pages/48950 words 179 pages/49225 words 180 pages/49500 words 181 pages/49775 words 182 pages/50050 words 183 pages/50325 words 184 pages/50600 words 185 pages/50875 words 186 pages/51150 words 187 pages/51425 words 188 pages/51700 words 189 pages/51975 words 190 pages/52250 words 191 pages/52525 words 192 pages/52800 words 193 pages/53075 words 194 pages/53350 words 195 pages/53625 words 196 pages/53900 words 197 pages/54175 words 198 pages/54450 words 199 pages/54725 words 200 pages/55000 words 30 days 6 hours 12 hours 24 hours 48 hours 3 days 4 days 5 days 7 days 10 days 20 days USD GBP CAD AUD EUR  Total: