Data Preprocessing java code

已关闭 已发布的 7 年前 货到付款
已关闭 货到付款

In this project, the students are to implement data pre-processing techniques and apply them to a gene expression dataset.

The dataset contains 62 samples collected from colon-cancer patients. 40 of the samples are labeled as ”negative” and 22 are labeled as ”positive.” Each tuple (row) in the dataset is a sample containing the readings for the genes, and the class (which is the last column) of the sample. Each gene is an attribute. The columns are separated by ”,”, which is a commonly used format in data mining. We will refer to the genes as G0, ..., GN, assigned in the left-to-right order as given in the original file.

You will write a C++ or Java program to handle the following two tasks:

Task 1. Task 2.

Discretize the data using equi-density binning with 3 bins for each of the first k attributes.

Use the entropy-based binning method to discretize all genes and to select the top-k genes, ranked in decreasing information gain order. Use 3 bins for each gene. Information gain for three bins is a generalization of the two-bins case (based on size-weighted entropy). To get three bins you should first divide the range of a given attribute into two bins and then divide one of the two bins into two more bins. The two splits should maximize the size-weighted entropy gain for the three intervals. (You should select between the two splits (one for the left interval and one for the right interval) as the the second split based on size-weighted entropy gain.)

数据挖掘 Java

项目ID: #13120155

关于项目

11个方案 远程项目 活跃的7 年前

有11名威客正在参与此工作的竞标,均价$89/小时

dobreiiita

Hello I am Java expert and interested in this project. I have reviewed the attached files and confident to handle it perfectly. I have a lot of experience in helping in students with assignments, so I will k 更多

$100 USD 在2天内
(376条评论)
7.4
koustav2006

Hi, I am good at core java programming and familiar with required data processing algorithms. I can get the work done in 24 hours. With Regards, Koustav

$80USD 在1天里
(48条评论)
5.1
moeenahmed21

I am an experienced C++ and Java developer. I will solve your problem and develop the program. Feel free to contact me for further discussion. Regards, Moeen Ahmed

$70USD 在1天里
(13条评论)
4.1
GITTechBAY

I am ready to work on your task as per the given requirement , please message me avaiaolbe 24/7 onlines for status update . 更多

$30USD 在1天里
(7条评论)
2.3
zain9674

Thank you for taking the time to review our bid! I just checked the description you have provided regarding the project and it would be a pleasure to assist you as well. I am really eager to work on your project with f 更多

$23 USD 在0天内
(0条评论)
0.0
Dextersmind2

Hi, I can certainly do your task, having more than 8 years of experience, please reply me and discuss the details. High grades and short deadline is guaranteed. Regards

$277 USD 在15天内
(0条评论)
0.0