Find Jobs
Hire Freelancers

Data mining

$30-250 USD

进行中
已发布大约 7 年前

$30-250 USD

货到付款
The group project is your opportunity to further explore an area of data mining you're interested in and also to gain practical experience in the domain. For your proposal, you must select team members and a topic. A custom rubric will be created for grading your project based on your topic choice, group size, and project difficulty. You will have complete freedom in what domain you choose, what technologies you use (including, for instance, programming languages, databases, etc), and what type of analysis you seek to perform, with the only restriction that the instructor will require full access to these tools in order to grade your work (thus, don't choose proprietary software unless the instructor has access to a license for it). Another concern is that your choices must carry a significant enough difficulty for you to be able to earn an A on your assignment. There must also be enough work to share between group members, so if you have a larger group, there should be multiple significantly difficult steps in your proposal for the knowledge discovery process. Every project should have some significant difficulty added to at least one step in the process. As a simple rule of thumb, for every added group member, there should be at least one additional step with significant difficulty. Please submit your proposal early. If it is accepted, you will receive full credit. If it is not, you will have as many opportunities to resubmit for full credit as you need up until the due date. After the due date, you will receive partial credit once your submission is accepted. What to Submit Include the following in your submission: data source data cleaning plan data integration plan data selection plan data mining plan pattern evaluation plan For each of these steps, also indicate what tools will be used, whether any significant programming will be needed, and if so what language will be used. You may also assign tasks to group members in your proposal, but this is not necessary at this step (it will be in your final report). Every member of the group needs to submit a proposal, but you can copy it. This way I will know that everyone is in board. Domain Choice Your proposal should include a choice of domain. This, essentially, is identifying the type of data you will be using in your project, and to some extent what type of analysis you'll be able to do. Here are some examples: social network analysis financial data analysis product recommendation sports predictions There are many more possible domains -- feel free to discuss any ideas you have with me in office hours or on the project discussion board on blackboard. Data Source Determine where, specifically, you will retrieve the data you will use for your project. There are two ways you can add difficulty to your assignment in this step: choose a less typical type of data (text mining, linked data, etc) or create your data set (ex. crawling the web, manual collection from surveys, etc). Finding an existing but seldom used data set will also add some credit for difficulty. Data Cleaning Plan If your data requires significant cleaning before it is used, this may be a more significant step. Indicate and explain how much work you expect for this step. Data Integration Plan If you are using multiple data sources, or if you intend to move your data set in to a database or data warehouse, explain your plan here. Otherwise, indicate this is not applicable. Data Selection Plan At the very least, indicate how you will select attributes from your data set for performing further analysis. Indicate if you intend to use data mining tools to determine what the subset of attributes should be, or if you intend to use a more complex technique to transform the data, such as principal component analysis. Data Mining Plan Determine what type of analysis you will perform (classification, clustering, outlier analysis, regression, mining association rules, etc). Give some detail on what you will be looking for, how you plan to do this analysis (i.e. using WEKA, writing a classification program in Python, etc), and what your expectations are for results. Again, use the discussion forum to discuss any tools you think might be interesting for other students. Pattern Evaluation Plan If you plan to perform an analysis on the results from the data mining step (for instance, cross validation, visualization, etc), indicate such here.
项目 ID: 13336797

关于此项目

3提案
远程项目
活跃7 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
I learned SAS and SAS is well known for accuracy in cleaning the data.
$222 USD 在10天之内
0.0 (0条评论)
0.0
0.0

关于客户

INDIA的国旗
India
0.0
0
会员自6月 26, 2016起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。