data gathering

已完成 已发布的 Aug 13, 2010 货到付款
已完成 货到付款

create a web application that does data mining on twitter data

## Deliverables

Need a server side application that will collect data in the following manner:

1. Search for the string? X? in twitter

it will take the rest of the sentence after the string X till the period? punctuation? mark (.) and save it to a MySQL database into a table called APPSUGGESTION in a field called SUGGESTION

2. Go over each of the rows in the table and check how many times they appear on? Google? and save it to the? [url removed, login to view] field

3. Go over each of the rows in the table and check how many times they appear on twitter and save it to the? [url removed, login to view] field

the? APPSUGGESTION ? table should have these columns:

CREATIONTIME (datetime when this row was created)

SUGGESTIONDATE (datetime when this suggestion was made on tweeter)?

SUGGESTION (string with up to 1000 characters)

SOURCETYPE (string with up to 40 characters? should say TWITTER by default , we might add more sources in the future)?

SOURCE (string with up to 40 characters - twitter user that tweeted this suggestion)

GOOGLEOCCURRENCECOUNT (long - number of times the same suggestion was found on google)

TWITTEROCCURRENCECOUNT ((long - number of times the same suggestion was found on twitter)

this application will run every Y minutes automatically

there should be one web page to configure both X and Y:

X = how frequently the application will run (in minutes)

Y = which search sentence? to use

* the web page label for X = "Analyze sentences? beginning? with :" + X

* the web page label for Y = "Run every " + Y + " minutes"

for example if I setup the following?

* Analyze sentences? beginning? with :? "looking for an app that"

* Run every 30 minutes

one of the rows in the DB might look like this:

CREATIONTIME =? "5/1/2008 8:30:52 AM"

SUGGESTIONDATE =? "4/1/2008 8:30:52 AM"

SUGGESTION = "finds how many people are in a picture"

SOURCETYPE = "TWITTER"

SOURCE = "agulander"

GOOGLEOCCURRENCECOUNT = 23

TWITTEROCCURRENCECOUNT = 7

工程 Java JavaScript Linux MySQL PHP 项目管理 软件构架 软件测试 网络主机 网站管理 网站测试

项目ID: #3646514

关于项目

3个方案 远程项目 活跃的Aug 13, 2010

授予:

melfel

See private message.

$60.56 USD 在14天内
(96条评论)
6.0

有3名威客正在参与此工作的竞标,均价$74/小时

bachosl

See private message.

$76.5 USD 在14天内
(49条评论)
4.9
bitwaretech

See private message.

$85 USD 在14天内
(0条评论)
0.0