Grabber

已关闭 已发布的 Apr 29, 2006 货到付款
已关闭 货到付款

An application in which I input: a) a list of keywords / keyphrases. b) a list of domain names (urls) c) the number of result links to crawl from the google result page d) the number of text snippets to grab per page e) a local folder filled with html files named: [url removed, login to view], [url removed, login to view] (html files are named using the same list of keyword / keyphrases supplied to the application) For each keyword/keyphrase the application will do the following: 1) perform a search on [url removed, login to view] with this string: key phrase1 site:url1 key phrase1 site:url2 ---- keyword2 site:url1 keyword2 site:url2 ecc. for all weywords/keyphrases and urls. 2) from each google result page, it will crawl the number of result links that I have input (starting from the first result and on). From each crawled page, it will grab the text surrounding the keyword, or the text surrounding each word of the key phrase, up to the number set in d) The rule shall be to grab text starting from the beginning of a sentence and up to its end (both marked by a punctuation). One piece of text (snippet) per word per page will be stored. 3) For each keyword/keyphrase, it will then randomly mix together all text snippets gathered (from the different domains I have indidated and from the various pages of each domain) for that keyword/keyphrase so that they result in this final content: Also for each keyword/phrase it find article from diff article sites. - Random sentence. Random sentence. - Random sentce. Random sentence. And so forth. 4) Lastly, it will replace each tag (e.g." <%content%>) inserted previously by me on each of the html files in the folder, with the final content corresponding to the keyword/keyphrase after which the html file is named. I am open to suggestions on modifications or enhancement, Thank you

## Deliverables

Rent A Coder requirements notice: As originally posted, this bid request does not have complete details. Should a dispute arise and this project go into arbitration "as is", the contract's vagueness might cause it to be interpreted against you, even though you were acting in good-faith. So for your protection, if you are interested in this project, please work-out and document the requirements onsite.

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

windows

工程 网络营销 MySQL PHP 软件构架 软件测试 网络主机 网站管理 网站测试

项目ID: #3468070

关于项目

4个方案 远程项目 活跃的May 15, 2006

有4名威客正在参与此工作的竞标,均价$149/小时

ImaginationDev

See private message.

$170 USD 在8天内
(35条评论)
5.2
acruc

See private message.

$170 USD 在8天内
(19条评论)
5.1
codebugvw

See private message.

$170 USD 在8天内
(7条评论)
4.1
grabert

See private message.

$85 USD 在8天内
(0条评论)
0.0