Find Jobs
Hire Freelancers

Custom Automation Script Needed

$100-500 USD

已完成
已发布超过 13 年前

$100-500 USD

货到付款
I need an script or application that can scan 12 million web pages that I specify and look for a specific string in the page html source and identity which pages contain this string and which pages do not. ## Deliverables I have 12 text files with 1,000,000 urls in each file (12 million urls total), I need a script or application that can visit each of the 12 million urls and look for a specific word in the page html source and provide me with two list, one list that contains the urls which found the word and another list which contains the urls that do NOT contain the word I specify. Out of 12 million maybe 8 million will contain the keyword and 4 million will not. I have a strong windows 2008 server with 100mbps unlimited connection OR we can do this with PHP and a cron job on a linux server which I can obtain. PLEASE tell me how you plan to build this application. I would also like for the script to go fast, with at least 200 threads, remember its just loading the html source, no images or scripts on the page. It should also be able to auto-resume if it crashes or stops in the middle without having to start from 1 again. PLEASE ANSWER THESE QUESTIONS IN YOUR BID OR I WILL IGNORE IT COMPLETELY: 1) How fast can you get me a fully functional program that will not freeze up and have to be constantly restarted? 2) Will you use Windows Server 2008 compatible application or using PHP/MySql? 3) How will your script handle errors such as page timeouts, unable to load url, 404 errors, 500 error....etc? 4) Approx how long do you estimate to scan all 12 million pages? I need this completed ASAP so please keep this in mind. 1) All deliverables will be considered "work made for hire" under U.S. Copyright law. Employer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the employer on the site per the worker's Worker Legal Agreement). 2) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done. 3) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables): a) For web sites or other server-side deliverables intended to only ever exist in one place in the Employer's environment--Deliverables must be installed by the Worker in ready-to-run condition in the Employer's environment. b) For all others including desktop software or software the employer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this project. ## Platform Windows Server 2008 OR PHP/MySql
项目 ID: 2982005

关于此项目

15提案
远程项目
活跃13 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
See private message.
$85 USD 在5天之内
4.9 (79条评论)
5.2
5.2
15威客以平均价$238 USD来参与此工作竞价
用户头像
See private message.
$244.80 USD 在5天之内
4.9 (41条评论)
6.8
6.8
用户头像
See private message.
$425 USD 在5天之内
4.9 (20条评论)
5.1
5.1
用户头像
See private message.
$233.75 USD 在5天之内
5.0 (23条评论)
4.9
4.9
用户头像
See private message.
$102 USD 在5天之内
5.0 (13条评论)
4.5
4.5
用户头像
See private message.
$242.25 USD 在5天之内
4.8 (20条评论)
4.0
4.0
用户头像
See private message.
$93.50 USD 在5天之内
5.0 (19条评论)
3.9
3.9
用户头像
See private message.
$148.75 USD 在5天之内
5.0 (18条评论)
3.4
3.4
用户头像
See private message.
$85 USD 在5天之内
4.8 (14条评论)
2.6
2.6
用户头像
See private message.
$102 USD 在5天之内
5.0 (6条评论)
1.5
1.5
用户头像
See private message.
$102 USD 在5天之内
5.0 (1条评论)
0.0
0.0
用户头像
See private message.
$110.50 USD 在5天之内
0.0 (0条评论)
0.0
0.0
用户头像
See private message.
$170 USD 在5天之内
0.0 (0条评论)
0.0
0.0
用户头像
See private message.
$1,275 USD 在5天之内
0.0 (0条评论)
0.0
0.0
用户头像
See private message.
$153 USD 在5天之内
0.0 (0条评论)
0.0
0.0

关于客户

UNITED STATES的国旗
Riverside, United States
4.9
241
付款方式已验证
会员自1月 13, 2010起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。