crawling / spidering data from website

已关闭 已发布的 Aug 14, 2005 货到付款
已关闭 货到付款

We need the company data from the following websites:

1. [[url removed, login to view]][1] (Approx. 2 mil. Companies)

2. [[url removed, login to view]][2]

3. <[url removed, login to view]>(28.000 companies in Japan)

4. [[url removed, login to view]][3](1.4 mil. Companies)

5. <[url removed, login to view]>

6. <[url removed, login to view]>

7. [[url removed, login to view]][4]

8. [[url removed, login to view]][5]

9. <[url removed, login to view]>

10. <[url removed, login to view]> ...

we need as much data as possible, such as names, descriptions, URLs - everything.

You must consider that some of the websites have timebased IP-locks. Meaning that you can only crawl a certain number of data per IP per hour. The websites are different that way. You will probably need more IP adresses.

## Deliverables

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

the data needs to be comma seperated text file.

工程 MySQL PHP 软件构架 软件测试

项目ID: #3849798

关于项目

6个方案 远程项目 活跃的Sep 4, 2005

有6名威客正在参与此工作的竞标,均价$1063/小时

siarheiaksi

See private message.

$1700 USD 在21天内
(45条评论)
5.6
bharatinfoways

See private message.

$2550 USD 在21天内
(14条评论)
5.1
outbox

See private message.

$1870 USD 在21天内
(8条评论)
4.2
NBitsindia

See private message.

$1275 USD 在21天内
(8条评论)
4.3
jackerlight

See private message.

$425 USD 在21天内
(27条评论)
4.0
vw1612773vw

See private message.

$127.5 USD 在21天内
(30条评论)
3.7
vadimrootckovsky

See private message.

$127.5 USD 在21天内
(2条评论)
1.5