[login to view URL] Website Scraper

已完成 已发布的 Mar 2, 2011 货到付款
已完成 货到付款

I need a program coded that will scrape the website, [url removed, login to view] for search results. Like here's an example - I might do a search for "catering" in California and I would like to scrape all these results. If you go to this url you can see an example of the listings that appear and how they appear - [url removed, login to view]

I need the following data scraped into CSV format:

Business Name

Address

City

State

Zip

Phone number

Website

Contact Name

Years in Business

NAICS Code

Some of that data like phone number is not visible on the search results page so the listing has to be opened/clicked to get that data. In some listings some info is not available every time like website or years in business or naics code.

I would like this project done asap preferably by the end of the weekend. I'm looking to work with coders that have time to put to this project to complete asap and not busy with other projects at the same time.

I would like the program to run fast but, I don't know if running too fast could be a problem if the site could detect a scraper and if it's running too many searches or something ban my IP? so that should be taken into consideration.

I would prefer to have this program coded in C#. If you bid let me know your understanding of the project, experience with scrapers, and give me an idea of how it will work, and the interface. In my searches I will also be selecting various options on the left side bar that you see on the [url removed, login to view] search results like date started, public/private, etc..

Also, I'm not sure if this is possible but, I would prefer that if I do the same search for "catering" in California more than once, like once today and again at the end of the week, that it only scrapes new results and does not download previously scraped data again.

I am on Windows 7 - 64-bit.

Let me know if you have any questions

Thanks

C# 编程 MySQL PHP 软件构架 软件测试 网络主机 网站管理 网站测试

项目ID: #3142716

关于项目

2个方案 远程项目 活跃的Mar 2, 2011

授予:

eried

See private message.

$85 USD 在5天内
(79条评论)
5.5

有2名威客正在参与此工作的竞标,均价$85/小时

matfizvw

See private message.

$85 USD 在5天内
(53条评论)
6.0