C# .net 3.5 - crawler and parse

已关闭 已发布的 Mar 30, 2009 货到付款
已关闭 货到付款

I want to be able to pass a url or urls to a service and have it analyze webpages for some key optimizations

I will pass a specific url to the service and it will process just this 1 page

or

I will pass a starting url to the service and it will crawl the site for all its pages (dedup please) . dont build your own craler code, check codeplex for some if you need it. (also make sure craler stays on same domain)

then it will grab all html/css and images from the site locally to the server and run some tests against it

1-Flag the page and image name that was scaled in html. for example someone uploades a image 1000x1000 and then in a wysiwyg they drag it down to 200x200.

2-Uploaded a image greater than some threshold value like 500kb. I should be able to set this in some config area in the code

3- List each page, # of images per page , size of each images, total of all images, number of css files on the page, number of js, size of all js and css per page

This will be what we use to start. Over time i will want to register more things to check, almost like modules. I can think of many more for example checking resources for gzip etc...I will want to register some new module in the system (via code) and have it be able to run against those modules as well. So let me know your methods here,

Maybe workflow foundation fits nice here with wcf. Im picky about code and want this to be done really well so let me knwo your plan

## Deliverables

1) Complete and fully-functional working program(s) in executable form as well as complete source code of all work done.

2) Deliverables must be in ready-to-run condition, as follows (depending on the nature of the deliverables):

a) For web sites or other server-side deliverables intended to only ever exist in one place in the Buyer's environment--Deliverables must be installed by the Seller in ready-to-run condition in the Buyer's environment.

b) For all others including desktop software or software the buyer intends to distribute: A software installation package that will install the software in ready-to-run condition on the platform(s) specified in this bid request.

3) All deliverables will be considered "work made for hire" under U.S. Copyright law. Buyer will receive exclusive and complete copyrights to all work purchased. (No GPL, GNU, 3rd party components, etc. unless all copyright ramifications are explained AND AGREED TO by the buyer on the site per the coder's Seller Legal Agreement).

## Platform

.net 2.5 wcf wwf

.NET Amazon Web Services ASP C# 编程 MySQL Odd Jobs SQL

项目ID: #3771304

关于项目

12个方案 远程项目 活跃的Apr 20, 2009

有12名威客正在参与此工作的竞标,均价$319/小时

radzivil

See private message.

$255 USD 在14天内
(92条评论)
6.0
utsavsoftech

See private message.

$1020 USD 在14天内
(5条评论)
5.8
logicalxpression

See private message.

$552.5 USD 在14天内
(2条评论)
3.9
codexp3rts

See private message.

$300.05 USD 在14天内
(6条评论)
4.5
sudhakarj21

See private message.

$255 USD 在14天内
(10条评论)
3.1
Technovice

See private message.

$212.5 USD 在14天内
(5条评论)
2.8
netedge1992vw

See private message.

$17 USD 在14天内
(2条评论)
1.3
vinodkumarb

See private message.

$595 USD 在14天内
(2条评论)
1.3
z0424155

See private message.

$85 USD 在14天内
(3条评论)
0.8
abhichamp

See private message.

$191.25 USD 在14天内
(1条评论)
0.8
pooonpooo

See private message.

$85 USD 在14天内
(1条评论)
0.0
alextominvw

See private message.

$255 USD 在14天内
(0条评论)
0.0