Find Jobs
Hire Freelancers

scrape hp website

$100-500 USD

已完成
已发布超过 11 年前

$100-500 USD

货到付款
Scrape hp website for drivers. MUST USE: Perl, Web::Scraper Looking for quality, clean, reusable, modern Perl here. Comments expected, so fluent English speakers only. ## Deliverables Start URL: [login to view URL] We need to follow the following product links on the start url: Handheld Printing ? Multifunction and All-in-One ? Network Print Servers ? Printers ? Second URLs: for each of the above products, we need to traverse the links under it until we reach the product page (they may or may not be multiple levels deep). For example, follow these links: Printers > HP LaserJet Printers > HP LaserJet P4500 Printer series > HP LaserJet P4515xm Printer (you should reach the following url if you followed the instructions correctly: [login to view URL]) You will now have arrived at the product page for the $PRODUCT_NAME1 HP LaserJet P4515xm Printer. $PRODUCT_NAME1's value should be the text of the last link we traversed to get here. We will also need $PRODUCT_NAME2 to be set to 'HP LaserJet P4510 Printer series' which is on the product page itself. We will need the scraper to do all languages and all operating systems. For the purposes of this explaination, make sure English (American) is selected, and select Microsoft Windows 7 (32-bit) to reach the third url. THIRD URL: You should be here if you followed the instructions correctly: [login to view URL] Here is where we get the rest of our variables. First variable will be $TYPE and $TYPE_DESCRIPTION. (examples: 'Driver - Universal Print Driver' $TYPE = Driver $TYPE_DESCRIPTION = Universal Print Driver) (Note: sometimes it will just say like 'Firmware', in which case set both variables to 'Firmware' or whatever the single type is) For each set ($TYPE,$TYPE_DESCRIPTION) we need to get each download and the information for it. For the first download on our page we could create a row (csv, tab delimited, or mySQL) that would look like: PRODUCT_NAME1,PRODUCT_NAME2,TYPE,TYPE_DESCRIPTION,DESCRIPTION,VERSION,DATE,SIZE,PRODUCT_URL,DRIVER_URL HP LaserJet P4515xm Printer,HP LaserJet P4510 Printer series,Driver,Universal Print Driver,1 - HP Universal Print Driver for Windows PCL6,5.5.0.12834,27 Jun 2012,[login to view URL],$DIRECT_URL_TO_DOWNLOAD Notice the last value, DRIVER_URL, which has a value of $DIRECT_URL_TO_DOWNLOAD. I'm leaving that for you to figure out, as the download button uses javascript to construct a url. NOTES: 1. If, on the product's download page a download item's description says '(Downloadable Driver Not Available)' then skip. (ex. [login to view URL]) 2. Follow the same rule if the download link says 'obtain software' (ex. same as above example) REQUIREMENTS: 1. Written in Perl, using Web::Scraper 2. Be familiar with Perl best practices. Modular, documented, don't repeat yourself, etc. 3. Modern Perl please
项目 ID: 2781069

关于此项目

5提案
远程项目
活跃12 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
See private message.
$200.60 USD 在10天之内
4.9 (36条评论)
4.9
4.9
5威客以平均价$337 USD来参与此工作竞价
用户头像
See private message.
$233.75 USD 在10天之内
4.9 (228条评论)
6.8
6.8
用户头像
See private message.
$400 USD 在10天之内
5.0 (88条评论)
5.5
5.5
用户头像
See private message.
$499.80 USD 在10天之内
5.0 (3条评论)
2.2
2.2
用户头像
See private message.
$350.20 USD 在10天之内
0.0 (0条评论)
0.0
0.0

关于客户

UNITED STATES的国旗
United States
5.0
1
会员自9月 30, 2012起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。