Find Jobs
Hire Freelancers

Custom Nutch Parser Plugin with mapping feature

$250-750 USD

已关闭
已发布大约 12 年前

$250-750 USD

货到付款
I am after someone who has experience writing custom Nutch plugins. The details of the project will be given to only those that meet first round requirements. You must have decent experience here and can show experience with Nutch. I am not after someone to write me a parser for a particular site. I am after someone who can write a custom parser based on ANY DOM TREE STRUCTURE! If you dont understand what that means, after nutch crawls a page I want fields with any data stored and automatically named. Eg if there is a field <Div>Someinfohere</Div> then the field that extracts that data is called <fieldname Div1>Someinfohere<field> Thats the first step, creating order from html. Second step is an easy way for me to map this to <fieldname Div1> to a solr [login to view URL] field. To do that I think the best way would be to have the data stored in a database of some description and a simple GUI created so that I can easily map <fieldname Div1> to <solr schema field>. Choice of technolgy is yours as long as it runs on a LAMP stack. Php and mysql preferred. Will be crawling approx 10 000 sites so this thing will have to handle any html template I throw at it. If there are multiple Divs on a page, call them Div1, Div2 etc. The Dom structure will be your guide eg HTML|DIV|TABLE|TR|TD| Some info here Im on a tight budget, dont go crazy with your bid.
项目 ID: 1562360

关于此项目

2提案
远程项目
活跃12 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
2威客以平均价$500 USD来参与此工作竞价
用户头像
Pls check PMB.
$750 USD 在1天之内
0.0 (0条评论)
0.0
0.0

关于客户

AUSTRALIA的国旗
parramatta, Australia
4.2
7
会员自4月 12, 2010起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。