Find Jobs
Hire Freelancers

Develop resume parser for a specialized type of resumes

$30-250 USD

进行中
已发布将近 9 年前

$30-250 USD

货到付款
I have thousands of resumes to scan that are in PDF format. I need to take those resumes and convert them to XML format. All resumes follow a similar format and are of the same type of candidate. All are in English. I have specific needs for the resume parsing. Usually, a resume parser focuses on work experience and focuses little on related areas such as academic awards and hobbies. The resume parser I need is one that focuses on things that a normal HR resume parser will not focus on - I need it to focus on the person's hobbies, academic qualifications, guess the person's age, guess the person's gender, etc. Work experience is still important but not as important as the other information. I have attached sample files from publicly available resumes that resemble the type of resumes that need to be parsed to give you a better idea of what we need to do. Further details will be provided upon request.
项目 ID: 7916389

关于此项目

8提案
远程项目
活跃9 年前

想赚点钱吗?

在Freelancer上竞价的好处

设定您的预算和时间范围
为您的工作获得报酬
简要概述您的提案
免费注册和竞标工作
颁发给:
用户头像
I've done a lot of work with Python and parsing data. I did some research and found the best/most reliable way to grab the text from the pdf is to use the xpdf package which includes a binary which does a pdf to txt conversion. Then all that remains is to parse the text into python and find a way to guess the information you want. For age, I think using the graduation years from school would be a good starting point, with tweaking based on other factors such as vocabulary used, etc. For gender, use the degree type/work experience and we can use probabilities to determine the likely gender. Any other classification can also be done once all the text is in the Python script.
$155 USD 在2天之内
0.0 (0条评论)
0.0
0.0
8威客以平均价$191 USD来参与此工作竞价
用户头像
Hi! I have good experience in python programming and data parsing. I think I can help you with this task.
$200 USD 在3天之内
5.0 (173条评论)
5.9
5.9
用户头像
A proposal has not yet been provided
$200 USD 在3天之内
4.9 (6条评论)
3.8
3.8
用户头像
Hello. How are u I saw your description and sample pdfs. I think that main point is to extract text from pdf . and I have convert to XML Format. I can complete well. I want to discuss with u, Please contact me. I'll wait your good reply. Bye Huang.
$189 USD 在3天之内
5.0 (8条评论)
3.5
3.5
用户头像
I am student pursuing my degree and have more free time to work and also working on a project based on python. Familiar with regular expressions module in python.
$166 USD 在3天之内
0.0 (0条评论)
0.0
0.0
用户头像
I have experience in file type parsing. I have developed doc, xls, ppt, pdf and rtf file parsers.
$250 USD 在3天之内
0.0 (0条评论)
0.0
0.0

关于客户

UNITED STATES的国旗
Cambridge, United States
4.9
6
付款方式已验证
会员自6月 22, 2015起

客户认证

谢谢!我们已通过电子邮件向您发送了索取免费积分的链接。
发送电子邮件时出现问题。请再试一次。
已注册用户 发布工作总数
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
加载预览
授予地理位置权限。
您的登录会话已过期而且您已经登出,请再次登录。