Gentalmen,
Please read carefully before you post your bid!
I have 40 xml files which contains about 10 000 urls each. ? (total is 400 000 links)
Each url is like that one
[[login to view URL]|bg][1]
I need html files of that urls.?
that means about **400 000** result pages.
Url -> html file
**The first? xml is attached. I want output file names to be same as the "q" paramether in the urls + .htm. Example**
**
**
**<a? href="**[login to view URL] year&langpair=en|bg**"target="**_blank**">a</a>
**
output file should be
**academic [login to view URL]**
**
**
**
**
Note that [login to view URL] or [login to view URL] might not work in your country. I believe you can use your contry domain instead of .bg or .com (i have tested it with .[login to view URL] and it works )
The task seems pretty simple. Here is the catch.?
**Google will block you if you try to download them at once. In fact they will block your ip address every 10 or 100 requests.? **
The only solution that is found is to change the ip address.
Good luck
## Deliverables
I'll give the xml files to the winner