Hi,
I am a UK based developer that specialises in data scanning and machine learning algorithms. I have already developed a solution that will fit your needs (very quick scanning). The software requires a text delimited file as an input and provides the same format as an output.
Please contact me for more information.
I have been developing for 6 years with J2EE technologies. Most of the projects I have worked were having CMS with main activities being document management centered.