I need someone to extract some information from tables in a PDF file and transfer them to an Excel file.
The Excel file is attached, and the PDF file is hosted here: [login to view URL]
There are five sheets in the Excel file, and they correspond to the following tables/pages in the PDF file (page numbers listed are those in the text of the PDF):
US_Army_General_Officers_Active = UNITED STATES ARMY GENERAL OFFICERS ACTIVE DURING THE CIVIL WAR ERA (Pages 701-762)
Union_Militia_General_Officers = UNION MILITIA GENERAL OFFICERS (Pages 762-767)
United_States_Navy_Flag_Grades = UNITED STATES NAVY FLAG GRADES (Pages 768-772)
Confederate_General_Officers = CONFEDERATE STATES ARMY GENERAL OFFICERS AND NAVY FLAG OFFICERS ACTIVE DURING THE CIVIL WAR ERA (Pages 787-801)
Confederate_Militia_Officers = CONFEDERATE STATES ARMY MILITIA GENERAL OFFICERS (Pages 801-807)
Some notes: If the name is missing in an entry in the PDF, the name of the individual is the name in the previous line.
Rank should be the first column in each sheet (these are listed before each table in the PDF).
The Excel tables should as closely match the format of those in the PDF as possible (with the exception that the Excel date format should be Month Day, Year).