Extracting text out of scanned PDF - 26/05/2023 07:50 EDT

Geschlossen Veröffentlicht vor 12 Monaten Bezahlt bei Lieferung
Geschlossen Bezahlt bei Lieferung

I need a freelancer who can extract text, containing special characters, from a 500 pages scanned PDFs for data entry purposes. The extracted text needs to be formatted as Excel/CSV files. The ideal candidate should have experience in data entry and knowledge of OCR software. Attention to detail is crucial. I would like to get the code, relying on freely available python libraries, so I can audit it and reuse later if needed.

An example page is shown in attachment. The page contains 2 tables. I need all the data to be extracted an put in a 3 or 6 column csv (whether you append the table on the right in the same columns as the table on the left is up to you).

Datenerfassung Datenverarbeitung OCR Python

Projekt-ID: #36658293

Über das Projekt

88 Vorschläge Remote Projekt Aktiv vor 10 Monaten