This site uses cookies.
Some of these cookies are essential to the operation of the site,
while others help to improve your experience by providing insights into how the site is being used.
For more information, please see the ProZ.com privacy policy.
Issues with scanned literary text PDF file converting to machine readable literary text Word file
Thread poster: Wei Ralph
Wei Ralph United States Local time: 22:23 Member (2013) English to Chinese + ...
Feb 25, 2021
Issues with scanned literary text PDF file converting to machine readable literary text Word file: 1. How to obtain clean machine readable literary text Word format from scanned literary text PDF file? 2. How to obtain clean machine readable literary text Word layout from scanned literary text PDF file? Any experience you have to share will be greatly appreciated.
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Gerard de Noord France Local time: 05:23 Member (2003) English to Dutch + ...
Have you tried opening the PDF in Word?
Feb 25, 2021
Have you tried opening the PDF file with File/Open in a current version of Word?
Cheers, Gerard
Jorge Payan
Mamadou djiby Wane
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Jorge Payan Colombia Local time: 22:23 Member (2002) German to Spanish + ...
Next: get yourself an OCR software
Feb 25, 2021
Gerard de Noord wrote:
Have you tried opening the PDF file with File/Open in a current version of Word?
Cheers, Gerard
If what Gerard de Noord suggests fail, you should try ABBYY FineReader or similar.
Saludos
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Samuel Murray Netherlands Local time: 05:23 Member (2006) English to Afrikaans + ...
@Wei
Feb 26, 2021
Wei Ralph wrote: How [can I] obtain clean machine readable literary text [in] Word [with correct] format [and/or layout] from [a] scanned literary text PDF file?
You need to either use a very good OCR program or you have to hire a typist. If you hire a good typist, you would not need to do anything further, but even the best OCR programs only get it 95% right, requiring you to fix layout and formatting manually afterwards.
Wei Ralph
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Wei Ralph United States Local time: 22:23 Member (2013) English to Chinese + ...
TOPIC STARTER
Issues with scanned literary text PDF file converting to machine readable literary text Word file
Feb 26, 2021
Mr. Murray,
Do you have an email address that can receive a page of this PDF file? or I can go ahead and upload a page here.
Wei Ralph
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Wei Ralph United States Local time: 22:23 Member (2013) English to Chinese + ...
TOPIC STARTER
Issues with scanned literary text PDF file converting to machine readable literary text Word file
Feb 26, 2021
Gerard,
Did try and not successful. A OCR specific software is probably the next best thing , other than time consuming typing.
Wei Ralph
Subject:
Comment:
The contents of this post will automatically be included in the ticket generated. Please add any additional comments or explanation (optional)
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
Manage your TMs and Terms ... and boost your translation business
Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.