Pages in topic:   [1 2 3] >
Poll: Would you use a PDF text extractor integrated into a CAT tool?
投稿者: ProZ.com Staff
ProZ.com Staff
ProZ.com Staff
サイトのスタッフ
Dec 22, 2008

This forum topic is for the discussion of the poll question "Would you use a PDF text extractor integrated into a CAT tool?".

View the poll here

A forum topic will appear each time a new poll is run. For more information, see: http://proz.com/topic/33629


 
Miroslav Jeftic
Miroslav Jeftic  Identity Verified
Local time: 05:18
2009に入会
英語 から セルビア語
+ ...
Yes, but Dec 22, 2008

Only if it works. That is, if it is producing the same or better results than the usual OCR tools. Otherwise, no.

 
Alexander Kondorsky
Alexander Kondorsky  Identity Verified
ロシア連邦
Local time: 06:18
英語 から ロシア語
+ ...
Other Dec 22, 2008

I don't know a program capable of correctly extracting text from a PDF file so that such text could be immediately exported into CAT tool. Sadly, there is no effective way of translating PDF files with a CAT tool. PDF is a curse of translators!

 
DZiW (X)
DZiW (X)
ウクライナ
英語 から ロシア語
+ ...
Should I find it appropriate Dec 22, 2008

As far as I know there are 2 (two) types of PDF
(a) with text layer
(b) scanned image (without text layer)

My Customers and I just decided to use variant (a) and we have no problems.
Cheers and good luck

[Редактировалось 2008-12-22 15:18 GMT]


 
Deborah do Carmo
Deborah do Carmo  Identity Verified
ポルトガル
Local time: 04:18
オランダ語 から 英語
+ ...
Yes Dec 22, 2008

If the results are fit for purpose - i.e. (i) I am able to translate the converted text using the CAT tool immediately, without any need to keep going back to the original PDF to check what it actually written there and (ii) the converted text is properly formatted so I don't have to fiddle around afterwards once I've cleaned up the file.

As far as cost is concerned, as long as something is user-friendly and increases my productivity, I'm willing to invest (within reason).
... See more
If the results are fit for purpose - i.e. (i) I am able to translate the converted text using the CAT tool immediately, without any need to keep going back to the original PDF to check what it actually written there and (ii) the converted text is properly formatted so I don't have to fiddle around afterwards once I've cleaned up the file.

As far as cost is concerned, as long as something is user-friendly and increases my productivity, I'm willing to invest (within reason).

Who is asking????
Collapse


 
Charlie Bavington
Charlie Bavington  Identity Verified
Local time: 04:18
フランス語 から 英語
Free market research :-) Dec 22, 2008

What an odd question.
Unless you were thinking of putting such a product on the market, of course.

Ah well, ho ho ho anyway


 
Tim Drayton
Tim Drayton  Identity Verified
サイプラス
Local time: 06:18
トルコ語 から 英語
+ ...
other Dec 22, 2008

I do not use a CAT tool. Curiously, could PDF files end up being a niche market for those of us who do not use them?

 
Brandis (X)
Brandis (X)
Local time: 05:18
英語 から ドイツ語
+ ...
with Alexander.. Dec 22, 2008

Hi! direct page scans are big headache, especially when the scan orientation and the resolution are bad. Only today I have shifted from Abby 8.0 to Abby 9.0 extended edition, still trying to find out how this is better than the older version. I wish you all otherwise a great x-mas celebration. BR Brandis

 
Marie-Hélène Hayles
Marie-Hélène Hayles  Identity Verified
Local time: 05:18
イタリア語 から 英語
+ ...
Other Dec 22, 2008

In my experience (mainly of revising translations of extracted texts) the formatting produced by text extractors is a total nightmare - fonts expanded and reduced seemingly at random, ridiculous line spacings, text boxes everywhere, random lines dotted here and there - it takes ages to get the formatting to a reasonable standard. So the answer is probably "no", unless it was better than the text extractors already on the market. As it is, ... See more
In my experience (mainly of revising translations of extracted texts) the formatting produced by text extractors is a total nightmare - fonts expanded and reduced seemingly at random, ridiculous line spacings, text boxes everywhere, random lines dotted here and there - it takes ages to get the formatting to a reasonable standard. So the answer is probably "no", unless it was better than the text extractors already on the market. As it is, I'd be more inclined to extract the text, clean up the formatting and only then run it through a CAT - and that's only worth my while for a small fraction of the (numerous) scanned PDFs I translate.
It would also depend on exactly what CAT tool was offering the text extractor, rather than any difference in price.
Collapse


 
neilmac
neilmac
スペイン
Local time: 05:18
スペイン語 から 英語
+ ...
I'm with Miroslav Dec 22, 2008

... only if it REALLY works. Properly. With no hassle.

 
Amy Duncan (X)
Amy Duncan (X)  Identity Verified
ブラジル
Local time: 00:18
ポルトガル語 から 英語
+ ...
Interesting point, Tim... Dec 22, 2008

Tim Drayton wrote:

I do not use a CAT tool. Curiously, could PDF files end up being a niche market for those of us who do not use them?


I don't use a CAT tool, either. I know people complain about translating PDF files, but I don't mind translating them at all. I just pop them up on the screen, open a Word document and half-size it, and translate away.


 
bookwormkt
bookwormkt
Local time: 04:18
フランス語 から 英語
+ ...
A what to do what?? Dec 22, 2008

I am very new to this and don't really understand the question! I tend to get work as word documents. I print them out and translate it on alternate lines on file paper, using pencil. After working on it, I word process the translation and send it out as a word document.
Is this a bit old fashioned?
As work picks up, I will investigate these tools.


 
Nahit Karataşlı
Nahit Karataşlı  Identity Verified
トルコ
Local time: 06:18
英語 から トルコ語
+ ...
Does a translator have to extract the text? Dec 22, 2008

Does a translator have to extract the text? IMHO, an accurate answer to the question would make the issue clear. If a translator uses a CAT tool, he/she should be provided with the document format that is supported by his/her CAT tool, and he/she must be responsible for the translation (content) and TM only.

 
Miroslav Jeftic
Miroslav Jeftic  Identity Verified
Local time: 05:18
2009に入会
英語 から セルビア語
+ ...
Well, Dec 22, 2008

You are right, but that isn't always the case. I (and I'm not the only one) had experience where the source files were available only in pdf and it was take it or leave it. Of course, it's the easiest way to reject such offers, but again, it's not always the case.

 
Lesley Clarke
Lesley Clarke  Identity Verified
メキシコ
Local time: 21:18
スペイン語 から 英語
Free Marketing? Dec 22, 2008

I answered OTHER, because quality would obviously be the deciding factor, integrated or not.

 
Pages in topic:   [1 2 3] >


To report site rules violations or get help, contact a site moderator:

このフォーラムのモデレーター
Jared Tabor[Call to this topic]

You can also contact site staff by submitting a support request »

Poll: Would you use a PDF text extractor integrated into a CAT tool?






CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »