TMX import takes 10 hours?
论题张贴者: Gyula Erdesz
Gyula Erdesz
Gyula Erdesz
匈牙利
Local time: 23:19
正式会员 (自2009)
English英语译成Hungarian匈牙利语
+ ...
Nov 7, 2011

Dear colleagues,

Yesterday I tried to import a middle-sized TMX file with 220.000 segments into an internal memory of Swordfish 3.0.6. As I did not know the capabilities of the software I intentionally chose the faster import option and left my computer alone for all day. But to my astonishment, Swordfish was not able to import the memory in 12 hours. At that point I interrupted the import process.

Is there any possibility to make the import faster or more effective? As
... See more
Dear colleagues,

Yesterday I tried to import a middle-sized TMX file with 220.000 segments into an internal memory of Swordfish 3.0.6. As I did not know the capabilities of the software I intentionally chose the faster import option and left my computer alone for all day. But to my astonishment, Swordfish was not able to import the memory in 12 hours. At that point I interrupted the import process.

Is there any possibility to make the import faster or more effective? As this translation memory size is quite average in my practice, plus I receive memory updates for almost every translation projects, I really need to find a solution for a faster and more effective import.

Shall I use a third party database engine (e.g. Oracle or SQL) memory instead of the internal memory of the software? Would it make the import process faster? If the import itself is so slow, how fast can Swordfish handle huge memories during translation?

Thank you in advance for sharing your ideas, opinions and experience.

Best regards,

Gyula Erdész
Collapse


 
Tomás Cano Binder, BA, CT
Tomás Cano Binder, BA, CT  Identity Verified
西班牙
Local time: 23:19
正式会员 (自2005)
English英语译成Spanish西班牙语
+ ...
Just one idea Nov 7, 2011

I am NOT a user of Swordfish, but to me, this looks more like a setup/performance problem in your computer than a problem with Swordfish handling large memories. With any reasonably fast CAT system, and I assume Swordfish is one of them, importing 220K segments should not take more than 10-15 minutes.

Just for the sake of the experiment, how about installing a more professional-level
... See more
I am NOT a user of Swordfish, but to me, this looks more like a setup/performance problem in your computer than a problem with Swordfish handling large memories. With any reasonably fast CAT system, and I assume Swordfish is one of them, importing 220K segments should not take more than 10-15 minutes.

Just for the sake of the experiment, how about installing a more professional-level database management system, like MySQL (the Community Server is free) in your machine and use Swordfish with it to see whether this improves the situation?

As per Swordfish's User Guide you can use MySQL with the software.
Collapse


 
WolfgangS
WolfgangS
法国
Local time: 23:19
正式会员 (自2007)
English英语译成German德语
+ ...
TMX import takes 10 hours Nov 7, 2011

Hi
I am an occasional user of Swordfish. The other day I imported a very large TMX (257 MB) into a new empty TM. It "only" took 3 hours ...
iMac 2,7 GHz,
Lion OSX
8 GB of RAM

I really would like to install MySQL but don't know how to do it so step by step instruction would be welcome.

Regards
Wolfgang


 
Gyula Erdesz
Gyula Erdesz
匈牙利
Local time: 23:19
正式会员 (自2009)
English英语译成Hungarian匈牙利语
+ ...
主题发起人
hardly a setup question Nov 7, 2011

Tomás Cano Binder, CT wrote:

With any reasonably fast CAT system, and I assume Swordfish is one of them, importing 220K segments should not take more than 10-15 minutes.



Dear Tomás,

After the unsuccessful import with Swordfish, I imported the TMX file into memoQ within 6 minutes. Based on this, I do not think that a setup/performance problem slows down Swordfish.

Thanks for the hint with MySQL database. It definitely wort a try.

Regards,

Gyula


 
Gyula Erdesz
Gyula Erdesz
匈牙利
Local time: 23:19
正式会员 (自2009)
English英语译成Hungarian匈牙利语
+ ...
主题发起人
And then? Nov 7, 2011

Wolfgang Schoene wrote:

I am an occasional user of Swordfish. The other day I imported a very large TMX (257 MB) into a new empty TM. It "only" took 3 hours ...
iMac 2,7 GHz,
Lion OSX
8 GB of RAM


Thank you for your feedback, Wolfgang.

My computer has got the same parameters as yours, but I use Linux OS.

How Swordfish behaves when you use the imported huge TM? Do you need to wait several seconds when Swordfish does TM lookup?

Regards,

Gyula


 
WolfgangS
WolfgangS
法国
Local time: 23:19
正式会员 (自2007)
English英语译成German德语
+ ...
And then ... Nov 7, 2011

Hi Gyula,
once the TM has been imported, lookup during the translation process is fast and fast means immediate. What is very slow, though, is Concordance search.

Wolfgang


 
Rodolfo Raya
Rodolfo Raya  Identity Verified
Local time: 19:19
English英语译成Spanish西班牙语
Split the file Nov 7, 2011

Hi,

Split the TMX file in several smaller pieces and import the pieces to the database.

Regards,
Rodolfo


 
Piotr Bienkowski
Piotr Bienkowski  Identity Verified
波兰
Local time: 23:19
正式会员 (自2005)
English英语译成Polish波兰语
+ ...
Some optimization? Nov 7, 2011

Rodolfo Raya wrote:

Hi,

Split the TMX file in several smaller pieces and import the pieces to the database.

Regards,
Rodolfo


Maybe the TMX could use some optimization as well, like removing duplicates, "noise" segments that do not contain anything useful, etc?

PB


 
Michael Beijer
Michael Beijer  Identity Verified
英国
Local time: 22:19
会员
Dutch荷兰语译成English英语
+ ...
Hi Piotr Jul 5, 2013

Piotr Bienkowski wrote:

Rodolfo Raya wrote:

Hi,

Split the TMX file in several smaller pieces and import the pieces to the database.

Regards,
Rodolfo


Maybe the TMX could use some optimization as well, like removing duplicates, "noise" segments that do not contain anything useful, etc?

PB


You wouldn't happen to have any tips on how to go about this would you?

I am trying to move a large number of TMXs generated by memoQ to CafeTran and have managed to remove all of the useless codes (I think so, at least, I did it with Olifant). However, I notice that my TMXs contain quite a few duplicates and empty segments.

Michael


 


To report site rules violations or get help, contact a site moderator:

本论坛的版主
Maya Gorgoshidze[Call to this topic]

You can also contact site staff by submitting a support request »

TMX import takes 10 hours?






Pastey
Your smart companion app

Pastey is an innovative desktop application that bridges the gap between human expertise and artificial intelligence. With intuitive keyboard shortcuts, Pastey transforms your source text into AI-powered draft translations.

Find out more »
LinguaCore
AI Translation at Your Fingertips

The underlying LLM technology of LinguaCore offers AI translations of unprecedented quality. Quick and simple. Add a human linguistic review at the end for expert-level quality at a fraction of the cost and time.

More info »