ITEM METADATA RECORD
Title: Linguistics-based word alignment for medical translators
Authors: Vanallemeersch, Tom ×
Wermuth, Cornelia #
Issue Date: Jan-2008
Series Title: Journal of Specialized Translation issue:9 pages:20-38
Abstract: Tools assisting professional translators memorise translated sentences but provide limited functionality for the identification of terms and their translation equivalents in translated texts. In this paper, we propose a word alignment approach aiming to improve efficiency and usability of this functionality, through identification of cognates and exploitation of linguistic knowledge, such as lemmas in bilingual glossaries and dictionaries. Our approach focuses on content words, is applicable to parallel texts of various sizes, and minimises the need for user parameter tuning and preprocessing steps. The method, implemented as the FragmALex system, tackles certain types of divergences between source and target text by creating and grouping links between fragments (word parts, words and word groups). The system output consists of fragment links in their original context. We performed a case study of Dutch and French medical articles, using a medical glossary and a general-purpose dictionary of restricted size.
Comparison of the output with a gold standard shows that the addition of the dictionary to the system accounts for a higher increase in recall (completeness of alignment) than
the addition of the glossary, while the decrease in precision remains low with either resource.
ISSN: 1740-357X
VABB publication type: VABB-1
Publication status: published
KU Leuven publication type: IT
Appears in Collections:Formal and Computational Linguistics (ComForT), Leuven
Multimodality, Interaction and Discourse, Campus Sint-Andries Antwerp
Linguistics Research Unit - miscellaneous
× corresponding author
# (joint) last author

Files in This Item:
File Description Status SizeFormat
Paper_Linguistics-based_word_alignment.pdfMain article Published 223KbAdobe PDFView/Open

 


All items in Lirias are protected by copyright, with all rights reserved.