Title: Improving fuzzy matching through syntactic knowledge
Authors: Vanallemeersch, Tom ×
Vandeghinste, Vincent #
Issue Date: 27-Nov-2014
Publisher: ASLING
Host Document: Translating and the Computer vol:36 pages:217-227
Conference: Translating and the Computer edition:36 location:London date:27-28 November 2014
Abstract: Fuzzy matching in translation memories (TM) is mostly string-based in current CAT tools.
These tools look for TM sentences highly similar to an input sentence, using edit distance to
detect the differences between sentences. Current CAT tools use limited or no linguistic
knowledge in this procedure. In the recently started SCATE project, which aims at improving
translators’ efficiency, we apply syntactic fuzzy matching in order to detect abstract similarities
and to increase the number of fuzzy matches. We parse TM sentences in order to create
hierarchical structures identifying constituents and/or dependencies. We calculate TER
(Translation Error Rate) between an existing human translation of an input sentence and the
translation of its fuzzy match in TM. This allows us to assess the usefulness of syntactic
matching with respect to string-based matching. First results hint at the potential of syntactic
matching to lower TER rates for sentences with a low match score in a string-based setting.
ISBN: 9782970073628
Publication status: published
KU Leuven publication type: IC-p
Appears in Collections:Formal and Computational Linguistics (ComForT), Leuven
× corresponding author
# (joint) last author

Files in This Item:
File Description Status SizeFormat
Asling_paper.pdf Published 596KbAdobe PDFView/Open


All items in Lirias are protected by copyright, with all rights reserved.