Download PDF Download PDF

Terminology

Publication date: 2014-01-01
Volume: 20 Pages: 279 - 303
Publisher: John Benjamins Publishing Co.

Author:

Bertels, Ann
Speelman, Dirk

Keywords:

specialized corpora, distributional semantics, Multidimensional scaling (MDS), semantic similarity, second-order and third-order cooccurrences, Social Sciences, Linguistics, Language & Linguistics, second-order and third-order co-occurrences, 2004 Linguistics, Languages & Linguistics, 4703 Language studies, 4704 Linguistics

Abstract:

This paper presents an innovative approach, within the framework of distributional semantics, for the exploration of semantic similarity in a technical corpus. In complement to a previous quantitative semantic analysis conducted in the same domain of machining terminology, this paper sets out to discover finegrained semantic distinctions in an attempt to explore the semantic heterogeneity of a number of technical items. Multidimensional scaling analysis (MDS) was carried out in order to cluster first-order co-occurrences of a technical node with respect to shared second-order and third-order co-occurrences. By taking into account the association values between relevant first and second-order co-occurrences, semantic similarities and dissimilarities between first-order co-occurrences could be determined, as well as proximities and distances on a graph. In our discussion of the methodology and results of statistical clustering techniques for semantic purposes, we pay special attention to the linguistic and terminological interpretation.