Download PDF

Extended Semantic Web Conference 2024 (ESWC), Date: 2024/05/26 - 2024/05/30, Location: Hersonissos, Heraklion, Greece

Publication date: 2024-05-19
Volume: 14665 Pages: 178 - 198
ISSN: 978-3-031-60635-9
Publisher: Springer Cham

The Semantic Web

Author:

Dasoulas, Ioannis
Yang, Duo ; Dimou, Anastasia

Keywords:

Flanders Make at KU Leuven

Abstract:

With the Machine Learning (ML) field rapidly evolving, ML pipelines continuously grow in numbers, complexity and components. Online platforms (e.g., OpenML, Kaggle) aim to gather and disseminate ML experiments. However, available knowledge is fragmented with each platform representing distinct components of the ML process or intersecting components but in different ways. To address this problem, we leverage semantic web technologies to model and integrate ML datasets, experiments, software and scientific works into MLSea, a resource consisting of: (i) MLSO, an ontology that models ML datasets, pipelines and implementations; (ii) MLST, taxonomies with collections of ML knowledge formulated as controlled vocabularies; and (iii) MLSea-KG, an RDF graph containing ML datasets, pipelines, implementations and scientific works from diverse sources. MLSea paves the way for improving the search, explainability and reproducibility of ML pipelines.