The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, not withstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

Training-less Multi-label Text Classification Using Knowledge Bases and Word Embeddings

Author:Wael Alkhatib, Steffen Schnitzer, and Christoph Rensing
Date:May 2019
Kind:In proceedings - use for conference & workshop papers
Book title:The 12th International Conference on Knowledge Science, Engineering and Management (KSEM 2019)
Editor:Christos Douligeris, Dimitris Karagiannis, Dimitris Apostolou
Keywords:semantics, knowledge base, ontology, text classification
Research Area(s):Knowledge Media
Abstract:Traditional multi-label text classifiers suffer from the high dimensionality of feature space, label imbalance, and training overhead. In this work, we depart from traditional approaches with intensive feature engineering and linguistic analysis by introducing a novel ontology-based training-less multi-label text classifier. We transform the classification task into a graph matching problem to have a training-less classifier. The experiment results, using the EUR-Lex dataset, proved that our method offers competitive performance with respect to the above-mentioned approaches in terms of F1-macro giving fair performance over the different labels despite of the training-less configurations.
URL: 978-3-030-29563-9_10
Full paper (pdf)

[Export this entry to BibTeX]