Verlagslink: https://www.scitepress.org/Papers/2022/109806/109806.pdf
Verlagslink DOI: 10.5220/0010980600003116
Titel: Towards more reliable text classification on edge devices via a Human-in-the-Loop
Sprache: Englisch
Autorenschaft: Andersen, Jakob Smedegaard  
Zukunft, Olaf 
Schlagwörter: Hybrid Intelligent Systems; Machine Learning; Text Classification; Interactive Machine Learning; Time Efficiency
Erscheinungsdatum: 2022
Verlag: SciTePress
Teil der Schriftenreihe: Proceedings of the 14th International Conference on Agents and Artificial Intelligence 
Bandangabe: 2 : ICAART
Anfangsseite: 636
Endseite: 646
Konferenz: International Conference on Agents and Artificial Intelligence 2022 
Zusammenfassung: 
Reliably classifying huge amounts of textual data is a primary objective of many machine learning applications. However, state-of-the-art text classifiers require extensive computational resources, which limit their applicability in real-world scenarios. In order to improve the application of lightweight classifiers on edge devices, e.g. personal work stations, we adapt the Human-in-the-Loop paradigm to improve the accuracy of classifiers without re-training by manually validating and correcting parts of the classification outcome. This paper performs a series of experiments to empirically assess the performance of the uncertainty-based Human-in-the-Loop classification of nine lightweight machine learning classifiers on four real-world classification tasks using pre-trained SBERT encodings as text features. Since time efficiency is crucial for interactive machine learning pipelines, we further compare the training and inference time to enable rapid interactions. Our results indicate that lightweight classifiers with a human in the loop can reach strong accuracies, e.g. improving a classifier’s F1-Score from 90.19 to 97% when 22.62% of a dataset is classified manually. In addition, we show that SBERT based classifiers are time efficient and can be re-trained in < 4 seconds using a Logistic Regression model.
URI: http://hdl.handle.net/20.500.12738/12790
ISBN: 978-989-758-547-0
Begutachtungsstatus: Diese Version hat ein Peer-Review-Verfahren durchlaufen (Peer Review)
Einrichtung: Forschungsgruppe Big Data Lab 
Department Informatik 
Fakultät Technik und Informatik 
Dokumenttyp: Konferenzveröffentlichung
Enthalten in den Sammlungen:Publications without full text

Zur Langanzeige

Seitenansichten

102
checked on 26.12.2024

Google ScholarTM

Prüfe

HAW Katalog

Prüfe

Volltext ergänzen

Feedback zu diesem Datensatz


Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons Creative Commons