Publisher URL: | https://www.scitepress.org/Papers/2022/109806/109806.pdf | Publisher DOI: | 10.5220/0010980600003116 | Title: | Towards more reliable text classification on edge devices via a Human-in-the-Loop | Language: | English | Authors: | Andersen, Jakob Smedegaard Zukunft, Olaf |
Keywords: | Hybrid Intelligent Systems; Machine Learning; Text Classification; Interactive Machine Learning; Time Efficiency | Issue Date: | 2022 | Publisher: | SciTePress | Part of Series: | Proceedings of the 14th International Conference on Agents and Artificial Intelligence | Volume number: | 2 : ICAART | Startpage: | 636 | Endpage: | 646 | Conference: | International Conference on Agents and Artificial Intelligence 2022 | Abstract: | Reliably classifying huge amounts of textual data is a primary objective of many machine learning applications. However, state-of-the-art text classifiers require extensive computational resources, which limit their applicability in real-world scenarios. In order to improve the application of lightweight classifiers on edge devices, e.g. personal work stations, we adapt the Human-in-the-Loop paradigm to improve the accuracy of classifiers without re-training by manually validating and correcting parts of the classification outcome. This paper performs a series of experiments to empirically assess the performance of the uncertainty-based Human-in-the-Loop classification of nine lightweight machine learning classifiers on four real-world classification tasks using pre-trained SBERT encodings as text features. Since time efficiency is crucial for interactive machine learning pipelines, we further compare the training and inference time to enable rapid interactions. Our results indicate that lightweight classifiers with a human in the loop can reach strong accuracies, e.g. improving a classifier’s F1-Score from 90.19 to 97% when 22.62% of a dataset is classified manually. In addition, we show that SBERT based classifiers are time efficient and can be re-trained in < 4 seconds using a Logistic Regression model. |
URI: | http://hdl.handle.net/20.500.12738/12790 | ISBN: | 978-989-758-547-0 | Review status: | This version was peer reviewed (peer review) | Institute: | Forschungsgruppe Big Data Lab Department Informatik Fakultät Technik und Informatik |
Type: | Chapter/Article (Proceedings) |
Appears in Collections: | Publications without full text |
Show full item record
Add Files to Item
Note about this record
Export
This item is licensed under a Creative Commons License