Publisher URL: https://www.scitepress.org/Papers/2022/109806/109806.pdf
Publisher DOI: 10.5220/0010980600003116
Title: Towards more reliable text classification on edge devices via a Human-in-the-Loop
Language: English
Authors: Andersen, Jakob Smedegaard  
Zukunft, Olaf 
Keywords: Hybrid Intelligent Systems; Machine Learning; Text Classification; Interactive Machine Learning; Time Efficiency
Issue Date: 2022
Publisher: SciTePress
Part of Series: Proceedings of the 14th International Conference on Agents and Artificial Intelligence 
Volume number: 2 : ICAART
Startpage: 636
Endpage: 646
Conference: International Conference on Agents and Artificial Intelligence 2022 
Abstract: 
Reliably classifying huge amounts of textual data is a primary objective of many machine learning applications. However, state-of-the-art text classifiers require extensive computational resources, which limit their applicability in real-world scenarios. In order to improve the application of lightweight classifiers on edge devices, e.g. personal work stations, we adapt the Human-in-the-Loop paradigm to improve the accuracy of classifiers without re-training by manually validating and correcting parts of the classification outcome. This paper performs a series of experiments to empirically assess the performance of the uncertainty-based Human-in-the-Loop classification of nine lightweight machine learning classifiers on four real-world classification tasks using pre-trained SBERT encodings as text features. Since time efficiency is crucial for interactive machine learning pipelines, we further compare the training and inference time to enable rapid interactions. Our results indicate that lightweight classifiers with a human in the loop can reach strong accuracies, e.g. improving a classifier’s F1-Score from 90.19 to 97% when 22.62% of a dataset is classified manually. In addition, we show that SBERT based classifiers are time efficient and can be re-trained in < 4 seconds using a Logistic Regression model.
URI: http://hdl.handle.net/20.500.12738/12790
ISBN: 978-989-758-547-0
Review status: This version was peer reviewed (peer review)
Institute: Forschungsgruppe Big Data Lab 
Department Informatik 
Fakultät Technik und Informatik 
Type: Chapter/Article (Proceedings)
Appears in Collections:Publications without full text

Show full item record

Page view(s)

102
checked on Dec 25, 2024

Google ScholarTM

Check

HAW Katalog

Check

Add Files to Item

Note about this record


This item is licensed under a Creative Commons License Creative Commons