Verlagslink DOI: | 10.1007/s10957-024-02391-9 | Titel: | On optimizing ensemble models using column generation | Sprache: | Englisch | Autorenschaft: | Aziz, Vanya Wu, Ouyang Nowak, Ivo Hendrix, Eligius M.T. Kronqvist, Jan |
Schlagwörter: | Column generation; Ensemble; Linear programming; Machine learning; Pricing problem | Erscheinungsdatum: | 22-Feb-2024 | Verlag: | Springer | Zeitschrift oder Schriftenreihe: | Journal of optimization theory and applications | Zeitschriftenband: | 203 | Anfangsseite: | 1794 | Endseite: | 1819 | Zusammenfassung: | In recent years, an interest appeared in integrating various optimization algorithms in machine learning. We study the potential of ensemble learning in classification tasks and how to efficiently decompose the underlying optimization problem. Ensemble learning has become popular for machine learning applications and it is particularly interesting from an optimization perspective due to its resemblance to column generation. The challenge for learning is not only to obtain a good fit for the training data set, but also good generalization, such that the classifier is generally applicable. Deep networks have the drawback that they require a lot of computational effort to get to an accurate classification. Ensemble learning can combine various weak learners, which individually require less computational time. We consider binary classification problems studying a three-phase algorithm. After initializing a set of base learners refined by a bootstrapping approach, base learners are generated using the solution of an linear programming (LP) master problem and then solving a machine learning sub-problem regarding a reduced data set, which can be viewed as a so-called pricing problem. We theoretically show that the algorithm computes an optimal ensemble model in the convex hull of a given model space. The implementation of the algorithm is part of an ensemble learning framework called decolearn. Numerical experiments with CIFAR-10 data set show that the base learners are diverse and that both the training and generalization error are reduced after several iterations. |
URI: | https://hdl.handle.net/20.500.12738/17171 | ISSN: | 1573-2878 | Begutachtungsstatus: | Diese Version hat ein Peer-Review-Verfahren durchlaufen (Peer Review) | Einrichtung: | Department Maschinenbau und Produktion Fakultät Technik und Informatik |
Dokumenttyp: | Zeitschriftenbeitrag |
Enthalten in den Sammlungen: | Publications without full text |
Zur Langanzeige
Volltext ergänzen
Feedback zu diesem Datensatz
Export
Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons