Novel distributional reinforcement and ensemble learning algorithms

Aziz, Vanya

doi:10.48441/4427.3295

Hinweis Dies ist nicht die aktuellste Version dieses Werkes. Die aktuellste Version findet sich unter:http://dx.doi.org/10.48441/4427.3295.2

Zitierlink: https://doi.org/10.48441/4427.3295

Verlagslink:	https://hdl.handle.net/10630/39287
Titel:	Novel distributional reinforcement and ensemble learning algorithms
Sonstige Titel:	Nuevos algoritmos de aprendizaje por refuerzo distributivo y aprendizaje ensamblador
Sprache:	Englisch
Autorenschaft:	Aziz, Vanya
Schlagwörter:	Robótica - Tesis doctorales; Programación lineal; Aprendizaje automático (Inteligencia artificial); Redes neuronales (Informática); Distributional Reinforcement Learning; Soft Actor-Critic; Robotics; Linear Programming; Ensemble
Erscheinungsdatum:	2025
Prüfungsdatum:	2025
Verlag:	UMA Editorial
Zusammenfassung:	This dissertation focuses on Deep Reinforcement Learning (DRL), a neural network-based approach for solving Markov Decision Processes in high-dimensional spaces with unknown transition dynamics. The main contribution of this thesis is the development of a novel state-of-the-art distributional reinforcement learning algorithm within the maximum-entropy Actor-Critic framework. This algorithm, termed ”Cramér-based Soft Distributional Soft Actor-critic” (C-DSAC), demonstrates superior performance to other RL algorithms, especially in environments with high-dimensional spaces and complex dynamics. Its performance is shown to be partly rooted in a phenomenon arising in Cramér-metric-based Distributional Reinforcement Learning, referred to as confidence-driven model updates. This mechanism ensures that the value function approximator is updated more conservatively when confidence in its estimates is low. Theoretical justifications for the algorithm are provided, demonstrating its convergence in the policy evaluation setting and, under widely accepted mild assumptions, in the control setting as well. Beyond foundational algorithmic research, this thesis contributes to the practical application of RL in robotics. Given the crucial role of multi-joint robotic systems in modern production technology, a RL meta-algorithm called ”Reinforcement Learning - Inverse Kinematics” (RL-IK) is devised. This approach enhances the applicability of reinforcement learning to robotic control tasks by significantly accelerating convergence to near-optimal policies compared to standard RL methods. An essential prerequisite for real-world RL applications in control systems is machine perception for state identification. To address challenges in this field, this thesis explores novel Supervised Learning (SL) approaches, validated on image classification tasks, with a focus on ensemble learning strategies.
URI:	https://hdl.handle.net/20.500.12738/19053
DOI:	10.48441/4427.3295
Begutachtungsstatus:	Diese Version wurde begutachtet (fachspezifisches Begutachtungsverfahren)
Einrichtung:	Universidad de Málaga Universidad de Málaga. Departamento de Ingeniería mecánica, térmica y de fluidos Department Maschinenbau und Produktion (ehemalig, aufgelöst 10.2025) Fakultät Technik und Informatik (ehemalig, aufgelöst 10.2025)
Dokumenttyp:	Dissertation/Habilitation
Abschlussarbeitentyp:	Dissertation
Hinweise zur Quelle:	Aziz, Vanya. (2025). Novel distributional reinforcement and ensemble learning algorithms, I-VII, 1-153. dissertation. UMA Editorial. https://hdl.handle.net/10630/39287
Betreuer*in:	Hendrix, Eligius María Theodorus Nowak, Ivo
Enthalten in den Sammlungen:	Publications with full text

Dateien zu dieser Ressource:

Datei	Beschreibung	Größe	Format
TD_AZIZ_Vanya.pdf		3.56 MB	Adobe PDF	Öffnen/Anzeigen

Zur Langanzeige

Google Scholar^TM

Prüfe

HAW Katalog

Prüfe

Feedback zu diesem Datensatz

Export

Versionsgeschichte

Version	Datensatz	Datum	Zusammenfassung
2	doi:10.48441/4427.3295.2	2026-04-02 07:58:52.153	updated version (only minor changes): 2026-03-30
1	doi:10.48441/4427.3295	2026-03-12 14:23:29.0	old version: 2025

Ausgewählte Version

Diese Ressource wurde unter folgender Copyright-Bestimmung veröffentlicht: Lizenz von Creative Commons

Dateien zu dieser Ressource:

Google ScholarTM

HAW Katalog

Feedback zu diesem Datensatz

Google Scholar^TM