Optimale Trajektorien mit Reinforcement Learning

Büchler, Dieter

DC Field	Value	Language
dc.contributor.advisor	Meisel, Andreas	-
dc.contributor.author	Büchler, Dieter
dc.date.accessioned	2020-09-29T11:35:29Z	-
dc.date.available	2020-09-29T11:35:29Z	-
dc.date.created	2012
dc.date.issued	2012-06-27
dc.identifier.uri	http://hdl.handle.net/20.500.12738/5838	-
dc.description.abstract	Diese Arbeit umfasst die Modellierung, die Umsetzung und den Test eines Zustandssignals für einen Reinforcement Learning Agenten. Ziel ist es, so schnell, wie möglich, über eine Rennstrecke zu fahren, was mit der Suche nach einer optimalen Fahrspur verbunden ist. Mittel des Neural Fitted Q Iteration Algorithmus wird in einem kontinuierlichen Zustand- und Aktionsraum und ohne Modell der Umwelt Daten für die Q-Funktion gesammelt. Die Approximation dieser Funktion wird mit einem künstlichen neuronalen Netz umgesetzt.	de
dc.description.abstract	This work covers the development, the implementation and the test of a state signal for a Reinforcement Learning agent. The aim is to drive as fast as possible over a race circuit. That involves searching for an optimal racing line. Data for the Q-function is collected in a continuous action-state-space without a model of the environment using the Neural Fitted Q Iteration algorithm. The function approximation of the Q-function is done by an artificial neural network.	en
dc.language.iso	de	de
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	-
dc.subject.ddc	004 Informatik
dc.title	Optimale Trajektorien mit Reinforcement Learning	de
dc.type	Thesis
openaire.rights	info:eu-repo/semantics/openAccess
thesis.grantor.department	Department Informatik
thesis.grantor.place	Hamburg
thesis.grantor.universityOrInstitution	Hochschule für angewandte Wissenschaften Hamburg
tuhh.contributor.referee	Rauscher-Scheibe, Annabella	-
tuhh.gvk.ppn	718295412
tuhh.identifier.urn	urn:nbn:de:gbv:18302-reposit-58405	-
tuhh.note.extern	publ-mit-pod
tuhh.note.intern	1
tuhh.oai.show	true	en_US
tuhh.opus.id	1751
tuhh.publication.institute	Department Informatik
tuhh.type.opus	Bachelor Thesis	-
dc.subject.gnd	Bestärkendes Lernen <Künstliche Intelligenz>
dc.type.casrai	Supervised Student Publication	-
dc.type.dini	bachelorThesis	-
dc.type.driver	bachelorThesis	-
dc.type.status	info:eu-repo/semantics/publishedVersion
dc.type.thesis	bachelorThesis
dcterms.DCMIType	Text	-
tuhh.dnb.status	domain	-
item.creatorGND	Büchler, Dieter	-
item.grantfulltext	open	-
item.openairetype	Thesis	-
item.advisorGND	Meisel, Andreas	-
item.fulltext	With Fulltext	-
item.languageiso639-1	de	-
item.cerifentitytype	Publications	-
item.creatorOrcid	Büchler, Dieter	-
item.openairecristype	http://purl.org/coar/resource_type/c_46ec	-
Appears in Collections:	Theses