DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Meisel, Andreas | - |
dc.contributor.author | Rohden, Andre | |
dc.date.accessioned | 2020-09-29T15:01:11Z | - |
dc.date.available | 2020-09-29T15:01:11Z | - |
dc.date.created | 2019 | |
dc.date.issued | 2019-04-17 | |
dc.identifier.uri | http://hdl.handle.net/20.500.12738/8674 | - |
dc.description.abstract | Reinforcement Learning ermöglicht einem selbstlernenden Agenten ein unbemanntes Flugobjekt in unkontrollierten Flugzuständen zu stabilisieren. Um dies zu erreichen, wird ein Deep Deterministic Policy Gradient Algorithmus angewendet. Durch Erweiterung wie Experience Replay Speicher, parametrisiertem Rauschen, Prioritized Experience Replay, Hindsight Experience Replay und Curriculum Learning lassen sich darüberhinaus Umgegebung mit sparse Reward trainieren. | de |
dc.description.abstract | Reinforcement learning allows a self-learning agent to stabilize an unmanned aerial vehicle in uncontrolled flight states. To achieve this, a deep deterministic policy gradient algorithm is applied. Through extensions like experience replay memory, parameterized noise, prioritized experience replay, hindsight experience replay and curriculum learning, it is furthermore possible to train environments with sparse reward. | en |
dc.language.iso | de | de |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | - |
dc.subject | reinforcement learning | en |
dc.subject | deep deterministic policy gradient | en |
dc.subject | experience replay memory | en |
dc.subject | curriculum learning | en |
dc.subject | quadcopter | en |
dc.subject.ddc | 004 Informatik | |
dc.title | Stabilisierung unkontrollierter Flugzustände mit Reinforcement Learning | de |
dc.title.alternative | Stabilization of uncontrolled flight states with reinforcement learning | en |
dc.type | Thesis | |
openaire.rights | info:eu-repo/semantics/openAccess | |
thesis.grantor.department | Department Informatik | |
thesis.grantor.place | Hamburg | |
thesis.grantor.universityOrInstitution | Hochschule für angewandte Wissenschaften Hamburg | |
tuhh.contributor.referee | Fohl, Wolfgang | - |
tuhh.gvk.ppn | 1663355266 | |
tuhh.identifier.urn | urn:nbn:de:gbv:18302-reposit-86762 | - |
tuhh.note.intern | 1 | |
tuhh.oai.show | true | en_US |
tuhh.opus.id | 4675 | |
tuhh.publication.institute | Department Informatik | |
tuhh.type.opus | Masterarbeit | - |
dc.subject.gnd | Operante Konditionierung | |
dc.type.casrai | Supervised Student Publication | - |
dc.type.dini | masterThesis | - |
dc.type.driver | masterThesis | - |
dc.type.status | info:eu-repo/semantics/publishedVersion | |
dc.type.thesis | masterThesis | |
dcterms.DCMIType | Text | - |
tuhh.dnb.status | domain | - |
item.advisorGND | Meisel, Andreas | - |
item.languageiso639-1 | de | - |
item.fulltext | With Fulltext | - |
item.creatorGND | Rohden, Andre | - |
item.openairetype | Thesis | - |
item.grantfulltext | open | - |
item.creatorOrcid | Rohden, Andre | - |
item.cerifentitytype | Publications | - |
item.openairecristype | http://purl.org/coar/resource_type/c_46ec | - |
Appears in Collections: | Theses |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
masterthesis_rohden.pdf | 18.29 MB | Adobe PDF | View/Open |
Note about this record
Export
Items in REPOSIT are protected by copyright, with all rights reserved, unless otherwise indicated.