DC FieldValueLanguage
dc.contributor.authorSharafi, Nahal-
dc.contributor.authorMartin, Christoph-
dc.contributor.authorHallerberg, Sarah-
dc.date.accessioned2025-07-11T06:26:08Z-
dc.date.available2025-07-11T06:26:08Z-
dc.date.issued2025-05-12-
dc.identifier.issn2470-0053en_US
dc.identifier.urihttps://hdl.handle.net/20.500.12738/17863-
dc.description.abstractNeural networks have become a widely adopted tool for tackling a variety of problems in machine learning and artificial intelligence. In this contribution, we use the mathematical framework of local stability analysis to gain a deeper understanding of the learning dynamics of feedforward neural networks. We derive equations for the tangent operator of the learning dynamics of three-layer networks learning regression tasks. The results are valid for an arbitrary number of nodes and arbitrary choices of activation functions. Applying the results to a network learning a regression task, we investigate numerically how stability indicators relate to the final training loss. Although the specific results vary with different choices of initial conditions and activation functions, we demonstrate that it is possible to predict the final training loss by monitoring finite-time Lyapunov exponents during the training process.en
dc.language.isoenen_US
dc.publisherAmerican Physical Societyen_US
dc.relation.ispartofPhysical review / publ. by The American Institute of Physics. Een_US
dc.subjectDynamical systemsen_US
dc.subjectArtificial neural networksen_US
dc.subjectChaos & nonlinear dynamicsen_US
dc.subject.ddc620: Ingenieurwissenschaftenen_US
dc.titleWeight dynamics of learning networksen
dc.typeArticleen_US
dc.identifier.pmid40534006en
dc.identifier.scopus2-s2.0-105005161529en
dc.description.versionPeerRevieweden_US
tuhh.container.endpage054208-14en_US
tuhh.container.issue5en_US
tuhh.container.startpage054208-1en_US
tuhh.container.volume111en_US
tuhh.oai.showtrueen_US
tuhh.publication.instituteDepartment Maschinenbau und Produktionen_US
tuhh.publication.instituteFakultät Technik und Informatiken_US
tuhh.publisher.doi10.1103/PhysRevE.111.054208-
tuhh.type.opus(wissenschaftlicher) Artikel-
dc.contributor.orcid#NODATA#en
dc.contributor.orcid0000-0002-3510-0429en
dc.contributor.orcid0000-0002-7026-7937en
dc.rights.cchttps://creativecommons.org/licenses/by/4.0/en_US
dc.type.casraiJournal Article-
dc.type.diniarticle-
dc.type.driverarticle-
dc.type.statusinfo:eu-repo/semantics/publishedVersionen_US
dcterms.DCMITypeText-
dc.contributor.departmentcityHamburgen
dc.contributor.departmentcityHamburgen
dc.contributor.departmentcityHamburgen
dc.contributor.departmentcountryGermanyen
dc.contributor.departmentcountryGermanyen
dc.contributor.departmentcountryGermanyen
dc.contributor.departmenturlhttps://api.elsevier.com/content/affiliation/affiliation_id/60032697en
dc.contributor.departmenturlhttps://api.elsevier.com/content/affiliation/affiliation_id/60032697en
dc.contributor.departmenturlhttps://api.elsevier.com/content/affiliation/affiliation_id/60032697en
dc.source.typearen
tuhh.container.articlenumber054208en
dc.funding.number01S19079en
dc.funding.sponsorBundesministerium für Bildung und Forschungen
dc.relation.acronymBMBFen
local.comment.externalarticle number: 054208en_US
item.creatorGNDSharafi, Nahal-
item.creatorGNDMartin, Christoph-
item.creatorGNDHallerberg, Sarah-
item.grantfulltextnone-
item.openairetypeArticle-
item.fulltextNo Fulltext-
item.languageiso639-1en-
item.cerifentitytypePublications-
item.creatorOrcidSharafi, Nahal-
item.creatorOrcidMartin, Christoph-
item.creatorOrcidHallerberg, Sarah-
item.openairecristypehttp://purl.org/coar/resource_type/c_6501-
crisitem.author.deptDepartment Maschinenbau und Produktion-
crisitem.author.deptDepartment Maschinenbau und Produktion-
crisitem.author.deptDepartment Maschinenbau und Produktion-
crisitem.author.parentorgFakultät Technik und Informatik-
crisitem.author.parentorgFakultät Technik und Informatik-
crisitem.author.parentorgFakultät Technik und Informatik-
Appears in Collections:Publications without full text
Show simple item record

Google ScholarTM

Check

HAW Katalog

Check

Add Files to Item

Note about this record


This item is licensed under a Creative Commons License Creative Commons