Masked autoencoder : influence of self-supervised pretraining on object segmentation in industrial images

Witte, Anja; Lange, Sascha; Lins, Christian

doi:10.1007/s44244-024-00020-y

Please use this identifier to cite or link to this item: https://doi.org/10.48441/4427.1962

DC Field	Value	Language
dc.contributor.author	Witte, Anja	-
dc.contributor.author	Lange, Sascha	-
dc.contributor.author	Lins, Christian	-
dc.date.accessioned	2024-10-17T13:36:44Z	-
dc.date.available	2024-10-17T13:36:44Z	-
dc.date.issued	2024-08-23	-
dc.identifier.issn	2731-667X	en_US
dc.identifier.uri	https://hdl.handle.net/20.500.12738/16387	-
dc.description.abstract	The amount of labelled data in industrial use cases is limited because the annotation process is time-consuming and costly. As in research, self-supervised pretraining such as MAE resulted in training segmentation models with fewer labels, this is also an interesting direction for industry. The reduction of required labels is achieved with large amounts of unlabelled images for the pretraining that aims to learn image features. This paper analyses the influence of MAE pretraining on the efficiency of label usage for semantic segmentation with UNETR. This is investigated for the use case of log-yard cranes. Additionally, two transfer learning cases with respect to crane type and perspective are considered in the context of label-efficiency. The results show that MAE is successfully applicable to the use case. With respect to the segmentation, an IoU improvement of 3.26% is reached while using 2000 labels. The strongest positive influence is found for all experiments in the lower label amounts. The highest effect is achieved with transfer learning regarding cranes, where IoU and Recall increase about 4.31% and 8.58%, respectively. Further analyses show that improvements result from a better distinction between the background and the segmented crane objects.	en
dc.language.iso	en	en_US
dc.publisher	Springer	en_US
dc.relation.ispartof	Industrial artificial intelligence	en_US
dc.subject	Masked autoencoder	en_US
dc.subject	Self-supervised pretraining	en_US
dc.subject	Semantic segmentation	en_US
dc.subject	UNETR	en_US
dc.subject	Label-efficiency	en_US
dc.subject	Log- yard cranes	en_US
dc.subject.ddc	004: Informatik	en_US
dc.title	Masked autoencoder : influence of self-supervised pretraining on object segmentation in industrial images	en
dc.type	Article	en_US
dc.identifier.doi	10.48441/4427.1962	-
dc.description.version	PeerReviewed	en_US
openaire.rights	info:eu-repo/semantics/openAccess	en_US
tuhh.container.issue	1	en_US
tuhh.container.volume	2	en_US
tuhh.identifier.urn	urn:nbn:de:gbv:18302-reposit-195713	-
tuhh.oai.show	true	en_US
tuhh.publication.institute	Department Informatik	en_US
tuhh.publication.institute	Fakultät Technik und Informatik	en_US
tuhh.publisher.doi	10.1007/s44244-024-00020-y	-
tuhh.type.opus	(wissenschaftlicher) Artikel	-
tuhh.type.rdm	true	-
dc.rights.cc	https://creativecommons.org/licenses/by/4.0/	en_US
dc.type.casrai	Journal Article	-
dc.type.dini	article	-
dc.type.driver	article	-
dc.type.status	info:eu-repo/semantics/publishedVersion	en_US
dcterms.DCMIType	Text	-
tuhh.container.articlenumber	7 (2024)	en_US
local.comment.external	Witte, A., Lange, S. & Lins, C. Masked autoencoder: influence of self-supervised pretraining on object segmentation in industrial images. Industrial Artificial Intelligence 2, 7 (2024). https://doi.org/10.1007/s44244-024-00020-y	en_US
tuhh.apc.status	false	en_US
item.grantfulltext	open	-
item.openairetype	Article	-
item.languageiso639-1	en	-
item.cerifentitytype	Publications	-
item.fulltext	With Fulltext	-
item.creatorGND	Witte, Anja	-
item.creatorGND	Lange, Sascha	-
item.creatorGND	Lins, Christian	-
item.creatorOrcid	Witte, Anja	-
item.creatorOrcid	Lange, Sascha	-
item.creatorOrcid	Lins, Christian	-
item.openairecristype	http://purl.org/coar/resource_type/c_6501	-
crisitem.author.dept	Department Informatik (ehemalig, aufgelöst 10.2025)	-
crisitem.author.orcid	0000-0003-3714-0069	-
crisitem.author.parentorg	Fakultät Technik und Informatik (ehemalig, aufgelöst 10.2025)	-
Appears in Collections:	Publications with full text