https://www.mdu.se/

mdu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Enhancing Drone Surveillance with NeRF: Real-World Applications and Simulated Environments
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems. Saab Aeronautics, Stockholm, Sweden. (HERO)
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems. (HERO)ORCID iD: 0000-0002-3093-1610
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems. (HERO)
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems. (HERO)ORCID iD: 0000-0001-6289-1521
Show others and affiliations
2024 (English)In: 2024 AIAA DATC/IEEE 43rd Digital Avionics Systems Conference (DASC), Institute of Electrical and Electronics Engineers (IEEE), 2024, article id 204263Conference paper, Published paper (Refereed)
Abstract [en]

Machine Learning (ML) systems require representative and diverse datasets to accurately learn the objective task. Insupervised learning data needs to be accurately annotated, whichis an expensive and error-prone process. We present a methodfor generating synthetic data tailored to the use-case achievingexcellent performance in a real-world usecase. We provide amethod for producing automatically annotated synthetic visualdata of multirotor unmanned aerial vehicles (UAV) and otherairborne objects in a simulated environment with a high degreeof scene diversity, from collection of 3D models to generation ofannotated synthetic datasets (synthsets). In our data generationframework SynRender we introduce a novel method of usingNeural Radiance Field (NeRF) methods to capture photo-realistichigh-fidelity 3D-models of multirotor UAVs in order to automatedata generation for an object detection task in diverse environments. By producing data tailored to the real-world setting, ourNeRF-derived results show an advantage over generic 3D assetcollection-based methods where the domain gap between thesimulated and real-world is unacceptably large. In the spirit ofkeeping research open and accessible to the research communitywe release our dataset VISER DroneDiversity used in this project,where visual images, annotated boxes, instance segmentation anddepth maps are all generated for each image sample.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024. article id 204263
Keywords [en]
datasets, neural networks, synthetic data generation, automatic annotation, dataset generation
National Category
Computer graphics and computer vision
Research subject
Computer Science
Identifiers
URN: urn:nbn:se:mdh:diva-69153DOI: 10.1109/DASC62030.2024.10749011ISI: 001453360400088Scopus ID: 2-s2.0-85211243547ISBN: 9798350349610 (print)OAI: oai:DiVA.org:mdh-69153DiVA, id: diva2:1913934
Conference
2024 AIAA DATC/IEEE 43rd Digital Avionics Systems Conference (DASC), San Diego, CA, USA, 29/9-3/10, 2024
Available from: 2024-11-18 Created: 2024-11-18 Last updated: 2025-12-03Bibliographically approved
In thesis
1. Synthetic Data in Data-driven Systems
Open this publication in new window or tab >>Synthetic Data in Data-driven Systems
2025 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Dataset generation is cumbersome yet of great importance for successful training of machine learning models. Collecting real-world data is expensive and sometimes prohibited, considering e.g. safety aspects or legal restrictions. By generating the bulk of training data by synthetic means it is possible to impose arbitrary and extensive scene randomization for increased data diversity.

Methods to quantify similarity between datasets on a statistical level are important tools to detect Out-of-Distribution (OOD) data and domain alignment. We have studied how such methods can be used to correlate model prediction accuracy drop when exposed to OOD-data.

Domain adaptation can be applied as an additional step to synthetic data, to decrease the gap to real world datasets, however it can introduce inadvertent label-flipping, a sort of semantic inconsistency between synthetic source and domain adapted output. Therefore, we pursuit another way of reducing the domain gap, by generating high-fidelity digital representations of real-world scenes and objects. We do this through the use of Neural Radience Fields and Gaussian Splats. These methods allow us to render objects of interest for a detection problem, with the perfect annotation of synthetically produced data, and a high degree of realism which we show improves detection accuracy compared to traditionally generated visual content.

Abstract [sv]

Generering av data för AI-modeller är besvärligt men av stor betydelse för väl-fungerande träning av maskininlärningsmodeller. Att samla in riktig sensordata är dyrt och ibland inte möjligt, med hänsyn till exempelvis säkerhetsaspekter eller juridiska begränsningar. Genom att generera huvuddelen av träningsdata på syntetisk väg är det möjligt att införa omfattande scenrandomisering vilket leder till ökad datadiversifiering. Metoder för att kvantifiera likheter mellan datamängder på statistisk nivå är viktiga verktyg för att identifiera när data ligger utanför den tänkta distributionen. Vi har studerat hur sådana metoder kan användas för att korrelera hur en modellsprecision sjunker när den exponeras för osedd data. Domänanpassning kan tillämpas som ett ytterligare steg till syntetisk data, för att minska gapet till riktig sensordata, men detta kan innebära att man introducerar oavsiktliga annoteringsfel, en sorts semantisk inkonsistens mellan syntetisk källdata och domänanpassad utdata. Därför går vi en annan väg för att minska domängapet genom att generera digitala representationer med hög kvalitet av verkliga scener och föremål. Vi gör detta genom att använda Neural Radience Fields (NeRF) och Gaussiska Splats. Dessa metoder gör det möjligt för oss att skapa objekt av intresse för ett detektionsproblem, med automatisk annotering baserad på syntetiskt framställda data, och en hög grad av realism som vi visar förbättrar detektionsnoggrannheten jämfört med traditionellt genererat visuellt innehåll.

Place, publisher, year, edition, pages
Västerås: Mälardalens Universitet, 2025. p. 186
Series
Mälardalen University Press Licentiate Theses, ISSN 1651-9256 ; 370
Keywords
datasets, neural networks, synthetic data generation, automatic annotation, dataset generation
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:mdh:diva-69154 (URN)978-91-7485-689-7 (ISBN)
Presentation
2025-01-30, Delta, Mälardalens universitet, Västerås, 13:00 (English)
Opponent
Supervisors
Available from: 2024-11-18 Created: 2024-11-18 Last updated: 2025-12-03Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Lindén, JoakimBurresi, GiovanniForsberg, HåkanDaneshtalab, Masoud

Search in DiVA

By author/editor
Lindén, JoakimBurresi, GiovanniForsberg, HåkanDaneshtalab, MasoudSöderquist, Ingemar
By organisation
Embedded SystemsMälardalen University
Computer graphics and computer vision

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 595 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf