https://www.mdu.se/

mdu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Enhancing Object Detection for Autonomous Machines in Private Construction Sites Through Federated Learning
RISE Research Institutes of Sweden, School of Innovation, Design and Engineering, Sweden.
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems.
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems.
RISE Research Institutes of Sweden, School of Innovation, Design and Engineering, Sweden.
2024 (English)In: Proc. - Int. Conf. Comput. Technol. Dev., TechDev, Institute of Electrical and Electronics Engineers (IEEE) , 2024, p. 39-43Conference paper, Published paper (Refereed)
Abstract [en]

A critical enabler of autonomous construction equipment is object detection, a computer vision task integral to navigation, task execution, and safety. However, challenging conditions at construction sites, such as mud splashes, dirt, and vibrations, can degrade object detection performance by causing sensor occlusions and image blurriness. Traditional adversarial training methods, which enhance model robustness by using perturbed data, are limited in construction environments due to the scarcity of diverse real-world adversarial data and the dynamic nature of these sites. Additionally, privacy concerns and site-specific data variability hinder data sharing across different construction sites. To overcome these challenges, this paper explores federated learning as a solution to enhance the robustness and adaptability of object detection models while preserving data privacy. FL enables continuous online learning without direct data exchange, offering a scalable and privacy-preserving approach to training models across diverse construction environments. Experimental results demonstrate that our approach improves model performance on the ConstScene dataset by up to ≈ 4.4% compared to the centralized AI model for object detection.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024. p. 39-43
Keywords [en]
Adversarial Training, Construction Site, Federated Learning, Object Detection, Privacy, Adversarial machine learning, Contrastive Learning, Differential privacy, Generative adversarial networks, Autonomous constructions, Autonomous machines, Condition, Construction environment, Construction sites, Detection performance, Navigation tasks, Objects detection, Task executions
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:mdh:diva-70739DOI: 10.1109/TechDev64369.2024.00016Scopus ID: 2-s2.0-105000707526ISBN: 9798331539771 (print)OAI: oai:DiVA.org:mdh-70739DiVA, id: diva2:1949256
Conference
Proceedings - 2024 13th International Conference on Computer Technologies and Development, TechDev 2024
Available from: 2025-04-02 Created: 2025-04-02 Last updated: 2025-11-21Bibliographically approved
In thesis
1. Enhancing Perception System Robustness Against Attacks and Natural Perturbations
Open this publication in new window or tab >>Enhancing Perception System Robustness Against Attacks and Natural Perturbations
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Deep learning has led to major progress in computer vision, but modern Deep Neural Networks (DNNs) are still highly vulnerable to input perturbations, which limits their robustness in safety-critical applications. This challenge becomes even more critical in real-world industrial environments, such as autonomous machinery operating on construction sites, where visual data is influenced by unpredictable weather conditions, variable lighting, and physical wear and degradation. In addition, data scarcity, privacy constraints, and domain shift prevent the direct application of conventional large-scale training pipelines. 

 

This thesis addresses these challenges by proposing a comprehensive, multi-level framework that strengthens model-level robustness against adversarial attacks, enhances data-level robustness to natural environmental perturbations, and improves adaptive learning under distributed and data-constrained conditions, enabling reliable deployment of visual perception models in complex, safety-critical environments.

 

The first contribution focuses on the robustness of model-level attacks against adversarial attacks. A meta-heuristic search method is proposed to automatically discover activation functions that increase resistance to adversarial perturbations without requiring adversarial training. A hybrid search strategy further improves convergence efficiency, yielding Convolutional Neural Networks (CNNs) that outperform standard architectures under adversarial attacks while maintaining competitive clean-data accuracy.

 

The second contribution introduces ConstScene, a large-scale semantic segmentation dataset representing real and synthetic construction-site imagery under diverse weather and sensor degradation conditions. Experiments reveal significant performance drops when models trained on clean data are exposed to perturbed inputs, demonstrating the need for environment-specific robustness benchmarks.

 

The third contribution introduces an integrated framework that combines Federated Learning (FL) for decentralized collaborative training with Few-Shot Learning (FSL) for sample-efficient domain adaptation, supported by server-side Hyperparameter Optimization (HPO). The proposed approach enables effective model adaptation across distributed construction sites without sharing raw data, significantly improving robustness across heterogeneous client datasets.

 

In general, this thesis proposes three contributions to enhance robustness in perception systems: model-level robustness against adversarial attacks, introducing the ConstScene dataset for benchmarking performance under real-world degradations and data-level robustness against natural perturbations, and an integrated framework enabling decentralized, sample-efficient model adaptation across heterogeneous environments.

Place, publisher, year, edition, pages
Västerås: Mälardalen University, 2026. p. 225
Series
Mälardalen University Press Dissertations, ISSN 1651-4238 ; 452
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:mdh:diva-74467 (URN)978-91-7485-736-8 (ISBN)
Public defence
2026-01-23, Gamma, Mälardalens universitet, Västerås, 10:00 (English)
Opponent
Supervisors
Available from: 2025-11-25 Created: 2025-11-21 Last updated: 2026-01-02Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Salimi, MaghsoodLoni, Mohammad

Search in DiVA

By author/editor
Salimi, MaghsoodLoni, Mohammad
By organisation
Embedded Systems
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 120 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf