https://www.mdu.se/

mdu.sePublications
Operational message
There are currently operational disruptions. Troubleshooting is in progress.
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
ConstScene: A Dataset and Model for Advancing Robust Semantic Segmentation in Construction Environment
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems.
Future Solutions Department, Volvo Construction Equipment, Eskilstuna, Sweden.
Future Solutions Department, Volvo Construction Equipment, Eskilstuna, Sweden.
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems.ORCID iD: 0000-0003-0416-1787
Show others and affiliations
2025 (English)In: Pattern Recognition and Artificial Intelligence: 4th International Conference, ICPRAI 2024, Jeju Island, South Korea, July 03-06, 2024, Proceedings, Part II, Springer Nature , 2025, p. 242-253Conference paper, Published paper (Refereed)
Abstract [en]

The increasing demand for autonomous machines in construction environments necessitates the development of robust object detection algorithms that can perform effectively across various weather and environmental conditions. This paper introduces a new semantic segmentation dataset specifically tailored for construction sites, taking into account the diverse challenges posed by adverse weather and environmental conditions. The dataset is designed to enhance the training and evaluation of object detection models, fostering their adaptability and reliability in real-world construction applications. Our dataset comprises annotated images captured under a wide range of different weather conditions, including but not limited to sunny days, rainy periods, foggy atmospheres, and low-light situations. Additionally, environmental factors such as the existence of dirt/mud on the camera lens are integrated into the dataset through actual captures and synthetic generation to simulate the complex conditions prevalent in construction sites. We also generate synthetic images of the annotations including precise semantic segmentation masks for various objects commonly found in construction environments, such as wheel loader machines, personnel, cars, and structural elements. To demonstrate the dataset’s utility, we evaluate state-of-the-art object detection algorithms on our proposed benchmark. The results highlight the dataset’s success in adversarial training models across diverse conditions, showcasing its efficacy compared to existing datasets that lack such environmental variability.

Place, publisher, year, edition, pages
Springer Nature , 2025. p. 242-253
Series
Lecture Notes in Computer Science, ISSN 0302-9743 ; 14893 LNCS
Keywords [en]
Adversarial Attacks, Construction Environment, Dataset, Robust Object Detection, Semantic Segmentation, Adversarial machine learning, Camera lenses, Image annotation, Adverse weather, Autonomous machines, Condition, Construction sites, Environmental conditions, Object detection algorithms
National Category
Computer and Information Sciences
Identifiers
URN: urn:nbn:se:mdh:diva-70418DOI: 10.1007/978-981-97-8705-0_16ISI: 001584476100016Scopus ID: 2-s2.0-85219205516ISBN: 9789819787043 (print)OAI: oai:DiVA.org:mdh-70418DiVA, id: diva2:1944071
Conference
4th International Conference, ICPRAI 2024, Jeju Island, South Korea, July 03-06, 2024
Available from: 2025-03-12 Created: 2025-03-12 Last updated: 2025-12-03Bibliographically approved
In thesis
1. Enhancing Perception System Robustness Against Attacks and Natural Perturbations
Open this publication in new window or tab >>Enhancing Perception System Robustness Against Attacks and Natural Perturbations
2026 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Deep learning has led to major progress in computer vision, but modern Deep Neural Networks (DNNs) are still highly vulnerable to input perturbations, which limits their robustness in safety-critical applications. This challenge becomes even more critical in real-world industrial environments, such as autonomous machinery operating on construction sites, where visual data is influenced by unpredictable weather conditions, variable lighting, and physical wear and degradation. In addition, data scarcity, privacy constraints, and domain shift prevent the direct application of conventional large-scale training pipelines. 

 

This thesis addresses these challenges by proposing a comprehensive, multi-level framework that strengthens model-level robustness against adversarial attacks, enhances data-level robustness to natural environmental perturbations, and improves adaptive learning under distributed and data-constrained conditions, enabling reliable deployment of visual perception models in complex, safety-critical environments.

 

The first contribution focuses on the robustness of model-level attacks against adversarial attacks. A meta-heuristic search method is proposed to automatically discover activation functions that increase resistance to adversarial perturbations without requiring adversarial training. A hybrid search strategy further improves convergence efficiency, yielding Convolutional Neural Networks (CNNs) that outperform standard architectures under adversarial attacks while maintaining competitive clean-data accuracy.

 

The second contribution introduces ConstScene, a large-scale semantic segmentation dataset representing real and synthetic construction-site imagery under diverse weather and sensor degradation conditions. Experiments reveal significant performance drops when models trained on clean data are exposed to perturbed inputs, demonstrating the need for environment-specific robustness benchmarks.

 

The third contribution introduces an integrated framework that combines Federated Learning (FL) for decentralized collaborative training with Few-Shot Learning (FSL) for sample-efficient domain adaptation, supported by server-side Hyperparameter Optimization (HPO). The proposed approach enables effective model adaptation across distributed construction sites without sharing raw data, significantly improving robustness across heterogeneous client datasets.

 

In general, this thesis proposes three contributions to enhance robustness in perception systems: model-level robustness against adversarial attacks, introducing the ConstScene dataset for benchmarking performance under real-world degradations and data-level robustness against natural perturbations, and an integrated framework enabling decentralized, sample-efficient model adaptation across heterogeneous environments.

Place, publisher, year, edition, pages
Västerås: Mälardalen University, 2026. p. 225
Series
Mälardalen University Press Dissertations, ISSN 1651-4238 ; 452
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:mdh:diva-74467 (URN)978-91-7485-736-8 (ISBN)
Public defence
2026-01-23, Gamma, Mälardalens universitet, Västerås, 10:00 (English)
Opponent
Supervisors
Available from: 2025-11-25 Created: 2025-11-21 Last updated: 2026-01-02Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Salimi, MaghsoodAfshar, SaraCicchetti, AntonioSirjani, Marjan

Search in DiVA

By author/editor
Salimi, MaghsoodAfshar, SaraCicchetti, AntonioSirjani, Marjan
By organisation
Embedded Systems
Computer and Information Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
isbn
urn-nbn

Altmetric score

doi
isbn
urn-nbn
Total: 138 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf