https://www.mdu.se/

mdu.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Leveraging GANs to Generate Synthetic Log Files forSmart-Troubleshooting in Industry 4.0
Mälardalen University, School of Innovation, Design and Engineering, Innovation and Product Realisation.ORCID iD: 0009-0009-9081-5476
Mälardalen University, School of Innovation, Design and Engineering, Innovation and Product Realisation.ORCID iD: 0000-0002-2833-7196
Mälardalen University, School of Innovation, Design and Engineering, Embedded Systems.ORCID iD: 0000-0002-8027-0611
2024 (English)Conference paper, Published paper (Refereed)
Abstract [en]

In this paper, we tackle the challenge of generatingsynthetic log files using generative adversarial networks to sup-port smart-troubleshooting experimentation. Log files are criticalfor implementing monitoring systems for smart-troubleshooting,as they capture valuable information about the activities andevents occurring within the monitored system. Analyzing theselogs is crucial for effective smart-troubleshooting, enhancing theoverall efficiency, reliability, and security of smart manufacturingprocesses. However, accessing public log data is difficult due toprivacy concerns and the need to protect sensitive information.Moreover, for the purpose of effective troubleshooting, it isessential to have datasets that include fault, error, and failure logsas well as standard logs. In recent years, synthetic log files haveemerged as a promising solution to augment limited real-worlddatasets and facilitate the development and evaluation of anomalydetection techniques. Building on this concept of synthetic data,we have developed a specific log generation technique and datasettailored for testing smart-troubleshooting techniques in heteroge-neous connected systems environments, such as industrial cyber-physical systems and the internet of things. First, we propose amethodology that generates synthetic log files based on generativeadversarial networks. Later, we instantiate this methodologyusing different Generative Adversarial Network implementationsand present a validation and a comprehensive comparativeanalysis of their performance. Eventually, we provide a robustdataset for anomaly detection and threat analysis in cyberspacesecurity. Based on the results of our comparison, CTGAN hasshown superior performance in generating high-quality syntheticlog files.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE) , 2024.
Keywords [en]
Generative Adversarial Network, Synthetic Data, Log Files, Industry 4.0, Smart-Troubleshooting.
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:mdh:diva-69262DOI: 10.1109/SEAA64295.2024.00079ISI: 001413352200069Scopus ID: 2-s2.0-85218629352OAI: oai:DiVA.org:mdh-69262DiVA, id: diva2:1918139
Conference
50th Euromicro Conference Series on Software Engineering and Advanced Applications (SEAA) 2024
Available from: 2024-12-04 Created: 2024-12-04 Last updated: 2025-12-03Bibliographically approved
In thesis
1. Smart-troubleshooting in Industry 4.0 leveraging log files and product information
Open this publication in new window or tab >>Smart-troubleshooting in Industry 4.0 leveraging log files and product information
2025 (English)Licentiate thesis, comprehensive summary (Other academic)
Abstract [en]

Connected internet of things devices are becoming more powerful, yet the chal-lenge of effectively managing them to prevent failures remains ongoing. In somescenarios, such as with devices produced by a single company, it is possibleto use the same fault detection provided by the manufacturer and follow theinstruction for resolving the threat. In other cases, such as with devices producedby different companies, the heterogeneity of devices necessitates a more detailedand complex approach to fault detection. Log files, which are records of eventsor processes generated by a device’s software or hardware, are crucial for moni-toring device behavior. It is important to consider the diversity of log files anddata to detect threats, identify their root causes, and provide effective solutions.In the realm of troubleshooting interconnected internet of things devices, currentsolutions predominantly address homogeneous device environments, whichlimits their scalability and adaptability to diverse device types and configura-tions. Instead, a more flexible approach is needed; one that can accommodatea variety of connected devices while minimizing reliance on specific companyinstructions. One such method is Smart-troubleshooting which involves a 4-stepcycle which include prevention, detection and diagnosis, recovery, and evolutionof threats. Given these premises, the ultimate goal of this research is to definea smart-troubleshooting approach based on log files and product information.By leveraging a generalized methodology, this approach seeks to enhance themanagement of internet of things systems in complex, multi-manufacturer en-vironments. This thesis focuses on a systematic review of log files and thestate of the art in troubleshooting methodologies. During the research, thescarcity of publicly available log files for troubleshooting purposes was identified. Consequently, a method was proposed for generating synthetic log filesusing generative adversarial networks. The proposed methodology leveragesthese log files along with product information to enhance smart-troubleshooting.To validate the approach and gather industry feedback, questionnaires and in-terviews was conducted. Following this, machine learning algorithms will beemployed to implement and refine the proposed method. By leveraging a gener-alized methodology, this approach seeks to improve the management and faultdetection of internet of things systems in complex, heterogeneous environments.

Place, publisher, year, edition, pages
Eskilstuna: Mälardalen University, 2025
Series
Mälardalen University Press Licentiate Theses, ISSN 1651-9256 ; 369
Keywords
Smart-troubleshooting, log analysis, resilience, cyber-physical systems, anomaly detection, machine learning, Industry 4.0.
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:mdh:diva-69307 (URN)978-91-7485-694-1 (ISBN)
Presentation
2025-01-23, C3-003, Mälardalens universitet, Eskilstuna, 09:30 (English)
Opponent
Supervisors
Available from: 2024-12-12 Created: 2024-12-06 Last updated: 2025-10-10Bibliographically approved

Open Access in DiVA

No full text in DiVA

Other links

Publisher's full textScopus

Authority records

Partovian, SaniaFlammini, FrancescoBucaioni, Alessio

Search in DiVA

By author/editor
Partovian, SaniaFlammini, FrancescoBucaioni, Alessio
By organisation
Innovation and Product RealisationEmbedded Systems
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 185 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf