Automatic Transformation of natural language requirements to PLCOPen XML using Large Language Models (LLM)
2025 (English)Independent thesis Advanced level (degree of Master (Two Years)), 30 credits / 45 HE credits
Student thesisAlternative title
Automatic Transformation of natural language requirements to Programmable Logic Controller Open Extensible Markup Language using Large Language Models (LLM) (English)
Abstract [en]
The transformation from requirements engineering (RE) into design artifacts presents a valuable opportunity for software development in contexts such as industrial automation. This thesis explores the feasibility of using Natural Language Processing (NLP) through Large Language Models (LLMs) to convert textual requirements into structured outputs, such as Function Block Diagrams (FBDs) in PLCopen XML format. The research aims to mitigate the challenges of manual transformation processes, including inefficiencies, ambiguity, and the growing complexity of software systems.
Leveraging offline, portable, and open-source LLMs, the study evaluates their ability to generate accurate and semantically aligned structured FBD outputs. Experiments were conducted using synthetic datasets with varying levels of complexity and interdependence. Results demonstrate that larger models excel in producing high-quality outputs, particularly when supported by prompt engineering techniques. However, these models face challenges in handling specific nuances. Smaller models, while faster, exhibit limitations in managing complex or ambiguous requirements.
This thesis contributes to the field by providing insights into the practical deployment of AI-driven solutions for RE (NLP4RE), emphasizing the importance of balancing automation with human oversight. The work highlights future research opportunities, including optimizing resource-efficient models, expanding datasets to real-world scenarios, and posting all the code used in public repositories to enhance reproducibility.
Place, publisher, year, edition, pages
2025. , p. 81
Keywords [en]
Automatic transformation, natural language processing, NLP, large language models, LLM, requirements engineering, RE, PLCopen XML, function block diagrams, FBD, software automation, industrial automation, prompt engineering, AI-driven requirements transformation, text-to-code generation, semantic analysis, computational constraints, domain-specific AI, hybrid AI models, clustering methods, structured output generation, model fine-tuning, resource-efficient AI, synthetic datasets, ambiguity resolution, AI in railway industry, engineering AI, software design automation, automated requirements processing, AI-driven software development, explainability in AI, privacy-preserving AI, NLP4RE, software verification, AI model evaluation, structured data generation, AI-assisted software engineering, text-to-XML transformation, model performance metrics, BLEU score, METEOR score, ROUGE score, AI for industrial applications, machine learning for requirements engineering, AI-powered software pipelines, automated software documentation.
National Category
Electrical Engineering, Electronic Engineering, Information Engineering Computer Systems
Identifiers
URN: urn:nbn:se:mdh:diva-70073OAI: oai:DiVA.org:mdh-70073DiVA, id: diva2:1935490
External cooperation
Alstom, Västerås, Sweden
Subject / course
Computer Science
Presentation
2025-01-30, Universitetsplan 1, 722 20 Västerås, Sweden, Västerås, 23:01 (English)
Supervisors
Examiners
2025-02-172025-02-062025-02-17Bibliographically approved