AutoFactory Dataset to Support AI in Manufacturing Systems.

Data Brief

Systems Engineering and Applications Laboratory, Cadi Ayyad University, ENSA, BP 2390, Marrakech, 40000, Marrakech-Safi, Morocco.

Published: October 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Automating code generation in manufacturing systems requires Artificial Intelligence (AI) models capable of interpreting textual requirement specifications. One of the main challenges is the absence of publicly available, domain-specific datasets suitable for training such models. This article presents AutoFactory, an open-source dataset that includes manually written and LLM-augmented requirement specifications, annotated by domain experts for Named Entity Recognition (NER) tasks using the BIO format. AutoFactory enables the extraction and classification of key industrial components, including actuators, pre-actuators, sensors, effectors, and others. These elements correspond to real-world input and output signals in manufacturing systems. The dataset is grounded in realistic scenarios built with Factory I/O and includes 3D scenes, tag tables, and screenshots captured from multiple camera perspectives. To enhance linguistic diversity while preserving semantic intent, all specifications were expanded using large language models and evaluated through semantic similarity analysis (average cosine similarity above 0.92), complemented by manual validation to ensure consistency. To facilitate the annotation process, a custom labeling tool was developed. It runs locally to preserve data privacy and provides a user-friendly interface. The tool is adaptable to a wide range of NER tasks and allows researchers to customize labels for different domains by adding or modifying tags. To assess the practical value of the dataset, transformer-based models such as BERT and DistilBERT were fine-tuned on the annotated requirement specifications. The highest-performing model reached an F1-score of 0.95, confirming the dataset's ability to support accurate identification of key components in manufacturing systems. This combination of human expertise and AI-based augmentation provides a robust foundation for training AI systems to interpret manufacturing behaviors from text. AutoFactory contributes to ongoing research efforts aimed at enabling intelligent systems to assist in generating control code from requirement specifications, reducing reliance on manual programming in industrial automation.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12363462PMC
http://dx.doi.org/10.1016/j.dib.2025.111938DOI Listing

Publication Analysis

Top Keywords

manufacturing systems
16
requirement specifications
16
ner tasks
8
systems
6
manufacturing
5
specifications
5
autofactory
4
autofactory dataset
4
dataset support
4
support manufacturing
4

Similar Publications

This article explores two complementary strategies for addressing the affordability and access challenges facing advanced therapies. As high development costs and limited market access have led to the withdrawal of several therapies, the article examines how these barriers create 'valleys of death' that prevent innovation from reaching patients. Through the case of Glybera and other examples, it outlines a rehabilitative approach focused on reforming current systems through improved reimbursement schemes, regulatory streamlining, and more efficient manufacturing.

View Article and Find Full Text PDF

Accelerating Transition State Search and Ligand Screening for Organometallic Catalysis with Reactive Machine Learning Potential.

J Chem Theory Comput

September 2025

State Key Laboratory of Fine Chemicals, Frontiers Science Center for Smart Materials Oriented Chemical Engineering, Department of Pharmaceutical Sciences, Institute of Chemical Process Systems Engineering, School of Chemical Engineering, Dalian University of Technology, Dalian 116024, China.

Organometallic catalysis lies at the heart of numerous industrial processes that produce bulk and fine chemicals. The search for transition states and screening for organic ligands are vital in designing highly active organometallic catalysts with efficient reaction kinetics. However, identifying accurate transition states necessitates computationally intensive quantum chemistry calculations.

View Article and Find Full Text PDF

Plastic waste continues to be a major environmental challenge, worsened by energy-intensive conventional recycling methods that require highly pure feedstocks. In this review, emerging electrochemical upcycling technologies are critically examined, focusing on the electro-oxidation transformation of polyethylene terephthalate (PET) into valuable chemical products. Key reaction pathways and target products are outlined to clarify the selective electrochemical reforming of PET.

View Article and Find Full Text PDF

Modern electronic systems are evolving toward miniaturized designs, flexible architectures, and high-power-density requirements. However, progress in developing electrical insulation materials that integrate mechanical robustness, flexibility, and thermal stability remains a critical challenge. This study introduces a novel nacre-inspired aramid-vermiculite nanopaper featuring a 3D interconnected layered network, designed for use in flexible electrical insulating applications.

View Article and Find Full Text PDF

Urinary catheters: state of the art and future perspectives - a narrative review.

Mater Today Bio

October 2025

University of Maribor, Faculty of Medicine, Institute of Biomedical Sciences, Taborska Ulica 8, SI-2000, Maribor, Slovenia.

Catheter associated urinary tract infection (CAUTI) is the most frequent healthcare associated infection, arising from microbial adhesion to catheter surfaces, biofilm development, and the growing problem of antimicrobial resistance. Many publications have addressed CAUTI epidemiology, biofilm biology, or biomaterials for catheters in isolation, yet there is little literature that connects these areas into a coherent translational perspective. This review seeks to fill that gap by combining an overview of biofilm pathophysiology with recent advances in material based innovations for catheter design, including nanostructured and responsive coatings, sensor enabled systems, additive manufacturing, and three dimensional printing.

View Article and Find Full Text PDF