Elder Research, Inc.

Ramon Perez

Using Large Language Models (LLMs) to classify fault types in power generating equipment maintenance records / Predictive Maintenance of Hydro Units: A Data-Driven Approach

Abstract: Using Large Language Models (LLMs) to classify fault types in power generating equipment maintenance records

Asset management organizations rely on accurate and consistent failure codes to understand equipment reliability, optimize maintenance strategies, and prioritize high-value interventions. Structured failure data enables engineering and maintenance teams to identify recurring issues, quantify risk, allocate capital effectively, and focus resources on the most impactful reliability improvements. However, in many legacy Maintenance Management Systems, historical corrective maintenance work orders were recorded without structured failure classifications. In this case, more than 20 years of work order history lacked standardized failure codes, creating a significant data gap. Manually reviewing and back-classifying tens of thousands of records would require thousands of labor hours, introduce subjectivity and inconsistency, and still result in incomplete coverage. This presentation describes a Phase 3 initiative that leverages Large Language Models (LLMs) to automatically classify failure types in power generation maintenance records. The objective was to determine whether modern generative AI systems could accurately interpret free-text corrective work orders and assign structured failure codes aligned to an established failure hierarchy. Data sources included corrective work orders extracted from a Maintenance Management System (Maximo), a multi-level failure classification hierarchy (e.g., failure class, component, cause, mitigation), and domain-specific technical documentation. To improve contextual accuracy and reduce hallucinations, a Retrieval-Augmented Generation (RAG) framework was implemented so that relevant hierarchical definitions and technical references were dynamically supplied to the model at inference time. A few-shot learning strategy was also applied to guide the model with representative labeled examples. The selected model, Anthropic Claude 3.5 Sonnet, was evaluated using majority-vote methodologies to increase robustness and confidence in final predictions. Multiple inference passes were performed per work order, and consensus-based selection was used to stabilize outputs. Model performance was assessed against a curated evaluation dataset using standard classification metrics. Results demonstrate that LLMs can reliably assign structured failure codes, achieving a 92% F1-score across evaluation datasets. The approach significantly reduces manual effort while maintaining high classification quality, enabling the rapid creation of a historically complete failure dataset that would otherwise take years to produce manually. Beyond retrospective data enrichment, the methodology also shows promise as a quality assurance tool for future human-coded work orders and as a potential replacement or augmentation to traditional failure reporting processes within the CMMS. The presentation discusses methodology design decisions, evaluation results, observed limitations, and areas for improvement—including expanding datasets with more diverse equipment failures, refining prompting and retrieval strategies, and exploring integration with diagnostic tools and industry reporting frameworks such as GADS. Overall, the findings suggest that LLM-driven classification can accelerate asset intelligence initiatives, enhance data-driven maintenance decision-making, and provide scalable, repeatable quality assurance for both historical and future maintenance records in power generation environments.

Keywords: AI, LLM

Abstract: Predictive Maintenance of Hydro Units: A Data-Driven Approach

In modern power systems, hydropower plants play a crucial role. Considering their irreplaceable functions in providing fast system response, balancing variable energy sources, and integrating an increasing share of renewable energy, their importance is expected to continue growing in the future. The central element of every hydropower plant is the hydro unit, a complex electromechanical system comprising the generator, turbine, and auxiliary equipment, whose unplanned failures can lead to significant economic losses, reduced system reliability, and disruptions in electricity supply. In addition to direct losses, such failures may also cause secondary effects, including increased balancing costs and additional stress on other generation capacities. The development of modern information and communication technologies and advanced analytical methods represents a key prerequisite for the modernization of hydropower plant operation and the transition toward intelligent control and maintenance systems. The integration of diverse data sources, their centralization, and advanced processing enable a deeper understanding of system behavior and support real-time decision-making. Improving maintenance strategies for hydro units and associated equipment requires a transition from traditional approaches, based on periodic inspections or reactive maintenance, to more advanced concepts such as Condition-Based Maintenance (CBM). This approach relies on the analysis of large volumes of data collected from various sources, including SCADA systems, monitoring systems, dedicated sensors, and results of periodic testing, to continuously assess the current condition of the equipment (data-driven approach). This enables the timely identification of deviations from normal operation, early detection of degradation indicators, and data-driven decision-making for planning and executing maintenance activities, thereby reducing the risk of unplanned failures. A specific subset of CBM strategies is Predictive Maintenance (PdM), which utilizes advanced algorithms based on machine learning and artificial intelligence to forecast degradation trends and estimate the remaining useful life of hydro units. These models enable not only anomaly detection but also the prediction of potential failures before they occur, significantly reducing the risk of unplanned outages, increasing system availability, and optimizing overall maintenance costs. Within the poster session, the conceptual design and key functionalities of a dedicated predictive maintenance solution for hydro units (Ægir), developed by the US-based company Elder Research, will be presented. The Ægir platform enables advanced analytics, early diagnostics, and decision support in real-world operational environments. Based on previous experience with the application of the Ægir platform, its implementation is expected to contribute to increased reliability and availability of hydro units, as well as to the overall efficiency improvement of the power system.

Keywords: hydrogenerator, diagnostics, predictive maintenance, AI

Biography of the presenter

Ramon Perez is the Director of AI Solutions at Elder Research, an AI/ML consultancy, where he builds AI enabled software products to solve challenging industrial problems. Ramon holds an engineering bachelors degree from Georgia Tech and masters degrees from Georgetown and Harvard universities.