When a rare fault occurs at one plant, the diagnostic process is often isolated, forcing engineers to solve novel problems with only local data. This leads to extended downtime, repeated engineering effort, and preventable production losses across the corporate fleet. A collaborative multi-agent workflow automates this by allowing a local diagnostic agent to securely query a central, anonymized knowledge graph of historical failures, maintenance actions, and sensor signatures from all other sites. The operational upside comes from converting one plant's solved problem into a reusable corporate asset, dramatically reducing mean-time-to-repair for rare but recurring faults.




