This workflow automates the continuous, multi-modal monitoring of exam sessions by orchestrating specialized agents that analyze video feeds for gaze deviation or extra persons, audio streams for prohibited communication, and browser/desktop activity for unauthorized applications. Each agent outputs a real-time risk score, which an orchestrator fuses into a composite confidence level. This eliminates the unsustainable burden of manual, screen-by-screen human surveillance, converting it into an exception-based model where proctors only review pre-qualified, high-likelihood incidents, dramatically cutting operational cost per exam.




