Memory cache hit rate is a performance metric that measures the percentage of memory read requests successfully served from a fast-access cache layer versus requiring a slower primary storage lookup. It is calculated as (Cache Hits / (Cache Hits + Cache Misses)) * 100. A high hit rate indicates efficient cache utilization, reducing memory latency and improving overall agent throughput by minimizing expensive retrievals from primary vector databases or knowledge graphs.
