AI Latency Elevated

Description

Severity: Info

A significant increase in AI response latency has been detected.

This could impact user experience, slow down automated workflows, and indicate performance bottlenecks.

Remediation

Investigate potential causes such as increased computational load, external dependencies, or system performance issues.

Security Frameworks

EU AI Act

EU-AI-ACT-AIA-012: AIA-012: Automatic record-keeping (logging)

Technically allow for the automatic recording of events ('logs') over the lifetime of the system to ensure traceability of functioning appropriate to the intended purpose; for Annex III(1)(a) systems, logs must include the period of use, reference database, input data leading to a match, and identification of natural persons involved in result verification.

EU-AI-ACT-AIA-026: AIA-026: Deployer obligations for high-risk AI

Take appropriate technical and organisational measures to use the system in accordance with the instructions for use; assign human oversight to competent, trained, supported natural persons; ensure input data is relevant and sufficiently representative for the intended purpose (to the extent the deployer controls data); monitor operation; suspend use and inform provider/distributor/authorities where risk under Art 79(1) is identified or after a serious incident; keep automatically generated logs for ≥6 months; inform workers and representatives prior to workplace deployment; comply with GDPR DPIA obligations; for law-enforcement use, register in EU database; inform persons subject to decisions; cooperate with authorities.

EU-AI-ACT-AIA-072: AIA-072: Post-market monitoring by providers

Establish and document a post-market monitoring system proportionate to the nature of AI technologies and risks; actively and systematically collect, document and analyse data on performance throughout the lifetime of the high-risk system; evaluate continuous compliance with Section 2 requirements. Implement based on a post-market monitoring plan (template to be provided by the Commission).

OWASP AI 2025

LLM10:2025 Unbounded Consumption

Unbounded Consumption occurs when a Large Language Model (LLM) application allows users to conduct excessive and uncontrolled inferences, leading to risks such as denial of service (DoS), economic losses, model theft, and service degradation.

MITRE ATLAS

AML.T0029: Denial of ML Service

Adversaries may target machine learning systems with a flood of requests for the purpose of degrading or shutting down the service. Since many machine learning systems require significant amounts of specialized compute, they are often expensive bottlenecks that can become overloaded. Adversaries can intentionally craft inputs that require heavy amounts of useless compute from the machine learning system.

AML.T0034: Cost Harvesting

Adversaries may target different machine learning services to send useless queries or computationally expensive inputs to increase the cost of running services at the victim organization. Sponge examples are a particular type of adversarial data designed to maximize energy consumption and thus operating cost.

AI Latency Elevated

Description

Remediation

Security Frameworks

EU AI Act

EU-AI-ACT-AIA-012: AIA-012: Automatic record-keeping (logging)

EU-AI-ACT-AIA-026: AIA-026: Deployer obligations for high-risk AI

EU-AI-ACT-AIA-072: AIA-072: Post-market monitoring by providers

OWASP AI 2025

LLM10:2025 Unbounded Consumption

MITRE ATLAS

AML.T0029: Denial of ML Service

AML.T0034: Cost Harvesting

NIST AI RMF

NIST-600-MES-2.4: Measure 2.4

NIST-600-MNG-4.1: Manage 4.1

ISO/IEC 42001

ISO-42001-A.6.2.6: A.6.2.6: AI System Operation and Monitoring

ISO-42001-A.6.2.8: A.6.2.8: AI System Recording of Event Logs

OWASP Agentic AI 2026

ASI08:2026 Cascading Failures

Need help?