Replay Vulnerability

Updated: May 5, 2026

Description

Severity: High

The AI model is vulnerable to the replay attack, where previous interactions or outputs can be reused in new contexts, potentially causing unintended data leakage or the generation of inappropriate responses.

This vulnerability occurs when the model inadvertently reuses prior responses that may have been part of a confidential or sensitive conversation.

Example Attack

If attackers successfully perform a replay attack, they may be able to extract sensitive or confidential information from previous outputs. This could lead to data leakage, violation of privacy policies, or unauthorized access to personal or sensitive data. Additionally, replayed content may be used to manipulate the AI into generating harmful or unethical responses.

Remediation

Investigate and enhance the effectiveness of guardrails and output security mechanisms to prevent the model from inadvertently reusing previous responses inappropriately. Implement stricter output controls, such as preventing the model from repeating or referencing previous interactions unless explicitly permitted. Regular audits should be conducted to ensure the model does not reuse potentially sensitive information across different sessions.

Security Frameworks

OWASP AI 2025

LLM02:2025 Sensitive Information Disclosure

Sensitive information can affect both the LLM and its application context. This includes personal identifiable information (PII), financial details, health records, confidential business data, security credentials, and legal documents. Proprietary models may also have unique training methods and source code considered sensitive, especially in closed or foundation models.

MITRE ATLAS

AML.T0057: LLM Data Leakage

Adversaries may craft prompts that induce the LLM to leak sensitive information. This can include private user data or proprietary information. The leaked information may come from proprietary training data, data sources the LLM is connected to, or information from other users of the LLM.

Replay Vulnerability

Description

Example Attack

Remediation

Security Frameworks

OWASP AI 2025

LLM02:2025 Sensitive Information Disclosure

MITRE ATLAS

AML.T0057: LLM Data Leakage

NIST AI RMF

NIST-600-MES-2.7: Measure 2.7

NIST-600-MES-2.10: Measure 2.10

NIST-600-MNG-4.1: Manage 4.1

ISO/IEC 42001

ISO-42001-A.6.2.4: A.6.2.4: AI System Verification and Validation

ISO-42001-A.6.2.6: A.6.2.6: AI System Operation and Monitoring

ISO-42001-A.9.4: A.9.4: Intended Use of the AI System

OWASP Agentic AI 2026

ASI01:2026 Agent Goal Hijack

ASI06:2026 Memory and Context Poisoning

Need help?