Summary

Foundation model instability refers to unpredictable changes in model behavior over time due to external factors like version updates, system prompt modifications, or provider changes. Unlike inherent non-determinism (ri-6), this instability stems from upstream modifications that alter the model’s fundamental behavior patterns. Such variability can undermine testing, reliability, and trust when no version control or change notification mechanisms are in place.

Description

Model providers frequently improve and update their foundation models, which may involve retraining, fine-tuning, or architecture changes. These updates, if applied without explicit notification or without allowing version pinning, can lead to shifts in behaviour even when inputs remain unchanged.

System Prompt Modifications: Many models operate with a hidden or implicit system prompt—a predefined set of instructions that guides the model’s tone, formatting, or safety behaviour. Changes to this internal prompt (e.g., for improved safety or compliance) can alter model outputs subtly or significantly, even if user inputs remain identical.
Context Window Effects: Model behaviour may vary depending on the total length and structure of input context, including position in the token window. Outputs can shift when prompts are rephrased, rearranged, or extended—even if core semantics are preserved.
Deployment Environment or API Changes: Changes in model deployment infrastructure (e.g., hardware, quantization, tokenization behaviour) or API defaults can also affect behaviour, particularly for latency-sensitive or performance-critical applications.

Versioning Challenges

LLM versioning is uniquely difficult due to:

Scale and Complexity: Massive parameter counts make tracking changes challenging
Dynamic Updates: Continuous learning and fine-tuning blur discrete version boundaries
Multidimensional Changes: Updates span architecture, training data, and inference parameters
Resource Constraints: Running multiple versions simultaneously strains infrastructure
No Standards: Lack of accepted versioning practices across organizations

Relying entirely on the model provider for evaluation—particularly for fast-evolving model types such as code generation—places the burden of behavioural consistency entirely on that provider. Any change introduced upstream, whether explicitly versioned or not, can impact downstream system reliability.

If the foundation model behaviour changes over time—due to lack of version pinning, absence of rigorous provider-side version control, or silent model updates—it can compromise system testing and reproducibility. This, in turn, may affect critical business operations and decisions taken on the basis of model output.

The model provider may alter the model or its configuration without explicit customer notification. Such silent changes can result in outputs that deviate from tested expectations. Even when mechanisms for version pinning are offered, the inherent non-determinism of these systems means that output variability remains a risk.

Another source of instability is prompt perturbation. Recent research highlights how even minor variations in phrasing can significantly impact output, and in some cases, be exploited to attack model grounding or circumvent safeguards—thereby introducing further unpredictability and risk.

Impact of Inadequate Versioning

Poor versioning practices exacerbate instability risks and create additional operational challenges:

Inconsistent Output: Models may produce different responses to identical prompts, leading to inconsistent user experiences and unreliable decision-making
Reproducibility Issues: Inability to replicate or trace past outputs complicates testing, debugging, and audit requirements
Performance Variability: Unexpected changes in model performance, potentially introducing regressions or new biases, while making it difficult to assess improvements
Compliance and Auditing: Inability to track and explain model changes creates compliance problems and difficulties in auditing AI-driven decisions
Integration Challenges: Other systems that depend on specific model behaviors may break when models are updated without proper versioning
Security and Privacy: Difficulty tracking security vulnerabilities or privacy issues, with new problems potentially introduced during updates

Related Risks

RI-6: Non-Deterministic Behaviour

OWASP LLM Top 10 References

LLM09:2025 Misinformation

FFIEC References

OTS: Risk Management DAM: VII Maintenance AUD: Risk Assessment and Risk-Based Auditing

EU AI Act References

III.S2.A9: Risk Management System III.S2.A15: Accuracy, Robustness and Cybersecurity V.S2.A53: Obligations for Providers of General-Purpose AI Models

NIST AI 600-1 References

2.8. Information Integrity

Foundation Model Versioning

Summary

Description

Versioning Challenges

Impact of Inadequate Versioning

Links

Related Risks

RI-6: Non-Deterministic Behaviour

Key Mitigations

AIR-PREV-010 : AI Model Version Pinning

AIR-DET-011 : Human Feedback Loop for AI Systems

AIR-DET-015 : Using Large Language Models for Automated Evaluation (LLM-as-a-Judge)

AIR-DET-004 : AI System Observability

AIR-PREV-005 : System Acceptance Testing

OWASP LLM Top 10 References

FFIEC References

EU AI Act References

NIST AI 600-1 References