Functional Safety for Intelligent Systems: Practitioner Basics

PostedSeptember 24, 2025

ByMarta Milodanovich

Engineering intelligent systems that operate reliably within complex, dynamic environments requires a disciplined approach to safety that transcends traditional component-level reliability. For professionals integrating AI, robotics, and advanced data processing into physical or critical digital workflows, understanding functional safety is not merely a technical checkbox; it is a foundational governance requirement. This article explores the core concepts of functional safety as they apply to intelligent systems, contextualising them within the European regulatory landscape and the practical realities of evidence generation and verification.

From Reliability to Functional Safety: A Paradigm Shift

Historically, safety engineering focused heavily on the reliability of individual components. The assumption was that if a part did not fail, the system would remain safe. While component reliability remains crucial, modern intelligent systems introduce complexities that this view cannot adequately address. A system can consist of perfectly reliable components yet still be unsafe if its logic, its interaction with the environment, or its response to unexpected conditions leads to hazardous outcomes. This is the domain of Functional Safety.

Functional safety is the part of the overall safety of a system or equipment that depends on a system or equipment operating correctly in response to its inputs, including the management of potential failures. It is not concerned with the intrinsic safety of a substance (like the chemical stability of a battery) but rather with the safety achieved by the correct execution of safety functions. For intelligent systems, this means ensuring that perception algorithms, decision-making logic, and actuation mechanisms collectively prevent or mitigate hazardous situations.

The distinction is critical for AI and robotics. A machine learning model might be highly accurate in training but can exhibit unpredictable behaviour in novel scenarios. Functional safety demands that we design systems that remain safe even when these components behave unexpectedly or when external conditions deviate from the norm. It shifts the focus from “will this component fail?” to “what happens if this component provides an incorrect output, and how does the system ensure safety regardless?”

The European Regulatory Context

In Europe, functional safety is not a suggestion; it is a legal requirement for a vast array of products placed on the market. The primary regulatory driver is the EU Machinery Regulation (2023/1230), which replaced the Machinery Directive. This regulation explicitly integrates functional safety requirements for machinery control systems. For products bearing a CE mark under this regulation, compliance with relevant safety standards is mandatory.

The harmonised standards that underpin these regulations are largely derived from the IEC 61508 standard for generic functional safety. This standard provides the umbrella framework. However, for specific sectors, derivative standards apply:

ISO 13849 for machinery safety (the PL – Performance Level – concept).
ISO 26262 for automotive (the ASIL – Automotive Safety Integrity Level – concept).
IEC 62304 for medical device software.

While these standards were developed for deterministic systems, their principles are increasingly being interpreted and adapted for systems incorporating AI and machine learning. The regulatory expectation is that if you claim your product is safe, you must provide objective evidence. For intelligent systems, this evidence generation is the central challenge.

Core Concepts: Hazards, Risks, and Safety Functions

The journey to functional safety begins with a rigorous analysis of what can go wrong. This is not a cursory glance; it is a systematic process mandated by standards and regulations.

Hazard Analysis and Risk Assessment (HARA)

The first step is identifying potential hazards associated with the system’s operation. A hazard is a potential source of harm. For an autonomous mobile robot in a warehouse, a hazard could be “collision with a human operator.” For an AI-driven medical diagnostic tool, a hazard could be “failure to detect a critical abnormality.”

Once hazards are identified, a risk assessment is performed. This involves evaluating the severity of potential harm (S), the probability of exposure to the hazardous situation (E), and the possibility of avoiding harm (A). In the context of ISO 13849, this leads to a Performance Level (PL) requirement. In automotive (ISO 26262), it determines the ASIL.

Key Principle: The level of rigor required for safety measures (and the evidence needed to prove them) is directly proportional to the calculated risk. A low-risk application may require basic safety measures; a high-risk application demands a comprehensive, documented, and verified safety architecture.

For intelligent systems, the risk assessment must account for the unique failure modes of AI. These include:

Out-of-distribution inputs: The system encounters data unlike anything seen in training.
Edge cases: Rare but critical scenarios (e.g., a pedestrian partially obscured by heavy rain and glare).
Adversarial attacks: Deliberate manipulation of inputs to cause misclassification.
Concept drift: The statistical properties of the target variable change over time.

Defining Safety Functions

Following the risk assessment, the engineering team must define Safety Functions. These are specific functions that, when executed, bring the system to a safe state or maintain it in a safe state. A safety function is defined by its Performance Level (PL) or Safe Failure Fraction (SFF), depending on the standard.

Examples of safety functions in intelligent systems include:

Safe Stop: An autonomous vehicle detecting an imminent collision executes a controlled emergency stop.
Safe State Transition: A surgical robot detects a loss of sensor calibration and transitions to a “freeze” state, requiring manual intervention.
Redundant Perception: A drone uses multiple sensor modalities (e.g., LiDAR and cameras). If the camera feed is ambiguous or degraded, the system relies on LiDAR to maintain safe navigation.

Crucially, the safety function must be independent or segregated from the basic control function if the basic control function is the source of the hazard. In AI systems, this often leads to the concept of a “safety cage” or “monitor” architecture. The AI performs the complex decision-making, but a separate, simpler, and highly reliable safety monitor validates the outputs against safety constraints.

The Verification Mindset: Evidence and Confidence

Functional safety is built on the principle that safety is not a feature that can be tested in isolation at the end of development. It must be designed in, and its effectiveness must be continuously verified. The “verification mindset” for intelligent systems requires a departure from traditional testing methods.

The Limits of Traditional Testing

Traditional software testing relies heavily on code coverage and executing specific test cases to verify expected behaviour. For intelligent systems, this is insufficient. You cannot write a test case for every possible pixel configuration an autonomous vehicle might encounter. You cannot exhaustively enumerate all potential inputs to a neural network.

Therefore, verification for AI must focus on:

Process Rigour: Demonstrating that the development process itself (data collection, model selection, training, validation) adheres to strict quality standards (e.g., ISO 26262-6 for software development).
Robustness Validation: Testing the system against perturbed inputs, adversarial examples, and simulated edge cases to understand its breaking points.
Formal Methods (where applicable):> Using mathematical proofs to verify that the safety logic (the monitor) behaves correctly under all conditions.

Evidence Hierarchy

Regulators and auditors look for a hierarchy of evidence. A safety claim is only as strong as the weakest link in this chain.

1. Requirements Evidence

Can you trace every safety requirement to its implementation? For AI, this is challenging. If a safety requirement is “the system shall not cross a virtual geofence,” how is this implemented in the neural network weights? The evidence here often involves architectural decisions (e.g., a hard-coded software limit that overrides the AI) rather than the AI model itself.

2. Design Evidence

This includes architecture diagrams, fault tree analyses (FTA), and failure mode and effects analyses (FMEA). For intelligent systems, we must add AI-specific risk analysis. This analyzes how data quality, model bias, or algorithmic opacity could lead to hazardous states.

3. Testing Evidence

Unit tests, integration tests, and system tests. For AI, this includes validation on hold-out datasets and stress testing. The concept of coverage is different. Instead of line coverage, we look at scenario coverage. Have we tested against a sufficient diversity of operational scenarios?

4. Operational Evidence

For systems deployed in the field, data on actual performance is invaluable. However, for safety-critical systems, we cannot wait for accidents to gather evidence. This is where simulation plays a massive role. High-fidelity simulation allows for the execution of millions of “virtual miles” or “virtual operations,” generating statistical evidence of safety performance that would be impossible to gather in the real world.

Simulation is not a substitute for real-world testing, but it is a critical component of the evidence portfolio. It allows for the exploration of rare events (hazards) in a controlled, repeatable manner.

Architectural Strategies for Intelligent Systems

How do we practically implement functional safety in a system dominated by non-deterministic AI? The industry is converging on specific architectural patterns.

The Monitor Pattern (Safety Cage)

This is the most common approach. The “AI” (the Controller) performs the complex task (e.g., path planning, object detection). A separate “Safety Monitor” observes the inputs and outputs of the Controller. The Monitor is typically:

Simpler: It uses simpler algorithms (e.g., rule-based, geometric checks).

Smaller: It has a smaller attack surface and is easier to verify.

Decisive: It has the authority to intervene (e.g., trigger an emergency stop) if the Controller’s actions violate safety constraints.

Example: An AI path planner suggests a route. The safety monitor checks if that route keeps the robot at least 50cm from obstacles (based on sensor data). If the AI suggests a path closer than 50cm, the monitor blocks the command and triggers a stop.

Redundancy and Diversity

Using diverse sensors and algorithms reduces the probability of common-mode failures. If a camera-based AI fails due to glare, a LiDAR-based system might still function. However, diversity introduces complexity in data fusion. The safety architecture must decide how to handle conflicting inputs from diverse sources. This often leads to voting mechanisms (e.g., 2-out-of-3 voting) or prioritization logic.

For software, diversity can mean using different algorithms or even different programming languages for the primary control and the safety monitor. This ensures that a bug in a specific library or compiler does not affect both the control and the safety mechanism.

Fail-Safe Design

Ultimately, if the system cannot guarantee safe operation, it must transition to a safe state. Defining what the safe state is, is a critical design decision. For a robot, it might be stopping motion. For a medical device, it might be reverting to manual control or a low-power diagnostic mode. The transition to the safe state must be robust and predictable, even if the AI system is in a state of chaos.

Regulatory Nuances: EU vs. National Implementation

While the EU has harmonised regulations like the Machinery Regulation, the interpretation and enforcement can vary. Professionals operating across borders must be aware of this.

The Role of Notified Bodies

For high-risk machinery or products falling under the AI Act’s high-risk categories, involvement of a Notified Body is often mandatory. These are independent conformity assessment bodies designated by EU member states. While the criteria for designation are EU-wide, the specific expertise and interpretation nuances can differ between Notified Bodies in different countries (e.g., TÜV SÜD in Germany vs. Bureau Veritas in France). It is advisable to engage with a Notified Body early in the design process.

National Standards and Guidelines

Some member states have specific national standards or guidelines that supplement the harmonised standards. For example, Germany has a strong tradition of safety engineering driven by the professional associations responsible for DGUV regulations (German Social Accident Insurance). While these are primarily for workplace safety, they influence product design expectations.

In the context of autonomous systems, countries like the Netherlands and Sweden have been proactive in creating testing corridors and regulatory sandboxes. These allow for real-world testing under supervision but require specific safety cases tailored to the test environment. The evidence generated in these sandboxes is often scrutinized heavily by national authorities before broader deployment is permitted.

The AI Act Intersection

The upcoming AI Act introduces a new layer. For high-risk AI systems (which include many safety-critical applications), the Act mandates a risk management system, data governance, and technical documentation. Crucially, the AI Act references “harmonised standards” for the presumption of conformity. It is highly likely that these standards will be heavily based on functional safety principles (IEC 61508, ISO 13849, etc.).

Practitioners should not view the AI Act and functional safety standards as separate silos. They are converging. A robust functional safety assessment will likely form the backbone of the technical documentation required for the AI Act.

Practical Steps for Practitioners

For teams building intelligent systems, integrating functional safety is a cultural and procedural shift.

1. Start with Safety

Safety cannot be an afterthought. It must be part of the initial concept. Conduct a preliminary Hazard Analysis and Risk Assessment (HARA) before writing significant amounts of code or training models. This defines the constraints the system must operate within.

2. Define the Safety Concept

Based on the HARA, define the safety functions and the required Performance Levels (PLr). Decide on the architecture. Will you use a monitor? Redundancy? What is the safe state? Document this in a Safety Requirements Specification (SRS).

3. Develop with Evidence in Mind

Every design decision, every line of code, and every training data selection should be traceable to a requirement. Use tools that support requirements traceability. For AI, this includes documenting the provenance of training data, the hyperparameters used, and the version of the model.

4. Verify Continuously

Do not wait for the end of the project to verify safety. Integrate verification into your CI/CD pipeline. Run regression tests on safety logic. Use simulation to test edge cases continuously. Conduct regular safety reviews involving independent experts.

5. Plan for the Long Term

Intelligent systems, particularly those using machine learning, can degrade or drift over time. A safety concept must include mechanisms for monitoring performance in the field (e.g., data drift detection) and processes for updating models safely. The regulatory expectation is that the safety lifecycle does not end at the point of sale.

Functional safety for intelligent systems is a complex discipline, but it is not insurmountable. It requires a rigorous, evidence-based approach that respects the unique capabilities and limitations of AI. By adhering to established safety principles and adapting them to the context of intelligent systems, developers can build products that are not only innovative but also trustworthy and compliant with European regulations.

Previous Machinery Regulation 2023/1230: What Changes for Smart Machines

Next Safety Standards for Robots and Machines: How to Use Them

AI in Education: Fundamentals & Tools

Understanding AI Basics

Practical AI Tools for Educators

Integrating AI into Teaching

Case Studies & Success Stories

Ethical AI & Inclusive Practices

Ethical & Legal Frameworks for AI Systems

Equity & Inclusion

Transparency & Trust

Algorithmic Bias, Discrimination & Legal Risk

AI Errors, Accountability & Legal Responsibility

Institutional AI Governance & Policy Design

AI, Security & GDPR Compliance

Data Protection & Privacy

Cybersecurity in AI

EU Regulations & Policies

Engaging Parents & Guardians

Data Protection, Privacy & AI Governance

Additional Resources

Glossary of Terms

Templates & Guides

Webinars & Research

AI for Administrative & Pedagogical Support

AI for Time Management

AI in Student Performance Tracking

AI for Automated Communication

AI for Document Management

AI, Robotics & Biotech Regulation in Europe

EU AI Act Explained: Scope, Risk Levels, Obligations

AI-Enabled Products: Robots, Medical Software, Smart Devices

Machinery Regulation & Safety Standards for Intelligent Systems

AI in Healthcare & Biotech: MDR, IVDR, EMA

Liability & Responsibility for AI-Driven Systems

Compliance in Practice: From Risk Assessment to CE Marking

Country-Specific AI Regulation & Enforcement

How EU AI Law Is Applied at National Level

Different Compliance Models

National AI Sandboxes & Regulatory Experiments

Public Sector AI Rules Across Europe

Cross-Border AI Deployment Challenges

When National Law Overrides EU Guidance

Legal Cases, Enforcement & Real-World Precedents

AI Act, GDPR & Algorithmic Decision-Making Cases

Liability Disputes Involving Automated Systems

Robotics Accidents & Legal Responsibility

Medical & Biotech AI Failures: Lessons Learned

What Enforcement Trends Tell Us About Future Regulation

Functional Safety for Intelligent Systems: Practitioner Basics

From Reliability to Functional Safety: A Paradigm Shift

The European Regulatory Context

Core Concepts: Hazards, Risks, and Safety Functions

Hazard Analysis and Risk Assessment (HARA)

Defining Safety Functions

The Verification Mindset: Evidence and Confidence

The Limits of Traditional Testing

Evidence Hierarchy

1. Requirements Evidence

2. Design Evidence

3. Testing Evidence

4. Operational Evidence

Architectural Strategies for Intelligent Systems

The Monitor Pattern (Safety Cage)

Redundancy and Diversity

Fail-Safe Design

Regulatory Nuances: EU vs. National Implementation

The Role of Notified Bodies

National Standards and Guidelines

The AI Act Intersection

Practical Steps for Practitioners

1. Start with Safety

2. Define the Safety Concept

3. Develop with Evidence in Mind

4. Verify Continuously

5. Plan for the Long Term