What is Interrupt latency in Arm Cortex-M series?

Interrupt latency in Arm Cortex-M series microcontrollers refers to the time it takes for an interrupt request to be serviced after it is asserted. It is an important metric that impacts real-time response in embedded systems.

Contents

Interrupt Handling in Cortex-M

Arm Cortex-M microcontrollers have an interrupt-driven architecture. When an event like an external interrupt or exception occurs, the processor stops executing the main program and starts executing an Interrupt Service Routine (ISR) to handle the event. After the ISR finishes, normal program execution resumes.

The Cortex-M interrupt handling mechanism involves these key steps:

An interrupt request is asserted by a peripheral or exception.
The processor finishes executing the current instruction.
The context of the main program is saved to the stack.

The interrupt vector table is used to branch to the corresponding ISR.
The ISR executes to handle the interrupt.
The main program context is restored from the stack.

Program execution resumes after the interrupted instruction.

Sources of Interrupt Latency

There are several sources of delay that contribute to the overall interrupt latency in Cortex-M:

Interrupt Detection Delay

It takes a few clock cycles from when the interrupt signal is asserted to when it is detected by the core’s interrupt controller. This depends on synchronization logic and interrupt controller design.

Instruction Completion Delay

The processor has to complete executing the current instruction before handling the interrupt. Usually Cortex-M cores have a 3-5 stage pipeline, so this could take up to 5 clock cycles for longer instructions.

Context Saving Delay

The complete context of the main program needs to be saved from processor registers to the stack when an exception occurs. This normally takes 12 clock cycles in Cortex-M as there are 12 core registers to save.

Interrupt Entry Delay

The processor needs to fetch the address of the ISR handler from the interrupt vector table. This takes a few clock cycles.

First Instruction Fetch Delay

The first instruction of the ISR needs to be fetched from memory before it can be executed. Cortex-M cores have a Von Neumann architecture with unified instruction and data buses, so this competes with data accesses.

Measuring Interrupt Latency

Interrupt latency can be measured using the following methods:

Logic Analyzer

A logic analyzer connected to the interrupt signal and processor clock can precisely measure the time from interrupt assertion to the start of ISR execution.

Oscilloscope

An oscilloscope can also measure interrupt latency from the time the interrupt signal is asserted to when the ISR changes the logic level of an GPIO pin.

Timer Capture

A free running timer can be used to capture the time when the interrupt occurs and when the ISR starts executing. Their difference gives the interrupt latency.

In-situ Measurement

Software counters can directly measure the number of cycles elapsed between the interrupt assertion and the execution of the first ISR instruction.

Factors Affecting Interrupt Latency

There are several microarchitectural factors that affect interrupt latency in Cortex-M processors:

Interrupt Priorities

Higher priority interrupts have lower latency as they can preempt lower priority ones. Nested interrupts increase latency due to additional context saving.

Processor Pipeline Depth

Deeper pipelines lead to longer instruction completion delays. Cortex-M7 has a 5-stage pipeline vs 3 in Cortex-M3/M4/M0.

Clock Frequency

Faster clock speeds reduce absolute interrupt latency in clock cycle counts, but not the number of steps involved.

Memory Architecture

The Von Neumann architecture adds latency due to instruction fetches competing with data accesses. Cortex-M7 has separate instruction and data buses.

Interrupt Vector Table Location

Locating the vector table in faster memory like TCM RAM reduces first instruction fetch delay compared to external flash.

Compiler Optimizations

Compiler optimizations like function inlining of ISRs can reduce call/return overhead and improve latency.

Typical Interrupt Latency Figures

Here are some typical interrupt latency figures for different Cortex-M cores:

Cortex-M0 – 12 to 20 clock cycles

Cortex-M3 – 12 to 16 clock cycles
Cortex-M4 – 12 to 16 clock cycles
Cortex-M7 – 14 to 20 clock cycles

Exact latencies depend on the microcontroller, clock speed, interrupts enabled, compiler optimizations etc.

Design Considerations for Low Latency

Here are some design guidelines to reduce interrupt latency in a Cortex-M based system:

Use the simplest Cortex-M core that meets performance requirements

Run the core at the highest clock frequency possible
Minimize lower priority interrupts when possible
Use RAM based vector table for fast ISR entry

Optimize ISR code to minimize execution time
Reduce OS overhead for hard real-time tasks
Use compiler optimizations like function inlining

Use latency analysis tools to identify optimization opportunities

Conclusion

Interrupt latency is a key metric that impacts responsiveness in real-time embedded systems using Arm Cortex-M cores. A combination of hardware, software and compiler techniques can help optimize interrupt latency. Typical latency ranges from 12 to 20 clock cycles for most Cortex-M variants.