Firmware

Memory Allocation Strategies: Static, Heap Caps, and Fragmentation

Abhinav Singh

18 Nov 2025 — 6 min read

Hoomanely Context

At Hoomanely, our embedded platforms power an ecosystem of modular IoT products where reliability, uptime, and predictability matter more than raw throughput. Our System-on-Module architecture spans sensing, communication and processing nodes, all operating continuously under diverse workloads. Memory allocation is therefore not a theoretical concern. It defines real-world reliability across devices deployed in customer environments.

This article reflects the memory strategies that guide how Hoomanely builds predictable, production-grade firmware.

Introduction

Memory allocation is one of the most fundamental yet misunderstood aspects of embedded systems design. A single poorly timed malloc call or an unpredictable burst of dynamic allocations can destabilize an otherwise well-architected device. On constrained systems without swap, memory is a finite and fragile resource. Even when total free memory appears sufficient, fragmentation can prevent the allocator from fulfilling a request due to lack of sufficiently large contiguous blocks.

This blog explores three crucial strategies for designing stable firmware: static allocation, heap caps, and fragmentation control. These concepts form the basis of long-running, predictable embedded systems used in IoT devices, industrial hardware, instrumentation, and edge compute systems.

1. Problem: Unbounded Allocation and Fragmentation

Dynamic allocation introduces two major risks:

1.1 Allocation Failure

The allocator cannot find a contiguous block due to:

Temporary peak loads exhausting available heap
Long-lived objects creating "holes" between freed blocks
External fragmentation preventing large allocations despite sufficient total free memory

1.2 Long-term Fragmentation

Fragmentation builds slowly over time:

Short-lived and long-lived allocations intermix on the same heap
Variable-sized buffers come and go unpredictably
Subsystems behave differently under varying workloads
Memory entropy increases monotonically

Devices expected to run for months or years cannot tolerate this drift. A device that passes a 24-hour stress test may still fail after 60 days of uptime due to accumulated fragmentation.

2. Strategy One: Static Allocation

Static allocation reserves memory at compile time and avoids the runtime allocator entirely. Memory layout is determined during linking, with objects placed in the .data or .bss sections.

Why Static Allocation is Essential

Zero fragmentation: Memory layout is fixed at compile time and never changes
Zero allocation overhead: No runtime allocator calls, no metadata bookkeeping
Fully deterministic timing: Access latency is constant and predictable
Perfect for timing-sensitive tasks: ISRs, real-time control loops, critical sections

Example

#define SENSOR_BUF_SIZE 512
static uint8_t sensor_buffer[SENSOR_BUF_SIZE];

Limitations

Cannot support variable-sized data structures
May increase RAM usage when conservatively sized (worst-case dimensioning required)
Harder to extend dynamically based on runtime conditions
Requires accurate worst-case sizing at design time

Static allocation gives determinism but lacks flexibility. It's the foundation of predictable embedded systems.

3. Strategy Two: Heap Caps and Memory Pooling

Dynamic allocations are necessary in real-world systems, but they must be controlled.

3.1 Heap Caps

Heap caps set a maximum memory budget for a module or subsystem, providing memory isolation and preventing any single component from monopolizing resources.

Implementation Concept

A heap cap wraps the standard allocator with accounting:

Track current usage against a defined ceiling
Return NULL when cap would be exceeded
Thread-safe operations via mutex protection
Per-subsystem caps for isolation

Benefits

Memory isolation: Subsystems cannot interfere with each other's allocation budget
Predictable worst-case usage: Maximum consumption is known at design time
Protection from subsystem interference: Network module cannot starve sensor module
Easier debugging: Memory leaks contained to specific subsystems with clear boundaries

Hoomanely Example

Hoomanely devices run sensing loops, communication threads, and utility tasks concurrently. Heap caps keep these subsystems isolated so that no module consumes memory beyond its intended budget, ensuring predictable operation under varying loads. For example, a network stack might be capped at 16KB, while sensor processing gets 8KB, and utilities share 4KB.

3.2 Memory Pools

Pools provide pre-sized blocks for frequent allocation patterns, eliminating both fragmentation and allocation overhead. A pool is essentially an array of fixed-size blocks with a free list or bitmap indicating availability.

Benefits

O(1) allocation and deallocation: Constant-time operations via bitmap or free list
No fragmentation: Fixed-size blocks never create holes or gaps
Cache-friendly: Sequential memory layout improves spatial locality
Best for protocol packets, queue nodes, structs: Common embedded patterns with known sizes

When to Use Pools

Network packet buffers with known maximum transmission unit (MTU)
Message queue nodes for inter-task communication
Sensor data structures with fixed schemas
State machine context objects with uniform size

Pools trade flexibility for predictability. They work best when allocation sizes cluster around known values (e.g., 64, 128, 256 bytes).

4. Strategy Three: Fragmentation Management

Even with caps and pools, fragmentation may still emerge when dynamic allocation spans multiple components or handles variable-sized data.

4.1 Fragmentation Sources

External fragmentation occurs when free memory exists but not in contiguous blocks:

Mixing long-lived and short-lived objects on the same heap
Highly variable buffer sizes (e.g., 32 bytes, then 1024 bytes, then 16 bytes)
Frequent allocate/free patterns creating "swiss cheese" memory
Bursty workloads where peak usage leaves permanent holes

Internal fragmentation occurs when allocators round up request sizes:

Allocator overhead (metadata per block, typically 8-16 bytes)
Alignment requirements (e.g., 8-byte or 16-byte boundaries for performance)
Minimum allocation sizes enforced by the allocator

4.2 Mitigation Techniques

Use Arenas (Linear Allocators)

Arena allocators bump a pointer forward and free all memory at once, eliminating fragmentation for temporary workloads. They maintain a simple pointer to the next available byte and increment it on each allocation. When work completes, the entire arena resets.

Ideal use cases:

Request-response patterns (allocate for request duration, then reset)
JSON/CBOR parsing (parse, process, discard temporary structures)
Temporary processing buffers (compute, then release)
Per-connection state in network servers

Arenas convert many small frees into one bulk reset, dramatically simplifying memory management.

Avoid Allocations in ISRs

Allocations inside interrupt service routines introduce:

Unpredictable delays: Allocator search time is non-deterministic
Lock contention: If heap uses mutexes, ISR may block or cause priority inversion
Reentrancy issues: Many allocators are not reentrant or ISR-safe
RTOS complications: Violates real-time scheduling guarantees

Solution: Pre-allocate buffers or use lock-free ring buffers for ISR-to-task communication. ISRs should only signal tasks, not allocate memory.

Group Objects by Lifetime

Allocating objects with similar lifetimes together reduces fragmentation by keeping the heap segregated:

Initialization objects: Allocated once during startup, never freed
Session objects: Live for connection duration (seconds to minutes)
Request objects: Allocated and freed per request (milliseconds)

Implementation strategy: Use separate heaps, arenas, or allocation strategies for each lifetime category. This prevents long-lived objects from fragmenting regions used by short-lived ones.

Prefer Fixed-size Data Structures

Bounded data structures prevent unbounded growth and fragmentation. Ring buffers, fixed-capacity arrays, and pre-allocated pools eliminate the need for dynamic resizing.

Example: Instead of a linked list with dynamic nodes, use a ring buffer with fixed capacity. This prevents both memory exhaustion and fragmentation from variable-length linked structures.

5. Memory Strategy Mapping at Hoomanely

Hoomanely's SoM architecture runs multiple concurrent subsystems:

Sensor loops: Periodic sampling and processing
Communication tasks: Network protocol handling (MQTT, CoAP, HTTP)
Monitoring utilities: Health checks, diagnostics, and watchdog functions

All simultaneously over long durations, with devices expected to maintain months of continuous uptime in customer deployments.

Layered Memory Strategy

To keep memory behavior predictable across this heterogeneous workload:

Critical paths use static allocation: Sensor buffers, ISR contexts, real-time state machines, configuration structures
Frequent patterns use pools: Network packets, queue nodes, small messages, event structures
Modules get isolated via heap caps: Network stack (16KB), application layer (12KB), utilities (4KB)
Batch workloads use arenas: JSON parsing, temporary computations, request handling, protocol encoding/decoding

This layered strategy simplifies debugging and maintains allocator health over months of uptime by matching allocation strategy to access pattern.

6. Hybrid Strategy Example

A production-ready allocator combines multiple strategies based on allocation characteristics:

void* allocate_buffer(size_t sz) {
    if (sz <= 128) return pool128_alloc();
    void* p = capped_malloc(sz);
    if (p) return p;
    return arena_alloc(&temp_arena, sz);
}

This hybrid allocator ensures:

Typical cases served by pools: O(1) allocation, zero fragmentation for common sizes
Larger requests constrained: Capped heap prevents runaway growth
Temporary workloads isolated: Arena prevents long-term fragmentation from transient data

The key is routing allocations to the strategy best suited for their size and lifetime.

7. Monitoring and Observability

Effective memory management requires runtime visibility. Production systems should track:

Key Metrics

Current usage: Active allocated bytes
Peak usage: High water mark over device lifetime
Failed allocations: Count of malloc returning NULL
Largest free block: Indicator of external fragmentation
Fragmentation ratio: 1 - (largest_free_block / total_free_memory)

Debug Techniques

Heap walk: Periodically traverse allocator metadata to detect corruption
Allocation tracking: Record caller addresses to identify leak sources
Watermark checking: Place sentinel values around buffers to detect overruns
Periodic snapshots: Compare heap state over time to detect gradual leaks

8. Key Takeaways

Static allocation ensures deterministic execution with zero runtime overhead and zero fragmentation
Heap caps isolate memory budgets and prevent subsystem interference through hard limits
Pools eliminate fragmentation for common patterns with O(1) fixed-size allocation
Arenas simplify temporary workloads by allowing bulk deallocation instead of individual frees
Avoid allocations inside interrupts to maintain real-time guarantees and prevent priority inversion
Group similar-lifetime objects together to reduce fragmentation through heap segregation
Monitor fragmentation metrics to detect degradation before failure occurs
Use hybrid strategies that combine multiple techniques for robust operation

Predictability is essential for long-running embedded devices like Hoomanely's IoT modules. Memory allocation strategy is not an optimization detail it's a fundamental architectural decision that determines whether your device runs reliably for hours or years. The strategies outlined here form a hierarchy: static allocation as the foundation, pools and caps for controlled dynamism, and arenas for temporary workloads. Together, they enable predictable operation under diverse workloads over extended timescales.

Memory Allocation Strategies: Static, Heap Caps, and Fragmentation

Abhinav Singh

Hoomanely Context

Introduction

1. Problem: Unbounded Allocation and Fragmentation

1.1 Allocation Failure

1.2 Long-term Fragmentation

2. Strategy One: Static Allocation

Why Static Allocation is Essential

Example

Limitations

3. Strategy Two: Heap Caps and Memory Pooling

3.1 Heap Caps

Implementation Concept

Benefits

Hoomanely Example

3.2 Memory Pools

Benefits

When to Use Pools

4. Strategy Three: Fragmentation Management

4.1 Fragmentation Sources

4.2 Mitigation Techniques

Use Arenas (Linear Allocators)

Avoid Allocations in ISRs

Group Objects by Lifetime

Prefer Fixed-size Data Structures

5. Memory Strategy Mapping at Hoomanely

Layered Memory Strategy

6. Hybrid Strategy Example

7. Monitoring and Observability

Key Metrics

Debug Techniques

8. Key Takeaways

Read more

Designing Hardware That Survives Accidental Hot-Plugging

Server-Driven UI: Changing Layouts Without App Updates

When Optimization Breaks: The Debug vs Release Performance Paradox

Graceful Degradation: What We Turn Off First