RFC-0145: Remove the host-side runtime memory allocator


Start Date	2025-05-16
Description	Update the runtime-host interface to no longer make use of a host-side allocator
Authors	Pierre Krieger, Someone Unknown

Summary

Update the runtime-host interface so that it no longer uses the host-side allocator.

Prior Art

The API of these new functions was heavily inspired by the API used by the C programming language.

This RFC is mainly based on RFC-4 by @tomaka, which was never adopted, and this RFC supersedes it.

Changes from RFC-4

The original RFC required checking if an output buffer address provided to a host function is inside the VM address space range and to stop the runtime execution if that's not the case. That requirement has been removed in this version of the RFC, as in the general case, the host doesn't have exhaustive information about the VM's memory organization. Thus, attempting to write to an out-of-bounds region will result in a "normal" runtime panic.
Function signatures introduced by PPP#7 have been used in this RFC, as the PPP has already been properly implemented and documented. However, it has never been officially adopted, nor have its functions been in use.
Return values were harmonized to i64 everywhere where they represent either a positive outcome as a positive integer or a negative outcome as a negative error code.
ext_offchain_network_peer_id_version_1 now returns a result code instead of silently failing if the network status is unavailable.
Added new versions of ext_misc_runtime_version and ext_offchain_random_seed.
Addressed discussions from the original RFC-4 discussion thread.

Motivation

The heap allocation of the runtime is currently controlled by the host using a memory allocator on the host side.

The API of many host functions contains buffer allocations. For example, when calling ext_hashing_twox_256_version_1, the host allocates a 32-byte buffer using the host allocator, and returns a pointer to this buffer to the runtime. The runtime later has to call ext_allocator_free_version_1 on this pointer to free the buffer.

Even though no benchmark has been done, it is pretty obvious that this design is very inefficient. To continue with the example of ext_hashing_twox_256_version_1, it would be more efficient to instead write the output hash to a buffer allocated by the runtime on its stack and passed by pointer to the function. Allocating a buffer on the stack, in the worst case, consists simply of decreasing a number; in the best case, it is free. Doing so would save many VM memory reads and writes by the allocator, and would save a function call to ext_allocator_free_version_1.

Furthermore, the existence of the host-side allocator has become questionable over time. It is implemented in a very naive way: every allocation is rounded up to the next power of two, and once a piece of memory is allocated it can only be reused for allocations which also round up to the exactly the same size. So in theory it's possible to end up in a situation where we still technically have plenty of free memory, but our allocations will fail because all of that memory is reserved for differently sized buckets. That behavior is de-facto hardcoded into the current protocol and for determinism and backwards compatibility reasons, it needs to be implemented exactly identically in every client implementation.

In addition to that, runtimes make substantial use of heap memory allocations, and each allocation needs to go through the runtime <-> host boundary twice (once for allocating and once for freeing). Moving the allocator to the runtime side would be a good idea, although it would increase the runtime size. But before the host-side allocator can be deprecated, all the host functions that use it must be updated to avoid using it.

Stakeholders

Runtime developers, who will benefit from the improved performance and more deterministic behavior of the runtime code.

Explanation

New definitions

New Definition I: Runtime Optional Positive Integer

By a Runtime Optional Positive Integer we refer to an abstract value $r \in \mathcal{R}$ where $\mathcal{R} := {\bot} \cup {0, 1, \dots, 2^{32} - 1},$ and where $\bot$ denotes the absent value.

At the Host-Runtime interface this type is represented by a signed 64-bit integer $x \in \mathbb{Z}$ (thus $\mathbb{Z} := {-2^{63}, \dots, 2^{63} - 1}$).

We define the encoding function $\mathrm{Enc}{\mathrm{ROP}} : \mathcal{R} \to \mathbb{Z}$ and decoding function $\mathrm{Dec}{\mathrm{ROP}} : \mathbb{Z} \to \mathcal{R} \cup {\mathrm{error}}$ as follows.

For $r \in \mathcal{R}$,

$$ \mathrm{Enc}_{\mathrm{ROP}}(r) := \begin{cases} -1 & \text{if } r = \bot, \ r & \text{if } r \in {0, 1, \dots, 2^{32} - 1}. \end{cases} $$

For a signed 64-bit integer $x$,

$$ \mathrm{Dec}_{\mathrm{ROP}}(x) := \begin{cases} \bot & \text{if } x = -1, \ x & \text{if } 0 \le x < 2^{32}, \ \mathrm{error} & \text{otherwise.} \end{cases} $$

A valid Runtime Optional Positive Integer at the Host-Runtime boundary is any 64-bit signed integer $x$ such that $x \in {-1} \cup {0, 1, \dots, 2^{32} - 1}$. All other 64-bit integer values are invalid for this type.

Conforming implementations must not produce invalid values when encoding. Receivers must abort execution if decoding results in $\mathrm{error}$.

New Definition II: Runtime Optional Pointer-Size

The Runtime optional pointer-size has exactly the same definition as Runtime pointer-size (Definition 216) with the value of 2⁶⁴-1 representing a non-existing value (an absent value).

Memory safety

Pointers to input parameters passed to the host must reference readable memory regions. The host must abort execution if the memory region referenced by the pointer cannot be read, unless explicitly stated otherwise.

Pointers to output parameters passed to the host must reference writeable memory regions. The host must abort execution if the memory region referenced by the pointer cannot be written, in case it is performing the actual write, unless explicitly stated otherwise.

Changes to host functions

ext_storage_get

Existing prototype

(func $ext_storage_get_version_1
    (param $key i64) (result i64))

Polkadot Fellowship RFCs

New Definition I: Runtime Optional Positive Integer

New Definition II: Runtime Optional Pointer-Size