Partitions

Warning

This feature is currently experimental, and its interface is subject to change.

LLD’s partitioning feature allows a program (which may be an executable or a shared library) to be split into multiple pieces, or partitions. A partitioned program consists of a main partition together with a number of loadable partitions. The loadable partitions depend on the main partition in a similar way to a regular ELF shared object dependency, but unlike a shared object, the main partition and the loadable partitions share a virtual address space at link time, and each loadable partition is assigned a fixed offset from the main partition. This allows the loadable partitions to refer to code and data in the main partition directly without the binary size and performance overhead of PLTs, GOTs or symbol table entries.

Usage

A program that uses the partitioning feature must decide which symbols are going to be used as the “entry points” for each partition. An entry point could, for example, be the equivalent of the partition’s main function, or there could be a group of functions that expose the functionality implemented by the partition. The intent is that in order to use a loadable partition, the program will use dlopen/dlsym or similar functions to dynamically load the partition at its assigned address, look up an entry point by name and call it. Note, however, that the standard dlopen function does not allow specifying a load address. On Android, the android_dlopen_ext function may be used together with the ANDROID_DLEXT_RESERVED_ADDRESS flag to load a shared object at a specific address.

Once the entry points have been decided, the translation unit(s) containing the entry points should be compiled using the Clang compiler flag -fsymbol-partition=<soname>, where <soname> is the intended soname of the partition. The resulting object files are passed to the linker in the usual way.

The linker will then use these entry points to automatically split the program into partitions according to which sections of the program are reachable from which entry points, similarly to how --gc-sections removes unused parts of a program. Any sections that are only reachable from a loadable partition’s entry point are assigned to that partition, while all other sections are assigned to the main partition, including sections only reachable from loadable partitions.

The following diagram illustrates how sections are assigned to partitions. Each section is colored according to its assigned partition.

_images/partitions.svg

The result of linking a program that uses partitions is essentially an ELF file with all of the partitions concatenated together. This file is referred to as a combined output file. To extract a partition from the combined output file, the llvm-objcopy tool should be used together with the flag --extract-main-partition to extract the main partition, or -extract-partition=<soname> to extract one of the loadable partitions. An example command sequence is shown below:

# Compile the main program.
clang -ffunction-sections -fdata-sections -c main.c

# Compile a feature to be placed in a loadable partition.
# Note that this is likely to be a separate build step to the main partition.
clang -ffunction-sections -fdata-sections -fsymbol-partition=libfeature.so -c feature.c

# Link the combined output file.
clang main.o feature.o -fuse-ld=lld -shared -o libcombined.so -Wl,-soname,libmain.so -Wl,--gc-sections

# Extract the partitions.
llvm-objcopy libcombined.so libmain.so --extract-main-partition
llvm-objcopy libcombined.so libfeature.so --extract-partition=libfeature.so

In order to allow a program to discover the names of its loadable partitions and the locations of their reserved regions, the linker creates a partition index, which is an array of structs with the following definition:

struct partition_index_entry {
  int32_t name_offset;
  int32_t addr_offset;
  uint32_t size;
};

The name_offset field is a relative pointer to a null-terminated string containing the soname of the partition, the addr_offset field is a relative pointer to its load address and the size field contains the size of the region reserved for the partition. To derive an absolute pointer from the relative pointer fields in this data structure, the address of the field should be added to the value stored in the field.

The program may discover the location of the partition index using the linker-defined symbols __part_index_begin and __part_index_end.

Restrictions

This feature is currently only supported in the ELF linker.

The partitioning feature may not currently be used together with the SECTIONS or PHDRS linker script features, nor may it be used with the --section-start, -Ttext, -Tdata or -Tbss flags. All of these features assume a single set of output sections and/or program headers, which makes their semantics ambiguous in the presence of more than one partition.

The partitioning feature may not currently be used on the MIPS architecture because it is unclear whether the MIPS multi-GOT ABI is compatible with partitions.

The current implementation only supports creating up to 254 partitions due to implementation limitations. This limit may be relaxed in the future.