LLVM 10.0.0 Release Notes¶
- Introduction
- Non-comprehensive list of changes in this release
- Changes to the LLVM IR
- Changes to building LLVM
- Changes to the ARM Backend
- Changes to the MIPS Target
- Changes to the PowerPC Target
- Changes to the SystemZ Target
- Changes to the X86 Target
- Changes to the AMDGPU Target
- Changes to the AVR Target
- Changes to the WebAssembly Target
- Changes to the Windows Target
- Changes to the OCaml bindings
- Changes to the C API
- Changes to the Go bindings
- Changes to the DAG infrastructure
- Changes to LLDB
- External Open Source Projects Using LLVM 10
- Additional Information
Warning
These are in-progress notes for the upcoming LLVM 10 release. Release notes for previous releases can be found on the Download Page.
Introduction¶
This document contains the release notes for the LLVM Compiler Infrastructure, release 10.0.0. Here we describe the status of LLVM, including major improvements from the previous release, improvements in various subprojects of LLVM, and some of the current users of the code. All LLVM releases may be downloaded from the LLVM releases web site.
For more information about LLVM, including information about the latest release, please check out the main LLVM web site. If you have questions or comments, the LLVM Developer’s Mailing List is a good place to send them.
Note that if you are reading this file from a Subversion checkout or the main LLVM web page, this document applies to the next release, not the current one. To see the release notes for a specific release, please see the releases page.
Non-comprehensive list of changes in this release¶
- The ISD::FP_ROUND_INREG opcode and related code was removed from SelectionDAG.
- Enabled MemorySSA as a loop dependency. Since
r370957
(D58311
[MemorySSA & LoopPassManager] Enable MemorySSA as loop dependency. Update tests.
), the MemorySSA analysis is being preserved and used by a series of loop passes. The most significant use is in LICM, where the instruction hoisting and sinking relies on aliasing information provided by MemorySSA vs previously creating an AliasSetTracker. The LICM step of promoting variables to scalars still relies on the creation of an AliasSetTracker, but its use is reduced to only be enabled for loops with a small number of overall memory instructions. This choice was motivated by experimental results showing compile and run time benefits or replacing the AliasSetTracker usage with MemorySSA without any performance penalties. The fact that MemorySSA is now preserved by and available in a series of loop passes, also opens up opportunities for its use in those respective passes. - The BasicBlockPass, BBPassManager and all their uses were deleted in this revision.
- The LLVM_BUILD_LLVM_DYLIB and LLVM_LINK_LLVM_DYLIB CMake options are no longer available on Windows.
- As per LLVM Language Reference Manual,
getelementptr inbounds
can not change the null status of a pointer, meaning it can not produce non-null pointer given null base pointer, and likewise given non-null base pointer it can not produce null pointer; if it does, the result is a poison value. Since r369789 (D66608[InstCombine] icmp eq/ne (gep inbounds P, Idx..), null -> icmp eq/ne P, null
) LLVM uses that for transformations. If the original source violates these requirements this may result in code being miscompiled. If you are using Clang front-end, Undefined Behaviour Sanitizer-fsanitize=pointer-overflow
check will now catch such cases. - Windows Control Flow Guard: the
-cfguard
option now emits CFG checks on indirect function calls. The previous behavior is still available with the-cfguard-nochecks
option. Note that this feature should always be used with optimizations enabled. Callbacks
have been added toCommandLine Options
. These can be used to validate of selectively enable other options.- The function attributes
no-frame-pointer-elim
andno-frame-pointer-elim-non-leaf
have been replaced byframe-pointer
, which has 3 values:none
,non-leaf
, andall
. The values mean what functions should retain frame pointers. - …
Changes to the LLVM IR¶
- Unnamed function arguments now get printed with their automatically generated name (e.g. “i32 %0”) in definitions. This may require front-ends to update their tests; if so there is a script utils/add_argument_names.py that correctly converted 80-90% of Clang tests. Some manual work will almost certainly still be needed.
Changes to the ARM Backend¶
During this release …
Changes to the MIPS Target¶
- Improved support for
octeon
and added support forocteon+
MIPS-family CPU. min
,max
,umin
,umax
atomics now supported on MIPS targets.- Now PC-relative relocations are generated for
.eh_frame
sections when possible. That allows to link MIPS binaries without having to pass the-Wl,-z,notext
option. - Fix evaluating J-format branch (
j
,jal
, …) targets when the instruction is not in the first 256 MB region. - Fixed
jal
,sc
,scs
,ll
,lld
,la
,lw
,sw
instructions expanding. Now they accept more types of expression as arguments, correctly handle load/store forXGOT
model, expand using less instructions or registers. - Initial MIPS support has been added to
llvm-exegesis
. - Generates
_mcount
calls using proper MIPS ABI. - Improved support of GlobalISel instruction selection framework. This feature is still in experimental state for MIPS targets though.
Changes to the PowerPC Target¶
Optimization:
- Improved register pressure estimates in the loop vectorizer based on type
- Improved the PowerPC cost model for the vectorizer
- Enabled vectorization of math routines on PowerPC using MASSV (Mathematical Acceleration SubSystem) library
compiler-rt:
- Added/improved conversion functions from IBM long double to 128-bit integers
Codegen:
- Optimized memory access instructions in loops (pertaining to update-form instructions and address computation)
- Added options to disable hoisting instructions to hotter blocks based on statically or profile-based block hotness estimates
- Code generation improvements (particularly with floating point and vector code as well as handling condition registers)
- Various infrastructural improvements, code refactoring, and bug fixes
- Optimized handling of control flow based on multiple comparison of same values
Tools:
- llvm-readobj supports displaying file header, section headers, symbol table and relocation entries for XCOFF object files
- llvm-objdump supports disassembling physical sections for XCOFF object files
Changes to the SystemZ Target¶
- Added support for the
-march=z15
and-mtune=z15
command line options (as aliases to the existing-march=arch13
and-mtune=arch13
options). - Added support for the
-march=native
command line option. - Added support for the
-mfentry
,-mnop-mcount
, and-mrecord-mcount
command line options. - Added support for the GHC calling convention.
- Miscellaneous codegen enhancements, in particular to enable better reuse of condition code values and improved use of conditional move instructions.
Changes to the X86 Target¶
During this release …
- Less than 128 bit vector types, v2i32, v4i16, v2i16, v8i8, v4i8, and v2i8, are now stored in the lower bits of an xmm register and the upper bits are undefined. Previously the elements were spread apart with undefined bits in between them.
- v32i8 and v64i8 vectors with AVX512F enabled, but AVX512BW disabled will now be passed in ZMM registers for calls and returns. Previously they were passed in two YMM registers. Old behavior can be enabled by passing -x86-enable-old-knl-abi
- -mprefer-vector-width=256 is now the default behavior skylake-avx512 and later Intel CPUs. This tries to limit the use of 512-bit registers which can cause a decrease in CPU frequency on these CPUs. This can be re-enabled by passing -mprefer-vector-width=512 to clang or passing -mattr=-prefer-256-bit to llc.
- Deprecated the mpx feature flag for the Intel MPX instructions. There were no intrinsics for this feature. This change only this effects the results returned by getHostCPUFeatures on CPUs that implement the MPX instructions.
- The feature flag fast-partial-ymm-or-zmm-write which previously disabled vzeroupper insertion has been removed. It has been replaced with a vzeroupper feature flag which has the opposite polarity. So -vzeroupper has the same effect as +fast-partial-ymm-or-zmm-write.
Changes to the AVR Target¶
During this release …
Changes to the WebAssembly Target¶
During this release …
Changes to the Windows Target¶
- Fixed section relative relocations in .debug_frame in DWARF debug info
Changes to the C API¶
- C DebugInfo API
LLVMDIBuilderCreateTypedef
is updated to include an extra argumentAlignInBits
, to facilitate / propagate specified Alignment information present in atypedef
to Debug information in LLVM IR.
Changes to the Go bindings¶
- Go DebugInfo API
CreateTypedef
is updated to include an extra argumentAlignInBits
, to facilitate / propagate specified Alignment information present in atypedef
to Debug information in LLVM IR.
Changes to LLDB¶
- Improved support for building with MinGW
- Initial support for debugging Windows ARM and ARM64 binaries
External Open Source Projects Using LLVM 10¶
Zig Programming Language¶
Zig is a system programming language intended to be an alternative to C. It provides high level features such as generics, compile time function execution, and partial evaluation, while exposing low level LLVM IR features such as aliases and intrinsics. Zig uses Clang to provide automatic import of .h symbols, including inline functions and simple macros. Zig uses LLD combined with lazily building compiler-rt to provide out-of-the-box cross-compiling for all supported targets.
Additional Information¶
A wide variety of additional information is available on the LLVM web page, in particular in the documentation section. The web page also contains versions of the
API documentation which is up-to-date with the Subversion version of the source
code. You can access versions of these documents specific to this release by
going into the llvm/docs/
directory in the LLVM tree.
If you have any questions or comments about LLVM, please feel free to contact us via the mailing lists.