Glibc Math Code Sees 4x Improvement On AMD Zen By Changing FMA Implementation
Merged this week to the GNU C Library “glibc” code is dropping the ldbl-96 FMA implementation from this library as in doing so they found a 4x improvement to throughput and latency on AMD Zen 3 hardware… ⌘ Read more