x86 cpus' Guide

 Search (OK)

 

Statistics
Collections : 23367  cpu
Database :
11105  cpu
7855  motherboard
17110  graphic_card
For sale : 435  cpu
Pictures : 27220  photos
 Add to favorites
 Homepage
Site map Site map

Français English Español Dutch

 Log in
 Register

Processor of the day

Signetics_MP8080AI_top.jpg
Signetics 8080A-1 CerDIP

Most popular CPUs

Intel Core 2 Duo E8800 (ES)
Intel Core 2 Duo E6200 (ES)
Intel Core 2 Duo E7700
Intel Pentium II 266 (0,35µ)
Intel Core 2 Duo E7800

Intel Core i7-3770
Intel Core i5-3470
Intel Core i5-2400
Intel Core i5-4570
Intel Core i7-2600

Most powerful CPUs
Desktop PCs
AMD : Ryzen 9 7950X
Intel : Core i9-13900KF
Laptop PCs
AMD : Ryzen 9 6980HX
Intel : Core i9-12950HX
Servers
AMD : EPYC 9654P
Intel : Xeon Platinium 9282

Other articles > X86 Glossary > 3-operand Fused Multiply-Add instructions (FMA3)

The FMA instruction set is an extension to the 128 and 256-bit Streaming SIMD Extensions instructions in the x86 microprocessor instruction set to perform fused multiply–add (FMA) operations. There are two variants: - FMA4 is supported in AMD processors starting with the Bulldozer architecture. FMA4 was realized in hardware before FMA3. - FMA3 is supported in AMD processors starting with the Piledriver architecture and Intel starting with Haswell processors and Broadwell processors since 2014. FMA3 and FMA4 instructions have almost identical functionality, but are not compatible. Both contain fused multiply–add (FMA) instructions for floating-point scalar and SIMD operations, but FMA3 instructions have three operands, while FMA4 ones have four. The FMA operation has the form d = round(a · b + c), where the round function performs a rounding to allow the result to fit within the destination register if there are too many significant bits to fit within the destination. The four-operand form (FMA4) allows a, b, c and d to be four different registers, while the three-operand form (FMA3) requires that d be the same register as a, b or c. The three- operand form makes the code shorter and the hardware implementation slightly simpler, while the four-operand form provides more programming flexibility.


Used for : AMD APU, AMD Athlon, AMD EPYC, AMD FX, AMD Opteron, AMD Ryzen, AMD Sempron, Intel , Intel Celeron, Intel Core i3, Intel Core i5, Intel Core i7, Intel Core i9, Intel Core M, Intel Core i3, Intel Core i5, Intel Core i7, Intel Pentium, Intel Xeon