slide 13 of 23
Unpartitioned Ops with Correction
Can be expensive, especially for saturation
Speedup comes from:
More parallelism (e.g., 2-bit fields)
Vectorized field Load/Store
Reduced register pressure
No partial-value forwarding stalls