Follow-up to PR #2291 (nv/develop/deflate-strategy-locals-hoist). After the scalar-local hoist landed, tried two variants in deflate_quick.c to see whether packing the two uint32_t fields into a struct could coax the compiler into 64-bit load/store transfers.
- Apple M5, macOS 26.4.1, Apple clang 21.0.0
- AArch64 native, x86_64 cross-compile via
CMAKE_OSX_ARCHITECTURES=x86_64 - CMake Release,
-D BUILD_SHARED_LIBS=OFF