Skip to content

Make PREFERRED_SIZE, GEMM_DIVIDE_LIMIT and _RATE available to DYNAMIC_ARCH builds#5688

Merged
martin-frbg merged 6 commits intoOpenMathLib:developfrom
martin-frbg:divlimit_dyn
Mar 20, 2026
Merged

Make PREFERRED_SIZE, GEMM_DIVIDE_LIMIT and _RATE available to DYNAMIC_ARCH builds#5688
martin-frbg merged 6 commits intoOpenMathLib:developfrom
martin-frbg:divlimit_dyn

Conversation

@martin-frbg
Copy link
Collaborator

@martin-frbg martin-frbg commented Mar 19, 2026

Closes a performance gap compared to dedicated builds for cpus that have non-default settings (mainly newer x86_64, Arm Neoverse and Fujitsu A64FX) - noticed while looking into numpy/numpy#30816

@mattip
Copy link
Contributor

mattip commented Mar 20, 2026

I think this is still needed to get the parameter through, right?

@martin-frbg
Copy link
Collaborator Author

Yes, but it is "only" a performance-related fix (and only relevant to Neoverse V1 - also Fugaku if/when I add the related GEMM_DIVIDE_RATE passthrough)

@martin-frbg martin-frbg added this to the 0.3.32 milestone Mar 20, 2026
@martin-frbg martin-frbg changed the title [WIP,Testing] Make GEMM_DIVIDE_LIMIT available to DYNAMIC_ARCH builds Make PREFERRED_SIZE, GEMM_DIVIDE_LIMIT and _RATE available to DYNAMIC_ARCH builds Mar 20, 2026
@martin-frbg martin-frbg merged commit a89142f into OpenMathLib:develop Mar 20, 2026
100 of 102 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants