Fix cpu performance debug builds #651

vlovich · 2025-02-13T20:44:09Z

Fixes #649 but does not fix the 30% perf reduction from stock CPU llama.cpp cli as I haven't tracked that down.

I first did a cleanup commit that should be identical behavior as before but made the OS target detection logic a lot more robust in cross-compilation environments through the use of an enum instead of ad-hoc places that accidentally used cfg!(windows) / cfg!(target_os =) which pick up the host not the target. Hopefully that's OK. Couldn't find a crate that did this out of the box unfortunately but I didn't look that hard. The second commit that has the fix is a lot smaller.

The OS detection code was bothering me as it wasn't properly doing cross compilation (some places were and some weren't). Additionally, the OS detection was a bit haphazard. This is a pure cleanup that parses the information in TARGET up-front into an enum that is then checked instead of working with strings.

Workaround for rust-lang/cmake-rs#240

vlovich added 2 commits February 13, 2025 12:43

Fix CPU inference performance when building MSVC Rust debug

5a4dbd4

Workaround for rust-lang/cmake-rs#240

MarcusDunn approved these changes Feb 13, 2025

View reviewed changes

MarcusDunn merged commit 5c8e81b into utilityai:main Feb 14, 2025
2 of 5 checks passed

AsbjornOlling mentioned this pull request Feb 20, 2025

Windows performance in Release profile seems crippled when building dev Cargo profile (RelWithDebInfo is faster) #649

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix cpu performance debug builds #651

Fix cpu performance debug builds #651

vlovich commented Feb 13, 2025 •

edited

Loading

Fix cpu performance debug builds #651

Fix cpu performance debug builds #651

Conversation

vlovich commented Feb 13, 2025 • edited Loading

vlovich commented Feb 13, 2025 •

edited

Loading