CUDA on ??? GPUs
Find a file
Violet e3a99b7ee1
Some checks failed
ZLUDA / Build (Linux) (push) Has been cancelled
ZLUDA / Build (Windows) (push) Has been cancelled
ZLUDA / Build AMD GPU unit tests (push) Has been cancelled
ZLUDA / Run AMD GPU unit tests (push) Has been cancelled
Fix min.ftz.nan.f16 for ROCm 6.3.4 (#506)
This PR fixes compatibility issues with ROCm 6.3.4 by replacing the unavailable llvm.minimum.f16 intrinsic with an equivalent implementation using llvm.minnum.f16 combined with NaN handling logic. Same applies to max operations

This also adds `"amdgpu-ieee"="false"` attribute everywhere, which gives us better codegen for min/max everywhere
2025-09-12 16:48:47 -07:00
.cargo Try to make ZLUDA more robust on Windows (#442) 2025-07-28 13:20:04 -07:00
.devcontainer Create infrastructure for performance libraries (#363) 2025-05-01 22:37:18 +02:00
.github Enhance bug report template with ZLUDA version input (#494) 2025-09-03 16:20:34 -07:00
comgr Add more instructions, tighten generated assembly (#475) 2025-08-18 11:12:57 -07:00
compiler Random fixes (#504) 2025-09-12 13:52:33 -07:00
cuda_macros Random fixes (#504) 2025-09-12 13:52:33 -07:00
cuda_types Progress compilation despite parsing errors (#495) 2025-09-08 14:35:29 -07:00
dark_api Progress compilation despite parsing errors (#495) 2025-09-08 14:35:29 -07:00
detours-sys Redo logging to better log dark API and performance libraries (#372) 2025-06-09 15:29:14 -07:00
docs Rework the documentation and landing page (#474) 2025-09-03 14:11:46 -07:00
ext Random fixes (#504) 2025-09-12 13:52:33 -07:00
format Add more NVML and cuBLAS coverage (#481) 2025-08-26 12:55:12 -07:00
llvm_zluda Always use Unix line endings (#453) 2025-07-30 15:09:47 -07:00
ptx Fix min.ftz.nan.f16 for ROCm 6.3.4 (#506) 2025-09-12 16:48:47 -07:00
ptx_parser Random fixes (#504) 2025-09-12 13:52:33 -07:00
ptx_parser_macros Add pass test mechanism for insert_implicit_conversions (#477) 2025-08-22 13:01:39 -07:00
ptx_parser_macros_impl Add pass test mechanism for insert_implicit_conversions (#477) 2025-08-22 13:01:39 -07:00
ptxas Random fixes (#504) 2025-09-12 13:52:33 -07:00
xtask Implement kernel cache (#465) 2025-08-11 09:20:40 -07:00
zluda Random fixes (#504) 2025-09-12 13:52:33 -07:00
zluda_bindgen Add more NVML and cuBLAS coverage (#481) 2025-08-26 12:55:12 -07:00
zluda_blas Some fixes to BLASLt (#482) 2025-08-26 14:28:36 -07:00
zluda_blaslt Some fixes to BLASLt (#482) 2025-08-26 14:28:36 -07:00
zluda_cache Implement kernel cache (#465) 2025-08-11 09:20:40 -07:00
zluda_common Random fixes (#504) 2025-09-12 13:52:33 -07:00
zluda_dnn Check Rust formatting on pull requests (#451) 2025-07-30 14:55:09 -07:00
zluda_fft Check Rust formatting on pull requests (#451) 2025-07-30 14:55:09 -07:00
zluda_inject Rename zluda_dump to zluda_trace (#456) 2025-07-31 08:07:03 -07:00
zluda_ml Add more NVML and cuBLAS coverage (#481) 2025-08-26 12:55:12 -07:00
zluda_preload Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_redirect Make sure it is possible to log 32bit PhysX (#374) 2025-07-08 10:19:49 -07:00
zluda_sparse Check Rust formatting on pull requests (#451) 2025-07-30 14:55:09 -07:00
zluda_trace Progress compilation despite parsing errors (#495) 2025-09-08 14:35:29 -07:00
zluda_trace_blas Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_trace_blaslt Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_trace_common Add nvml tracing (#476) 2025-08-18 23:09:15 -07:00
zluda_trace_dnn Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_trace_fft Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_trace_nvml Add nvml tracing (#476) 2025-08-18 23:09:15 -07:00
zluda_trace_sparse Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
.git-blame-ignore-revs Make blame ignore formatting commit (#502) 2025-09-09 13:12:31 -07:00
.gitattributes Fix linguist instructions 2024-08-07 13:29:03 +02:00
.gitignore Add support for shfl.sync.MODE.b32 (#409) 2025-07-16 17:23:11 -07:00
.gitmodules Fix host code and update to CUDA 12.4 (#299) 2024-12-02 00:29:57 +01:00
.rustfmt.toml Always use Unix line endings (#453) 2025-07-30 15:09:47 -07:00
Cargo.lock Progress compilation despite parsing errors (#495) 2025-09-08 14:35:29 -07:00
Cargo.toml Add small compiler fixes and a fake ptxas binary (#491) 2025-09-03 12:23:01 -07:00
geekbench.svg Update README for version 4 (#315) 2024-12-31 17:33:59 +01:00
LICENSE-APACHE Update wording, add license 2020-11-24 23:23:53 +01:00
LICENSE-MIT Update wording, add license 2020-11-24 23:23:53 +01:00
README.md Update quick start link in README.md (#493) 2025-09-03 14:18:28 -07:00

ZLUDA is a drop-in replacement for CUDA on non-NVIDIA GPUs. ZLUDA allows running unmodified CUDA applications using non-NVIDIA GPUs with near-native performance