CUDA on ??? GPUs
Find a file
Andrzej Janik 6c811a55d2
Random fixes (#504)
This is a collection of random changes coming from the workload I'm working on. The most important change is better support for `.params`: PTX uses .param namespace both for some local variables and kernel args. This is a problem for us because those are different address spaces on AMDGPU. So far we've made an effort to convert to local and const namespaces whenever possible, but this commit tries to handle more patterns, which are impossible to track precisely, by converting to generic space.
2025-09-12 13:52:33 -07:00
.cargo Try to make ZLUDA more robust on Windows (#442) 2025-07-28 13:20:04 -07:00
.devcontainer Create infrastructure for performance libraries (#363) 2025-05-01 22:37:18 +02:00
.github Enhance bug report template with ZLUDA version input (#494) 2025-09-03 16:20:34 -07:00
comgr Add more instructions, tighten generated assembly (#475) 2025-08-18 11:12:57 -07:00
compiler Random fixes (#504) 2025-09-12 13:52:33 -07:00
cuda_macros Random fixes (#504) 2025-09-12 13:52:33 -07:00
cuda_types Progress compilation despite parsing errors (#495) 2025-09-08 14:35:29 -07:00
dark_api Progress compilation despite parsing errors (#495) 2025-09-08 14:35:29 -07:00
detours-sys Redo logging to better log dark API and performance libraries (#372) 2025-06-09 15:29:14 -07:00
docs Rework the documentation and landing page (#474) 2025-09-03 14:11:46 -07:00
ext Random fixes (#504) 2025-09-12 13:52:33 -07:00
format Add more NVML and cuBLAS coverage (#481) 2025-08-26 12:55:12 -07:00
llvm_zluda Always use Unix line endings (#453) 2025-07-30 15:09:47 -07:00
ptx Random fixes (#504) 2025-09-12 13:52:33 -07:00
ptx_parser Random fixes (#504) 2025-09-12 13:52:33 -07:00
ptx_parser_macros Add pass test mechanism for insert_implicit_conversions (#477) 2025-08-22 13:01:39 -07:00
ptx_parser_macros_impl Add pass test mechanism for insert_implicit_conversions (#477) 2025-08-22 13:01:39 -07:00
ptxas Random fixes (#504) 2025-09-12 13:52:33 -07:00
xtask Implement kernel cache (#465) 2025-08-11 09:20:40 -07:00
zluda Random fixes (#504) 2025-09-12 13:52:33 -07:00
zluda_bindgen Add more NVML and cuBLAS coverage (#481) 2025-08-26 12:55:12 -07:00
zluda_blas Some fixes to BLASLt (#482) 2025-08-26 14:28:36 -07:00
zluda_blaslt Some fixes to BLASLt (#482) 2025-08-26 14:28:36 -07:00
zluda_cache Implement kernel cache (#465) 2025-08-11 09:20:40 -07:00
zluda_common Random fixes (#504) 2025-09-12 13:52:33 -07:00
zluda_dnn Check Rust formatting on pull requests (#451) 2025-07-30 14:55:09 -07:00
zluda_fft Check Rust formatting on pull requests (#451) 2025-07-30 14:55:09 -07:00
zluda_inject Rename zluda_dump to zluda_trace (#456) 2025-07-31 08:07:03 -07:00
zluda_ml Add more NVML and cuBLAS coverage (#481) 2025-08-26 12:55:12 -07:00
zluda_preload Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_redirect Make sure it is possible to log 32bit PhysX (#374) 2025-07-08 10:19:49 -07:00
zluda_sparse Check Rust formatting on pull requests (#451) 2025-07-30 14:55:09 -07:00
zluda_trace Progress compilation despite parsing errors (#495) 2025-09-08 14:35:29 -07:00
zluda_trace_blas Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_trace_blaslt Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_trace_common Add nvml tracing (#476) 2025-08-18 23:09:15 -07:00
zluda_trace_dnn Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_trace_fft Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
zluda_trace_nvml Add nvml tracing (#476) 2025-08-18 23:09:15 -07:00
zluda_trace_sparse Force loading ZLUDA through LD_PRELOAD (#447) 2025-07-31 18:00:13 -07:00
.git-blame-ignore-revs Make blame ignore formatting commit (#502) 2025-09-09 13:12:31 -07:00
.gitattributes Fix linguist instructions 2024-08-07 13:29:03 +02:00
.gitignore Add support for shfl.sync.MODE.b32 (#409) 2025-07-16 17:23:11 -07:00
.gitmodules Fix host code and update to CUDA 12.4 (#299) 2024-12-02 00:29:57 +01:00
.rustfmt.toml Always use Unix line endings (#453) 2025-07-30 15:09:47 -07:00
Cargo.lock Progress compilation despite parsing errors (#495) 2025-09-08 14:35:29 -07:00
Cargo.toml Add small compiler fixes and a fake ptxas binary (#491) 2025-09-03 12:23:01 -07:00
geekbench.svg Update README for version 4 (#315) 2024-12-31 17:33:59 +01:00
LICENSE-APACHE Update wording, add license 2020-11-24 23:23:53 +01:00
LICENSE-MIT Update wording, add license 2020-11-24 23:23:53 +01:00
README.md Update quick start link in README.md (#493) 2025-09-03 14:18:28 -07:00

ZLUDA is a drop-in replacement for CUDA on non-NVIDIA GPUs. ZLUDA allows running unmodified CUDA applications using non-NVIDIA GPUs with near-native performance