Commit graph

200 commits

Author SHA1 Message Date
Andrzej Janik
0ceeb7c74b Attempt 15 2024-04-27 01:23:14 +00:00
Andrzej Janik
b4b630c130 Attempt 14 2024-04-27 03:15:09 +02:00
Andrzej Janik
ec8e702936 Attempt 13 2024-04-27 03:03:58 +02:00
Andrzej Janik
db8ca658d7 Attempt 12 2024-04-27 02:58:00 +02:00
Andrzej Janik
b2d293a239 Attempt 11 2024-04-27 02:52:27 +02:00
Andrzej Janik
1f2df2c809 Attempt 10 2024-04-27 02:49:25 +02:00
Andrzej Janik
000c8ecbde Attempt 9 2024-04-27 02:45:42 +02:00
Andrzej Janik
ec9a054dc2 Attempt 8 2024-04-27 02:40:59 +02:00
Andrzej Janik
2d1cd17dd8 Attempt 7 2024-04-27 02:39:45 +02:00
Andrzej Janik
830f5127d4 Attempt 6 2024-04-27 02:35:07 +02:00
Andrzej Janik
3f8d17c068 Attempt 5 2024-04-27 02:22:31 +02:00
Andrzej Janik
dd5d2a2a85 Attempt 4 2024-04-27 02:17:46 +02:00
Andrzej Janik
b68c10fa45 Attempt 3 2024-04-27 00:13:59 +00:00
Andrzej Janik
430ab77d47 Actions attempt 2 2024-04-27 02:09:01 +02:00
Andrzej Janik
e6cf145aa9 Github build on Linux, attempt #1 2024-04-27 00:01:04 +00:00
Andrzej Janik
660a789e72 Special case for dumping from NV CUDA 2024-04-26 23:16:50 +00:00
Andrzej Janik
5effd6b4f8 Add missing file 2024-04-26 22:14:52 +00:00
Andrzej Janik
81ea153780 Implement xtask zip on Linux 2024-04-26 22:14:39 +00:00
Andrzej Janik
954a225a86 Add support for zip packaging, only on Windows for now 2024-04-19 22:56:01 +02:00
Andrzej Janik
5d5f7cca75
Rewrite surface implementation to more accurately support unofficial CUDA semantics (#203)
This fixes black screen in some CompuBench tests (TV-L1 Optical Flow) and other apps that use CUDA surfaces incorrectly
2024-04-14 02:39:34 +02:00
Andrzej Janik
774f4bcb37
Implement sad instruction (#198) 2024-04-06 01:23:53 +02:00
Andrzej Janik
0d9ace2475
Fix buggy carry flags when mixing subc/sub.cc with addc/add.cc (#197) 2024-04-05 23:26:08 +02:00
NyanCatTW1
76bae5f91b
Implement mad.hi.cc (#196) 2024-04-05 19:12:59 +02:00
Andrzej Janik
b695f44c18
Support old PTX compression scheme (#188) 2024-03-29 02:03:23 +01:00
Andrzej Janik
7d4147c8b2
Add Blender 4.2 support (#184)
Redo primary context and fix various long-standing bugs around this API
2024-03-28 17:12:10 +01:00
Andrzej Janik
1ede61c696
Disable even more optional LLVM components (#179) 2024-03-17 14:53:15 +01:00
Andrzej Janik
f47a93a951
Fix reported build errors (#178) 2024-03-17 01:32:48 +01:00
Ikko Eltociear Ashimine
14a4016964
Update README.md (#166)
underying -> underlying
2024-03-08 01:35:05 +01:00
Seb Ospina
af0216b1a0
Fix adrenalin software link (#139)
The link that should be for AMD Adrenalin was pointing to ROCm linux info
2024-02-26 12:43:46 +01:00
Andrzej Janik
4a81dbffb5
Update llama.cpp support (#102)
Add sign extension support to prmt, allow set.<op>.f16x2.f16x2, add more BLAS mappings
2024-02-16 00:01:21 +01:00
Ikko Eltociear Ashimine
9f7be97ef6
Update README.md (#100)
uderlying -> underlying
2024-02-15 18:15:31 +01:00
Andrzej Janik
8d10f756a9
Add troubleshooting/debugging instructions (#91) 2024-02-15 13:25:52 +01:00
ManInDark
c884348427
Fixed typo in readme (#89) 2024-02-15 01:38:42 +01:00
Arna13
0c3bf2d9d0
Fixing typo in README.md (#63) 2024-02-13 21:57:51 +01:00
Sean McLemon
f2a44e0e05
Tidy up some English in ARCHITECTURE.md (#61) 2024-02-13 21:55:21 +01:00
Andrzej Janik
1b9ba2b233 Nobody expects the Red Team
Too many changes to list, but broadly:
* Remove Intel GPU support from the compiler
* Add AMD GPU support to the compiler
* Remove Intel GPU host code
* Add AMD GPU host code
* More device instructions. From 40 to 68
* More host functions. From 48 to 184
* Add proof of concept implementation of OptiX framework
* Add minimal support of cuDNN, cuBLAS, cuSPARSE, cuFFT, NCCL, NVML
* Improve ZLUDA launcher for Windows
2024-02-11 20:45:51 +01:00
Andrzej Janik
60d2124a16
Search for a new developer (#44) 2021-02-28 12:18:44 +01:00
Andrzej Janik
4d3e37befc
Update README.md (#42) 2021-02-22 01:32:04 +01:00
Andrzej Janik
a906c350f2
Make misc fixes (#41)
* Update ze_loader.lib to the newest version
* Export _ptsz/_ptds for which we have a legacy stream implementations
* Stop producing build logs if we are not looking at them anyway
2021-02-22 01:29:03 +01:00
Andrzej Janik
ab690c6491
Add zluda_redirect.dll to CI builds (#40) 2021-02-21 17:44:42 +01:00
Andrzej Janik
4ed9ef8edb
Improve CI (#39)
* Use official GPU driver packages for building on Linux
* Start building on Windows
* Start uploading artifacts
2021-02-21 14:44:58 +01:00
Andrzej Janik
36514bd6eb
Improve ZLUDA injection (#37)
Improve injector&redirector so it's no longer required to manually mess with files if the application links nvcuda.dll. Additionally inject into child processes
2021-02-20 21:40:19 +01:00
Andrzej Janik
972f612562
Fix signed integer conversion (#36)
This fixes the last remaining bug preventing end-to-end GeekBench run, so also update Geekbench results in README
2021-01-26 21:05:09 +01:00
Andrzej Janik
3e2e73ac33 Add script for replaying dumped kernel (#34)
zluda_dump can already create traces of GPU execution, this script can replay those traces.
Additionally, changed added just enough code in core ZLUDA to support simple PyCUDAexecution
2021-01-23 16:57:07 +01:00
Andrzej Janik
ff8135e8a3
Add a library for dumping kernels arguments before and after launch (#18) 2021-01-16 22:28:48 +01:00
Andrzej Janik
09f679693b
Prevent linker from stripping exports on Linux (#33) 2021-01-15 01:17:44 +01:00
Andrzej Janik
5cd9a5fbc4
Add empty implementation of cuDeviceGetLuid (#30)
This function is required by recent versions of CUDA runtime on Windows
2021-01-08 19:43:46 +01:00
Andrzej Janik
237a6c113a
Regenerate SPIR-V tests (#29)
In one of the previous commits we made a change to mark ld/st as aligned. This change was not propagated to test files
2021-01-08 19:06:11 +01:00
Andrzej Janik
078ae20c2c
Improve build procedure and instructions (#28)
Fixes issues pointed out in #27:
* spirv_tools-sys was build in non-test profiles
* By default ZLUDA dll has a wrong name
* We relied on third-party OpenCL installation on Windows
* We encouraged building debug configuration
* We didn't provide build information for developers (cmake, python, submodules)
2021-01-08 17:17:46 +01:00
Andrzej Janik
2c0e9b912f
Fix Windows ZLUDA injector (#26)
Fix various bugs in injector and redirector, make them more robust and enable building them by default
2021-01-03 18:45:48 +01:00