gpuowl cs2 vkfft AMD Ryzen 9 7950X 16-Core testing with a ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS) and NVIDIA GeForce RTX 3080 10GB on Ubuntu 23.10 via the Phoronix Test Suite. a: Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GB, Graphics: NVIDIA GeForce RTX 3080 10GB, Audio: NVIDIA GA102 HD Audio, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Ubuntu 23.10, Kernel: 6.7.0-060700-generic (x86_64), Desktop: GNOME Shell 45.2, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 b: Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GB, Graphics: NVIDIA GeForce RTX 3080 10GB, Audio: NVIDIA GA102 HD Audio, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Ubuntu 23.10, Kernel: 6.7.0-060700-generic (x86_64), Desktop: GNOME Shell 45.2, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 c: Processor: AMD Ryzen 9 7950X 16-Core @ 5.88GHz (16 Cores / 32 Threads), Motherboard: ASUS ROG STRIX X670E-E GAMING WIFI (1416 BIOS), Chipset: AMD Device 14d8, Memory: 2 x 16GB DRAM-6000MT/s G Skill F5-6000J3038F16G, Disk: 2000GB Samsung SSD 980 PRO 2TB + 4001GB Western Digital WD_BLACK SN850X 4000GB, Graphics: NVIDIA GeForce RTX 3080 10GB, Audio: NVIDIA GA102 HD Audio, Monitor: DELL U2723QE, Network: Intel I225-V + Intel Wi-Fi 6 AX210/AX211/AX411 OS: Ubuntu 23.10, Kernel: 6.7.0-060700-generic (x86_64), Desktop: GNOME Shell 45.2, Display Server: X Server 1.21.1.7, Display Driver: NVIDIA 550.40.07, OpenGL: 4.6.0, OpenCL: OpenCL 3.0 CUDA 12.4.74, Compiler: GCC 13.2.0, File-System: ext4, Screen Resolution: 3840x2160 VkFFT 1.3.4 Test: FFT + iFFT R2C / C2R Benchmark Score > Higher Is Better a . 51046 |==================================================================== b . 50803 |==================================================================== c . 49951 |=================================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in half precision Benchmark Score > Higher Is Better a . 148147 |=================================================================== b . 143220 |================================================================= c . 145097 |================================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C Bluestein in single precision Benchmark Score > Higher Is Better a . 13286 |==================================================================== b . 13225 |=================================================================== c . 13358 |==================================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in double precision Benchmark Score > Higher Is Better a . 25136 |=================================================================== b . 23693 |=============================================================== c . 25627 |==================================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in single precision Benchmark Score > Higher Is Better a . 113874 |=================================================================== b . 113952 |=================================================================== c . 113935 |=================================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C multidimensional in single precision Benchmark Score > Higher Is Better a . 46283 |================================================================= b . 47650 |=================================================================== c . 48385 |==================================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C Bluestein benchmark in double precision Benchmark Score > Higher Is Better a . 3724 |==================================================================== b . 3763 |===================================================================== c . 3758 |===================================================================== VkFFT 1.3.4 Test: FFT + iFFT C2C 1D batched in single precision, no reshuffling Benchmark Score > Higher Is Better a . 116216 |=================================================================== b . 116227 |=================================================================== c . 116319 |=================================================================== GpuOwl 7.5 Exponent: 57885161 Iterations / Second > Higher Is Better a . 723.24 |================================================================== b . 729.39 |=================================================================== c . 728.86 |=================================================================== GpuOwl 7.5 Exponent: 77936867 Iterations / Second > Higher Is Better a . 532.10 |================================================================== b . 536.19 |=================================================================== c . 532.20 |=================================================================== GpuOwl 7.5 Exponent: 332220523 Iterations / Second > Higher Is Better a . 115.73 |================================================================== b . 116.65 |=================================================================== c . 115.78 |=================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: FP64 Compute TFLOPs/s > Higher Is Better a . 0.528 |==================================================================== b . 0.531 |==================================================================== c . 0.527 |=================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: FP32 Compute TFLOPs/s > Higher Is Better a . 32.87 |==================================================================== b . 32.92 |==================================================================== c . 32.80 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT64 Compute TIOPs/s > Higher Is Better a . 3.231 |==================================================================== b . 3.222 |==================================================================== c . 3.225 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT32 Compute TIOPs/s > Higher Is Better a . 16.86 |==================================================================== b . 16.67 |=================================================================== c . 16.92 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT16 Compute TIOPs/s > Higher Is Better a . 14.57 |==================================================================== b . 14.56 |==================================================================== c . 14.56 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: INT8 Compute TIOPs/s > Higher Is Better a . 12.11 |==================================================================== b . 12.08 |=================================================================== c . 12.17 |==================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: Memory Bandwidth Coalesced Read GB/s > Higher Is Better a . 702.72 |=================================================================== b . 702.78 |=================================================================== c . 702.84 |=================================================================== ProjectPhysX OpenCL-Benchmark 1.2 Operation: Memory Bandwidth Coalesced Write GB/s > Higher Is Better a . 721.83 |=================================================================== b . 721.79 |=================================================================== c . 721.72 |=================================================================== Counter-Strike 2 Resolution: 1920 x 1080 Frames Per Second > Higher Is Better a . 308.0 |=================================================================== b . 311.4 |==================================================================== c . 309.8 |==================================================================== Counter-Strike 2 Resolution: 1920 x 1200 Frames Per Second > Higher Is Better a . 291.7 |==================================================================== b . 292.7 |==================================================================== c . 293.7 |==================================================================== Counter-Strike 2 Resolution: 2560 x 1440 Frames Per Second > Higher Is Better a . 221.9 |==================================================================== b . 221.3 |==================================================================== c . 221.1 |==================================================================== Counter-Strike 2 Resolution: 3840 x 2160 Frames Per Second > Higher Is Better a . 121.4 |=================================================================== b . 121.6 |=================================================================== c . 122.8 |====================================================================