Nexus/GPUs: Difference between revisions
Jump to navigation
Jump to search
(Created page with "There are several different types of [https://www.nvidia.com/en-us/ NVIDIA] GPUs in the Nexus cluster that are available to be scheduled. They are listed below in order of newest to oldest architecture, and then alphabetically. {| class="wikitable sortable" ! Name ! [https://www.nvidia.com/en-us/technologies Architecture] ! [https://developer.nvidia.com/cuda-toolkit CUDA] Cores ! GPU Memory ! Memory Bandwidth ! FP32 Performance ! [https://blogs.nvidia.com/blog/2020/...") |
No edit summary |
||
Line 7: | Line 7: | ||
! GPU Memory | ! GPU Memory | ||
! Memory Bandwidth | ! Memory Bandwidth | ||
! FP32 Performance | ! FP32 Performance (TFLOPS) | ||
! [https:// | ! [https://developer.nvidia.com/blog/accelerating-ai-training-with-tf32-tensor-cores/ TF32] Performance ([https://developer.nvidia.com/blog/accelerating-inference-with-sparsity-using-ampere-and-tensorrt Dense / Spare TOPS]) | ||
|- | |- | ||
| GeForce RTX 3070 | | GeForce RTX 3070 | ||
| Ampere | | Ampere | ||
| | | 5888 | ||
| | | 8GB GDDR6 | ||
| | | 448GB/s | ||
| | | 20.3 | ||
| | | 20.3/40.6 | ||
|- | |- | ||
| GeForce RTX 3090 | | GeForce RTX 3090 | ||
| Ampere | | Ampere | ||
| | | 10496 | ||
| | | 24GB GDDR6X | ||
| | | 936GB/s | ||
| | | 35.6 | ||
| | | 35.6/71 | ||
|- | |- | ||
| RTX A4000 | | RTX A4000 | ||
| Ampere | | Ampere | ||
| | | 6144 | ||
| | | 16GB GDDR6 | ||
| | | 448GB/s | ||
| | | 19.2 | ||
| | | not officially published | ||
|- | |- | ||
| RTX A5000 | | RTX A5000 | ||
| Ampere | | Ampere | ||
| | | 8192 | ||
| | | 24GB GDDR6 | ||
| | | 768GB/s | ||
| | | 27.8 | ||
| | | not officially published | ||
|- | |- | ||
| RTX A6000 | | RTX A6000 | ||
| Ampere | | Ampere | ||
| | | 10752 | ||
| | | 48GB GDDR6 | ||
| | | 768GB/s | ||
| | | 38.7 | ||
| | | 77.4/154.8 | ||
|- | |- | ||
| GeForce RTX 2080 Ti | | GeForce RTX 2080 Ti | ||
| Turing | | Turing | ||
| | | 4352 | ||
| | | 11GB GDDR5X | ||
| | | 616GB/s | ||
| | | 13.4 | ||
| | | n/a | ||
|- | |- | ||
| GeForce GTX 1080 Ti | | GeForce GTX 1080 Ti | ||
| Pascal | | Pascal | ||
| | | 3584 | ||
| | | 11GB GDDR5X | ||
| | | 484GB/s | ||
| | | 11.3 | ||
| n/a | | n/a | ||
|- | |- | ||
| GeForce GTX Titan Xp | | GeForce GTX Titan Xp | ||
| Pascal | | Pascal | ||
| | | 3840 | ||
| | | 12GB GDDR5X | ||
| | | 548GB/s | ||
| | | 12.1 | ||
| n/a | | n/a | ||
|- | |- | ||
| Quadro P6000 | | Quadro P6000 | ||
| Pascal | | Pascal | ||
| | | 3840 | ||
| | | 24GB GDDR5X | ||
| | | 432GB/s | ||
| | | 12.6 | ||
| n/a | | n/a | ||
|- | |- | ||
| Tesla P100 | | Tesla P100 | ||
| Pascal | | Pascal | ||
| | | 3584 | ||
| | | 16GB CoWoS HBM2 | ||
| | | 732GB/s | ||
| | | 9.3 | ||
| n/a | | n/a | ||
|- | |- | ||
| GeForce GTX Titan X | | GeForce GTX Titan X | ||
| Maxwell | | Maxwell | ||
| | | 3072 | ||
| | | 12GB GDDR5 | ||
| | | 336GB/s | ||
| | | 6.7 | ||
| n/a | | n/a | ||
|- | |- | ||
| Tesla M40 | | Tesla M40 | ||
| Maxwell | | Maxwell | ||
| | | 3072 | ||
| | | 12GB GDDR5 | ||
| | | 288GB/s | ||
| | | 6.8 | ||
| n/a | | n/a | ||
|- | |- | ||
| Tesla K80 | | Tesla K80 | ||
| Kepler | | Kepler | ||
| | | 4992 | ||
| | | 24GB GDDR5 | ||
| | | 480GB/s | ||
| | | 8.7 | ||
| n/a | | n/a | ||
|- | |- | ||
|} | |} |
Revision as of 19:01, 28 September 2023
There are several different types of NVIDIA GPUs in the Nexus cluster that are available to be scheduled. They are listed below in order of newest to oldest architecture, and then alphabetically.
Name | Architecture | CUDA Cores | GPU Memory | Memory Bandwidth | FP32 Performance (TFLOPS) | TF32 Performance (Dense / Spare TOPS) |
---|---|---|---|---|---|---|
GeForce RTX 3070 | Ampere | 5888 | 8GB GDDR6 | 448GB/s | 20.3 | 20.3/40.6 |
GeForce RTX 3090 | Ampere | 10496 | 24GB GDDR6X | 936GB/s | 35.6 | 35.6/71 |
RTX A4000 | Ampere | 6144 | 16GB GDDR6 | 448GB/s | 19.2 | not officially published |
RTX A5000 | Ampere | 8192 | 24GB GDDR6 | 768GB/s | 27.8 | not officially published |
RTX A6000 | Ampere | 10752 | 48GB GDDR6 | 768GB/s | 38.7 | 77.4/154.8 |
GeForce RTX 2080 Ti | Turing | 4352 | 11GB GDDR5X | 616GB/s | 13.4 | n/a |
GeForce GTX 1080 Ti | Pascal | 3584 | 11GB GDDR5X | 484GB/s | 11.3 | n/a |
GeForce GTX Titan Xp | Pascal | 3840 | 12GB GDDR5X | 548GB/s | 12.1 | n/a |
Quadro P6000 | Pascal | 3840 | 24GB GDDR5X | 432GB/s | 12.6 | n/a |
Tesla P100 | Pascal | 3584 | 16GB CoWoS HBM2 | 732GB/s | 9.3 | n/a |
GeForce GTX Titan X | Maxwell | 3072 | 12GB GDDR5 | 336GB/s | 6.7 | n/a |
Tesla M40 | Maxwell | 3072 | 12GB GDDR5 | 288GB/s | 6.8 | n/a |
Tesla K80 | Kepler | 4992 | 24GB GDDR5 | 480GB/s | 8.7 | n/a |