Nexus/GPUs: Difference between revisions

From UMIACS
Jump to navigation Jump to search
(Created page with "There are several different types of [https://www.nvidia.com/en-us/ NVIDIA] GPUs in the Nexus cluster that are available to be scheduled. They are listed below in order of newest to oldest architecture, and then alphabetically. {| class="wikitable sortable" ! Name ! [https://www.nvidia.com/en-us/technologies Architecture] ! [https://developer.nvidia.com/cuda-toolkit CUDA] Cores ! GPU Memory ! Memory Bandwidth ! FP32 Performance ! [https://blogs.nvidia.com/blog/2020/...")
 
No edit summary
Line 7: Line 7:
! GPU Memory
! GPU Memory
! Memory Bandwidth
! Memory Bandwidth
! FP32 Performance
! FP32 Performance (TFLOPS)
! [https://blogs.nvidia.com/blog/2020/05/14/tensorfloat-32-precision-format/ TF32] Performance
! [https://developer.nvidia.com/blog/accelerating-ai-training-with-tf32-tensor-cores/ TF32] Performance ([https://developer.nvidia.com/blog/accelerating-inference-with-sparsity-using-ampere-and-tensorrt Dense / Spare TOPS])
|-
|-
| GeForce RTX 3070
| GeForce RTX 3070
| Ampere
| Ampere
|
| 5888
|
| 8GB GDDR6
|
| 448GB/s
|
| 20.3
|
| 20.3/40.6
|-
|-
| GeForce RTX 3090
| GeForce RTX 3090
| Ampere
| Ampere
|
| 10496
|
| 24GB GDDR6X
|
| 936GB/s
|
| 35.6
|
| 35.6/71
|-
|-
| RTX A4000
| RTX A4000
| Ampere
| Ampere
|
| 6144
|
| 16GB GDDR6
|
| 448GB/s
|
| 19.2
|
| not officially published
|-
|-
| RTX A5000
| RTX A5000
| Ampere
| Ampere
|
| 8192
|
| 24GB GDDR6
|
| 768GB/s
|
| 27.8
|
| not officially published
|-
|-
| RTX A6000
| RTX A6000
| Ampere
| Ampere
|
| 10752
|
| 48GB GDDR6
|
| 768GB/s
|
| 38.7
|
| 77.4/154.8
|-
|-
| GeForce RTX 2080 Ti
| GeForce RTX 2080 Ti
| Turing
| Turing
|
| 4352
|
| 11GB GDDR5X
|
| 616GB/s
|
| 13.4
|
| n/a
|-
|-
| GeForce GTX 1080 Ti
| GeForce GTX 1080 Ti
| Pascal
| Pascal
|
| 3584
|
| 11GB GDDR5X
|
| 484GB/s
|
| 11.3
| n/a
| n/a
|-
|-
| GeForce GTX Titan Xp
| GeForce GTX Titan Xp
| Pascal
| Pascal
|
| 3840
|
| 12GB GDDR5X
|
| 548GB/s
|
| 12.1
| n/a
| n/a
|-
|-
| Quadro P6000
| Quadro P6000
| Pascal
| Pascal
|
| 3840
|
| 24GB GDDR5X
|
| 432GB/s
|
| 12.6
| n/a
| n/a
|-
|-
| Tesla P100
| Tesla P100
| Pascal
| Pascal
|
| 3584
|
| 16GB CoWoS HBM2
|
| 732GB/s
|
| 9.3
| n/a
| n/a
|-
|-
| GeForce GTX Titan X
| GeForce GTX Titan X
| Maxwell
| Maxwell
|
| 3072
|
| 12GB GDDR5
|
| 336GB/s
|
| 6.7
| n/a
| n/a
|-
|-
| Tesla M40
| Tesla M40
| Maxwell
| Maxwell
|
| 3072
|
| 12GB GDDR5
|
| 288GB/s
|
| 6.8
| n/a
| n/a
|-
|-
| Tesla K80
| Tesla K80
| Kepler
| Kepler
|
| 4992
|
| 24GB GDDR5
|
| 480GB/s
|
| 8.7
| n/a
| n/a
|-
|-
|}
|}

Revision as of 19:01, 28 September 2023

There are several different types of NVIDIA GPUs in the Nexus cluster that are available to be scheduled. They are listed below in order of newest to oldest architecture, and then alphabetically.

Name Architecture CUDA Cores GPU Memory Memory Bandwidth FP32 Performance (TFLOPS) TF32 Performance (Dense / Spare TOPS)
GeForce RTX 3070 Ampere 5888 8GB GDDR6 448GB/s 20.3 20.3/40.6
GeForce RTX 3090 Ampere 10496 24GB GDDR6X 936GB/s 35.6 35.6/71
RTX A4000 Ampere 6144 16GB GDDR6 448GB/s 19.2 not officially published
RTX A5000 Ampere 8192 24GB GDDR6 768GB/s 27.8 not officially published
RTX A6000 Ampere 10752 48GB GDDR6 768GB/s 38.7 77.4/154.8
GeForce RTX 2080 Ti Turing 4352 11GB GDDR5X 616GB/s 13.4 n/a
GeForce GTX 1080 Ti Pascal 3584 11GB GDDR5X 484GB/s 11.3 n/a
GeForce GTX Titan Xp Pascal 3840 12GB GDDR5X 548GB/s 12.1 n/a
Quadro P6000 Pascal 3840 24GB GDDR5X 432GB/s 12.6 n/a
Tesla P100 Pascal 3584 16GB CoWoS HBM2 732GB/s 9.3 n/a
GeForce GTX Titan X Maxwell 3072 12GB GDDR5 336GB/s 6.7 n/a
Tesla M40 Maxwell 3072 12GB GDDR5 288GB/s 6.8 n/a
Tesla K80 Kepler 4992 24GB GDDR5 480GB/s 8.7 n/a