|CUDA Parallel Processing cores||3072|
|NVIDIA Tensor Cores||384|
|NVIDIA RT Cores||48|
|Frame Buffer Memory||16 GB GDDR6|
|Rays Cast||8 Giga Rays/Sec|
|Peak Single Precision (FP32) Performance||11.2 TFLOPS|
|Peak Half Precision (FP16) Performance||22.3 TFLOPS|
|Peak Integer Operation (INT8) Performance||178.4 TOPS|
|Deep Learning TeraFLOPS1||89.2 TFLOPS|
|Memory Bandwidth||448 GB/s|
|Max Power Consumption||265 W|
|Graphics Bus||PCI Express 3.0 x16|
|Display Connectors||DP 1.4 (4) + VirtualLink (1)|
|Form Factor||4.4” H x 10.5” L Dual Slot|
|Product Weight||972 g|
|NVIDIA® 3D Vision® and 3D Vision Pro||Support via 3 pin mini DIN|
|Frame lock||Compatible (with Quadro Sync II)|
|NVLink Interconnect||50 GB/s|
- Microsoft Windows 10 (64-bit)
- Microsoft Windows 8 and 8.1 (64-bit)
- Microsoft Windows 7 (64-bit)
- Linux® – Full OpenGL implementation, complete with NVIDIA and ARB extensions (64-bit)
3D Graphics Architecture
- Scalable geometry architecture
- Hardware tessellation engine
- NVIDIA® GigaThread™ engine with 5 async copy engines
- Shader Model 5.1 (OpenGL 4.5 and DirectX 12)
- Up to 32K x 32K texture and render processing
- Transparent multisampling and super sampling
- 16x angle independent anisotropic filtering
- 32-bit per-component floating point texture filtering and blending
- 64x full scene antialiasing (FSAA)/128x FSAA in SLI Mode
- Decode acceleration for MPEG-2, MPEG-4 Part 2 Advanced Simple Profile, H.264, HEVC, MVC, VC1, DivX (version 3.11 and later), and Flash (10.1 and later)
- Dedicated H.264 and HEVC Encoder1
- Blu-ray dual-stream hardware acceleration (supporting HD picture-in-picture playback)
- NVIDIA GPU Boost™ (automatically improves GPU engine throughput to maximize application performance)
Advanced Display Features
- Support for any combination of four connected displays
- Four DisplayPort 1.4 outputs (supporting resolutions such as 3840 x 2160 @ 120 Hz and 5120 x 2880 @ 60 Hz)
- DisplayPort to VGA, DisplayPort to DVI (single-link and dual-link) and DisplayPort to HDMI cables (resolution support based on dongle specifications)
- HDR support over DisplayPort 1.4 (SMPTE 2084/2086, BT. 2020) (4K @ 60 Hz 10b/12b HEVC Decode, 4K @ 60 Hz 10b HEVC Encode)
- HDCP 2.2 support over DisplayPort, DVI, and HDMI connectors
- 12-bit internal display pipeline (hardware support for 12-bit scanout on supported panels, applications, and connection)
- NVIDIA 3D Vision™ technology, 3D DLP, Interleaved, and other 3D stereo format support
- Full OpenGL quad buffered stereo support
- Underscan/overscan compensation and hardware scaling
- NVIDIA nView® multi-display technology
- Support for large-scale, ultra-high resolution visualization using the NVIDIA® SVS platform which includes NVIDIA® Mosaic, NVIDIA® Sync and NVIDIA® Warp/Blend technologies
NVIDIA CUDA Parallel Processing Architecture
- New RT (Ray Tracing) Core per SM
- Turing SM Architecture (streaming multi-processor design that delivers greater processing efficiency)
- Dynamic Parallelism (GPU dynamically spawns new threads without going back to the CPU)
- Mixed-precision (1-, 4-, 8-, 16-, 32- and 64-bit) computing
- API support includes: – CUDA C, CUDA C++, DirectCompute 5.0, OpenCL, Java, Python, and Fortran
- Error correction codes (ECC) on graphics memory
- Configurable up to 96 KB of RAM (dedicated shared memory per SM)
DisplayPort and HDMI Digital Audio
- Support for the following audio modes: – Dolby Digital (AC3), DTS 5.1, Multi-channel (7.1) LPCM, Dolby Digital Plus (DD+), and MPEG-2/MPEG-4 AAC
- DisplayPort Data rates of 48 KHz
- Word sizes of 16-bit, 20-bit, and 24-bit
- Mini-DisplayPort Latch mini-DisplayPort connector is designed with a custom retention mechanism to firmly secure its connection with the display cable.
The World’s First Ray Tracing GPU
Shatter the boundaries of what’s possible with NVIDIA® Quadro RTX™ 5000. Powered by the NVIDIA Turing™ architecture and the NVIDIA RTX™ platform, it fuses ray tracing, deep learning and advanced shading to supercharge next-generation workflows. Creative and technical professionals can make more informed decisions faster and tackle demanding design and visualization workloads with ease.
New RT Cores and Tensor Cores bring the power of real-time ray tracing and AI-enhanced workflows to millions of design and creative professionals. Combined with NVIDIA NVLink™ technology, RTX 5000 scales graphics memory and performance to drive the most demanding rendering, AI, and visual computing workloads. And the all-new VirtualLink® provides connectivity to next-generation, high-resolution VR HMDs to let you view your work in the most compelling virtual environments. Welcome to the future of professional visual computing.
BUILT FOR PROFESSIONALS
1. NVIDIA NVLink™
Link two GPUs with a high-speed interconnect to scale memory capacity to 32GB and drive higher performance with up to 50 GB/s of data transfer.
Be ready for next-generation of high-resolution VR head-mounted displays and enjoy simplified cabling with support for the industry standard VirtualLink connector.
3. Next-Gen Memory
Equipped with 16GB of ultra-fast GDDR6 memory to hold large datasets – complex designs for products, architectural walkthroughs, media assets and more.
4. NVIDIA Turing GPU Architecture
Armed with the all-new RTCore for ray tracing, 384 Tensor Cores for AI and 3072 CUDA cores for parallel computing, NVIDIA Turing is simply the world’s most advanced GPU.
Turing GPU Architecture
Based on state-of-the-art 12nm FFN (FinFET NVIDIA) high-performance manufacturing process customized for NVIDIA to incorporate 3072 CUDA cores, the Quadro RTX 5000 GPU is the most powerful computing platform for HPC, AI, VR and graphics workloads on professional desktops. The Turing GPU architecture enables the biggest leap in computer real-time graphics rendering since NVIDIA’s invention of programmable shaders in 2001. It includes 13.6 billion transistors on die size of 545 mm2. Able to deliver more than 11.2 TFLOPS of single-precision (FP32), 22.3 TFLOPS of half-precision (FP16), 44.6 TOPS of integer-precision (INT8), and 89.2 TFLOPs of tensor operation capability, it supports a wide range of compute-intensive workloads flawlessly.
New dedicated hardware-based ray-tracing technology allows the GPU for the first time to real-time render film quality, photorealistic objects and environments with physically accurate shadows, reflections, and refractions. The real-time ray-tracing engine works with NVIDIA OptiX, Microsoft DXR, and Vulkan APIs to deliver a level of realism far beyond what is possible using traditional rendering techniques. RT cores accelerate the Bounding Volume Hierarchy (BVH) traversal and ray casting functions using low number of rays casted through a pixel.
Enhanced Tensor Cores
New mixed-precision cores purpose-built for deep learning matrix arithmetic, delivering 8x TFLOPS for training, compared to previous generation. Quadro RTX 5000 utilizes 384 Tensor Cores; each Tensor Core performs 64 floating point fused multiply-add (FMA) operations per clock, and each SM performs a total of 1024 individual floating point operations per clock. In addition to supporting FP16/FP32 matrix operations, new Tensor Cores added INT8 (2048 integer operations per clock) and experimental INT4 and INT1 (binary) precision modes for matrix operations.
Advanced Shading Technologies
Mesh Shading: Compute-based geometry pipeline to speed geometry processing and culling on geometrically complex models and scenes. Mesh shading provides up to 2x performance improvement on geometry-bound workloads. Variable Rate Shading (VRS): Gain rendering efficiency by varying the shading rate based on scene content, direction of gaze, and motion. Variable rate shading provides similar image quality with 50% reduction in shaded pixels. Texture Space Shading: Object/texture space shading to improve the performance of pixel shader-heavy workloads such as depth-of-field and motion blur. Texture space shading provides greater throughput with increased fidelity by reusing pre-shaded texels for pixel-shader heavy VR workloads.
High Performance GDDR6 Memory
Built with Turing’s vastly optimized 16GB GDDR6 memory subsystem for the industry’s fastest graphics memory (448 GB/s peak bandwidth), Quadro RTX 5000 is the ideal platform for latency-sensitive applications handling large datasets. Quadro RTX 5000 delivers greater than 50% more memory bandwidth compared to previous generation.
NVIDIA GPU BOOST 4.0
Automatically maximize application performance without exceeding the power and thermal envelope of the card. Allows applications to stay within the boost clock state longer under higher temperature threshold before dropping to a secondary temperature setting base clock. This feature requires implementation by software applications and it is not a stand-alone utility. Please contact email@example.com for details on availability.
Advanced Streaming Multiprocessor (SM) Architecture
Combined shared memory and L1 cache improve performance significantly, while simplifying programming and reducing the tuning required to attain best application performance. Each SM contains 96 KB of L1/shared memory, which can be configured for various capacities depending on compute or graphics workload. For compute cases, up to 64 KB can be allocated to the L1 cache or shared memory, while graphics workload can allocate up to 48 KB for shared memory; 32 KB for L1 and 16 KB for texture units. Combining the L1 data cache with the shared memory reduces latency and provides higher bandwidth.
Double the throughput and reduce storage requirements with 16-bit floating point precision computing to enable the training and deployment of larger neural networks. With independent parallel integer and floating-point data paths, the Turing SM is also much more efficient on workloads with a mix of computation and addressing calculations.
Error Correcting Code (ECC) on Graphics Memory
Meet strict data integrity requirements for mission critical applications with uncompromised computing accuracy and reliability for workstations.
Pixel-level preemption provides more granular control to better support time-sensitive tasks such as VR motion tracking.
Preemption at the instruction-level provides finer grain control over compute tasks to prevent long-running applications from either monopolizing system resources or timing out.
H.264 and HEVC Encode/Decode Engines
Deliver faster than real-time performance for transcoding, video editing, and other encoding applications with two dedicated H.264 and HEVC encode engines and a dedicated decode engine that are independent of 3D/compute pipeline.
Single Instruction, Multiple Thread (SIMT)
New independent thread scheduling capability enables finer-grain synchronization and cooperation between parallel threads by sharing resources among small jobs.
Connect a pair of Quadro RTX 5000 cards with NVLink to increase the effective memory footprint and scale application performance by enabling GPU-to-GPU data transfers at rates up to 25 GB/s (bidirectional) for a total bandwidth of 50 GB/s.
NVIDIA® SLI® Technology
Leverage multiple GPUs to dynamically scale graphics performance, enhance image quality, expand display real estate, and assemble a fully virtualized system.
Scale Application Performance with NVLink Technology
Full-Scene Antialiasing (FSAA)
Dramatically reduce visual aliasing artifacts or “jaggies” with up to 64X FSAA (128x with SLI )for unparalleled image quality and highly realistic scenes.
32K Texture and Render Processing
Texture from and render to 32K x 32K surfaces to support applications that demand the highest resolution and quality image processing.
New open industry standard connectivity for next-generation VR headsets delivering four high-speed HBR3 DisplayPort lanes, USB3.1 data channel and up to 27 watts of power. The alternate mode of USB-C is optimized for latency and bandwidth demands to deliver increased display resolution and incorporate high-bandwidth cameras for tracking and augmented reality with VR headset.
Ability to render four separate views in a single pass to dramatically reduce the graphics pipeline workload and improving realism. With 2X the projection centers from previous generation, the Simultaneous Multi-Projection (SMP) engine performs up to 2X the geometry rendering workload. This allows for more flexibility with position-independent views to produce more creative scenes.
Support up to four 5K monitors @ 60Hz, or dual 8K displays per card. Quadro RTX 5000 supports HDR color for 4K @ 120Hz for 10/12b HEVC decode and up to 4K @ 60Hz for 10b HEVC encode. Each DisplayPort connector is capable of driving ultra-high resolutions of 4096×2160 @ 120 Hz with 30-bit color.
NVIDIA® Mosaic™ Technology
Transparently scale the desktop and applications across up to 4 GPUs and 16 displays from a single workstation while delivering full performance and image quality
NVIDIA® Quadro Sync II
Synchronize the display and image output of up to 32 displays from 8 GPUs (connected through two Sync II boards) in a single system, reducing the number of machines needed to create an advanced video visualization environment.
NVIDIA® nView® Advanced Desktop Software
Gain unprecedented end-user control of the desktop experience for increased productivity in single large display or multi-display environments.
Frame Lock Connector Latch
Each frame lock connector is designed with a self-locking retention mechanism to secure its connection with the frame lock cable to provide robust connectivity and maximum productivity.
OpenGL Quad Buffered Stereo Support
Provide a smooth and immersive 3D Stereo experience for professional applications.
Ultra High Resolution Desktop Support
Get more Mosaic topology choices with high resolution displays devices with a 32K Max desktop size.
Professional 3D Stereo Synchronization
Robust control of stereo effects through a dedicated connection to directly synchronize 3D stereo hardware to a Quadrographics card.
Turing Optimized Software
Deep learning frameworks such as Caffe2, MXNet, CNTK, TensorFlow, and others deliver dramatically faster training times and higher multi-node training performance. GPU accelerated libraries such as cuDNN, cuBLAS, and TensorRT delivers higher performance for both deep learning inference and High-Performance Computing (HPC) applications.
NVIDIA® CUDA® Parallel Computing Platform
Natively execute standard programming languages like C/C++ and Fortran, and APIs such as OpenCL, OpenACC and Direct Compute to accelerates techniques such as ray tracing, video and image processing, and computation fluid dynamics.
A single, seamless 49-bit virtual address space allows for the transparent migration of data between the full allocation of CPU and GPU memory.
NVIDIA® GPUDirect for Video
GPUDirect for Video speeds communication between the GPU and video I/O devices by avoiding unnecessary system memory copies and CPU overhead.
NVIDIA Enterprise-Management Tools
Maximize system uptime, seamlessly manage wide-scale deployments and remotely control graphics and display settings for efficient operations.
NVIDIA Packaging and Accessories
- NVIDIA Quadro RTX5000
- Quadro RTX Quick Start Guide
- Quadro Support Guide
- 1 DisplayPort to DVI Adapter
- 1 DisplayPort to HDMI Adapter
- 1 Auxiliary Power Cable(8-pin to dual 6-pin adapter)