Kepler GK110 Revision B Graphics Architecture
Technology & Specifications (reference)
To understand the GeForce GTX 780 you need to understand how NVIDIA releases their products. The GeForce GTX 780 is basically the GeForce GTX Titan which has GK110 Kepler GPU architecture. The chip is huge in size, it is still based on the 28nm fabrication node. But when I say huge, I mean it as it has 7.1 Billion transistors tucked away in a 45 mm × 45 mm 2397-pin S-FCBGA chip. That's a 561 mm² die size. The GPU on that 10.5 inches long dual-slot GeForce GTX 780 graphics card boasts an impressive 2304 CUDA (shader) cores and has 7100 Million transistors.
That shader amount is amongst the biggest differentials:
- GeForce GTX 780 has 2304 shader processors and 3 GB of graphics memory.
- GeForce GTX Titan has 2688 shader processors and 6 GB of graphics memory.
The product is obviously PCI-Express 3.0 ready and has a TDP of around 250W with a typical idle power draw of 15 Watts. That TDP is a maximum overall, and on average your GPU will not consume that amount of power. But let me first show you the GK110 GPU die:
NVIDIA GK110 Kepler architecture GPU - you can see 15 sets of SMX clusters, GTX 780 uses 12 of these.
The GK110 is based off the Kepler architecture, as such you will get the fairly standard pre-modelled SMX clusters of 192 shader processors per cluster. Out of the 15 available, there are 12 active SMX clusters for the GTX 780, times 192 shader processors which thus offers you 2496 shader processors. If it would have had the full 15 SMX clusters enabled, yeah that would have been 2880 shader processors. There is a distinct difference with extra double precision units, but we'll talk about that on the next page though. But how does 4.3 Teraflops of peak compute performance sound?
But to make a bold comparison, the GeForce GTX 680 which many of you guys have has 1536 of these shader processors. We'll get more in-depth into the architecture on the next page though. As far as the memory specs of the GK110 Kepler GPU are concerned, the boards feature a 384-bit memory bus connected to 3 GB of GDDR5 video buffer memory, aka VRAM aka your framebuffer aka graphics memory for the GTX 780.
On the memory controller side of things you'll see that the reference memory clock (effective data-rate) is now set at 6 GHz / Gbps. This boils down to an exotic memory bandwidth of no less than 288 GB/s on that 384-bit memory bus. Much like the GTX 680, the GPU core versus the shader processor domain is clocked at 1:1, meaning both the core and shader domain clock in at a 863 MHz reference base clock frequency. One clock to rule them all (well internally there are dozens of different other clocks really, but let's keep it simple, shall we?).
The graphics adapters are of course DirectX 11.1 and 12 ready. With Windows 8, 7 and Vista also being ready with game compatibility to take advantage of DirectCompute, multi-threading, hardware tessellation and the latest shader 5.0 extensions. For your reference here's a quick overview of some past generation high-end GeForce cards opposed to the new Kepler based GeForce GTX 780.
Single GPU based cards | GeForce GTX 480 |
GeForce GTX 580 |
GeForce GTX 680 |
GeForce GTX 780 |
GeForce GTX Titan |
GeForce GTX 780 Ti |
Stream (Shader) Processors | 480 | 512 | 1536 | 2304 | 2688 | 2880 |
Core Clock (MHz) | 700 | 772 | 1006 | 863 | 836 | 875 |
Shader Clock (MHz) | 1400 | 1544 | - | - | - | - |
Boost Clock | - | - | 1058 | 900 | 876 | 928 |
Memory Clock (effective MHz) | 3700 | 4000 | 6000 | 6000 | 6000 | 7000 |
Memory amount | 1536 | 1536 | 2048 | 3072 | 6144 | 3072 |
Memory Interface | 384-bit | 384-bit | 256-bit | 384-bit | 384-bit | 384-bit |
Memory Type | gDDR5 | gDDR5 | gDDR5 | gDDR5 | gDDR5 | gDDR5 |
HDCP | Yes | Yes | Yes | Yes | Yes | Yes |
Two Dual link DVI | Yes | Yes | Yes | Yes | Yes | Yes |
HDMI | Yes | Yes | Yes | Yes | Yes | Yes |
For Kepler overall, Nvidia kept their memory controllers GDDR5 compatible. Memory wise, Nvidia has nice large memory volumes available due to their architecture, we pass 2 GB as standard these days for most of Nvidia's series 600 graphics cards. The 3 GB on the GTX 780 however is plenty. The hardware engineers at Nvidia reworked the memory subsystem quite a bit, enabling much higher memory clock frequency speeds compared to previous generation GeForce GPUs. The result is memory speeds up-to 6 Gbps.
- The GTX 580 has six memory controllers (6x256MB) = 1536 MB of GDDR5 memory
- The GTX 680 has four memory controllers (4x512MB) = 2048 MB of GDDR5 memory
- The GTX 780 has six memory controllers (6x512MB) = 3072 MB of GDDR5 memory
- The GTX Titan has six memory controllers (6x1024MB) = 6144 MB of GDDR5 memory