Page 1
GeForce RTX 4080 and 4090 GPU announcement
Today is finally the day that NVIDIA will announce its first ADA GPU architecture graphics card for the consumer market. In this post a recap of what we can expect in the upcoming week's products wise. The business is revealing its new GPU design for next-generation gaming PCs. In the short term, two or three variants are expected, and each one will outperform the Ampere 3000 series somehow. For at least a week NVIDIA had been hinting at and ultimately teasing its announcement. The RTX 40 series, the new GeForce logo, and Ada architecture are all things the company has said to be on the horizon. NVIDIA's RTX 4000 GPUs will definitely provide impressive performance, but next-gen graphics cards may have to contend with alternatives like AMD's rDNA 3 and Intel's Arc Alchemist.
The RTX 40 Series GPUs feature a range of new technological innovations, including:
- Streaming multiprocessors with up to 83 teraflops of shader power — 2x over the previous generation.
- Third-generation RT Cores with up to 191 effective ray-tracing teraflops — 2.8x over the previous generation.
- Fourth-generation Tensor Cores with up to 1.32 Tensor petaflops — 5x over the previous generation using FP8 acceleration.
- Shader Execution Reordering (SER) that improves execution efficiency by rescheduling shading workloads on the fly to better utilize the GPU’s resources. As significant an innovation as out-of-order execution was for CPUs, SER improves ray-tracing performance up to 3x and in-game frame rates by up to 25%.
- Ada Optical Flow Accelerator with 2x faster performance allows DLSS 3 to predict movement in a scene, enabling the neural network to boost frame rates while maintaining image quality.
- Architectural improvements tightly coupled with custom TSMC 4N process technology results in an up to 2x leap in power efficiency.
- Dual NVIDIA Encoders (NVENC) cut export times by up to half and feature AV1 support. The NVENC AV1 encode is being adopted by OBS, Blackmagic Design DaVinci Resolve, Discord and more.
None of the GeForce RTX 40 GPUs support NVLink but could use the PCIe Gen4 x16 interface.
Ada Lovelace architecture - AD102 GPU has 76.3B transistors
This design, built on a unique TSMC 4N process, provides more raster, raytracing, and AI-accelerated computation performance over the previous generation Ampere. The AD102 GPU has 76.3 billion transistors and a surface area of 608.4 mm2. This indicates that the transistor density of 125.5 million per mm2 is 2.78x higher than Samsung fabbed GA102 Ampere GPU built on the 8N node.
NVIDIA Ada has something new called Shader Execution Reordering (SER), which is said to speed up raster operations and provide up to 25% improved gaming performance. Ada also is fitted with next-generation RT Cores (Gen3) as well as faster Tensor cores (Gen4). The latter can achieve up to 1400 TFLOPS, which is 4,375 times greater than Ampere's third-generation cores.
Shader Cores | L2 Cache | |
AD102 | 18,432 | 96MB |
---|---|---|
AD103 | 10,752 | 64MB |
AD104 | 7,680 | 48MB |
AD106 | 4,608 | 32MB |
AD107 | 3,072 | 32MB |
Team Green has shown the most powerful Lovelace GPU, which has up to 76 billion transistors and, like Hopper, is built on TSMC's 4N node. Regular shaders, as well as the raytracing and Tensor cores, have all been improved.
At its initial price of USD 1,499, the GeForce RTX 3090 was $1,000 less than the Nvidia Titan RTX. Unfortunately, we don't see this trend continuing, but the RTX 4090 will likely be priced between $1,499 and $1,999 depending on AIB designs, making it competitive with the RTX 3090 Ti, the current king of the RTX hill. We now turn our focus to the RTX 4080 and perhaps announced later RTX 4070, we had hope that their initial retail pricing of $699 and $499, respectively, would be maintained. However, the recent increase in the cost of silicon wafers may cause a 10% increase in the MSRP of RTX 4000 GPUs. 899 USD is the cheapest version.
The CUDA Core count is going to rise on all Nvidia hardware, the RTX 4090 graphics card will contain 16,384 Shading Cores. Below is an overview of what we think are the specs; these will be updated once more and official information arrives.
Speculated specs | GeForce RTX 4090 | GeForce RTX 4080 | GeForce RTX 4070 | ||
---|---|---|---|---|---|
Architecture | Ada (TSMC 4NM) | Ada (TSMC 4NM) | Ada (TSMC 4NM) | ||
GPU | AD102-300 | AD103-300 | AD104 | AD104-400 | AD104-250 |
SMs | 128 | 76 | 60 | 60 | 56 |
CUDA Cores | 16384 | 9728 | 7680 | 7680 | 7168 |
Base Clock | 2235 MHz | TBC | TBC | TBC | TBC |
Boost Clock | 2520 MHz | 2.51 GHz | 2.61 GHz | TBC | TBC |
Raytracing cores | 128 Gen3 | 76 Gen3 | 60 Gen3 | TBC | TBC |
Tensor Cores | 512 Gen4 | 304 Gen4 | 240 Gen4 | ||
Memory | 24 GB G6X | 16 GB G6X | 12GB G6X | 12GB G6X | 10GB G6X |
Memory Bus | 384-bit | 256-bit | 192-bit | 192-bit | 160-bit |
Memory Speed | 21 Gbps | 23 Gbps | 21 Gbps | 21 Gbps | 21 Gbps |
Bandwidth | 1008 GB/s | 736 GB/s | 504GB/s | 504 GB/s | 420 GB/s |
Socket Power | 12VHPWR | 12VHPWR | 12VHPWR | ||
PCIe | Gen4 x16 | Gen4 x16 | Gen4 x16 | ||
TGP | 450W | 340W | 320W | ||
Launch Date | October 12th, 2022 | November 2022 | TBA | TBA | |
Price | $1599 / 1959 EUR | $1199 / 1469 EUR | $899 / 1099 EUR |
** we calculated the RT and tensor cores; these specs are not yet final.
Although Nvidia kept the RTX 4000's specifications tightly under wraps, a few tidbits have leaked over time. It has been speculated that the flagship AD102 GPU die would be used in the upcoming RTX 4090 and RTX 4080 graphics cards, resulting in a ~70% increase in CUDA/Shader cores available compared to the RTX 3000 series equivalent.
A fully enabled version of the AD102 GPU would see well over 18K Shader cores. Nvidia's RTX 4000 Series graphics cards are built on TSMC's 4/5nm production node, promising improved performance over the RTX 3000 Series' 8nm GPUs. Nvidia can pack more transistors onto the GPU by using a more compact process node, increasing its processing speed. Since ray tracing and DLSS are still crucial technologies for GeForce graphics cards, Nvidia will certainly work to improve their efficiency.
NVIDIA GeForce RTX 4090
The new flagship GPU is the AD102-300, which has 16384 CUDA cores and a boost clock of up to 2520 MHz and 23-power phases. This translates to a single-precision performance of 82.6 TFLOPS, which is 2.3x higher than its predecessor, the RTX 3090. The flagship Ada-based SKU will include 24GB of GDDR6X memory, with a peak bandwidth of 1 TB/s. This new card will require at least 100W more power than the 3090 Ti.
NVIDIA confirms that this model will be available on October 12 for $1599.
NVIDIA RTX 4080 series
Also revealed are two 4080 graphics cards with the same model number but differing RAM and GPU specifications. The RTX 4080 16GB will have an AD103 GPU with 9728 CUDA cores, 16GB of GDDR6X memory clocked at 22.5 Gbps, and a 320W TDP. This model will be available in November for at least $1199 USD. What was intended to be the RTX 4070 is now the RTX 4080 12GB. This model includes an AD104 GPU and 7680 CUDA cores. This model has 12GB GDDR6X RAM and a TDP of 285W, as the name says. This SKU is priced at $899 USD by NVIDIA.
NVIDIA confirms that the cards will be available in November.
NVIDIA announcing DLSS3
It creates new frames and can boost framerates 4x. Multiple recent enhancements, such as temporal components and Reflex latency enhancements. Makes new frames without using the render graphics pipeline.
DLSS 3 expands on DLSS Super Resolution by incorporating Optical Multi Frame Generation to generate totally new frames and NVIDIA Reflex reduced latency technology for improved responsiveness. DLSS 3 is driven by the NVIDIA Ada Lovelace architecture's new fourth-generation Tensor Cores and Optical Flow Accelerator, which also powers the GeForce RTX 40 Series graphics cards. The DLSS Frame Generation convolutional autoencoder receives four inputs: current and previous game frames, an optical flow field generated by Ada's Optical Flow Accelerator, and game engine data like m
DLSS3 games
- A Plague Tale: Requiem
- Atomic Heart
- Black Myth: Wukong
- Bright Memory: Infinite
- Chernobylite
- Conqueror’s Blade
- Cyberpunk 2077
- Dakar Rally
- Deliver Us Mars
- Destroy All Humans! 2 – Reprobed
- Dying Light 2 Stay Human
- F1Ⓡ 22
- F.I.S.T.: Forged In Shadow Torch
- Frostbite Engine
- HITMAN 3
- Hogwarts Legacy
- ICARUS
- Jurassic World Evolution 2
- Justice
- Loopmancer
- Marauders
- Microsoft Flight Simulator
- Midnight Ghost Hunt
- Mount & Blade II: Bannerlord
- Naraka: Bladepoint
- NVIDIA Omniverse
- NVIDIA Racer RTX
- PERISH
- Portal with RTX
- Ripout
- S.T.A.L.K.E.R. 2: Heart of Chornobyl
- Scathe
- Sword and Fairy 7
- SYNCED
- The Lord of the Rings: Gollum
- The Witcher 3: Wild Hunt
- THRONE AND LIBERTY
- Tower of Fantasy
- Unity
- Unreal Engine 4 & 5
- Warhammer 40,000: Darktide
This article will be updated once new info arrives.