Ada Lovelace is NVIDIA's fourth-generation RTX architecture, manufactured on TSMC's custom 4N process. It introduces 4th-generation Tensor Cores with FP8 support, 3rd-generation ray tracing cores, and the Shader Execution Reordering (SER) engine for improved workload scheduling.
Ada Lovelace is NVIDIA's fourth-generation RTX architecture, manufactured on TSMC's custom 4N process. It introduces 3rd-generation ray tracing cores, 4th-generation Tensor Cores with FP8 support, and the Shader Execution Reordering (SER) engine for improved workload scheduling.
The RTX 4090 features the full AD102 GPU die with 128 Streaming Multiprocessors (SMs), each containing 128 CUDA cores for a total of 16,384. Its 512 Tensor Cores can perform FP8 matrix operations at up to 1,321 TOPS, making it exceptionally efficient for quantized LLM inference.
The memory subsystem uses a 384-bit bus connected to 24 GB of Micron GDDR6X running at 21 Gbps, delivering 1,008 GB/s of bandwidth. For AI inference, this bandwidth is the primary bottleneck β it directly determines how many tokens per second the GPU can generate during autoregressive decoding.