
Let’s put this in perspective, shall we? The current generation GeForce RTX 3090 is the fastest consumer card NVIDIA has ever introduced. It is built around a GA102 Ampere GPU with 72 power multiprocessors and 10,496 CUDA cores, along with 328 texture map units, 112 version output units, 328 Tensor cores, and 82 dedicated radiation detection cores.
- GeForce RTX 3090: 72 SMs, 10,496 CUDA cores
- GeForce RTX 3080: 68 SMs, 8,704 CUDA cores
- GeForce RTX 3070: 46 SMs, 5,888 CUDA cores
- GeForce RTX 3060 Ti: 38 SMs, 4,864 CUDA cores
The same player has now suggested that Lovelace could use an 18,432 CUDA core. Look…

It sounds more like speculation than inside information, but given the leaker’s record, it can speculate based on inside information. Unfortunately @ kopite7kimi does not elaborate on the alleged 12 * 6 structure, but it is interesting nonetheless.
This amounts to an increase in graphics processing groups (GPCs) to 12, compared to a full-fat GA102 Ampere GPU with 7 GPCs. If accurate, Lovelace could potentially accommodate 72 texture processing groups (TPCs) and 144 streaming multiprocessers, providing a whopping 18,432 CUDA cores.
This would be an absolute sample if the information is accurate. And who really knows this at this early stage. Remember that Turing was first launched in September 2018, while Ampere came out two years later. It is therefore an architecture that we will only see in the latter part of 2022.