GTX680 Quad SLI vs HD7970 Quad CrossfireX Review

👤by Richard Weatherstone Comments 📅19-04-12
Specifications (GTX680)


Firstly, comparing the GTX680 with it's forbear, the GTX580 we see that there are some significant changes. Immediately identifiable are the amount of stream processors which have tripled. from 512 to 1536. The ROPs have however dropped 25% but to compensate the core and memory clockspeeds have been massively increased, with the GTX680 being the clocked slightly above the 1GHz mark.

The battle everyone wants to see though is the GTX680 vs HD7970 and comparing the two cards against one another is a good place to start. On paper at least, the GTX680 appears to have the slight upper hand. Sacrificing a gigabyte of GDDR5 and running with the 256-bit interface, the GTX680 cannot hope to compete in total bandwidth available losing out to both the HD7970 and HD7950 by a fair margin despite having a much higher memory overclock. The tables are however turned when we study the texture fillrate where the GTX680 comes out on top.



It was almost two years to the day that we saw the introduction of NVIDIA's Fermi, NVIDIA's previous architectural change. The GF100 featured 512 stream processors in a 16x32 format built by TSMC in a 40nm process. Sporting OpenGL 4.0 and DirectX 11, enthusiasts climbed over each other to get the latest and greatest GPU from NVIDIA. While the performance was there, power efficiency was very poor with the cards consuming lots of power and running very hot.

Enter the 5 series 8 months later and many would say the GTX580 was the card the GTX480 should have been. It was more powerful thanks to the fully unlocked memory controller (6x64bit), ran cooler and thus became a firm favourite among the high end GPU market.

For a little over a year the GTX580 enjoyed sitting atop of the GPU pile (save for dual GPU variants) and it wasn't until the AMD HD7970 emerged in December 2011 that the GTX580 lost it's performance crown. NVIDIA however had not been resting on their laurels...

GTX680 GPU
The successor to Fermi is codenamed Kepler. Listening to feedback of Fermi from gamers around the world, NVIDIA sought to create a graphics card that was not only the best money could buy in terms of performance but crucially it had to run cool and quiet to make for a power efficient model. Kepler addresses these issues in two ways, GPU Boost which we will discuss further in our overclocking section of the review and a redesigned streaming multiprocessor.

Kepler Core
The GTX680 is equipped with a GK104 core which NVIDIA claim is their highest and most power efficient GPU to date. Fabricated on the 28nm process, every component was designed with power efficiency in mind to ensure the GPU gave the best performance-per-watt possible.

While the 28nm manufacturing process holds the bulk of the power saving features, the way the new architecture works is also key to furthering Kepler's power reduction. SMX, an evolution of the SM from Fermi is NVIDIA's new streaming multiprocessor:



The SMX now runs at the graphics closk speed rather than 2x that speed as previously but because the GK104 has 1536 CUDA cores (8xSMX), over Fermi's (GF110) 512 CUDA cores (16xSM), Kepler allows the GTX680 to operate at twice the performance per watt measurement compared to the GTX580.

The clock throughput of FMA2, SFU and texture operations have all been significantly increased and while some operations still retain the same speed as the GTX580, the GTX680's much higher core clockspeed ensures a substantial increase for all GPU operations.

To feed the new SMX, each unit has four warp schedulers, each capable of firing off two instructions per warp, every clock. With a redesigned scheduling function, the GTX680 is a lot less complex and much more streamlined allowing the compiler to determine which instructions will be issued and can provide this information direct to the hardware block rather than going around the houses using Multi-port decode and a register scoreboard along with the dependency check before the information gets issued. This method is both quicker and more importantly, more power efficient.



Perhaps the biggest change to the processor core is the omission of the old Shader Clock. The shader clock was getting tired as it was part of the old Tesla architecture which was implemented as an area optimisation. Because of the way Kepler executes instructions at a much more streamlined manner and at a higher clock rate, fewer copies of the execution unit need be made.

Another improvement over Fermi with the introduction of SMX is the redesigned Polymorph engine (2.0). The Polymorph engine takes card of the tessellation workload of the GTX680. The design of the new Polymorph engine is to ensure that the tessellation we see on screen has little (as possible) impact on the rendering performance. What you may find bizarre though is that NVIDIA have cut the Polymorph engine count in half to 8 from Fermi's 16 yet claim it is almost twice as quick. This is thanks to the improved core an memory clockpeed.

One area where NVIDIA are shouting from the rooftops is the power consumption area. With a max power draw of 195W compared to 250W of the HD7970, the GTX680 should save a few more yards of the Brazillian rainforests (and of course cut your electricity bill) than the HD7970. Price wise there is little to choose between them. The GTX680 is a little more expensive yet only because of the recent price drops of the HD7900 series. At launch, the HD7970 was significantly more expensive.

Overall, looking at the specifications you would be forgiven for thinking that Kepler is not the GPU everyone was hoping for. Sure it has faster clockspeeds but how does that transpire to performance when the core hardware (stream processor count, memory size) is not as impressive as AMD's?

Let's take a look at the competition...

22 pages 1 2 3 4 > »

Comments