FiringSquad: Home of the Hardcore Gamer - Games, Hardware, Reviews and NewsSubmit your own or view users' CPU overclocking results!

  
 Home   News   THE MATRIX   Deals   Hardware   Games   Features   Media   Products   Forums   FS China 
AddThis Social Bookmark Button

Home : Hardware : Video Cards : ATI RADEON X1800 XT 512MB/X1800 XL Performance Preview
» Join the Greatest Gaming Community NOW! (It's free)

Already a member? Login
 


Random Gallery >> 
Click to view high-res Image!
Medieval 2: Total War Review Screenshots [62] (3)


C&C:Renegade Review, wrist-slittingly good! (8) by McStu
Know Your Roots: Unreal Tournament Review [Preliminary #2] (7) by Discobiscuits
World in Conflict (PC) Review (6) by Kessandra
Clive Barker's Jericho Review (Round 2) (6) by jacobvandy
Puzzle Quest (PC) Preview (3) by Beefysworld
Defense of the Ancients Review (Warcraft III Map/Mod) [Preliminary #2] (6) by Discobiscuits
Monstru's Biostar Contest results (0) by Monstru
Far Cry (PC) Review by exe3 (4) by exe3
A Truely "Epic" Game: Unreal Tournament 3 Review (4) by Discobiscuits
Guitar Hero 3 - The thing that should not be (UPDATED) (5) by Beefysworld

More Blogs >>




ATI RADEON X1800 XT 512MB/X1800 XL Performance Preview
October 05, 2005   Brandon Sandman Bell > [View My Other Articles]
Product Info | User Reviews | Article Images(31) | Image Gallery | Comments | Forum Thread
New 3.0 shaders


The RADEON X1800 is ATI’s first shader model 3.0 graphics part. As we learned with the GeForce 6800 launch a year ago, shader model 3.0 brings with it support for more instructions, thus allowing developers to write more complex shader programs. In addition to this, another important feature that shader model 3.0 added was dynamic branching (flow control), allowing developers to add loops to their programs.

This particular feature was designed to make writing shaders easier for developers, one common example used was multiple light sources. In previous shader models, the developer would have to write a shader for each light. Dynamic branching makes it possible for the developer to write one shader, which then loops through a certain number of vertex lights and exits once all the lights have been processed. This helps to reduce shader count complexity. Another potential advantage to branching is reducing the variety of shaders used (i.e. many different shaders versus one).

Besides eased development, shader model 3.0 also presents potential performance improvements. For example, developers can use dynamic branching to skip large portions of code that are determined to be unnecessary, and thus help to speed up the shader.

Branching, if not used carefully however, can introduce slower performance. With RADEON X1800, ATI sought to improve branching and also improve texture fetching. After all, if a pixel shader needs to look up a texture value that is not located in the texture cache, it must look in graphics memory, which can introduce hundreds of cycles of latency.

To improve flow control, ATI breaks down the pixel processing workload into a large number of small threads. ATI refers to this as ultra-threading. These threads consist of small 4x4 blocks of pixels (16) on which the same shader code is executed.

ATI RADEON X1800 XT 512MB/X1800 XL Performance Preview [  @ 573 x 530 ] > View Full-Size in another window.


ATI RADEON X1800 XT 512MB/X1800 XL Performance Preview [  @ 574 x 406 ] > View Full-Size in another window.



Secondly, ATI adds dedicated flow control logic. The RADEON X1800 features an ultra-threading dispatch processor which acts as a central dispatch unit that tracks and distributes up to 512 threads across the RADEON X1800’s shader processors. Each of these shader processors consists of four pixel shaders, what has traditionally been referred to as “quads”. Each of these processors is autonomous and contains its own dedicated branch unit to help eliminate flow control overhead in the shader processors.



Whenever the dispatch processor determines that a core has become idle, it is assigned a new thread to execute. If the idle thread was waiting for data, it is temporarily suspended until that data becomes available, thus freeing its ALUs to work on other threads. ATI claims that this enables the Radeon X1800 pixel shader cores to maintain over 90% utilization in practice, with negligible idle time regardless of the shader code being run.

In closing, ATI feels that by breaking the pixel processing workload into smaller threads, the RADEON X1800 works more efficiently. Ultra-threading also hides the latency normally encountered with texture fetching. Meanwhile, the X1800’s dedicated flow control logic minimizes shader processor idle times and wasted cycles. All this adds up to improved flow control, which will become increasingly important as developers continue to implement branching in their code.


Back! New specs     512-bit ring bus? Next!
Blog + Share: Digg Del.icio.us Reddit SU furl • More: AddThis Social Bookmark Button
Send This Article to a Friend!  
Table of Contents
  Print Entire Article  

MATRIX CONTENT » RANDOM MEDIA BLOG More Blogs >>
No ratings yet
» Please rate this
I am an AMD AgentRead this Media-Blog entry!» The Orange Box Review in 500 Words! [Preliminary #2] (7)
by Swatt (14) Talk with this user on their Shout Box (My other blogs) Posted 12 months ago

Sponsored Links
:
[GO]


 Latest Headlines
Biostar $2500 Overclocking Champion Announced (3)
Wrath of Lich King breaks sales records (4)
GeForce GTX 260-based GX2 card coming Jan 9th? (2)
PhysX-accelerated Mirror's Edge coming in Jan (6)
NVIDIA posts WHQL-certified Big Bang II driver (5)
Today's News >>
Today's Siteseeing >>


 Table of Contents


FiringSquad is powered by... Back to Top Site MapContact UsAdvertise With Us Privacy StatementAbout Us  
News RSSSiteseeing RSSArticle RSS   © 1998-2008 FS Media, Inc. All Rights Reserved