Summary: Adding more RAM modules than you need can slow your system down. Ok, if that blurb doesn't interest you enough, how about CPU benchmarks using real-world digital photography suites and scientific computing benches that compare the Athlon64 3200+ against the recently built Opteron 246 and a pair of P4 rigs? That's all inside and a brief look at our Athlon64 build: you won't want to miss this article!
Yesterday, you learned everything there was to know about building a true high-performance system. With the dual Opteron complete, we'll have to compare it against a few systems. Intel elected not to offer us Xeon CPUs for Xeon/i875P in this round-up, and so we'll be comparing the Dual Opteron 246 against the Athlon 64 3200+ against a Pentium 4 for today. Don't worry, you won't be disappointd with the content when you're done. For the rest of you techies, before we go on with the numbers, Alexis is going add just a quick overview of the other systems we used in the test. Athlon 64 Construction
With the above requirements, we scoured the internet for potential cases and settled on the Chenbro SR205. Those of you hardcore case modders may be familiar with this case as it was one of the first to offer a front and rear 120mm fan, it even goes beyond that with a 80mm fan in front of the hard drives. The 120mm fan positions can also accommodate 92mm fans as well. As you all know, hard drives are becoming a more substantial source of heat in our systems. With a fan directed at the hard drives, the drives will be kept cool and hopefully have their MTBF's extended. This case has an ample supply of drive bays, with five 3.5" bays and three 5.25" bays. The 5.25" bays use drive rails. I think drive rails are nice, but they are often proprietary so if you lose the set that came with your case, you'll have to contact the manufacturer. This happened to us, and we are pleased to report that Chenbro was very prompt in returning our RMA request and was also very helpful. Clearly they take pride in customer service and stand behind their products. It also supports extended ATX motherboards which not all cases do, even if you don't have an extended ATX motherboard, this alone suggests that the case will be provide adequate room for peripherals and will make working in it a little easier. [image]
A unique feature of this case is the cardholder. At first this appears like a bulky air flow restrictor, but in fact it can be used to provide additional pressure on your PCI or AGP cards, preventing them from falling out during shipping or moving of the computer. This cardholder can accommodate just about any PCI card, except for the very low profile network adaptors. I think more cases should incorporate this to ensure that the cards stay put. [image]
This case is not perfect though. It is clearly designed for a business environment as removal of the side panels to access the interior involves first removing the top cover and then the side panel with a unique handle mechanism. It is not a screwless case either, but personally I prefer the security of old fashion screws, rather than friction or spring based retention mechanism that may fatigue over time. This is a steel case, so it is not a lightweight. The steel is of good quality with a dull grey finish, this is a less expensive steel than the polished shiny steel in the Supermicro case. It uses 1 mm thick steel, and the side panels exhibit minimal flex. In addition, this case won't get too much attention sitting on your desk, which may be a good or bad thing, depending upon your preferences. I would have also like to see some front USB and firewire ports as well. Mods
So no case is perfect, but the question becomes what are you going to do about it? Well, we're taking our dremel out. The plastic front panel of the Chenbro obscures the front 80mm fan intake. With the stock setup, this 80mm fan has a restricted intake path, reducing airflow and increasing noise. What we've done is to simply cut out an opening in the plastic panel to facilitate airflow. This is a common, previously described mod for the Chenbro SR205 case.
SIDEBAR: Does anyone really read these?
We're going with a 400w power supply for this single CPU setup. It never hurts to have extra power on tap. The most important thing when shopping for a power supply is not just the reported wattage, but the distribution of this power. Old cpus like the 486 used to run primarily off the +5v rail, in contrast the current generation of cpus use a +12v rail. So you need to make sure that the power supply has a robust +12v rail. Some power supplies even have a dedicated +12v rail for the processor. In fact, the new ATX12V version 1.3 protocol calls for more robust +12v rails as compared to prior standards. Alan: We made initial plans to go with a Fortron/Sparkle Power power supply, probably the best in the low-cost, high-performance price range. Fortron/SPI power supplies are among the best in the industry and you'll find them rebadged in black with a quiet fan as the Zalman PSU. In this system, we've used a SilenX.com 400W PSU. SilenX is a company in Southern California that has begun developing their own power supplies in-house. Normally, we'd be wary of an unproven design house, however SilenX is building it around the Fortron/SPI platform that we trust, and the manufactured is handled by Fortron - this is the same way PC Power and Cooling also custom designs their PSUs. Basically, SilenX worked with Fortron and contracted power supplies with better heatsinks, larger capacitors, upgraded MOSFETs, and AC line noise filters. This should improve stability of the power supply and SilenX.com also reports having a more stable 12V rail. After production, SilenX adds their own fan design with silicone dampening mounts to reduce the noise. One thing to consider is that the power supply fan is often not the primary source of your system noise, so unless you pair this Silenx power supply with silent system fans, your total system volume may not be much less. [image]
I had initially planned to use the 460W or 550W SilenX active PFC power supply. SilenX.com let us know that they're very conservative with their power ratings and so we've gone with their 400W Active PFC model. Going to an Active PFC with power factor > .99, will improve the efficiency of the unit when compared to a passive PFC (80% true efficiency as opposed to 76%). This means there will be just a little bit of extra power when it's needed. Since power supplies only draw power as needed, the Active PFC models should also reduce the current drawn from the wall. For anyone who leaves their system on 24/7, the small reduction in power use could potentially make a difference. Finally, active PFC models can handle voltage fluctuations better than their passive PFC models and also remove the need for 110/220V switching, something rarely important, but worth noting. At the end of the day, the improved efficiency and the environmental benefit from Active PFC makes it a worthwhile investment for $10. That said, SilenX.com does believe in their lower-end PSUs are were fully confident that their 400W PSU passive PFC would be capable of driving even our a high-performance system in our challenging test environments. Before I move on to the next point, it's worth noting that SilenX.com is currently fighting a battle against Ahanix, ExoticPC, and ColorCase who are also selling "SilenX" power supplies. The only SilenX PSUs that we have experience with are those sold through SIDEBAR: A front panel gameport is the ugliest case feature I've ever seen.
Motherboard
Alexis: The debate continues over the Via KT800 vs the nVidia nForce3 150 chipset. Benchmarks have both chipsets running neck and neck, so it really comes down to which motherboard has the features that you want. It's disappointing that Soundstorm hasn't made its way to the nForce3, and it seems like motherboard manufacturers aren't interested in getting Soundstorm support either. In the end, we just have to hope for a good software-based real-time Dolby Digital ICE to be possible.
Memory
You can't do much without enough memory, we used to think that you can never have too much memory, but we'll get to that a little later. Memory needs to be fast and stable. RAM that can be overclocked to crazy speeds but which gives you occasional errors will be useless in a workstation environment. The hardcore IT guys shoot for "five nines" reliability, or 99.999% uptime, makes sense since their job is on the line if stuff fails. Experience has shown that Corsair's reputation for making fast, reliable RAM has been well earned. They are not the lowest price memory, but they always perform as advertised and sometimes a little more. If you are pushing your system to the limit you need to minimize the variables limiting your overclock, one way to do this is to get the fastest ram possible.
SIDEBAR: If a critical medical device had 99.999% uptime, every patient on it would die.
Athlon 64 3200+
Athlon 64 3200+, enough said. We went with the stock heatsink and fan with Cooler Master Premium heatsink compound (Shin Etsu). The AMD stock heatsink is rather simple, constructed of aluminum without any copper core and only a standard fan. In our testing, it did prove more than adequate in keeping the cpu cool. For extreme overclocking with voltage bumps, a larger heatsink may be needed. We'd go with the Zalman's that are used on our Opteron system. I prefer the aluminum-copper Zalman's vs the all copper ones because of the weight factor. In theory, with tower cases, there may be uneven stress on the heatsink mount since the forces that hold it onto the cpu are perpendicular to the weight of the heatsink. With a desktop case this would not be a factor. I would definitely consider the weight of a heatsink if you are continually transporting your system around. All copper heatsinks do have a performance advantage not only for the increased heat conductivity of copper, but also due to the increased mass. Recall that the rated heat capacity of the heatsinks are measured per gram of mass, so a heavier heatsink will be able to absorb more heat from the CPU with a smaller change in temperature.
In order to answer the question of which CPU is best for gaming and work, we cannot rely on synthetic benchmarks. Instead, we need to get a good variety of real-world tests that can be interpreted and applied to other tasks. We also chose not to explore the overclocking potential of these setups because we wanted to show the minimum expected performance when buying these systems. For my gaming benchmarks, I was planning on going head to head against systems of FiringSquad past, but in the process of running my gaming benchmarks, NVIDIA released 52.16 WHQL drivers and ATI released Catalyst 3.8, invalidating my comparison. In the interest of time, I will leave the 3D benchmark articles to Brandon and Chris, since they'll be able to provide the most meaningful results for you. For our CPU tests, we're focusing on real-world benchmarks for the digital photographer (single and dual CPU) and one scientific computing comparison. If you look at the future of high-performance desktops, you'll likely find yourself looking at digital photography. This is a point where improvements in speed are noticeable. Digital photographers can take hundreds to thousands of pictures a day, but they aren't taking JPEGs. Instead they're capturing RAW images which contain all of the data they get back from the the shoot. They'll develop the images for JPEG for proofing, etc. This is done so that if the color balance is off or if the exposure is slightly off, it is possible to go back to the RAW "negative" and re-develop the image. With regular workloads in 500-1500 images, even small differences in performance can make a big difference in the long run. These numbers are not unrealistic -- I'm not a professional photographer, but I took 980 images this year at San Francisco Fleet Week (a public air show in the San Francisco Bay) images with my digital SLR - I'm glad there is no marginal cost with digital pictures. I need it now!
Unlike rendering animations where you can let it sit and one animation is a month-long project, photographers will need the end results right away so they can proof it, edit the images, and sell the images. They may be starting a new shoot the next day. Second, when it comes to 3D content creation, most of it is user-limited during the day (the CPU is idle when the artist is thinking about what he wants to draw). In the evening it's an overnight render of animation, which essentially means that if the render starts at the end of the workday, it just needs to be ready before the beginning of the workday the following day - it doesn't matter if it finishes at midnight or 15 minutes before the workday starts.
Capture One D-SLR (http://www.phaseone.com)
Ask any owner with a digital SLR what they look for in a computer and it'll be RAW processing performance. I don't mean "raw performance" in the marketing sense, but development of the unprocessed RAW files from the imaging sensor. That is, while the typical consumer digital camera saves images in JPEG, photography enthusiasts and professionals prefer to save RAW images. These "smaller-than-TIFF" files losslessly record exactly what the sensor sees and allow for greater flexibility in developing the film. Capture One D-SLR is the professional's choice for RAW image development and is a multithreaded application that takes advantage of multiple CPUs. NeatImage (http://www.neatimage.com)
Sometimes called the "Big-CCD-in-software," NeatImage is a revolutionary noise reduction application which removes noise while maintaining a surprising amount of detail. The algorithm is fine tuned and robust enough that it's possible to clean up images from webcam with good results to cleaning up images with flagship D-SLRs. If don't use the term revolutionary often, but this is truly revolutionary. We'll likely be including this benchmark in the future Adobe Photoshop 7.01 - The Fred Miranda Actions (http://www.fredmiranda.com)
Look around on the 'net and chances are you'll see people using PSBench for their digital imaging benchmarks. While PSBench has its merits, when is the last time you saw a PhotoShop user use the "Lens Flare" filter? Ok, how about the last time it wasn't used as a spoof at SomethingAwful? Lens flares in Photoshop are like the flashing "under construction" sign of the 'net. Likewise, how frequently do you need to rotate large 100MB images by 0.9 degrees? While that's certainly used more frequently than the lens flare, it's still not very common. Adobe Photoshop 7.01 - Camera RAW ( http://www.adobe.com/products/photoshop/cameraraw.html)
Although Capture One D-SLR is probably the most advanced RAW development software, Adobe Camera Raw is also very popular and provides a good camera-independent test. We evaluated the time it took to open up a D60 image at the native 6 mpixels at 8-bit color and the time it took to internally upsample to a 15 megapixel image at 16-bit color through Camera RAW. These images were processed in Adobe RGB colorspace.
MATLAB - N72 Script (http://www.mathworks.com)
MATLAB is your multipurpose scientific computing application. Every engineer and his brother has used Matlab at one point or another. It's a very flexible application used in high school to teach basic Newtonian physics and was used in industry to design the Joint Strike Fighter. This is also a single threaded application. Why? Mathworks has done their own studies and determined that for most Matlab tasks, a lot of computation is spent parsing and processing the script, something that isn't parallel at all. Parsing scripts isn't a very glamorous aspect of scientific computing, but it's very important to real-world use. Think of the car that does 0-60 in 4 seconds but requires you to refill the gas tank every 10 miles. There's no doubt that the car is fast, but no one would really use it.
This is only one slice of what is scientific computing and each task is going to have a different performance signature for every application. That said, we would much rather develop a set of benchmarks on commercial scientific computing applications that are more likely to represent how CPUs are used instead of home-brew software. (i.e. it's the same question about buying a graphics card to run 3DMark versus buying a graphics card to play games). Our version is Matlab 6.5 Release 13, and all scripts are in pcode. SIDEBAR: I've given up coming up with new random facts for now
Dual AMD Opteron 246 AMD Athlon 64 3200+ Intel Pentium 4 2.8C/800 (Hyper-Threading Enabled) Intel Pentium 4 3.0C/800 (Hyper-Threading Enabled) Tyan Thunder K8W (Opteron) ASUS K8V Deluxe (Athlon 64) Intel Desktop Board D875PBZ "Bonanza" (Pentium 4) 1 GB Corsair 2xCMX512RE-3200LL XMS DDR400 Registered ECC Ram (Opteron) 1 GB Corsair 2xCMX512-3200LL XMS Pro DDR400 (Athlon64) 1 GB Corsair 2xCMX512-3200LL XMS Pro DDR400 (P4 2.8) 1 GB Crucial 2x512MB DDR400 CL3 (P4 3.0) SuperMicro SP-450 PSU (Opteron) SilenX.com 400W 16 dB PSU (Athlon64) Forton/SPI 300W ATX PSU - standard MicronPC (Pentium 4) Monster Power HTS 3600 Power Conditioning Windows XP Professional SP1 We did not have time to test the Pentium 4 3.0GHz on Corsair RAM, but the CL3 RAM is what is stock on many store-bought PCs, but Crucial ram is probably better than most generic. Don't worry, read through the entire article before you complain… Final words before we show the numbers
Our focus on digital photography should be seen in two ways. If you're interested in digital photography, this will be an important evaluation of AMD versus Intel CPUs as well as a demonstration of the benefits of a dual processor system. If you're just interested in a "standard CPU" review, think of these as real-world synthetic tests of memory performance and FPU/SSE/MMX performance. All systems were running unfragmented hard drives. We had initially run these tests on a RAM drive to take the hard drive performance out of the equation, but found that for these tests, the hard drive performance was not significantly different between these ATA systems.
Capture One D-SLR
Photoshop 7.01 Open 6MP CRW
![]() The Athlon 64 3200+ is the surprising leader here. The fast memory latency appears to bring it even above the rest of the group. Notice also that the P4 2.8 GHz and P4 3.0 GHz perform identically. This is yet more proof that you shouldn't buy the fastest CPU if you're not ready to back it up with the fastest memory. ![]() Once again the three CPUs are very closely matched. The 5-second improvement with the dual Opteron 246 is significant given the frequency of how often this action is run, and really shows how well the Opteron takes to symmetrical processing. SIDEBAR: The state of Florida is bigger than England.
Neat Image
So far our digital camera tests have been somewhat uneventful. The P4, Opteron 246, and Athlon64 3200+ have been neck and neck. To keep you interested, this is yet another example that fast memory can be more important than a fast CPU. In our last test with Neat Image, things get considerably more interesting.
Matlab N72
Preliminary Conclusion
It seems like the Pentium 4 and Athlon 64 3200+ are very closely matched for RAW processing and the Fred Miranda scripts. However, the Athlon64 takes a considerable lead with Neat Image, the most CPU intensive application in our digital photography test suite. Equally important is the fact that NeatImage has become one of the "must have" applications for any digital photographer, amateur or professional.
It's not over
If we weren't FiringSquad, we probably would have ended the article there and patted ourselves on the back. Most reviews on the net test 2x512MB memory setup or 2x256MB. It's cheap, popular, and certainly a recommended budget configuration. But this is all about high-end systems right? What if you want more than 2 DIMMs? That's where things get interesting. Athlon 64
Although Athlon64 motherboards ship with 3 DIMM slots, they can only run 2 DIMMS (4 banks) at full speed. This limits you an Athlon64 system to 2x256MB (two single bank), 2x512 (two double bank), or 2x1024 MB (two high-density double bank) memory configurations. Should you go with 3x256MB, your memory speed will drop to DDR333 and going with 3x512MB of RAM will drop you all the way down to DDR200 (PC1600)! Registered Memory on Athlon 64 FX and Opteron
On the Athlon 64 FX and Opteron CPUs, there really isn't any difference in performance with additional DIMMs. Those CPUs are designed to handle 4 DIMMs (8 banks) without any problems, and since it's registered, adding DIMMs does not increase latency - you've already paid the toll at the register. Simply put, the registered DIMM design allows you to add additional memory to the system without adverse affecting system performance. And Intel?
The Intel i865PE Springdale and i875P Canterwood platforms claim to preserve full DDR400 clock speeds, but will add additional latency with greater than two DIMMS. Those of you with careful reading skills will notice that I used the word "claim." Lost in Translation? Here's the summary
On the Athlon64 and Pentium4 i875 platforms, the more memory you add, the slower the memory performance. In the case of the Athlon64, the drop occurs with a drop to 1.6GB/s bandwidth, half its peak. It can transfer less data for the same amount of time. In the case of the Pentium4, the performance drop is supposed to be an increase in latency, but maintains the DDR400 bandwidth of 3.2GB/sec. It's top speed is the same, but has slower acceleration.
Dual AMD Opteron 246 AMD Athlon 64 3200+ Intel Pentium 4 2.8C/800 (Hyper-Threading Enabled) Tyan Thunder K8W (Opteron) ASUS K8V Deluxe (Athlon 64) Intel Desktop Board D875PBZ "Bonanza" (Pentium 4) 2 GB Corsair 4xCMX512RE-3200LL XMS DDR400 Registered ECC Ram (Opteron) 2 GB Corsair 4xCMX512-3200LL XMS Pro DDR400 (Pentium 4) 1.5 GB Corsair 2xCMX512-3200LL XMS Pro DDR400 (Athlon64) SuperMicro SP-450 PSU (Opteron) SilenX.com 400W 16 dB PSU (Athlon64) Forton/SPI 300W ATX PSU - standard MicronPC (Pentium 4) Monster Power HTS 3600 Power Conditioning Windows XP Professional SP1 Comments
1.5GB represents the peak memory use on the Athlon64 3200+ using our 512MB Corsair XMS Pro 3200LL DIMMS. Since the Athlon64 Clawhammer uses a single channel 64-bit memory controller, we wouldn't expect any problems from using an odd number of DIMMs.
Capture One D-SLR
Photoshop Camera Raw
There's no point in commenting on these numbers anymore. It's pretty obvious what's going on. Fred Miranda
Neat Image
Matlab
SiSoft Memory Bandwidth
Closing Thoughts
There are two key teaching points in this article.
Conclusion
With today's ultra-high performance memory controllers, adding extra memory modules can actually cause a decrease in performance. On all non-registered systems, the use of 2 DIMM slots should be considered the optimal configuration. Final Verdict
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| © Copyright 2003 FS Media, Inc. |