
Interesting. At my department we are buying one possibly two Dual 2GHz opterons with 6GB mem and 30 250GB Hard drives (6TB Raid5) attached using 4 8506 cards. They are going to be used for data analysis with large (100GB+) data sets, so we need every MB per sec we can get. This is a purely IO bound application.backebergd wrote:Regarding the 3ware card, are you planning on RAID5? If you have the resources, do a benchmark between RAID5 through the 3ware card and JBOD with the 3ware card/software raid 5. My experience was that software RAID was almost twice as fast. We tried the 8506 with eight, six, and four 200GB SATA drives. That benchmark was actually done on a Xeon machine, but you should be able to see similar results on the dual Opteron.

If this work can be spread across more than one machine your best performance might be to limit yourself to 15/16 disks per box. Is all this stuff going in a 6U chassis?!haugboel wrote: Interesting. At my department we are buying one possibly two Dual 2GHz opterons with 6GB mem and 30 250GB Hard drives (6TB Raid5) attached using 4 8506 cards. They are going to be used for data analysis with large (100GB+) data sets, so we need every MB per sec we can get. This is a purely IO bound application.
I envision a few problems in this scenario. First of all, definitely put labels on both ends of every cable you use. Second, what about heat? How are you going to cool this monster set of drives? And then finally your question about cpu cycles. The real answer is I don't know your application. You say it's data bound, rather than cpu bound, so you'll have to do your own benchmarking to get the answer to that. Even if your performance is okay, think about the capacity you're losing:haugboel wrote:Do you think it still would pay off to use a JBOD configuration or does it take up too many CPU cycles with 30 drives? I was wondering that for best performance a two layered structure would be optimal. Ie: Use hardware Raid0 and combine the raid sets of different controllers using software Raid5. Effectively giving raid50. In this way data read - typically a 2-4 GB chunk of data - would spread out over all the controllers and PCI busses.
No links to the test results, but you can replicate everything I did yourself. Put the drives in, configure them how you want (press Alt-3 at boot, edit /etc/raidtab.conf, mkraid), then use multiple passes of hdparm, or even try out sample data with your actual applicaition.haugboel wrote:Do you have any links to the test results![]()
What would be the optimal filesystem, XFS? ReiserFS? (Reiser4)
We are going to have it in a tower (or should I call it a cubebackebergd wrote:If this work can be spread across more than one machine your best performance might be to limit yourself to 15/16 disks per box. Is all this stuff going in a 6U chassis?!
The TyanS2880GNR motherboard we will use has two PCI-X busses, there should be enough bandwith. Depending on how it goes we may opt for a quad opteron for the second system, if it comes with three or four pci busses.backebergd wrote: 2 8506-8s on one PCI bus is enough strain. I don't know what would happen with four. I know that dual channel-bonded Myrinet cards is enough to nearly chew up the bus capacity in a PC architecture.
The case is designed for being a file server. Cooling should be ok. We get the system already assembled and stress tested. It is going to be in a server room which has a wind tunnel like ventilationbackebergd wrote:I envision a few problems in this scenario. First of all, definitely put labels on both ends of every cable you use. Second, what about heat? How are you going to cool this monster set of drives? And then finally your question about cpu cycles. The real answer is I don't know your application. You say it's data bound, rather than cpu bound, so you'll have to do your own benchmarking to get the answer to that.haugboel wrote:Do you think it still would pay off to use a JBOD configuration or does it take up too many CPU cycles with 30 drives? I was wondering that for best performance a two layered structure would be optimal. Ie: Use hardware Raid0 and combine the raid sets of different controllers using software Raid5. Effectively giving raid50. In this way data read - typically a 2-4 GB chunk of data - would spread out over all the controllers and PCI busses.
Mentioning Reiser4, was just for the hell of itbackebergd wrote: No links to the test results, but you can replicate everything I did yourself. Put the drives in, configure them how you want (press Alt-3 at boot, edit /etc/raidtab.conf, mkraid), then use multiple passes of hdparm, or even try out sample data with your actual applicaition.
I'm personally prejudiced towards reiser3. It's very solid, and how much trouble are you going to have if your data gets toasted? I don't think reiser4 is quite ready for production data. XFS is supposed to have some performance enhancements over reiserfs in certain tests. You should try both to see which gets better raw numbers on your data and the way you do your read/write patterns. One warning I've heard from others is that if you do choose XFS, have your box on a UPS, because it doesn't recover from disaster very well. Again, this is just second hand, I've never played with XFS.
Thanks for the feedback.backebergd wrote:Which Tyan motherboard are you using? They have numbers, like 2885. Generally speaking, I've seen the Arima dual Opterons running better than the dual Opteron Tyan boards. Fewer problems in general. Also, if you're running DDR 333 or DDR 400, it pays to use vendor-approved memory. The cheap stuff can get you into trouble; it's worth the extra $20 /dimm or whatever the difference is.
Regarding the 3ware card, are you planning on RAID5? If you have the resources, do a benchmark between RAID5 through the 3ware card and JBOD with the 3ware card/software raid 5. My experience was that software RAID was almost twice as fast. We tried the 8506 with eight, six, and four 200GB SATA drives. That benchmark was actually done on a Xeon machine, but you should be able to see similar results on the dual Opteron.
And BTW, 4GB is a lot of ram, but it's not a monster machine anymore. The current quad opteron from Celestica supports 32GB if you use 2GB dimms.

Code: Select all
# bonnie++ -u root -s 16g -n 100:50k -d /some-directory
Version 1.03 ------Sequential Output------ --Sequential Input- --Random-
-Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP
machina 16G 24959 65 24993 10 17894 5 21846 71 86172 11 211.8 0
------Sequential Create------ --------Random Create--------
-Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
files:max /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP
100:50:0 327 98 200883 100 41699 99 327 98 188339 100 390 49
machina,16G,24959,65,24993,10,17894,5,21846,71,86172,11,211.8,0,100:50:0,327,98,200883,100,41699,99,327,98,188339,100,390,49
doerrfleischfee wrote:Hello, may I ask you if you can tell me when you re encountering any problems regarding to 3ware escalade raid controllers an the tyan s2885 mobo?
My box has a 3ware 8506-8 attached to pci-x bus B on an K8W with four WD740GD discs in raid5.
On my way setting up this machine I am confronted with the following problems:
with kernel 2.6.0 (attached to pci-x bus 1 (133 MHz max)): decreased system stability, kernel module 3w-xxxx reporting fatal error "nothing to flush?", data loss
with kernel 2.6.1 (attached to pci-x bus 1):
system locks at boot when loading 3w-xxxx driver (compiled into kernel)
with kernel 2.6.0 (attached to pci-x bus 2):
3w-xxxx still producing "nothing to flush?"-error, but no data loss
with kernel 2.6.1 (attached to pci-x bus 2):
no big problems, but still bad performance:I have contacted 3ware in this but still no answer...Code: Select all
# bonnie++ -u root -s 16g -n 100:50k -d /some-directory Version 1.03 ------Sequential Output------ --Sequential Input- --Random- -Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks-- Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP machina 16G 24959 65 24993 10 17894 5 21846 71 86172 11 211.8 0 ------Sequential Create------ --------Random Create-------- -Create-- --Read--- -Delete-- -Create-- --Read--- -Delete-- files:max /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP 100:50:0 327 98 200883 100 41699 99 327 98 188339 100 390 49 machina,16G,24959,65,24993,10,17894,5,21846,71,86172,11,211.8,0,100:50:0,327,98,200883,100,41699,99,327,98,188339,100,390,49

If you check the whole archive(!) from November until now, the 3ware question comes up multiple times, and it seems they found a work around. Furthermore it should only be a problem for people with more than 3.5GB of memory. There are also problems with the AGP controller on the TYAN board. In the SUSE mailing list people upgrade to the latest (beta) bios. YMMV, but if I had known, I wouldn't have ordered a TYAN board.doerrfleischfee wrote:Please read the following complete thread http://lists.suse.com/archive/suse-amd6 ... /0096.html
I just did and I'm very concerned. I will also try to get answers from Tyan and 3Ware.doerrfleischfee wrote:Please read the following complete thread http://lists.suse.com/archive/suse-amd6 ... /0096.html
Code: Select all
Version 1.03 ------Sequential Output------ --Sequential Input- --Random-
-Per Chr- --Block-- -Rewrite- -Per Chr- --Block-- --Seeks--
Machine Size K/sec %CP K/sec %CP K/sec %CP K/sec %CP K/sec %CP /sec %CP
titan 16G 36201 99 94009 25 24543 7 21308 74 75273 11 250.8 0
------Sequential Create------ --------Random Create--------
-Create-- --Read--- -Delete-- -Create-- --Read--- -Delete--
files:max /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP /sec %CP
100:50:0 2389 31 138209 99 1835 17 2391 32 129041 100 289 3
titan,16G,36201,99,94009,25,24543,7,21308,74,75273,11,250.8,0,100:50:0,2389,31,138209,99,1835,17,2391,32,129041,100,289,3
Code: Select all
00:06.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8111 PCI (rev 07)
00:07.0 ISA bridge: Advanced Micro Devices [AMD] AMD-8111 LPC (rev 05)
00:07.1 IDE interface: Advanced Micro Devices [AMD] AMD-8111 IDE (rev 03)
00:07.2 SMBus: Advanced Micro Devices [AMD] AMD-8111 SMBus 2.0 (rev 02)
00:07.3 Bridge: Advanced Micro Devices [AMD] AMD-8111 ACPI (rev 05)
00:07.5 Multimedia audio controller: Advanced Micro Devices [AMD] AMD-8111 AC97 Audio (rev 03)
00:0a.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8131 PCI-X Bridge (rev 12)
00:0a.1 PIC: Advanced Micro Devices [AMD] AMD-8131 PCI-X APIC (rev 01)
00:0b.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8131 PCI-X Bridge (rev 12)
00:0b.1 PIC: Advanced Micro Devices [AMD] AMD-8131 PCI-X APIC (rev 01)
00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 NorthBridge
00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 NorthBridge
00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 NorthBridge
00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 NorthBridge
00:19.0 Host bridge: Advanced Micro Devices [AMD] K8 NorthBridge
00:19.1 Host bridge: Advanced Micro Devices [AMD] K8 NorthBridge
00:19.2 Host bridge: Advanced Micro Devices [AMD] K8 NorthBridge
00:19.3 Host bridge: Advanced Micro Devices [AMD] K8 NorthBridge
01:06.0 RAID bus controller: 3ware Inc 3ware 7000-series ATA-RAID (rev 01)
02:07.0 VGA compatible controller: nVidia Corporation NV11 [GeForce2 MX/MX 400] (rev b2)
02:09.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5703 Gigabit Ethernet (rev 02)
03:00.0 USB Controller: Advanced Micro Devices [AMD] AMD-8111 USB (rev 0b)
03:00.1 USB Controller: Advanced Micro Devices [AMD] AMD-8111 USB (rev 0b)
03:0c.0 FireWire (IEEE 1394): Texas Instruments TSB43AB22/A IEEE-1394a-2000 Controller (PHY/Link)
04:00.0 Host bridge: Advanced Micro Devices [AMD] AMD-8151 System Controller (rev 13)
04:01.0 PCI bridge: Advanced Micro Devices [AMD] AMD-8151 AGP Bridge (rev 13)
The knowledge base entry (#10964) also points to a PDF short paper on the problem. https://www.3ware.com/kbadmin/attachmen ... %20A_P.pdfFROM 3Ware's Knowledge Base:
Q10964 - Software Configuration: I am having trouble with 2880 and 2885 Tyan motherboards that support AMD Opteron chipsets.
3ware recommends installing the Escalade 7506/8506 series controllers with a 3ware recommended riser card and only in slots 1 or 2 on the Tyan S2880 and the Tyan S2885. These slots are located closest to the memory modules (DIMM sockets) on the motherboards. A 3ware recommended riser card from Adex is required (see attached tech brief). Please review the linked technical brief that describes issues that are known that affect interoperability between 3ware 66 MHz RAID controllers, and certain riser card and Motherboard combinations.
Placing a 66MHz 3ware Escalade RAID controller in slots 1 or 2 without a 3ware recommended riser card, or in slots 3,4 or 5 on these Tyan platforms may impact data integrity. The attached document describes these issues in more detail.

