CFLAGS Central (Part 1)

Säck · Posted: Wed Mar 03, 2004 9:19 am Post subject:

I tink i'll do a complet new install of my gentoo system, since i have played around a little bit too much and my hd is full.

i have a pentium 4-m and i'd like cflags settings that will work withouth problems.
my actual settings are:

CHOST="i686-pc-linux-gnu"
CFLAGS="-march=pentium4 -O3 -pipe -fomit-frame-pointer"

this has worked out in most of the cases pretty well, but not allways. Openoffice didn't compile, and strangely kde 3.2 korganizer doesn't work right. in a thread (i can't remember which one) that this might come from march=pentium4.

Well my next system should be a system that is optimized but STABLE!!
So I consider lowering my CFLAGS to

CFLAGS="-march=i686 -O2 -pipe -fomit-frame-pointer"

now my questions:
-is the change from -march=pentium4 to -march=i686 decrasing performance drastically.
-shouldn't i actually use -mcpu=i686 since my cpu isn't a pentium pro?
-is this going to result in a more stable system?

and my last question: when I do a stage 3 install, well what are the cflags of the i686 and the pentium 4 installation by default?

greets and thanks for your help
_________________
Remember: Gentoo Rocks

tapted · Posted: Thu Mar 04, 2004 10:56 am Post subject:

n3m0 · Posted: Sat Mar 06, 2004 2:20 pm Post subject:

sleek · n00b Joined: 09 Jan 2003 Posts: 71

For all those with an Intel Celeron (Coppermine) 600mhz CPU:

fishhead · Posted: Tue Mar 09, 2004 3:04 am Post subject:

KingPunk · Posted: Tue Mar 09, 2004 8:21 pm Post subject:

just thought i'd add my two point two cents

tapted · Posted: Tue Mar 09, 2004 10:23 pm Post subject:

KingPunk · Posted: Tue Mar 09, 2004 10:53 pm Post subject:

odd enough, i've compiled the whole system with it. rofl.

and they say it will in fact, make it run slower.
so, what would be the best to use?
like, if you were to get the cflags to run on a 2500+ barton, 333fsb, 512 L2,
... what would you run?

i want to get the absloute fastest system going. that way i can get
every edge over my friends box hes building. (we got a nice little
competition going :twisted:

..and he doesn't know how to do software
optimizations, via cflags, so yeah!)

so if i could get ahold of the "best" flags to use, without the need
for debugging, i just want my box to smoke. as long as it isn't menthol,
har har har :lol:

thanks much.
~KingPunk
_________________
When the FBI/CIA/NSA/FDA/and other three-letter government agencies come looking, you don't know me, you never saw me, never heard of me. get it? got it? good!
also: ALL YOUR POLLITICAL BASE ARE BELONG TO HILLARY IN '08!!

n3m0 · Posted: Wed Mar 10, 2004 8:03 pm Post subject:

punter · Guru Joined: 25 Nov 2002 Posts: 506

Gentree · Posted: Fri Mar 12, 2004 5:41 pm Post subject:

robmoss · Posted: Fri Mar 12, 2004 6:23 pm Post subject:

I was under the impression that -malign-double was very, very good indeed... when it works. I may have to test this.
_________________
Reality is for those who can't face Science Fiction.

emerge -U will kill your Gentoo
ecatmur, Lord of Portage Bash Scripts

n3m0 · Posted: Fri Mar 12, 2004 7:56 pm Post subject:

nmcsween · Guru Joined: 12 Nov 2003 Posts: 381

n3m0: Aligning the functions to take the whole width of the cache would cause something called cache misses and also fill it up with usless data since when it needs something that is say only 8 bytes it causes the extra 56 bytes to be filled with junk thus filling your caches with junk to my understanding -falign-functions and -falign-jumps only compiles some of the code into the boundries and not all (not all meaning other code).

nmcsween · Guru Joined: 12 Nov 2003 Posts: 381

As far as -malign-double its use is to compile code into a two word boundry instead of the default. This generally maims the alignment, it's not needed. On the other hand if you feel like you need to ride the really really wild side of gcc optimizations then try -mregparm=3 this controls how many registers are used to pass integer arguments from 1-3, which is a good thing but make sure you do that on a fresh install.

nmcsween · Guru Joined: 12 Nov 2003 Posts: 381

If you want to have an ultra optimized system try out these flags:
CFLAGS="-march=athlon-xp -O3 -pipe -fomit-frame-pointer -momit-leaf-frame-pointer -ftracer -fno-crossjumping -falign-functions=16 -falign-loops=16 -falign-jumps=16 -fno-align-labels -mfpmath=sse,387 -maccumulate-outgoing-args -fmove-all-movables -freduce-all-givs"
#-fnew-ra ( use -fnew-ra with caution) All these flags optimize without an additonal increase in memory usage or drive space usage of what -O3 specifies.

neenee · Veteran Joined: 20 Jul 2003 Posts: 1786

i now use:

CFLAGS="-O2 -march=athlon-xp -pipe -fomit-frame-pointer -ftracer"

KingPunk · Posted: Tue Mar 16, 2004 12:58 am Post subject:

nmcsween · Guru Joined: 12 Nov 2003 Posts: 381

tapted · Posted: Wed Mar 17, 2004 8:21 am Post subject:

I'll say it again: the consensus seems to be that -mfpmath=387,sse is bad...

According to

http://gcc.gnu.org/onlinedocs/gcc-3.3/gcc/Optimize-Options.html
and
http://gcc.gnu.org/onlinedocs/gcc-3.3/gcc/i386-and-x86-64-Options.html

it would also appear that -fomit-frame-pointer \implies -momit-leaf-frame-pointer

and -mfpmath=387 is the default for all but the Athlon x86-64 compiler

-ftracer is new in gcc3.3 and looks good.

-fno-crossjumping and -fno-align-labels are not mentioned directly -- perhaps someone knows benefits/disadvantages.

-maccumulate-outgoing-args also looks handy.

Here's a snip

nmcsween · Guru Joined: 12 Nov 2003 Posts: 381

seppe · Posted: Wed Mar 17, 2004 3:06 pm Post subject:

What do you guys suggest for this cpu?

nmcsween · Guru Joined: 12 Nov 2003 Posts: 381

First off I have to say don't listen to a good amount of people here. Some people seem to be giving bad advice. why? most likely they don't know what there talking about. (this isn't to anyone in particular). I really don't see why people are telling you to use -Os since your system is well within the limits of even -O3 and -O3 will add a few much needed flags to your compiles that your march flag specifies so to wrap this up heres what i recommend:
-march=pentium3 -O3 -pipe -fomit-frame-pointer -momit-leaf-frame-pointer -ftracer -fno-crossjumping -mfpmath=sse -maccumulate-outgoing-args -fmove-all-movables -freduce-all-givs that will give you a noticable increase in speed. Also -ffast-math is totaly up to you, but i don't recommend it since you'll get a 40% increase in speed in very very very rare occasions.

nmcsween · Guru Joined: 12 Nov 2003 Posts: 381