Re: speed performance: reference vs. pointer
Gennaro Prota wrote:
On 19 Dec 2006 13:02:28 -0500, James Kanze wrote:
[...]
Even when it does use a pointer, the reference version could be
faster, because the compiler knows that it won't change. In
general, the more information the compiler has to work with, the
better it can optimize
Can or could. Unfortunately real compilers may surprise you on this.
It's gotten to the point where almost nothing a real compiler
does will surprise me anymore. (Except, maybe, output readable
error messages. That would surprise me.)
A
recent case I came upon with gcc and my SHA class templates was
something along the lines of:
const int sz( 8 );
template< typename ... >
void f( byte_type( &a )[ sz ] )
{
...
return f_impl( a );
}
template< typename ...>
void f_impl( byte_type( & a )[ sz ] )
{
...
}
It is hard to believe but changing f_impl's parameter to type
byte_type * had a tremendous performance impact on the generated code.
Interesting. I would have expected as a first approximation
that the implementation treat it exactly as if you'd passed a
byte_type*, once the template parameters were deduced.
According to the standard, it could add bounds checking (since
the information is present, which it isn't with a pointer), but
somehow, I'd be rather surprised if this were the case.
Did you look at the generated code, to see what the difference
was.
And while I'm generally quite happy with gcc this is the sort of
things that should never happen; I can understand that the compiler is
to some extent a black-box but I don't want my code to become suddenly
half as fast as before --think for instance of the opposite change--
just because I slightly touched/improved it somewhere.
And yet, it's unavoidable in certain cases. Imagine an
optimization that is based on the fact that sz is a multiple of
the size of a machine word. Change to a pointer, and the
compiler cannot do it. And I can construct cases (I'm not
saying that they occur in real code) where such optimizations
could makes a significant difference.
And because of
these not so infrequent facts I'm unwillingly convincing myself that
benchmarks should be part of the regression tests.
A program has to be "fast enough". And regression tests should
verify that. Even if, in practice, it's rarely a problem. A
modification might slow the program down 10%, but in the
meantime, the machines it's running on have speeded up 50%.
--
James Kanze (GABI Software) email:james.kanze@gmail.com
Conseils en informatique orientie objet/
Beratung in objektorientierter Datenverarbeitung
9 place Simard, 78210 St.-Cyr-l'Icole, France, +33 (0)1 30 23 00 34
--
[ See http://www.gotw.ca/resources/clcm.htm for info about ]
[ comp.lang.c++.moderated. First time posters: Do this! ]