Re: Why std::sort is ~20% faster then introsort coded by hand?
Hello,
ikarus wrote:
I'm comparing sorting algorithm for study goals.
But 'hand-coded' version is still slower.
Why? May be in production std::sort version it is used some SIMD
optimization internally or assembler optiomization?
Pseudocode (C++, may be bugs) of introsort version:
// CODE
template <typename T>
inline void swap(T *a, T *b)
{
T temp = *a;
*a = *b;
*b = temp;
}
First some stylistic critique: The swap of standard C++ is supposed to
be used differently, reusing that name with that interface is highly
confusing.
A possible cause for the 20% slowdown: For nontrivial classes T with a
proper swap this implementation of swap is suboptimal, you are doing 3
full copies of T instances, even involving 3 allocations and
deallocations. If there is a std::swap for T objects, that one should
be used. This alone could well suffice for a 20% difference, because
the number of swaps is in the same order as the number of comparisions
for many sorting algorithms, so a suboptimal swap could very well add
to the overall constant of the algorithm, e.g. if T is std::string. If
T is a simple type like int, then there probably is no big problem with
your swap. I have not seen the types you used for T. So you have to
try, whether this plays a role.
Bernd Strieder