Re: Concurrent code performance

From:

Marc <marc.glisse@gmail.com>

Newsgroups:

comp.lang.c++

Date:

Sun, 9 Oct 2011 13:29:19 +0000 (UTC)

Message-ID:

<j6s7jf$m6u$1@news-rocq.inria.fr>

Saeed Amrollahi wrote:

Hi All C++ developers

I learned (from theory and practice) the most important benefit of
qualified multi-threaded programming is better performance.
Here it is a (not well-written) concurrent (two worker threads)
and sequential version
of Summation of [0, 2000000[ interval.
Of course my concurrent code is not good. It's for exposition only.

// concurrent code
#include <thread>
#include <iostream>

using namespace std;
long long s1 = 0, s2 = 0;
void Sum(int first, int last, long long& res) // calculate the sum in
[first, last[ interval
{
  while (first < last)
    res += first++;
}

int main()
{
  long long r1 = 0, r2 = 0;
  thread t1(Sum, 0, 1000000, std::ref(r1));
  thread t2(Sum, 1000000, 2000000, std::ref(r2));

I would make Sum return a long long instead of updating a reference.
If you are unlucky, r1 and r2 end up on the same cache line, you have
false sharing and the concurrent version ends up slower than the
sequential one.

  t1.join(); t2.join();
  cout << r1 + r2 << '\n';

  return 0;
}

In addition the generated code for sequential_sum is about 6 KB, but
the size
of concurrent_code is about 45 KB.
Is there any problem in my measurement? Why the concurrent object code
is 7+ times bigger than sequential version? How do you explain it?

Your basic code is very simple: a loop, a couple additions, almost
nothing. It is not really surprising that code meant to handle
multiple threads is several times larger. Now it might be possible for
libstdc++ to factor more of that code into the shared library, but
that's a different, more complicated question.