Re: how to store list of varying types

From:

"Nick Schultz" <nick.schultz@flir.com>

Newsgroups:

microsoft.public.vc.mfc

Date:

Mon, 30 Jun 2008 09:50:34 -0700

Message-ID:

<OyBurFt2IHA.2208@TK2MSFTNGP04.phx.gbl>

I need to pass a class that will contain about 30 to 100 bytes of
information. The class also has 2 vectors, one that holds the raw packets
(CAN bus supports 8 byte max packets) that make up the protocol packet , and
the other vector holds descriptions of the data fields in the payload (byte
position, length, name). Original implementation had the vectors storing
pointers to the objects, however since we're passing data between processes,
those pointers won't be valid to the receiving process, correct?

Also, I was told our systems use approximately 40% of the 1 mbit/s bus
speed. According to the protocol, there are some (small) messages that are
anticipated to be issued 500-800Hz. others range from 200-267 Hz and some
1 to 60 Hz.

This is my first real world, nontrivial application (fresh out of college),
so I don't really have a feel where or when optimizations. Thanks for your
help!

Also, this backend, "routing" process should be running at all times. Would
making it a windows service be an appropriate solution? Are there any
precautions I should take?

Thanks

Nick

"Joseph M. Newcomer" <newcomer@flounder.com> wrote in message
news:p39b6492okqlgmg1t6d8u3t6e0n1hqom7q@4ax.com...

One way to handle this is to create a "router process" that handles all
communication. A
process that wishes to receive messages posts a message to the router
process that tells
what kind of messages it wants to receive. When the router process
receives a message of
type "A" it passes a copy of it to all the registered processes. For
example, it could
use PostMessage if the content is small (two pointer-sized values), or it
could
sequentially send WM_COPYDATA to each process. Or, because SendMessage is
synchronous,
you could consider starting a new thread for each process, creating a UI
thread. The main
data thread will do a PostThreadMessage to each thread based on the
desired registry of
elements, and each thread does a dequeue-and-SendMessage(WM_COPYDATA) of
the data.

It makes no sense whatsoever to consider shared_ptr in this context
because there is
nothing to share, or share it with. How big are your packets, for
example? I'd just copy
the entire packet, and not worry about overheads of making a copy. This
would be a
pointless waste of time most of the time. How long does it take to copy
20 bytes? MEASURE
it. Use the high-resolution timer. How many tens of nanoseconds does it
take?

It is a common error to try to optimize code that never required
optimization.

Example: I have a system that uses PostMessage for interprocess
communication. A string
is sent by putting the connection id and a byte count in WPARAM, and 0 to
4 bytes of text
in LPARAM. Typical messages were 20 to 100 bytes, so it could take 6-21
messages to pass
it (a message with a 0 byte count was the "end of message" terminator).

This was a quick hack to get the program running. However, some years
later, we had a
client that required "400 messages per minute" performance. This was the
Moment of Truth:
I was going to have to rewrite this interface. But FIRST, I decided to
measure it. I
cranked up the input data generator on four machines all conntected with
100-base-T
Ethernet. I peaked out at 1400 messages/minute. So efficiency didn't
matter; I had beat
the desired goal by better than a factor of 3. That's good enough.

Premature optimization is usually a mistake. In the absence of
performance data, attempts
at optimization are usually misdirected, resulting in overly complex code
that is harder
to create, debug, and maintain than the simple code, but which has no
noticeable impact on
the performance.
joe

On Fri, 27 Jun 2008 15:30:28 -0700, "Nick Schultz" <nick.schultz@flir.com>
wrote:

Hmm...

What would you recommend of a way of sending multiple copies of the same
packet from one process to potentially multiple processes? Also keep in
mind
that not every process will always receieve every packet, for example
process 1 & 2 only care about packet-type A and process 2 & 3 only care
about packet-type B

What I want is a backend process (perhaps a service) that manages a
connection to the bus, performs protocol parsing, etc.

Applications will hook into the backend by registering and requesting what
type of messages it wants to receive. The backend then uses filters to
distribute packets to the applications. Original intent was to use
shared_ptrs to the packet objects so we don't have to waste memory and
time
copying multiple objects, however it now sounds like that is not an
option...

Thanks Joe for the input,

Nick

"Joseph M. Newcomer" <newcomer@flounder.com> wrote in message
news:4jma64poa58k345o72igegcm6sbrarokcl@4ax.com...

This will work in all kinds of contexts, but not for multiple
applications.
joe

On Fri, 27 Jun 2008 11:51:07 -0700, "Nick Schultz"
<nick.schultz@flir.com>
wrote:

The main use for this application is that there can be multiple
applications
interested in the same packet. instead of making multiple copies the
same
packet, I can just create multiple shared_ptrs that point to one packet,
and
when the last application is done with the packet, it will delete
itself.

"Joseph M. Newcomer" <newcomer@flounder.com> wrote in message
news:vbvt54p0cvlvsheu97igbiqe2hbo3qa14d@4ax.com...

But what good does a shared_ptr do here? It is overkill.
joe
On Thu, 19 Jun 2008 09:37:36 -0700, "Nick Schultz"
<nick.schultz@flir.com>
wrote:

MFC Feature Pack includes TR1 which has shared_ptrs.

"Giovanni Dicanio" <giovanni.dicanio@invalid.com> wrote in message
news:Oym$PJe0IHA.2384@TK2MSFTNGP04.phx.gbl...

"Nick Schultz" <nick.schultz@flir.com> ha scritto nel messaggio
news:ekB5f1Y0IHA.4500@TK2MSFTNGP03.phx.gbl...

I planned on creating a ProtocolPacket class that represents an
entire
packet, and contains a vector of dataElements. dataElement is a
class
that
contains a pointer to the data, its size(in bytes) and a char* that
stores
its field name.

I would need more details, but in general I would say that in C++, I
prefer using std::vector as container (instead of raw pointer), and
std::wstring or some other string class instead of char*.

Moreover, there is a usual naming convention in C++, that class
names
start with an upper-case letter (so, I would use DataElement instead
of
dataElement).
Lower-case tends to be used for other cases, like class instances.
e.g.

  // Instantiate a DataElement
  DataElement dataElement;

So, I would define a class or a struct like this:

class DataElement
{
  public:

      std::vector< BYTE > Data;

      // You don't need a size-in-bytes field here,
      // because vector has a size() method for
      // that purpose.
      // So Data.size() gives you that size.

     // I assume that your "field names" here are ANSI only.
     // For Unicode, you may use std::wstring.
     std::string Name;
};

Then I would store all these DataElement's in a vector like this:

typedef std::vector< DataElement * > DataElementList;

DataElementList myDataElements;

Note that the vector stores *pointers* to DataElement instances.
If these pointers have a shared ownership semantic, I would wrap
them
in
a
smart pointer like shared_ptr.
e.g.

typedef boost::shared_ptr< DataElement > DataElementSP;
typedef std::vector< DataElementSP > DataElementList;

In that way, you don't have to pay attention to DataElement
destruction
(the shared_ptr smart pointer stores a reference count, and when it
gets
0, the object is automatically deleted).

My original implementation called for malloc'ing the necessary
space
on
the
heap,

In C++, you would use new[] instead of malloc(), or a robust
container
like std::vector.

SomeType * p = new SomeType[ count ];

std::vector< SomeType > v[ count ];

From vector, you can have the pointer to the first element using:

SomeType * pFirst = &v[0];

If you use new[], you must also delete (sooner or later) your data,
using
delete[].
Instead, vector has a destructor that does cleanup.

Moreover, vector can safely grow its size if necessary (e.g. after a
.push_back( <new data> ); ), and it's guarded against buffer
overruns
(which are security enemy #1).
Instead, using raw new[], you may have lots of problems like
off-by-one
index, or index completely out-of-range, corrupting nearby memory,
etc.
It's not that you must not use new[]: you may use new[], but you (or
those
who will mantain your code) must pay lots more attention, and the
code
is
less robust, more fragile, thant using a robust C++ container class
like
std::vector.

Note that there are also MFC versions of the classes I mentioned in
this
post: you can use CString to store strings, and CArray template
instead
of
std::vector.
(AFAIK, MFC has no equivalent of smart pointer like shared_ptr...).

HTH,
Giovanni

Joseph M. Newcomer [MVP]
email: newcomer@flounder.com
Web: http://www.flounder.com
MVP Tips: http://www.flounder.com/mvp_tips.htm