Re: My OPE & the Euclicidean TSP
Time to put my visualization skills back together with graphs instead of
merely thinking about amorphous inheritance charts...
JSH wrote:
Well the first couple has moved from the starting city to two other
different cities, choosing them such that their physical distance
apart is the LEAST possible given all possible city choices, while the
other couple has gone to different cities for some other reason, so
what do we now know?
We know that the two travelers are near each other. That's about it.
Let me show you an example of where this fails. This is the weight table:
A B C D E F
A - 10 10 5 5 1
B 10 - 1 6 6 4
C 10 1 - 6 6 4
D 5 6 6 - 2 5
E 5 6 6 2 - 5
F 1 4 4 5 5 -
That should be Euclidean.
Start on A. Travelers look for the closest pair of nodes. These are B
and C. So the path has an A->B and an A->C route, costing 20 so far.
Next pair of close nodes: D and E. Since the distance is the same, it
doesn't matter whether the one on B goes to D or vice versa; the extra
cost is 12, to a cumulative of 32. One node left, F, which costs an
extra 10 for both to come to. The total is 42.
There is a shorter path (I'm fairly sure it's the shortest, but I'm not
going to make that claim). Go from A to B (10) to C (total 11) to D (17)
to E (19) to F (24) to A again (25). It is, however, better than the
path your algorithm gives.
Analytically, why does it fail? If you picture a group of nodes, two of
which are far out of the way but extremely close, the shortest tour must
necessarily travel between them so that it pays the travel cost of going
out there only twice instead of four times.
Your previous attempt to resolve a case where you had to visit nodes in
a certain sequence by declaring "try starting at every node" won't work
here again, because I could have an arbitrary number of remote node pairs.
I have a strong intuition that factoring in the distance to get to the
pair as well as their proximity won't work, but to be sure, I'd have to
play around with some graphical utilities. I haven't seen Patricia's
application, but if it had a GUI that allowed you to put nodes in, it
would ease the burden of trying to come up with the edge cases.
We know that the second couple is further away from each other as they
traveled FARTHER than the first and MUST eventually make up that
distance, as eventually they come back together, so we already know
that the second couple has already traveled further and will have to
travel still further to make up the distance than the first. It's
like a double whammy. They traveled to more distant cities, and are
farther apart so will have a greater distance to travel in coming back
together down the line.
Another easy graphical example: imagine a circle, and at several points,
you had pairs of nodes straddling the circle. Your algorithm will have
the travelers moving in pairs around the circle, which means the path is
approximately twice the circumference; the optimal path will go between
the nodes in a zig-zag-like fashion, at approximately the circumference.
My hope is that pondering that problem and how each local choice leads
to a global result: distance apart, will help understanding of how
this algorithm works, and why it works.
I'm tempted to say that this algorithm will perform poorly in real-world
circumstances: when nodes aggregate in clusters, it moves between
clusters in pairs whereas optimal solutions will only travel once
between clusters.
--
Beware of bugs in the above code; I have only proved it correct, not
tried it. -- Donald E. Knuth