Dijkstra's Algorithm

<Head><Title>Shortest Paths: Dijkstra's Algorithm</Title></Head>
<center><H1>Dijkstra's Algorithm</H1></center>

<H1>Notes:</H1>

<UL>
<LI>This algorithm is not presented in the same way that you'll find
      it in most texts because i'm ignored directed vs. undirected graphs
      and i'm ignoring the loop invariant that you'll see in any book
      which is planning on proving the correctness of the algorithm.
<P>
<LI>If we are dealing with a di-graph the algorithm will be the same
      with the only difference being the meaning attached to each edge.
<P>
<LI>The loop invariant is that at any stage we have partitioned the
      graph into three sets of vertices (S,Q,U), S which are vertices to 
      which we know their shortest paths, Q which are ones we have "queued"
      knowing that we may deal with them now and U which are the other
      vertices.  This information is important in proving the correctness and
      analyzing the running-time of the algorithm and not for understanding
      it (in my humble opinion).
<P>
</UL>

<B>REF</B>: Cormen. Thomas H., Leiserson, Charles E., Rivest, Ronald L.,
     <I>Introduction to Algorithms</I>, MIT Press (1990), pages 527-531
<P>

<H1>Revision:</H1>
The current revision of this page was last modified on August 27, 1998.
<Hr>

Given a graph G with a weight function wt:E(G)->R which maps the edges
to a real valued weight and wt(e) > 0 for all e in E(G).  NOTE: wt(e) > 0
is a VERY important assumption.  As i've presented the algorith, a weight
of 0 is not actually a problem (it could be for some implementations) but
a negative weight could/would result in an infinite loop.  For example,
<PRE>
     *---- 1 ---*
      \        /
      -10     1
         \   /
          \ /
           *
</PRE>
will generate paths which cycle endlessly.
<P>

You need an origin vertex (where all the paths are starting from, or,
more typically in games, where the paths are ending).
<P>

Augment the labels of the vertices by a real value, initially infinity,
which is the shortest weighted path from the origin to this vertex (which
has been found so far).  Also, augment each vertex with a "pointer" to its
parent in the shortest weighted path found so far, initially have this
pointing nowhere.
<P>

You need a priority queue which is sorted based on the weight of the
shortest path from the origin to the vertex.  When an element is inserted
into the priority queue and it already exists, the previous copy must be
removed and the new one inserted into the right level [*].
<P>

<PRE>
Take the origin vertex, set the weight of the shortest path to 0 and
push it onto the priority queue.

while the priority queue is not empty, pop an entry <v,w_v,p_v> where
  v is the vertex, w_v and p_v are the augmented labels of v.
  foreach edge e=(v,u) in G, where u has augmented labels w_u, p_u.
      if wt(e) + w_v < w_u then
          set p_u to v
          set w_u to wt(e) + w_v
          add <u, w_u, p_u> to the priority queue.
</PRE>

<BR>
<H1>Comments:</H1>

Technically, [*] must be true because otherwise it would be impossible to
prove the correctness of the algorithm.  I've played with an personally
think that you can get a better performance (your priority queue becomes
almost O(1) with the cost of extra elements building up in the queue)
by relaxing this restriction and allowing multiple copies of the same
vertex to accumulate in the queue.
<P>

The net effect will be that some sets of vertices may have to be cycled
through the queue several times (read "more cost").  It is my opinion
that this additional cost is considerably less than the O(log n) [n is
the size of the priority queue] time required to implement this algorithm
as described above.
<P>

However, I have not attempted to analyze the worst case effect of relaxing
[*] and would say to implement it in this way only if you trust my guess.
<P>

Also, note that if you were going to implement a heuristic for directing
the search, you wouldn't insert the weight of the shortest path from the
source node to this node.  Instead, you would insert the weight of the
shortest path from the source node to this node plus the heuristic's
estimate of the cost to the destination node.  I won't be discussing this
anymore until the very end of the document.
<P>

<BR>
<H1>Tile/Grid Implementation:</H1>

<DL>
<DT><B>Def</B>: What I mean by "a game with a grid/tile map"
<DD>
A game where the playing area is dividing into
     squares with one or more "object" on the square.  These objects include
     walls, doors, etc which are assumed to occupy the entire square.
</DL>

If you want to apply what i'm going to say where walls do not occupy the
entire square, you'll need a function wt({x,y}, {x',y'}) which gives the
cost of moving from (x,y) to (x',y') and otherwise it's the same.
<P>

In a game with a grid map, you need a function (or a table or whatever)
which i'll call wt(x,y) which gives you the "cost" of moving onto a
specified grid location (x,y).  Note: "moving *onto*".  If you are
writing a dungeon based game and you have a teleporter at (a,b) that you don't
want the monsters to hit, make wt(a,b) = infinity where infinity is
some arbitrarily large number.  The same applies to walls, if you have a
wall square at (a,b) then wt(a,b) = infinity.
<P>

For any square, there are at most 8 squares around it.  This is easy to
implement with an array that gives the offsets for each square.
<P>

We'll assume that you don't want the shortest paths for the entire world
(which could be arbitrarily large) but instead want shortest paths for
only a limited area, say of width and height 2*DELTA+1.
<P>

We will be storing elements in two arrays in positions [0..2*DELTA] which
correspond to the position on the map (o_x - x + DELTA - 1).  To make
the pseudo-code more readable, let:
<PRE>
    f_x(x) = x - o_x + DELTA
    f_y(y) = y - o_y + DELTA
</PRE>
be two functions which translate the coordinates.
<P>

The following two arrays are used to store the "augmented" information
discussed in the earlier explanation.  These store the following information:
<PRE>
w[x'][y'] The weight of the current shortest path from (o_x, o_y) to
          (o_x + x' - DELTA, o_y + y' - DELTA).
p[x'][y'] The parent node of that shortest path

int w[2*DELTA+1][2*DELTA+1];
int p[2*DELTA+1][2*DELTA+1][2];
</PRE>

<BR>
<H2>Algorithm: (origin at (o_x, o_y))</H2>
<PRE>
   set each w[i][j] = infinity
   set each p[i][j] = (infinity, infinity)

   set w[f_x(o_x)][f_y(o_y)] = 0
   push (o_x, o_y) onto the priority queue, say pq.                     [**]

   while (pop (x,y) from pq is possible)
                                                                        [***]
       foreach each square (x',y') adjacent to (x,y) and where
           |x-o_x| <= DELTA and |y-o_y| <= DELTA
           if w[f_x(x)][f_y(y)] + wt(x',y') < w[f_x(x')][f_y(y')] then
              { we have found a new shortest path }
              w[f_x(x')][f_y(y')] = w[f_x(x)][f_y(y)] + wt(x',y')
              p[f_x(x')][f_y(y')] = (x,y)
              push (x',y') onto the priority queue                      [**]
</PRE>

And you now have the shortest paths from (o_x,o_y) to every square in
the range { o_x - DELTA, o_y - DELTA, o_x + DELTA, o_y + DELTA } and
this will print the path from some point (x,y) to (o_x, o_y):
<P>
<PRE>
do
     print (x,y)
     (x,y) = p[f_x(x)][f_(y)]
while (x,y) != (infinity, infinity)
</PRE>
and any square, (x,y), which is not reachable from (o_x, o_y) will have:
<PRE>
p[f_x(x)][f_y(y)] = (infinity, infinity)
w[f_x(x)][f_y(y)] = infinity
</PRE>

<BR>
<H2>Sample Code:</H2>

I have written a simple example of implementing this algorithm for a grid
map game.  The source code is zipped (pkzip).  The code itself is quite
simple.<P>

I have been able to compile and run the code on several UNIX boxes and DOS
with both the Microsoft C compiler (version 7, I think) and GCC (2.6.3 with
the version of DJGPP current as of June 1, 1995).  It should be quite easy
to port to any system.<P>

Included in the source code is a single map which should give you the gist
of what sort of input it expects and you should read the file sp.c which
explains how to use the demo.
<P>

This source code is release with the understanding that I retain ownership
of the code.  You are free to make use of it in any way you see fit and
by transferring the code, you accept all responsibility for any damages
which result from using it.
<P>

Having accepted that, you may get the source code:<BR>
<a href=sp_demo.zip>Sample C source code, simple grid map</a>
<P>
<I>NOTE</I> I've also got a modified version of the same code with a link
a little further down into the document.  The second version can use a simple
heuristic to find the path quickly.
<P>
<H2>Adding a Heuristic</H2>
If you are more interested in finding the shortest path between two nodes
(and not all shortest paths to a node), you may be interested in applying
a heuristic function to the search.
<P>
<I>Author's Note:</I> My apologies on this section.  It is not overly
clear and is a little too brief.  If you don't follow what I'm saying,
you may wish to look at the source code I've supplied and then compare
the algorith in this page to the actual source code.  It should be
fairly clear.
<P>
A <I>heuristic function</I> is a fancy way of saying a function which guesses
at an answer.  In this case, our heurisitic function will be guessing
at the length of the shortest path from any node to the destination node.
<P>
<B>IMPORTANT</B>: It is important that your function always be at most the
real shortest distance.  If you are using a real valued cartesian playing
field, you would want to use sqrt(dist(x)^2 + dist(y)^2) distance function.
Using a heurtistic function of 0 will get you back to the original algorithm.
<P>
The effect of adding a good heuristic function would be to direct the search
so that very few bad paths will be considered.  I'm not going to bother
much with the theory that this is a shortest path or even why it works.
If you're avidly reading this part, you probably just want to get a
short path as quickly as possible and are willing to trust a little magic.
<P>
To add the heuristic to my algorithm, let <I>h((x,y), (dest_x, dest_y))</I>
be a function which estimates the shortest path betwen two points.
<P>
In the algorith, in each line that has ** beside it, add the heuristic
function applied to the current node.  At the point marked with ***
add the line:
<Br>
if (x,y) = (dest_x, dest_y) then done.
<P>
And voila!  You have a function which can rapidly find the shortest path
between any two points.
<Hr>
The following source code uses exactly the same algorith with a heuristic
function that corresponds to the shortest distance on a grid.
<pre>
  H(x_1, y_1, x_2, y_2) = max( |x_2 - x_1|, |y_2 - y_1| )
</pre>
which is the shortest possible distance on a grid between the points
(x_1, y_1) and (x_2, y_2).  The only changes to the original source code
are to add this value to the priority when entries are added to the
priority queue and a simple if statement which stops processing when we
have found the node for which we are searching.
<P>
To compare the different techniques, you can use two command line options.
<B>-a</B> will request that processing stop when the destination has been
located and <B>-h</B> will request that the heuristic be applied.  Thus
running the program with -a and then with -a -h will give you a good
comparison between Djisktra's and Djisktra's with the heuristic.
<P>
<B>Disclaimer</B> I have not compiled this code on any machine except
a DEC Alpha (OSF).  The <I>zip</I> file was created using that system's
zip command.  I don't guarantee that anyone else will be able to get and
compile this code.  For this reason, I'm continuing to provide the original
source code (just in case).
<P>
<a href=sp_new.zip>Sample C source code, Djisktra's with a heuristic</a>
<P>
<H2> A* vs. Dijkstra </H2>
<P>
If you are familiar with the A* algorithm, you might notice that adding
the heuristic to Dijkstra's algorithm gives you something very similar.
Basically, Dijkstra's algorithm with a heuristic is equivalent to A*
except for a couple of technical facts:
<P>
<ul>
  <li> A* will generally use less memory
  <li> You must know the entire search space to use Dijkstra's algorithm
</ul>
<p>
For people that are writing tile based games, I would actually
recommend that they use Dijkstra's algorithm because it is easier to
code and will be about as efficient as A*.
<P>
<HR>
Feel free to send any comments, criticism et al to me.
<ADDRESS>
Chris Palmer<BR>
crpalmer+@cs.cmu.edu
</ADDRESS>