Network Working Group                             B. Noble*, G. Nguyen+,
Request for Comments:  DRAFT                M. Satyanarayanan*, R. Katz+
1 Sep 1996                                   *Carnegie Mellon University
Informational                        +University of California, Berkeley


                         Mobile Network Tracing


Status of this Memo

   This memo provides information for the Internet community.  This memo
   does not specify an Internet standard of any kind.  Distribution of
   this memo is unlimited.

   This is an interim version which differs from the June 27, 1996
   version in Section 4 It has been substantially revised in response to
   comments from readers, including Mary Baker, Chane Fullmer, and Geoff
   Kuenning.  We have also added the material present in the Appendices.
   Revisions to the remainder of the RFC are in progress, and should be
   completed within a month.


Abstract

   Mobile networks are both poorly understood and difficult to
   experiment with.  This RFC argues that mobile network tracing
   provides both tools to improve our understanding of wireless
   channels, as well as to build realistic, repeatable testbeds for
   mobile software and systems.  The RFC is a status report on our
   work tracing mobile networks.  Our goal is to begin discussion on a
   standard format for mobile network tracing as well as a testbed for
   mobile systems research.  We present our format for collecting mobile
   network traces, and tools to produce from such traces analytical
   models of mobile network behavior.

   We also describe a set of tools to provide network modulation based
   on collected traces.  Modulation allows the emulation of wireless
   channel latency, bandwidth, loss, and error rates on private,
   wired networks.  This allows system designers to test systems in a
   realistic yet repeatable manner.


1. Introduction

   How does one accurately capture and reproduce the observed behavior
   of a network?  This is an especially challenging problem in mobile


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 1]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   computing because the network quality experienced by a mobile host
   can vary dramatically over time and space.  Neither long-term average
   measures nor simple analytical models can capture the variations
   in bandwidth, latency, and signal degradation observed by such a
   host.  In this RFC, we describe a solution based on network tracing.
   Our solution consists of two phases:  trace recording and trace
   modulation.

   In the trace recording phase, an experimenter with an instrumented
   mobile host physically traverses a path of interest to him.  During
   the traversal, packets from a known workload are generated from
   a static host.  The mobile host records observations of both
   packets received from the known workload as well as the device
   characteristics during the workload.  At the end of the traversal,
   the list of observations represents an accurate trace of the
   observed network behavior for this traversal.  By performing multiple
   traversals of the same path, and by using different workloads, one
   can obtain a trace family that collectively characterizes network
   quality on that path.

   In the trace modulation phase, mobile system and application software
   is subjected to the network behavior observed in a recorded trace.
   The mobile software is run on a LAN-attached host whose kernel is
   modified to read a file containing the trace (possibly postprocessed
   for efficiency,) and to delay, drop or otherwise degrade packets in
   accordance with the behavior described by the trace.  The mobile
   software thus experiences network quality indistinguishable from that
   recorded in the trace.  It is important to note that trace modulation
   is fully transparent to mobile software --- no source or binary
   changes have to be made.

   Trace-based approaches have proved to be of great value in areas
   such as file system design [2, 10, 11] and computer architecture.
   [1, 5, 13] Similarly, we anticipate that network tracing will prove
   valuable in many aspects of mobile system design and implementation.
   For example, detailed analyses of traces can provide insights into
   the behavior of mobile networks and validate predictive models.  As
   another example, it can play an important role in stress testing
   and debugging by providing the opportunity to reproduce the network
   conditions under which a bug was originally uncovered.  As a third
   example, it enables a system under development to be subjected to
   network conditions observed in distant real-life environments.  As
   a final example, a set of traces can be used as a benchmark family
   for evaluating and comparing the adaptive capabilities of alternative
   mobile system designs.

   Our goal in writing this RFC is to encourage the development
   of a widely-accepted standard format for network traces.  Such


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 2]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   standardization will allow traces to be easily shared.  It will also
   foster the development and widespread use of trace-based benchmarks.
   While wireless mobile networks are the primary motivation for this
   work, we have made every effort to ensure that our work is applicable
   to other types of networks.  For example, the trace format and some
   of the tools may be valuable in analyzing and modeling ATM networks.

   The rest of this RFC is organized as follows.  We begin by examining
   the properties of wireless networks and substantiating the claim
   that it is difficult to model such networks.  Next, in Section 3, we
   describe the factors that should be taken into account in designing
   a trace format.  We present the details of a proposed trace format
   standard in Section 4.  Section 5 presents a set of tools that
   we have built for the collection, analysis and replay of traces.
   Finally, we conclude with a discussion of related and future work.


2. Modeling Wireless Networks

   Wireless channels are particularly complex to model, because of their
   inherent dependence on the physical properties of radio waves (such
   as reflections from "hard" surfaces, diffraction around corners, and
   scattering caused by small objects) and the site specific geometries
   in which the channel is formed.  They are usually modeled as a time-
   and distance-varying signal strength, capturing the statistical
   nature of the interaction among reflected radio waves.  The signal
   strength can vary by several orders of magnitude (+ or - 20-30
   dB) within a short distance.  While there have been many efforts
   to obtain general models of radio propagation inside buildings
   and over the wide area, these efforts have yielded inherently
   inaccurate models that can vary from actual measurements by an order
   of magnitude or more.

   Signal-to-noise ratio, or SNR, is a measure of the received signal
   quality.  If the SNR is too low, the received signal will not be
   detected at the receiver, yielding bit errors and packet losses.
   But SNR is not the only effect that can lead to losses.  Another
   is inter-symbol interference caused by delay spread, that is,
   the delayed arrival of an earlier transmitted symbol that took a
   circuitous propagation path to arrive at the receiver, thereby
   (partially) canceling out the current symbol.  Yet another problem is
   doppler shift, which causes frequency shifts in the arrived signal
   due to relative velocities of the transmitter and the receiver,
   thereby complicating the successful reception of the signal.  If
   coherent reception is being used, receiver synchronization can be
   lost.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 3]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   More empirically, it has been observed that wireless channels adhere
   to a two state error model.  In other words, channels are usually
   well behaved but occasionally go into a bad state in which many burst
   errors occur within a small time interval.

   Developers of network protocols and mobility algorithms must
   experiment with realistic channel parameters.  It is highly desirable
   that the wireless network be modeled in a thoroughly reproducible
   fashion.  This would allow an algorithm and its variations to
   be evaluated in a controlled and repeatable way.  Yet the above
   discussion makes it clear that whether analytical models are used or
   even actual experimentation with the network itself, the results will
   be either inaccurate or unlikely to be reproducible.  A trace-based
   approach alleviates these problems.


3. Desirable Trace Format Properties

   In designing our trace format, we have been guided by several
   principles.  First traces should be compact.  Second, the format
   should be extensible.  Third, they should be self-describing.
   Finally, they must be easy to manage without unduly sacrificing the
   other goals.  This section describes how each of these principles has
   affected our design.

   Our primary design goal is compactness of trace format.  Capturing
   network activity, even for relatively inactive networking
   sessions, is expensive in space.  This rules out an ASCII or other
   human-readable trace format.  However, we do not carry this goal to
   extremes.  We leave room for expansion in bitfields, and try to be
   conservative in selecting field sizes.  In the assumption that modern
   machines are optimized for word-alignment accesses, we often require
   padding to 32-bit boundaries.

   While we have found several interesting uses for network traces, it
   is certain that more will evolve over time.  As network tracing is
   used in new ways, it may be necessary to add new data to the trace
   format.  Rather than force the trace format to be redesigned, we
   have structured the format to be extensible.  There is a built-in
   mechanism to add to the kinds of data that can be recorded in network
   traces.

   This extensibility is of little use if the tool set needs to change
   incompatibly as the trace format is extended.  Recognizing this,
   we have made the format -- particularly the extensible portions --
   self-describing.  Thus, old versions of tools can continue to work
   with extended traces, if perhaps in a less than optimal way.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 4]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   In our experience with other tracing systems, management of trace
   files is often difficult at best.  Common problems include the need
   to manage multiple trace files as a unit, not easily being able to
   extract the salient features of large trace files, and difficulty
   cataloging trace data.  To help cope with file management, we have
   designed the the traces to be split or merged easily.  To ease
   cataloging, we've chosen to store some descriptive information as
   ASCII strings, allowing easy identification of trace files.


4. Trace Format

   This section describes the format for network traces.  We begin by
   presenting the physical layout of the atomic types in a trace, and
   then outline the basic abstractions that are key to the trace format:
   the record, as well as the track, a collection of related records.
   We then describe the records at the beginning and end of a trace,
   the header and trailer.  The bulk of the section describes the three
   kinds of record tracks:  packet, device, and general.  These also
   make up the bulk of the actual trace.  We conclude the section with
   a discussion of two special purpose records:  the annotation and the
   trace data loss records.

   To help illustrate the format, we often refer to a hypothetical
   example of a laptop tracing TCP traffic that it sends and receives
   over two different physical network interfaces.  The first is a
   2 Mb/s WaveLan radio ethernet, and the second is a 28.8 Kb/s Metricom
   radio modem.  There are any number of TCP connections to any number
   of hosts, some over the WaveLan and some over Metricom.


4.1. Basic Abstractions

   In this section, we begin by describing the fundamental data types,
   how they are composed into records, and how the records themselves
   are composed into related groups of records called tracks.


4.1.1. Fundamental Types


struct tr_string_t {
    u_int32_t      s_size;
    unsigned char  *s_string;
};

                    Figure 1: The atomic string type


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 5]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


              +------------+-----------------------------+
              | u_int8_t   | 1-byte integer, unsigned    |
              | int8_t     | 1-byte integer, signed      |
              | u_int16_t  | 2-byte integer, unsigned    |
              | int16_t    | 2-byte integer, signed      |
              | u_int32_t  | 4-byte integer, unsigned    |
              | int32_t    | 4-byte integer, signed      |
              | u_int64_t  | 8-byte integer, unsigned    |
              | int64_t    | 8-byte integer, signed      |
              +------------+-----------------------------+
                     Table 1: Basic integral types


   There are several atomic data types from which trace records are
   composed.  All of these data types are stored in a trace in network
   byte-order, which is to say most significant byte first.

   There are four basic integral types; 8, 16, 32, and 64 bits.  Each
   of these may be signed or unsigned.  All of them are stored most
   significant byte first.  They appear in Table 1.  These basic types
   can often be formed into variable-length arrays; When we use them, we
   include an explicit size field expressed in bytes.  When this size
   is not a multiple of four bytes, we require that the field be padded
   to the nearest 4-byte boundary in the physical trace representation.
   This padding is implicit in all variably-sized fields.

   There is a single data type for ASCII strings:  tr_string_t.
   These are variable-length arrays of unsigned characters, are
   null-terminated, and may contain any non-NULL ASCII character,
   including newlines.  As with numeric arrays, we always explicitly
   include the size of the string in bytes, and the string is padded
   to the nearest 4-byte boundary.  Because of this, we define a
   tr_string_t to contain the size field, as shown in Figure 1.


4.1.2. Records

   A record is the smallest unit of trace data, and is a composition of
   one or more of the fundamental data types discussed in the previous
   section.  There are several different types of records, each of which
   is discussed in Sections 4.2 through 4.7.  However, there are three
   required fields that begin each record:  a magic number, the size of
   the record, and the time at which the record was generated.  This
   structure is depicted in Figure 2.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 6]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


                          +------------------+
                          | Magic Number     |
                          | Size of Record   |
                          | Timestamp        |
                          +------------------+
                          | Other Fields     |
                          |     ...          |
                          +------------------+

                        Figure 2: Record format


   The first field in each record is a magic word, a 32 bit pattern that
   both identifies the record's type and lends some confidence that the
   record is well formed.  A record's magic word, along with at most one
   previous record in the trace, completely defines the layout of that
   record.  We describe how layout can be determined in Section 4.1.3.
   The full list of allocated magic numbers, along with available and
   reserved ranges and the mechanism by which new ones are allocated,
   appears in Appendix A.

   The second field of each record is the record's size in bytes.  This
   servers two purposes.  First, the size can be matched against known
   constraints on records of that type, lending further confidence that
   the record is well-formed.  For example, all records are expected to
   be an even multiple of four bytes in length.  Second, if a record
   type is unknown to a given piece of processing software, the software
   can skip the unknown record and continue with the remainder of the
   trace.

   The final field required of all records is the time at which it
   was generated.  These timestamps are pairs of u_int32_t integers.
   The first of these measures the number of seconds since midnight,
   January 1, 1970.  The second is the number of additional nanoseconds.
   This is identical to the POSIX time standard.  We have chosen to
   use this single timestamp format because it is convenient, covers
   most anticipated systems' notions of time, and provides sufficient
   granularity for measuring network events.  Timestamps are all taken


struct tr_time_t {
    u_int32_t  tt_sec;  /* Seconds since 00:00 1/1/70 */
    u_int32_t  tt_nsec; /* Additional nsecs */
};
                       Figure 3: Timestamp format


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 7]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   from the tracing host's local clock, and all records appear in
   a trace in time-order.  The trace includes facilities to record
   timezone and daylight savings time information so that traces from
   multiple hosts may be compared directly.

   Records may have one or more variably-sized components, each of which
   has an explicit size field, as the first field of the variably-sized
   component.  These components are all padded to the nearest 4-byte
   boundary.  In in our descriptions of individual record types, we
   present records as structures, and the variably-sized components as
   separate structures pointed to by the enclosing records.  However,
   the physical representation of such structures in the trace
   are flattened; we have built a library to handle marshaling and
   unmarshaling of records.


4.1.3. Tracks

   Many of the record types have both fixed, required fields as
   well as a set of optional fields.  It is these options that
   provide extensibility in our trace format.  However, to provide a
   self-describing trace, we need some compact way of determining which
   optional fields are present in a given record.  To do this, we group
   related sets of packets into tracks.  For example, we might record
   the TCP packets arriving over the WaveLAN interface in one track, and
   the TCP packets arriving over the Metricom interface in another.

   A track is a header followed by some number of related records; the
   header completely describes the format of the individual records.
   Figure 4 illustrates the two tracks our example scenario might
   capture.

   Since records appear in the trace in time-order, records from
   separate tracks are typically interleaved with one another.  The only
   requirement is that the header for each individual track appears
   before any of the track's records.  Figure 5 shows an example of how
   records from the tracks in Figure 4 would be interleaved in the trace
   file.

   All entries in a track are presumed to have the same set of optional
   fields present.  To compactly describe these optional fields, track
   headers contain property lists.  An entry in a property list is a
   record that defines:  whether this optional field is recorded once
   for the entire track, or once in each track entry; the type and size
   of the data recorded; and which optional field is being recorded.
   The layout of a property list entry is illustrated in Figure 6.  A
   track header explicitly names the number of property list entries
   present, and then records each property list sequentially.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 8]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


       +----------++----------++----------++----------++----------+
       | WaveLAN  || WaveLAN  || WaveLAN  || WaveLAN  || WaveLAN  |
       | Header   || Packet 1 || Packet 2 || Packet 3 || Packet 4 |
       | Time: 1  || Time: 2  || Time: 4  || Time: 6  || Time: 12 |
       +----------++----------++----------++----------++----------+

                     (a) WaveLAN track header, entries


       +----------++----------++----------++----------++----------+
       | Metricom || Metricom || Metricom || Metricom || Metricom |
       | Header   || Packet 1 || Packet 2 || Packet 3 || Packet 4 |
       | Time: 3  || Time: 5  || Time: 7  || Time: 9  || Time: 11 |
       +----------++----------++----------++----------++----------+

                     (b) Metricom track header, entries

             Figure 4: Two TCP tracks collected on one host
    +----------++----------++----------++----------++----------+-
    | WaveLAN  || WaveLAN  || Metricom || WaveLAN  || Metricom |
    | Header   || Packet 1 || Header   || Packet 2 || Packet 1 | ***
    | Time: 1  || Time: 2  || Time: 3  || Time: 4  || Time: 5  |
    +----------++----------++----------++----------++----------+-

                  Figure 5: Interleaved track records

#define PLIST_HEADER_ONLY   0x1

#define PLIST_TYPE_UINT8    0x1
#define PLIST_TYPE_INT8     0x2
#define PLIST_TYPE_UINT16   0x3
#define PLIST_TYPE_INT16    0x4
#define PLIST_TYPE_UINT32   0x5
#define PLIST_TYPE_INT32    0x6
#define PLIST_TYPE_UINT64   0x7
#define PLIST_TYPE_INT64    0x8
#define PLIST_TYPE_STRING   0x9

struct tr_plist_ent_t {
    u_int8_t     pl_flags;   /* header-only? */
    u_int8_t     pl_type;    /* Type of recorded data */
    u_int16_t    pl_size;    /* Size of data in bytes */
    u_int32_t    pl_name;    /* Which option recorded */
    u_int8_t     *pl_value;  /* Only if header prop. */
};

                    Figure 6: Property list entries


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996    [Page 9]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   Some data concerning a track may only need to be measured once for
   the entire track.  For example, if we recorded the maximum segment
   size used by the laptop's TCP implementation, we would only need to
   record it once; recording it for every TCP packet would be wasteful.
   We call such optional fields ``header-only options'', and they appear
   directly in the track header's property list.  Such property list
   entries have the PLIST_HEADER_ONLY bit set.

   Each optional field may be recorded in an array of any one of the
   valid atomic data types described in Section 4.1.1.  The type used
   for a particular option is encoded in the pl_type field of the
   property list.  The number of bytes used by each entry is encoded
   in pl_size.  If that number is not a multiple of four, the physical
   representation of the field must be padded to the nearest 4-byte
   boundary.

   The pl_name field encodes which option this entry represents.  There
   are a number of such optional fields defined to date; they are
   described throughout this document.  A brief list of the defined
   options along with the procedure for defining new ones in Appendix B.

   Those options that are per-entry appear in each track entry in the
   same order in which they were defined.  No space is occupied in track
   entries by header-only options.  To illustrate how property lists
   work, consider the WaveLAN track being recorded by our hypothetical
   laptop.  There are three options that are recorded in this track:
   PKT_SEQUENCE, the TCP sequence number of each packet, TCP_MSS,
   the maximum segment size this implementation of TCP is using, and
   TCP_DPORT, the destination port of the packet.  An illustration
   of the track header's property list appears in Figure 7, and the
   optional values recorded in each track entry in Figure 8.

   In our example property list, there are three entries.  The
   first property list entry names a per-entry option, measured in
   u_int32_t's.  Since there are four bytes used per entry, there should
   be one such number per entry.  The optional field being recorded is
   the sequence number from each TCP packet, one packet per track entry.
   The second property list entry names a header-only option, also
   measured in a single u_int32_t.  Since it is a header-only option,
   the value of the MSS used by this host's implementation of TCP occurs
   directly after the other fields of the property list.  The final
   property list entry names a per-entry option, measured in a single
   u_int16_t, that captures the destination port for each packet.

   Each entry record in our example WaveLAN track thus has two optional
   fields, the sequence number of the TCP packet, and the destination
   port of the packet.  The first option is a four-byte, unsigned
   integer, and appears as the first recorded option in each entry.  The


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 10]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


     32      24       16        8       0
     +--------+--------+--------+-------+
     |                              0x3 |  /* 3 entries total */
     +==================================+  /* plist entry #1 */
     |    0x0 |    0x5 |            0x4 |  /* flags, type, size */
     +--------+--------+--------+-------+
     |                     PKT_SEQUENCE |  /* name, but no value */
     +==================================+  /* plist entry #2 */
     |    0x1 |    0x5 |            0x4 |  /* flags, type, size */
     +--------+--------+--------+-------+
     |                          TCP_MSS |  /* name */
     +--------+--------+--------+-------+
     |                      <MSS VALUE> |  /* value, (header-only) */
     +==================================+  /* plist entry #3 */
     |    0x0 |    0x3 |            0x2 |  /* flags, type, size */
     +--------+--------+--------+-------+
     |                        TCP_DPORT |  /* name, but no value */
     +==================================+

           Figure 7: Example property list for WaveLAN track


     32      24       16        8       0
     +==================================+
     |                <SEQUENCE NUMBER> | /* First option */
     |-----------------+----------------|
     |  <DPORT NUMBER> |      <padding> | /* Destination port */
     +==================================+
        Figure 8: Example track-entry options for WaveLAN track


   second option is a two-byte unsigned integer, and appears directly
   after the sequence number.  Since this option uses a number of bytes
   that isn't a multiple of four, it must have two extra bytes of
   padding to fill it out.  This padding is implicit, and per-option.
   Thus, if there were two options in a row, each a single two-byte
   number, each option would occupy four bytes:  two for the value, and
   two for the padding.  This is to allow options to be interpreted
   completely independently in the event that a program reading the
   trace doesn't understand some of the options.


4.2. Trace Headers and Trailers

   Trace files begin with a trace header, and end with a trace
   trailer.  The formats of these appear in Figure 9.  The header gives
   the time the trace began, the version number of the trace, the


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 11]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


      #define NTR_VERSION   1    /* The version in this RFC */

      #define TR_DST_NONE   0    /* not on dst */
      #define TR_DST_USA    1    /* USA style dst */
      #define TR_DST_AUST   2    /* Australian style dst */
      #define TR_DST_WET    3    /* Western European dst */
      #define TR_DST_MET    4    /* Middle European dst */
      #define TR_DST_EET    5    /* Eastern European dst */
      #define TR_DST_CAN    6    /* Canada */

      struct tr_header_t {
          u_int32_t        h_magic;
          u_int32_t        h_size;
          struct tr_time_t h_timestamp;  /* starting time */
          u_int32_t        h_version;    /* Version # of trace */
          int16_t          h_minwest;    /* Minutes west of GMT */
          u_int16_t        h_dsttype;    /* Type of daylt. sav.*/
          tr_string_t      h_date;       /* Date collected */
          tr_string_t      h_desc;       /* description of trace */
          u_int32_t        h_numagent;   /* # hosts traced */
          tr_string_t      *h_agent;     /* DNS name [numagent] */
          u_int32_t        *h_agent_ip;  /* IP addr [numagent] */
      };

      struct tr_trailer_t {
          u_int32_t         f_magic;
          u_int32_t         f_size;
          struct tr_time_t  f_timestamp; /* end time */
          tr_string_t       f_date;      /* Date end written */
      };


               Figure 9: Trace header and trailer records


   timezone/daylight savings time information for timestamps in that
   trace, as well as the ASCII representation for the date the trace
   was taken.  We also collect a text description of the trace to make
   cataloging traces simpler.

   The trace header concludes with an array describing the hosts from
   which the trace was collected.  Because our traces can be merged with
   one another, there may be records from more than one host in the
   trace.

   While it is true that a single trace may contain records collected
   from more than one machine, we require all timestamps to be from


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 12]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   the same timezone, and to use the same daylight savings correction.
   Thus, before merging traces taken on machines with different
   timezones, they must be post-processed.

   The trace trailer contains only the time at which the trace ended; it
   serves primarily as a marker to show the trace is complete.


4.3. Packet Tracks

   Measuring packet activity is the main focus of mobile network
   tracing.  Packet activity is recorded in tracks, with a packet header
   and a set of packet entries.  A single track is meant to capture the
   activity of a single protocol, traffic from a single router, or some
   other subset of the total traffic seen by a machine.  The decision
   of how best to subdivide packet activity is left to the individual
   experimenter, though we encourage experimenters to make use of tools
   already available.  The required portions of packet headers and
   entries are presented in Figure 10.


      struct tr_pkt_hdr_t {
          u_int32_t           ph_magic;
          u_int32_t           ph_size;
          struct tr_time_t    ph_timestamp;
          u_int32_t           ph_trackmagic; /* magic # of track ent. */
          u_int32_t           ph_ipaddr;     /* host generating track */
          u_int32_t           ph_dev_type;   /* device collected from */
          u_int32_t           ph_protocol;   /* protocol */
          u_int32_t           ph_plistsz;    /* Entries in plist */
          struct tr_plist_ent_t *ph_plist;   /* variable size */
      };

      #define TR_PKTINBOUND    0x1
      #define TR_PKTOUTBOUND   0x2

      struct tr_pkt_ent_t {
          u_int32_t        pe_magic;
          u_int32_t        pe_size;
          struct tr_time_t pe_timestamp;
          u_int32_t        pe_flags;
          u_int32_t        pe_psize;    /* packet size */
          u_int8_t         *pe_vlist;   /* determined by plist/size */
      };


               Figure 10: Packet header and entry records


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 13]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   Packet track headers identify which host generated the trace records
   for that track, as well as the time at which the track began.  It
   records the device on which these packets are received or sent, and
   the protocol used to ship the packet; these allow interpretation of
   device-specific or protocol-specific options.  The header concludes
   with the property list for the track, as described in Section 4.1.3.

   The track header also includes the field ph_trackmagic.  Each of a
   track's entry records has as its magic number the track header's
   ph_trackmagic field.  Consider as an example the hypothetical WaveLAN
   TCP track header and entry shown in Figure 11.  In Figure 11(a), we
   see that this record is a packet header record, and that it is the
   header defining track 0xeda80001.  Thus, whenever we see a record
   whose magic number is 0xeda80001, such as in Figure 11(b), we know
   that it is a packet entry, and its header is that from Figure 11(a).

   A packet entry is generated for every traced packet.  It contains
   the size of the traced packet, the time at which the packet was sent
   or received, flags to indicate whether the packet was inbound or
   outbound, and the list of property measurements as specified in the
   track header.


+------------------------+
|        TR_PKTHDR_MAGIC | /* Identifies as packet header */
|------------------------|
|          <header size> | /* Size of header record */
|------------------------|
|                        |
|            <timestamp> | /* Time record was generated */
|------------------------|
|             0xeda80001 | /* Magic # used by entries of this track */
|------------------------|
|          ****          | /* Header continues... */

                (a) Header for the WaveLAN TCP track

     +------------------------+
     |             0xeda80001 | /* Magic #: identifies this entry */
     |------------------------|
     |         ****           | /* Entry continues */

                (b) An entry in the WaveLAN TCP track

              Figure 11: How track entries are identified


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 14]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   +---------------+-----------------------------------------------+
   | ADDR_SEND     | Address of sending host                       |
   | ADDR_RECV     | Address of receiving host                     |
   | ADDR_LINK     | Address of one-hop router                     |
   | PKT_SEQUENCE  | Sequence number of packet                     |
   | PKT_SENTTIME  | Time packet was sent                          |
   | PKT_HOPS      | Number of hops packet took                    |
   | SOCK_SENDPORT | Sending socket ports                          |
   | SOCK_RECVPORT | Receiving socket ports                        |
   | IP_PROTO      | Protocol number of an IP packet               |
   | ICMP_PINGTIME | Roundtrip time of an ICMP ECHO/REPLY pair     |
   | ICMP_KIND     | Type and code of an ICMP packet               |
   | ICMP_ID       | The id field of an ICMP packet                |
   | TCP_MSS       | The id field of an ICMP packet                |
   | TCP_DPORT     | TCP destination port                          |
   | TCP_SPORT     | TCP source port                               |
   | TCP_ACK       | TCP ack number                                |
   | TCP_WINDOW    | TCP window value                              |
   | TCP_TIMESTAMP | TCP timestamp option                          |
   | PROTO_FLAGS   | Protocol-specific flags                       |
   | PROTO_ERRLIST | Protocol-specific status/error words          |
   +---------------+-----------------------------------------------+
          Table 2: Current optional fields for packet entries


   The options we have defined to date are in Table 2.  Several of these
   have played an important role in our early experiments.  ADDR_SEND
   identifies the senders of traffic during the experiment.  We can
   determine network performance using either PKT_SENTTIME for one-way
   traffic between two hosts with closely synchronized clocks, or round
   trip ICMP ECHO traffic and the ICMP_PINGTIME option.  Tracking
   PKT_SEQUENCE numbers sheds light on both loss rates and patterns.
   Section 5 discusses how these measurements are used.  Appendix B
   lists the bit patterns used in the trace to correspond to these
   fields, as well as the typical sizes in which they are represented.


4.4. Device Tracks

   Our trace format captures details of the devices which carry network
   traffic.  To date, we've found this most useful for correlating
   lost packets with various signal parameters provided by wireless
   devices.  The required portions of device header and entry records
   appear in Figure 12, and are quite simple.  Device track headers
   identify the host generating the track's records, the time at which
   the observation starts, and the type of device that is being traced.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 15]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


      struct tr_dev_hdr_t {
          u_int32_t           dh_magic;
          u_int32_t           dh_size;
          struct tr_time_t    dh_timestamp;
          u_int32_t           dh_trackmagic; /* Magic # of track ent. */
          u_int32_t           dh_ipaddr;     /* host generating track */
          u_int32_t           dh_dev_type;   /* device described */
          u_int32_t           dh_plistsz;    /* Entries in plist */
          struct tr_plist_ent_t *dh_plist;   /* Variable size */
      };

      struct tr_dev_ent_t {
          u_int32_t        de_magic;
          u_int32_t        de_size;
          struct tr_time_t de_timestamp;
          u_int8_t         *de_vlist;  /* determined by plist/size */
      };


               Figure 12: Device header and entry records


   Each entry contains the time of the observation, and the list of
   optional characteristics.

   These optional characteristics, listed in Table 3, are concerned with
   the signal parameters of the wireless interfaces we have available.
   Interpreting these parameters is heavily device-dependent.  Because
   of this, we have taken the position that we need different options
   for each different kind of device.  For example, the WaveLAN measures
   signal-to-noise ratios on a scale from 0 to 63, while the Metricom
   radio uses a totally different encoding.  Rather than try to convert
   all these to a common currency, we instead measure them in native
   units, and depend on post-processing to provide comparable numbers.
   We describe how we've used device observations in Section 5.


4.5. General Tracks

   We use general tracks to record things that don't fit clearly in
   either the packet or device model.  At the moment, physical location
   of a mobile host is the only attribute tracked in general trace
   records.  The required portions of the general header and entry
   records are shown in Figure 13.  In addition to the property list,
   general headers have only the IP address of the host generating the
   record and the time at which observations began.  General entries
   have only a timestamp, and the optional fields.  We have only


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 16]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


  +-----------------+--------------------------------------------------+
  | DEV_MAJID       | Major number of device (header only)             |
  | DEV_MINID       | Minor number of device (header only)             |
  | DEV_STATUS      | Device specific status registers                 |
  +-----------------+--------------------------------------------------+
  | WVLN_SIGTONOISE | Signal to noise ratio reported by WaveLAN        |
  | WVLN_SIGQUALITY | Signal quality reported by WaveLAN               |
  | WVLN_SILENCELVL | WaveLAN silence level                            |
  +-----------------+--------------------------------------------------+
  | MCOM_SIGTONOISE | Signal to noise ratio reported by Metricom       |
  | MCOM_SIGQUALITY | Signal quality reported by Metricom              |
  | MCOM_SILENCELVL | Metricom silence level                           |
  +-----------------+--------------------------------------------------+
          Table 3: Current optional fields for packet entries


   experimented with physical location, and have not yet settled on a
   standard, portable way to describe it.  As such, the options we use
   are not part of this RFC.


      struct tr_gen_hdr_t {
          u_int32_t             gh_magic;
          u_int32_t             gh_size;
          struct tr_time_t      gh_timestamp;
          u_int32_t             gh_trackmagic;
          u_int32_t             gh_ipaddr;
          u_int32_t             gh_plistsz;
          struct tr_plist_ent_t *gh_plist; /* Variable size */
      };

      struct tr_gen_ent_t {
          u_int32_t        ge_magic;
          u_int32_t        ge_size;
          struct tr_time_t ge_timestamp;
          u_int32_t        *ge_vlist; /* Variable size */
      };


              Figure 13: General header and entry records


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 17]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


4.6. Annotations

   While general tracks are designed primarily for periodically measured
   data, an experimenter may occasionally want to embed arbitrary,
   one-time-only data into a trace.  We include annotation records to
   provide for this.  Such records are not part of a track; they stand
   alone.  The structure of an annotation record is shown in Figure 14.
   Annotations include the time at which the annotation was inserted in
   the trace, the host which inserted the annotation, and the data of
   the annotation itself.  Note that, by definition, the annotation data
   is header only, and appears directly after the single property list
   entry.


4.7. Lost Trace Data

   It is possible that, during collection, some trace records may be
   lost due to trace buffer overflow or other reasons.  Rather than
   throw such traces away, or worse, ignoring the lost data, we've
   included a loss record to count the types of other records which are
   lost in the course of trace collection.  Loss records are shown in
   Figure 15.

   A single loss record has the time at which it was generated, the
   host that generated it, and the number of distinct record types
   being reported lost, l_lossentsz.  This is followed by an array
   of pairs of 4-byte integers:  the magic number of the record type
   reported as lost, and a count.  This array has l_lossentsz entries.
   We define loss counts of -1 to mean that it is unknown how many
   records of a given type were lost.  We define a special magic number
   NTR_UNKNOWN_MAGIC to specify that we've lost some records of unknown
   types.


      struct tr_annote_t {
          u_int32_t        a_magic;
          u_int32_t        a_size;
          struct tr_time_t a_timestamp;
          u_int32_t        a_ipaddr;
          struct tr_plist_ent_t a_plent; /* single, hdr plist ent. */
      };


                     Figure 14: Annotation records


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 18]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


      struct tr_lossent_t {
          u_int32_t   le_magic; /* Magic # of records lost */
          int32_t     le_lost;  /* how many (-1 == don't know) */
      };

      struct tr_loss_t {
          u_int32_t        l_magic;
          u_int32_t        l_size;
          struct tr_time_t l_timestamp;
          u_int32_t        l_ipaddr;
          u_int32_t        l_lossentsz;  /* How many lossents */
          struct tr_lossent_t l_lossarr; /* Array of lossents */
      };


                        Figure 15: Loss records


5. Software Components

   In this section, we describe the set of tools that have been built to
   date for mobile network tracing.  We believe many of these tools are
   widely applicable to network tracing tasks, but some have particular
   application to mobile network tracing.  We begin with an overview of
   the tools, their applicability, and the platforms on which they are
   currently supported, as well as those they are being ported to.  This
   information is summarized in Table 4.

   We have made every effort to minimize dependencies of our software on
   anything other than protocol and device specifications.  As a result,
   we expect ports to other BSD-derived systems to be straightforward;
   ports to other UNIX systems may be more complicated, but feasible.


                  +--------------+--------------+--------------+
                  | Collection   | Modulation   | Analysis     |
      +-----------+--------------+--------------+--------------+
      | NetBSD    | In Progress  | Supported    | Supported    |
      | BSDI      | Supported    | Planned      | Supported    |
      +-----------+--------------+--------------+--------------+
This table summarizes the currently supported platforms for the tracing
tool suites, and the platforms to which ports are underway.

                       Table 4: Tool Availability


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 19]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   There are three categories into which our tracing tools can be
   placed:  trace collection, trace modulation, and trace analysis.
   Trace collection tools are used for generating new traces.  They
   record information about the general networking facilities, as well
   as data specific to mobile situations:  mobile host location, base
   station location, and wireless device characteristics.  These tools
   are currently supported on BSDI, and are being ported to NetBSD. We
   describe these tools in Section 5.1.

   Trace modulation tools emulate the performance of a traced wireless
   network on a private wired network.  The trace modulation tools,
   discussed in Section 5.2, are currently supported on NetBSD
   platforms.  They are geared toward replaying low speed/quality
   networks on faster and more reliable ones, and are thus most
   applicable to reproducing mobile environments.

   In Section 5.3, we conclude with a set of trace processing and
   analysis tools, which are currently supported on both NetBSD and
   BSDI platforms.  Our analyses to date have focused on properties of
   wireless networks, and are most directly applicable to mobile traces.
   The processing tools, however, are of general utility.


5.1. Trace Collection Tools

   The network trace collection facility comprises two key components:
   the trace agent and the trace collector.  They are shown in
   Figure 16.


                                     +-----------+  write to disk
                                     | Trace     | ==============>
                                     | Collector |
                                     +-----------+
                                             A
     ========================================|===== kernel boundary
     +-----------------+                     |
     | Transport Layer |                     |
     |-----------------|             +------------------+
     |  Network Layer  |------------>| Trace   +------+ |
     |-----------------|             | Agent   |buffer| |
     |  NI |  NI |  NI |------------>|         +------+ |
     +-----------------+             +------------------+
 This figure illustrates the components of trace collection.  The NI's
                        are network interfaces.

               Figure 16: Components of trace collection


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 20]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   The trace agent resides in the kernel where it can obtain data that
   is either expensive to obtain or inaccessible from the user level.
   The agent collects and buffers data in kernel memory; the user-level
   trace collector periodically extracts data from this kernel buffer
   and writes it to disk.  The buffer amortizes the fixed costs of data
   transfer across a large number of records, minimizing the impact of
   data transfer on system performance.  The trace collector retrieves
   data through a pseudo-device, ensuring that only a single -- and
   therefore complete -- trace file is being generated from a single
   experiment.  To provide simplicity and efficiency, the collector does
   not interpret extracted data; it is instead processed off-line by the
   post-processing and analysis tools described in Sections 5.2 and 5.3.

   There are three sorts of data collected by the tracing tools:
   network traffic, network device characteristics, and mobile host
   location.  The first two are collected in much the same way; we
   describe the methodology in Section 5.1.1.  The last is collected
   in two novel ways.  These collection methods are addressed in
   Section 5.1.2.


5.1.1. Traffic and Device Collection

   The trace agent exports a set of function calls for traffic and
   device data collection.  Traffic data is collected on a per-packet
   basis.  This is done via a function called from device drivers with
   the packet and a device identifier as arguments.  For each packet,
   the trace record contains the source and destination address options.
   Since our trace format assembles related packets into tracks, common
   information, such as the destination address, is recorded in the
   track header to reduce the record size for each packet entry.  We
   also record the size of each packet.

   Information beyond packet size and address information is typically
   protocol-dependent.  For transport protocols such as UDP and TCP,
   for example, we record the source and destination port numbers; TCP
   packet records also contain the sequence number.  For ICMP packets,
   we record their type, code and additional type-dependent data.  As
   explained in Section 5.2.3, we record the identifier, sequence number
   and time stamp for ICMP ECHOREPLY packets.

   Before appending the record to the trace buffer, we check to see if
   it is the first record in a track.  If so, we create a new packet
   track header, and write it to the buffer prior the packet entry.

   Our trace collection facility provides similar mechanisms to record
   device-specific data such as signal quality, signal level, and noise
   level.  Hooks to these facilities can be easily added to the device


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 21]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   drivers to invoke these tracing mechanisms.  The extensible and
   self-describing features of our trace format allow us to capture a
   wide variety of data specific to particular network interfaces.

   For wireless network devices, we record several signal quality
   measurements that the interfaces provide.  Although some interfaces,
   such as NCR's WaveLAN, can supply this of information for every
   packet received, most devices average their measurements over a
   longer period of time.  As a result, we only trace these measurements
   periodically.  It is up to the device drivers to determine the
   frequency at which data is reported to the trace agent.

   When devices support it, we also trace status and error events.
   The types of errors, such as CRC or buffer overflow, allow us to
   determine causes for some observed packet losses.  For example, we
   can attribute loss to either the wireless channel or the network
   interface.


5.1.2. Location Tracing

   At first thought, recording the position of a mobile host seems
   straightforward.  It can be approximated by recording the base
   station (BS) with which the mobile host is communicating.  However,
   due to the large coverage area provided by most radio interfaces,
   this information provides a loose approximation at best.  In
   commercial deployments, we may not be able to reliably record the
   base station with which a mobile host communicates.  This section
   outlines our collection strategy for location information in both
   outdoor and indoor environments.

   The solution that we have considered for wide-area, outdoor
   environments makes use of the Global Positioning System (GPS). The
   longitude and latitude information provided by the GPS device is
   recorded in a general track.

   Indoor environments require a different approach because the
   satellite signals cannot reach a GPS device inside a building.  We
   considered deploying an infrared network similar to the Active Badge
   [14] or the ParcTab [12]; however, this significant addition to the
   wireless infrastructure is not an option for most research groups.

   As an alternative, we have developed a graphical tool that displays
   the image of a building map and expects the user to "click" their
   location as they move; the coordinates on the map are recorded in one
   or more general tracks.  The header of such tracks can also record
   the coordinates of the base stations if they are known.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 22]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   An extension can be easily added to this tool to permit multiple
   maps.  As the user requests that a new map be loaded into the
   graphical tracing tool, a new location track is created along with
   an annotation record that captures the file name of that image.
   Locations of new base stations can be recorded in this new track
   header.  Each location track should represent a different physical
   and wireless environment.


5.2. Trace Modulation Tools

   A key tool we have built around our trace format is PaM, the Packet
   Modulator.  The idea behind PaM is to take traces that were collected
   by a mobile host and distill them into modulation traces.  These
   modulation traces capture the networking environment seen by the
   traced host, and are used by a PaM kernel to delay, drop, or corrupt
   incoming and outgoing packets.  With PaM, we've built a testbed that
   can repeatably, reliably mimic live systems under certain mobile
   scenarios.

   There are three main components to PaM. First, we've built a kernel
   capable of delaying, dropping, and corrupting packets to match the
   characteristics of some observed network.  Second, we've defined a
   modulation trace format to describe how such a kernel should modulate
   packets.  Third, we've built a tool to generate modulation traces
   from certain classes of raw traces collected by mobile hosts.


5.2.1. Packet Modulation

   The PaM modulation tool has been placed in the kernel between the IP
   layer and the underlying interfaces.  The tool intercepts incoming
   and outgoing packets, and may choose to drop it, corrupt it, or delay
   it.  Dropping an incoming or outgoing packet is easy, simply don't
   forward it along.  Similarly, we can corrupt a packet by flipping
   some bits in the packet before forwarding it.

   Correctly delaying a packet is slightly more complicated.  We model
   the delay a packet experiences as the time it takes the sender to put
   the packet onto the network interface plus the time it takes for the
   last byte to propagate to the receiver.  The former, the transmission
   time, is the size of the packet divided by the available bandwidth;
   the latter is latency.

   Our approach at delay modulation is simple -- we assume that the
   actual network over which packets travel is much faster and of better
   quality than the one we are trying to emulate, and can thus ignore
   it.  We delay the packet according to our latency and bandwidth


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 23]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   targets, and then decide whether to drop or corrupt it.  We take
   care to ensure that packet modulation does not unduly penalize
   other system activity, using the internal system clock to schedule
   packets.  Since this clock is at a large granularity compared to
   delay resolution, we try to keep the average error in scheduling to
   a minimum, rather than scheduling each packet at exactly the right
   time.


5.2.2. Modulation Traces

   To tell the PaM kernel how the modulation parameters change over
   time, we provide it with a series of modulation-trace entries.  Each
   of these entries sets loss and corruption percentages, as well as
   network latency and inter-byte time, which is 1/bandwidth.  These
   entries are stored in a trace file, the format of which is much
   simpler than record-format traces, and is designed for efficiency in
   playback.  The format of modulation traces is shown in Figure 17.


      struct tr_rep_hdr_t {
          u_int32_t        rh_magic;
          u_int32_t        rh_size;
          u_int32_t        rh_time_fmt;         /* nsec or used */
          struct tr_time_t rh_ts;
          char             rh_date[TR_DATESZ];
          char             rh_agent[TR_NAMESZ];
          u_int32_t        rh_ip;
          u_int32_t        rh_ibt_ticks;        /* units/sec, ibt */
          u_int32_t        rh_lat_ticks;        /* units/sec, lat */
          u_int32_t        rh_loss_max;         /* max loss rate */
          u_int32_t        rh_crpt_max;         /* max corrupt rate */
          char             rh_desc[0];          /* variable size */
      };

      struct tr_rep_ent_t {
          u_int32_t         re_magic;
          struct tr_time_t  re_dur;          /* duration of entry */
          u_int32_t         re_lat;          /* latency */
          u_int32_t         re_ibt;          /* inter-byte time */
          u_int32_t         re_loss;         /* loss rate */
          u_int32_t         re_crpt;         /* corrupt rate */
      };


                   Figure 17: Modulation trace format


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 24]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   Modulation traces begin with a header that is much like that found in
   record-format trace headers.  Modulation headers additionally carry
   the units in which latency and inter-byte time are expressed, and the
   maximum values for loss and corruption rates.  Individual entries
   contain the length of time for which the entry applies as well as the
   latency, inter-byte time, loss rate, and corruption rate.


5.2.3. Trace Transformation

   How can we generate these descriptive modulation traces from the
   recorded observational traces described in Section 4?  To ensure a
   high-quality modulation trace, we limit ourselves to a very narrow
   set of source traces.  As our experience with modulation traces is
   limited, we use a simple but tunable algorithm to generate them.

   Our basic strategy for determining latency and bandwidth is
   tied closely to our model of packet delays:  delay is equal to
   transmission time plus latency.  We further assume that packets which
   traversed the network near one another in time experienced the same
   latency and bandwidth during transit.  Given this, we look for two
   packets of different size that were sent close to one another along
   the same path; from the transit times and sizes of these packets, we
   can determine the near-instantaneous bandwidth and latency of the
   end-to-end path covered by those packets.  If traced packet traffic
   contains sequence numbers, loss rates are fairly easy to calculate.
   Likewise, if the protocol is capable of marking corrupt packets,
   corruption information can be stored and then extracted from recorded
   traces.

   Using timestamped packet observations to derive network latency
   and bandwidth requires very accurate timing.  Unfortunately, the
   laptops we have on hand have clocks that drift non-negligibly.  We
   have chosen not to use protocols such as NTP [9] for two reasons.
   First, they produce network traffic above and beyond that in the
   known traced workload.  Second, and perhaps more importantly, they
   can cause the clock to speed up or slow down during adjustment.  Such
   clock movements can play havoc with careful measurement.

   As a result, we can only depend on the timestamps of a single
   machine to determine packet transit times.  So, we use the ICMP ECHO
   service to provide workloads on traced machines; the ECHO request
   is timestamped on it's way out, and the corresponding ECHOREPLY is
   traced.  We have modified the ping program to alternate between small
   and large packets.  Traces that capture such altered ping traffic can
   then be subject to our transformation tool.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 25]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   The tool itself uses a simple sliding window scheme to generate
   modulation entries.  For each window position in the recorded trace,
   we determine the loss rate, and the average latency and bandwidth
   experienced by pairs of ICMP ECHO packets.  The size and granularity
   of the sliding window are parameters of the transformation; as we
   gain experience both in analysis and modulation of wireless traces,
   we expect to be able to recommend good window sizes.

   Unfortunately, our wireless devices do not report corrupt
   packets; they are dropped by the hardware without operating system
   notification.  However, our modulation system will also coerce any
   such corruptions to an increased loss rate, duplicating the behavior
   in the original network.


5.3. Trace Analysis Tools

   A trace is only as useful as its processing tools.  The requirements
   for such tools tools include robustness, flexibility, and
   portability.  Having an extensible trace format places additional
   emphasis on the ability to work with future versions.  To this end,
   we provide a general processing library as a framework for users to
   easily develop customized processing tools; this library is designed
   to provide both high portability and good performance.

   In this section, we first present the trace library.  We then
   describe a set of tools for simple post-processing and preparing the
   trace for further analyses.  We conclude with a brief description of
   our analysis tools that are applied to this minimally processed data.


5.3.1. Trace Library

   The trace library provides an interface that applications can use
   to simplify interaction with network traces, including functions
   to read, write, and print trace records.  The trace reading and
   writing functions manage byte swapping as well as optional integrity
   checking of the trace as it is read or written.  The library employs
   a buffering strategy that is optimized to trace I/O. Trace printing
   facilities are provided for both debugging and parsing purposes.


5.3.2. Processing Tools

   The processing tools are generally the simplest set of tools we have
   built around the trace format.  By far the most complicated one is
   the modulation-trace transformation tool described in Section 5.2.3;
   the remainder are quite simple in comparison.  The first such tool is


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 26]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   a parser that prints the content of an entire trace.  With the trace
   library, it is less than a single page of C code.  For each record,
   it prints the known data fields along with their textual names,
   followed by all the optional properties and values.

   Since many analysis tasks tend to work with records of the same type,
   an enhanced version of the parser can split the trace data by tracks
   into many files, one per track.  Each line of the output text files
   contains a time stamp followed by the integer values of all the
   optional data in a track entry; in this form traces are amenable to
   further analysis be scripts written in an interpreted language such
   as perl.

   We have developed a small suite of tools providing simple functions
   such as listing all the track headers and changing the trace
   description as they have been needed.  With the trace library, each
   such tool is trivial to construct.


5.3.3. Analysis Tools

   Analysis tools depend greatly on the kind of information an
   experimenter wants to extract from the trace; our tools show our own
   biases in experimentation.  Most analyses derive common statistical
   descriptions of traces, or establish some correlation between the
   trace data sets.

   As early users of the trace format and collection tools, we have
   developed a few analysis tools to study the behavior of the wireless
   networks at our disposal.  We have been particularly interested
   in loss characteristics of wireless channels and their relation
   to signal quality and the position of the mobile host.  In this
   section, we briefly present some of these tools to hint at the kind
   of experimentation possible with our trace format.

   Loss characteristics are among the most interesting aspects of
   wireless networks, and certainly among the least well understood.
   To shed light on this area, we have created tools to extract the
   loss information from collected traces; in addition to calculating
   the standard parameters such as the packet loss rate, the tool also
   derives transitional probabilities for a two-state error model.

   This has proven to be a simple yet powerful model for capturing the
   burstiness observed in wireless loss rates due to fading signals.
   To help visualize the channel behavior in the presence of mobility,
   our tool can replay the movement of the mobile host while plotting
   the loss rate as it changes with time.  It also allows us to zoom


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 27]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   in the locations along the path and obtain detailed statistics over
   arbitrary time intervals.

   Our traces can be further analyzed to understand the relationship
   between channel behavior and the signal quality.  For wireless
   devices like the NCR WaveLAN, we can easily obtain measurements of
   signal quality, signal strength, and noise level.  We have developed
   a simple statistical tool to test the correlation between measured
   signal and the loss characteristics.  Variations of this test are
   also possible using different combinations of the three signal
   measurements and the movement of the host.

   The question of just how mobile such mobile hosts are can also be
   investigated through our traces.  Position data are provided by
   traces that either involved GPS or user-supplied positions with our
   trace collection tools.  This data is valuable for comparing and
   validating various mobility prediction algorithms.  Given adequate
   network infrastructure and good signal measurements, we can determine
   the mobile location within a region that is significantly smaller
   than the cell size.  We are developing a tool to combine position
   information and signal measurement from many traces to identify the
   "signal quality" signature for different regions inside a building.
   Once this signature database is completed and validated, it can be
   used to generate position information for other traces that contain
   only the signal quality information.


6. Related Work

   The previous work most relevant to mobile network tracing falls into
   two camps.  The first, chiefly exemplified by tcpdump [7] and the
   BSD Packet Filter, or BPF [8], collect network traffic data.  The
   second, notably Delayline [6], and the later Probe/Fault Injection
   Tool [4], and the University of Lancaster's network emulator [3],
   provide network modulation similar to PaM.

   There are many systems that record network packet traffic; the de
   facto standard is tcpdump, which works in concert with a packet
   filter such as BPF. The packet filter is given a small piece of code
   that describes packets of interest, and the first several bytes
   of each packet found to be interesting is copied to a buffer for
   tcpdump to consume.  This architecture is efficient, flexible, and
   has rightly found great favor with the networking community.

   However, tcpdump captures only traffic data.  It records neither
   information concerning mobile networking devices nor mobile host
   location.  Rather than adding separate software components to a host
   running tcpdump to capture this additional data, we have chosen to


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 28]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   follow an integrative approach to ease trace file administration.
   We have kept the lessons of tcpdump and BPF to heart; namely
   copying only the information necessary, and transferring data up
   to user level in batches.  It may well pay to investigate either
   incorporating device and location information directly into BPF,
   or taking the flexible filtering mechanism of BPF and including it
   in our trace collection software.  For the moment, we do not know
   exactly what data we will need to explore the properties of mobile
   networks, and therefore do not exclude any data.

   There are three notable systems that provide packet modulation
   similar to PaM. The earliest such work is Delayline, a system
   designed to emulate wide-area networks atop local-area ones; a goal
   similar to PaM's.  The most striking difference between Delayline
   and PaM is that Delayline's emulation takes place entirely at the
   user-level, and requires applications to be recompiled against a
   library emulating the BSD socket system and library calls.  While
   this is a portable approach that works well in the absence of
   kernel-level source access, it has the disadvantage that not all
   network traffic passes through the emulation layer; such traffic
   may have a profound impact on the performance of the final system.
   Delayline also differs from PaM in that the emulated network uses a
   single set of parameters for each emulated connection; performance
   remains fairly constant, and cannot change much over time.

   The Lancaster network emulator was designed explicitly to model
   mobile networks.  Rather than providing per-host modulation, it
   uses a single, central server through which all network traffic
   from instrumented applications passes.  While this system also does
   not capture all traffic into and out of a particular host, it does
   allow modulation based on multiple hosts sharing a single emulated
   medium.  There is a mechanism to change the parameters of emulation
   between hosts, though it is fairly cumbersome.  The system uses a
   configuration file that can be changed and re-read while the system
   is running.

   The system closest in spirit to PaM is the Probe/Fault Injection
   Tool.  This system's design philosophy allows an arbitrary protocol
   layer -- including device drivers -- to be encapsulated by a layer
   below to modulate existing traffic, and a layer above to generate
   test traffic.  The parameters of modulation are provided by a script
   in an interpreted language, presently Tcl, providing considerable
   flexibility.  However, there is no mechanism to synthesize such
   scripts -- they must be explicitly designed.  Furthermore, the
   use of an interpreted language such as Tcl limits the use of PFI
   to user-level implementations of network drivers, and may have
   performance implications.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 29]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


7. Future Work

   This work is very much in its infancy; we have only begun to explore
   the possible uses for mobile network traces.  We have uncovered
   several areas of further work.

   The trace format as it stands is very IP-centric.  While one could
   imagine using unknown IP addresses for non-IP hosts, while using
   header-only properties to encode other addressing schemes, this is
   cumbersome at best.  We are looking into ways to more conveniently
   encode other addressing schemes, but are content to focus on IP
   networks for the moment.

   Two obvious questions concerning wireless media are the following.
   How does a group of machines perform when sharing the same bandwidth?
   How asymmetric is the performance of real-world wireless channels?
   While we do have tools for merging traces taken from multiple
   hosts into a single trace file, we've not yet begun to examine
   such multiple-host scenarios in depth.  We are also looking into
   instrumenting wireless base stations as well as end-point hosts.

   Much of our planned work involves the PaM testbed.  First and
   foremost, many wireless channels are known to be asymmetric;
   splitting the replay trace into incoming and outgoing modulation
   entries is of paramount importance.  We would like to extend PaM to
   handle multiple emulated interfaces as well as applying different
   modulation parameters to packets from or to different destinations.
   One could also imagine tracing performance from several different
   networking environments, and switching between such environments
   under application control.  For example, consider a set of traces
   showing radio performance at various altitudes; an airplane simulator
   in a dive would switch from high-altitude modulation traces to
   low-altitude ones.

   Finally, we are anxious to begin exploring the properties of
   real-world mobile networks, and subjecting our own mobile system
   designs to PaM to see how they perform.  We hope others can make use
   of our tools to do the same.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 30]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


A. Magic Numbers

   The magic numbers that all record types other than those for
   tr_pkt_ent_t, tr_dev_ent_t, and tr_gen_ent_t are known and fixed
   for all traces.  All of these magic numbers begin with the prefix
   0xC2B00000, and they appear in Table 5.  For the three entry property
   types, the numbers are selected at trace creation time; they must
   be selected from a range of numbers that are prefixed by the first
   two bytes of the corresponding header magic number.  They cannot end
   in either 0x0000 or 0xFFFF, nor can they have the same magic number
   as the corresponding header record.  These rules are summarized in
   Table 8.

   Devices are identified in device header records, and protocols in
   packet header records.  The magic numbers for these identifications
   are shown in Figures 6 and 7, respectively.  The former are all
   prefaced with 0xC2D5, the latter with 0xC2D0.


B. Optional Fields

   A number of optional fields have been defined to date in the network
   tracing format, and surely more will be added as the format seems
   more diverse uses.  This appendix lists the identifying names of the
   options defined so far, as well as the methodology for assigning new
   names.

   The options are partitioned into several classifications:  general
   packet attributes, socket attributes, protocol attributes, general
   device attributes, WaveLan attributes, Metricom attributes, and
   generic attributes.  Adding a new classification, adding options
   within an existing classification, is currently the sole domain
   of the authors.  However, there is a classification to allow
   extension by users of the trace package to add their own options


                     +---------------+------------+
                     | HEADER_MAGIC  | 0xC2B1382B |
                     | TRAILER_MAGIC | 0xC2B8B772 |
                     | PKTHDR_MAGIC  | 0xC2B7A5D5 |
                     | DEVHDR_MAGIC  | 0xC2B6E8A2 |
                     | GENHDR_MAGIC  | 0xC2BC19FE |
                     | ANNOTE_MAGIC  | 0xC2BFF7A0 |
                     | LOSS_MAGIC    | 0xC2BA4A24 |
                     +---------------+------------+
             Table 5: Magic Numbers for Known Record Types


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 31]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


        +---------------+------------+---------------------+
        | DEV_EE_MAGIC  | 0xC2D5A3D2 | Intel EtherExpress  |
        | DEV_EP_MAGIC  | 0xC2D514F9 | 3Com EtherLink III  |
        | DEV_WL_MAGIC  | 0xC2D52C3C | ATT WaveLAN 900 MHz |
        | DEV_WG_MAGIC  | 0xC2D56315 | ATT WaveLAN 2.5 GHz |
        | DEV_MC_MAGIC  | 0xC2D55B19 | MetriCom radio      |
        | DEV_CD_MAGIC  | 0xC2D5B1FB | CDPD Cellular modem |
        | DEV_SL_MAGIC  | 0xC2D5FBFA | SLIP line           |
        +---------------+------------+---------------------+
                Table 6: Magic Numbers for Device Types


                 +-------------------+------------+
                 | PROTO_IP_MAGIC    | 0xC2D003BF |
                 | PROTO_TCP_MAGIC   | 0xC2D0B19C |
                 | PROTO_UDP_MAGIC   | 0xC2D0413D |
                 | PROTO_ICMP_MAGIC  | 0xC2D004E1 |
                 +-------------------+------------+
                  Table 7: Magic Numbers for Protocols


    +--------------+----------------------------------------------+
    | tr_pkt_ent_t | 0xC2B70001-0xC2B7A5D4, 0xC2B7A5D6-0xC2B7FFFE |
    | tr_dev_ent_t | 0xC2B60001-0xC2B6E8A1, 0xC2B6E8A3-0xC2B6FFFE |
    | tr_gen_ent_t | 0xC2BC0001-0xC2BC19FD, 0xC2BC19FF-0xC2BCFFFE |
    +--------------+----------------------------------------------+
            Table 8: Magic Number Ranges for Packet, Device,
                       and General Entry Records

             +----------------+-----------------------+
             | General packet | 0xA0540001-0xA054FFFE |
             | Socket         | 0xA0551101-0xA05511FE |
             | Protocol/gen   | 0xA0300001-0xA030FFFE |
             | Protocol/IP    | 0xA0310001-0xA031FFFE |
             | Protocol/TCP   | 0xA0320001-0xA032FFFE |
             | Protocol/UDP   | 0xA0330001-0xA033FFFE |
             | Protocol/ICMP  | 0xA0340001-0xA034FFFE |
             | Other protocol | 0xA0350001-0xA04FFFFE |
             | General device | 0xA0600001-0xA060FFFE |
             | WaveLan        | 0xA0610001-0xA061FFFE |
             | Metricom       | 0xA0620001-0xA062FFFE |
             | Generic        | 0xA0EF0001-0xA0EFFFFE |
             | Extension      | 0xA0F60001-0xA0F6FFFE |
             +----------------+-----------------------+
               Table 9: Ranges for Option Classifications

.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 32]
RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   without contacting the authors.  The ranges allocated to individual
   classifications are shown in Table 9.  The assigned values for
   options defined to date appear in Tables 10, 11, and 12.  Note that
   the options int the latter have been assigned numbers from the
   extension range, since we have not yet convinced ourselves of their
   general utility, nor have we settled on a format for them.


                  +---------------+------------+
                  | ADDR_SEND     | 0xA054B078 |
                  | ADDR_RECV     | 0xA054976E |
                  | ADDR_LINK     | 0xA054E821 |
                  | PKT_SEQUENCE  | 0xA0549C30 |
                  | PKT_SENTTIME  | 0xA0540267 |
                  | PKT_HOPS      | 0xA0542F7E |
                  | SOCK_SENDPORT | 0xA05511D9 |
                  | SOCK_RECVPORT | 0xA0551125 |
                  | PROTO_FLAGS   | 0xA03055E0 |
                  | PROTO_ERRLIST | 0xA03089C1 |
                  | IP_PROTO      | 0xA0315CA6 |
                  | ICMP_PINGTIME | 0xA034F638 |
                  | ICMP_KIND     | 0xA034E9D3 |
                  | ICMP_ID       | 0xA0347F10 |
                  | TCP_MSS       | 0xA032CFED |
                  | TCP_DPORT     | 0xA032651E |
                  | TCP_SPORT     | 0xA0328F8A |
                  | TCP_ACK       | 0xA032B064 |
                  | TCP_WINDOW    | 0xA032A7CE |
                  | TCP_TIMESTAMP | 0xA032371C |
                  +---------------+------------+
                     Table 10: Packet Option Names

.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 33]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


                 +-----------------+------------+
                 | DEV_MAJID       | 0xA0609E91 |
                 | DEV_MINID       | 0xA0601B32 |
                 | DEV_STATUS      | 0xA0600A79 |
                 | WVLN_SIGTONOISE | 0xA0612D1E |
                 | WVLN_SIGQUALITY | 0xA06108CD |
                 | WVLN_SILENCELVL | 0xA0610EE3 |
                 | MCOM_SIGTONOISE | 0xA0623364 |
                 | MCOM_SIGQUALITY | 0xA062FE7C |
                 | MCOM_SILENCELVL | 0xA062B149 |
                 +-----------------+------------+
                     Table 11: Device Option Names

.

                  +---------------+------------+
                  | MH_LOC_X      | 0xA0F6B6F9 |
                  | MH_LOC_Y      | 0xA0F624A4 |
                  | MH_LOC_LAT    | 0xA0F669B7 |
                  | MH_LOC_LON    | 0xA0F60851 |
                  +---------------+------------+
                     Table 12: Generic Option Names

.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 34]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


Acknowledgements

   The authors wish to thank Dave Johnson, who provided early pointers
   to related work and helped us immeasurably in RFC formatting.  We
   also wish to thank those who offered comments on early drafts of the
   document:  Mary Baker, Mike Davis, Barbara Denny, Chane Fullmer,
   Geoff Kuenning, Mark Lewis, and Hui Zhang.  Finally, we would like to
   thank Bruce Maggs and Chris Hobbs, our first customers!

   This research was supported by the Air Force Materiel Command
   (AFMC) and ARPA under contract numbers F196828-93-C-0193 and
   DAAB07-95-C-D154, and the State of California MICRO Program.
   Additional support was provided by AT&T, Hughes Aircraft, IBM Corp.,
   Intel Corp., and Metricom.  The views and conclusions contained here
   are those of the authors and should not be interpreted as necessarily
   representing the official policies or endorsements, either express
   or implied, of AFMC, ARPA, AT&T, Hughes, IBM, Intel, Metricom,
   Carnegie Mellon University, the University of California, the State
   of California, or the U.S. Government.


Security Considerations

   This RFC raises no security considerations.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 35]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


Authors' Addresses

   Questions about this document can be directed to the authors:

    Brian D. Noble
    Computer Science Department
    Carnegie Mellon University
    5000 Forbes Avenue
    Pittsburgh, PA  15213-3891

    Phone:  +1-412-268-7399
    Fax:    +1-412-268-5576
    E-mail: bnoble@cs.cmu.edu


    Giao T. Nguyen
    Room 473 Soda Hall #1776 (Research Office)
    University of California, Berkeley
    Berkeley, CA  94720-1776

    Phone:  +1-510-642-8919
    Fax:    +1-510-642-5775
    E-mail: gnguyen@cs.berkeley.edu


    Mahadev Satyanarayanan
    Computer Science Department
    Carnegie Mellon University
    5000 Forbes Avenue
    Pittsburgh, PA  15213-3891

    Phone:  +1-412-268-3743
    Fax:    +1-412-268-5576
    E-mail: satya@cs.cmu.edu


    Randy H. Katz
    Room 231 Soda Hall #1770 (Administrative Office)
    University of California, Berkeley
    Berkeley, CA  94720-1770

    Phone:  +1-510-642-0253
    Fax:    +1-510-642-2845
    E-mail: randy@cs.berkeley.edu


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 36]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


References

    [1] Chen, J. B., and Bershad, B. N.  The Impact of Operating System
        Structure on Memory System Performance.  In Proceedings of the
        14th ACM Symposium on Operating System Principles (Asheville,
        NC, December 1993).

    [2] Dahlin, M., Mather, C., Wang, R., Anderson, T., and Patterson,
        D.  A Quantitative Analysis of Cache Policies for Scalable
        Network File Systems.  In Proceedings of the 1994 ACM SIGMETRICS
        Conference on Measurement and Modeling of Computer Systems
        (Nashville, TN, May 1994).

    [3] Davies, N., Blair, G. S., Cheverst, K., and Friday, A.  A
        Network Emulator to Support the Development of Adaptive
        Applications.  In Proceedings of the 2nd USENIX Symposium on
        Mobile and Location Independent Computing (April 10-11 1995).

    [4] Dawson, S., and Jahanian, F.  Probing and Fault Injection of
        Dependable Distributed Protocols.  The Computer Jouranl 38, 4
        (1995).

    [5] Gloy, N., Young, C., Chen, J. B., and Smith, M. D.  An Analysis
        of Dynamic Branch Prediction Schemes on System Workloads.  In
        The Proceedings of the 23rd Annual International Symposium on
        Computer Architecture (May 1996).

    [6] Ingham, D. B., and Parrington, G. D.  Delayline:  A Wide-Area
        Network Emulation Tool.  Computing Systems 7, 3 (1994).

    [7] Jacobson, V., Leres, C., and McCanne, S.  The Tcpdump Manual
        Page.  Lawrence Berkeley Laboratory, Berkeley, CA.

    [8] McCanne, S., and Jacobson, V.  The BSD Packet Filter:  A New
        Architecture for User-level Packet Capture.  In Proceedings of
        the 1993 Winter USENIX Technical Conference (San Deigo, CA,
        January 1993).

    [9] Mills, D. L.  Improved Algorithms for Synchronizing Computer
        Network Clocks.  IEEE/ACM Transactions on Networking 3, 3 (June
        1995).

   [10] Mummert, L. B., Ebling, M. R., and Satyanarayanan, M.
        Exploiting Weak Connectivity for Mobile File Access.  In
        Proceedings of the 15th Symposium on Operating System Prinicples
        (Copper Mountain, CO, December 1995).


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 37]

RFC DRAFT               Mobile Network Tracing                1 Sep 1996


   [11] Nelson, M. N., Welch, B. B., and Ousterhout, J. K.  Caching in
        the Sprite Network File System.  ACM Transactions on Computer
        Systems 6, 1 (February 1988).

   [12] Schilit, B., Adams, N., Gold, R., Tso, M., and Want, R.  The
        PARCTAB Mobile Computing System.  In Proceedings of the 4th IEEE
        Workshop on Workstation Operating Systems (Napa, CA, October
        1993), pp. 34--39.

   [13] Uhlig, R., Nagle, D., Stanley, T., Mudge, T., Sechrest, S., and
        Brown, R.  Design Tradeoffs for Software-Managed TLBs.  ACM
        Transactions on Computer Systems 12, 3 (August 1994).

   [14] Want, R., Hopper, A., Falcao, V., and Gibbons, J.  The Active
        Badge Location System.  ACM Transactions on Information Systems
        10, 1 (January 1992), 91--102.


Noble, Nguyen, Satyanarayanan, and Katz   Expires 1 Nov 1996   [Page 38]