Newsgroups: comp.ai.jair.announce
Path: cantaloupe.srv.cs.cmu.edu!das-news.harvard.edu!news2.near.net!MathWorks.Com!europa.eng.gtefsd.com!news.umbc.edu!haven.umd.edu!ames!kronos.arc.nasa.gov!jair-ed
From: minton@ptolemy.arc.nasa.gov
Subject: New Article, Pattern Matching and Discourse Processing...
Message-ID: <1994Aug29.231619.12113@ptolemy-ethernet.arc.nasa.gov>
Originator: jair-ed@polya.arc.nasa.gov
Lines: 56
Sender: usenet@ptolemy-ethernet.arc.nasa.gov (usenet@ptolemy.arc.nasa.gov)
Nntp-Posting-Host: polya.arc.nasa.gov
Organization: NASA/ARC Information Sciences Division
Date: Mon, 29 Aug 1994 23:16:19 GMT
Approved: jair-ed@ptolemy.arc.nasa.gov

JAIR is pleased to announce publication of the following article:

Kitani, T., Eriguchi, Y. and Hara, M. (1994)
  "Pattern Matching and Discourse Processing in Information Extraction
   from Japanese Text", Volume 2, pages 89-110.
   Postscript: volume2/kitani94a.ps (465K)

   Abstract: Information extraction is the task of automatically picking
   up information of interest from an unconstrained text.  Information of
   interest is usually extracted in two steps.  First, sentence level
   processing locates relevant pieces of information scattered throughout
   the text; second, discourse processing merges coreferential
   information to generate the output.  In the first step, pieces of
   information are locally identified without recognizing any
   relationships among them.  A key word search or simple pattern search
   can achieve this purpose.  The second step requires deeper knowledge
   in order to understand relationships among separately identified
   pieces of information.  Previous information extraction systems
   focused on the first step, partly because they were not required to
   link up each piece of information with other pieces.  To link the
   extracted pieces of information and map them onto a structured output
   format, complex discourse processing is essential.  This paper reports
   on a Japanese information extraction system that merges information
   using a pattern matcher and discourse processor.  Evaluation results
   show a high level of system performance which approaches human
   performance.

The PostScript file is available via:
   
 -- comp.ai.jair.papers

 -- World Wide Web: The URL for our World Wide Web server is
       http://www.cs.washington.edu/research/jair/home.html

 -- Anonymous FTP from either of the two sites below:
      CMU:   p.gp.cs.cmu.edu        directory: /usr/jair/pub/volume1
      Genoa: ftp.mrg.dist.unige.it  directory:  pub/jair/pub/volume1

 -- automated email. Send mail to jair@cs.cmu.edu or jair@ftp.mrg.dist.unige.it
    with the subject AUTORESPOND, and the body GET VOLUME1/KITANI94A.PS
    (either upper or lowercase is fine). 
    Note: Your mailer might find this file too large to handle.

 -- JAIR Gopher server: At p.gp.cs.cmu.edu, port 70. 

For more information about JAIR, check out our WWW or FTP sites, or
send electronic mail to jair@cs.cmu.edu with the subject AUTORESPOND
and the message body HELP, or contact jair-ed@ptolemy.arc.nasa.gov.


