Package jangada

Jangada is an API for signature block extraction and reply-to extraction from email messages.

See:
          Description

Class Summary
Demo2 How to use the Signature file detector.
Demo3 An example of how to extract the reply-lines(quotes) of a message or, a message with the reply-lines removed.
Demo4 This class is another example of how to extract the reply-lines(quotes) of a message ; Or a message with the reply-lines removed.
Demo5 An example of how to extract the signature blocks of an email message.
Demo6 Example of the usage of ReplyTo annotator and SigFileAnnotator.
LocalStringAnnotator An abstract annotator that is based on marking up substrings within a string, using the CharAnnotation class from Minorthird.
ReplyToAnnotator Annotator for Extracting the reply lines from email messages.
SigFileAnnotator  
SigFileDetector Classify an incoming email message as having or not a Signature File.
SigFilePredictor Signature File extraction Algorithm It follows the description in "Learning to Extract Signature and Reply Lines from Email", V.R.Carvalho and W.W.Cohen, CEAS (Conference of Email and Anti-Span), 2004 *
SigFilePredictor.WindowRepresentation Inner class to represent the message as a sequence of features - using window features (neighbor lines)
 

Package jangada Description

Jangada is an API for signature block extraction and reply-to extraction from email messages. Written by Vitor R. Carvalho, jangada follows the ideas from the following paper: "Learning to Extract Signature and Reply Lines from Email", Vitor R. Carvalho and William W. Cohen, CEAS (Conference of Email and Anti-Span), Mountain View, CA, July, 2004

Jangada was built using the machinery(learning algorithms, text annotation classes, etc) provided by Minorthird. Please see http://minorthird.sourceforge.net/ for more details.