|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectinfo.ephyra.nlp.SentenceExtractor
public class SentenceExtractor
Extracts sentences and text fragments from an HTML document.
| Field Summary | |
|---|---|
private static java.lang.String |
NON_STRUC_TAGS
Regular expression that describes non-structuring tags, i.e. |
| Constructor Summary | |
|---|---|
SentenceExtractor()
|
|
| Method Summary | |
|---|---|
static java.lang.String[] |
getSentencesFromHtml(java.lang.String html)
Extracts sentences from an HTML document |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
private static final java.lang.String NON_STRUC_TAGS
| Constructor Detail |
|---|
public SentenceExtractor()
| Method Detail |
|---|
public static java.lang.String[] getSentencesFromHtml(java.lang.String html)
html - the HTML document
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||