|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
Parser-Interface of the Web search engine.
The parser reads any given text or HTML input (represented by the Reader) and extracts
all words and links from that source.
The result must be returned as Object that implements the ISDocumentInterface
String,
HTMLEditorKit,
StringTokenizer| Method Summary | |
boolean |
isStopword(java.lang.String who)
Decides whether the given token is claimed as stopword or not. |
ISDocumentInterface |
parse(java.io.Reader input)
Performs the input analysis. |
java.lang.String |
stem(java.lang.String who)
Applies the Porter stemming algorithm and returns the resulting word stem. |
| Method Detail |
public ISDocumentInterface parse(java.io.Reader input)
input - the input of the parser (e.g., text file or HTTP connection), represented by the Reader
public boolean isStopword(java.lang.String who)
who - The String to be checked.
public java.lang.String stem(java.lang.String who)
who - The word to be stemmed.
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||