|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
public interface ISDBCrawlerInterface
Interface of the main Crawler class of the Web search engine. This class is used to start and stop the Crawler, to reset the engine and to control crawling parameters. Additionally, it provides the functionality to store crawled documents into the database using the built-in database interface block.
| Field Summary |
|---|
| Fields inherited from interface ISSearch.ISCrawlerInterface |
|---|
RUNNING, STOPPED |
| Method Summary | |
|---|---|
void |
closeDB()
Closes the database connection of the built-in database interface. |
ISDBinterface |
getDBInterface()
Returns the built-in database interface of the crawler |
boolean |
openDB()
Initializes the internal database interface and opens its database connection |
boolean |
store(java.net.URL link,
ISDocumentInterface doc)
Stores the crawled document and its URL into the database |
| Methods inherited from interface ISSearch.ISCrawlerInterface |
|---|
addLink, getBest, getContentType, getCrawlingDepth, getCurrentDocument, getCurrentURL, getMaxQueueSize, getNextURL, getQueueSize, getState, getTimeout, isDataStructureEmpty, isVisited, reset, robotSafe, runParser, setCrawlingDepth, setCurrentDocument, setQueueMaxSize, setState, setTimeout, start, stop |
| Methods inherited from interface java.lang.Runnable |
|---|
run |
| Method Detail |
|---|
boolean store(java.net.URL link,
ISDocumentInterface doc)
link - the URL of the crawled documentdoc - extracted terms and links from the document
boolean openDB()
void closeDB()
ISDBinterface getDBInterface()
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||