|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface IHttpDocumentFilter
Filter a document after the document content is downloaded.
It is highly recommended to overwrite the toString()
method
to representing this filter properly in human-readable form (e.g. logging).
It is a good idea to include specifics of this filter so crawler users
can know exactly why documents got accepted/rejected rejected if need be.
Method Summary | |
---|---|
boolean |
acceptDocument(HttpDocument document)
Whether to accept a HTTP document. |
Method Detail |
---|
boolean acceptDocument(HttpDocument document)
document
- the document to validate
true
if accepted, false
otherwise
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |