|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface IHttpHeadersFilter
Filter a document based on their HTTP headers, before the document content is downloaded.
It is highly recommended to overwrite the toString()
method
to representing this filter properly in human-readable form (e.g. logging).
It is a good idea to include specifics of this filter so crawler users
can know exactly why documents got accepted/rejected rejected if need be.
Method Summary | |
---|---|
boolean |
acceptDocument(String url,
HttpMetadata headers)
Whether to accept a URL HTTP headers. |
Method Detail |
---|
boolean acceptDocument(String url, HttpMetadata headers)
url
- the URL to accept/reject its headersheaders
- HTTP headers associated with the URL
true
if accepted, false
otherwise
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |