AbortChecker |
This class furnishes an abort signal whenever the job activity says it should.
|
AuthenticationCredentials |
This interface describes immutable classes which represents authentication information for all kinds of authentication.
|
CookieManager |
This class manages the database table into which we write cookies.
|
CookieManager.CookiesCacheClass |
Cache class for robots.
|
CookieManager.CookiesDescription |
This is the object description for a session key object.
|
CredentialsDescription |
This class describes credential information pulled from a configuration.
|
CredentialsDescription.SessionCredentialItem |
Session credential helper class
|
CredentialsDescription.SessionCredentialParameter |
Session credential parameter class
|
DataCache |
This class is a cache of a specific URL's data.
|
DataCache.DocumentData |
This class represents everything we need to know about a document that's getting passed from the
getDocumentVersions() phase to the processDocuments() phase.
|
DNSManager |
This class manages the database table into which we DNS entries for hosts.
|
DNSManager.DNSCacheClass |
Cache class for robots.
|
DNSManager.DNSInfo |
This is a cached data item.
|
DNSManager.HostDescription |
This is the object description for a robots host object.
|
FindHandler |
This class is used to discover links in a session login context
|
FormData |
This interface describes the form data gleaned from an HTML page.
|
FormDataAccumulator |
This class accumulates form data and allows overrides
|
FormDataElement |
This interface describes individual form data elements, for form submission.
|
IDiscoveredLinkHandler |
This interface describes the functionality needed by a link extractor to note a discovered link.
|
IHTMLHandler |
This interface describes the functionality needed by an HTML processor in order to handle an HTML document.
|
IMetaTagHandler |
This interface describes the functionality needed by a parser to handle metadata tags.
|
IRedirectionHandler |
This interface describes the functionality needed by an redirection processor in order to handle a redirection.
|
IThrottledConnection |
This interface represents an established connection to a URL.
|
IXMLHandler |
This interface describes the functionality needed by an XML processor in order to handle an XML document.
|
LinkParseState |
This class recognizes and interprets all links
|
LoginCookies |
This interface describes cookies obtained during sequential authentication.
|
LoginParameters |
This interface describes login parameters to be used to submit a page during sequential authentication.
|
MetaParseState |
This class recognizes and interprets all meta tags
|
PageCredentials |
This interface describes immutable classes which represents authentication information for page-based authentication.
|
RobotsManager |
This class manages the database table into which we write robots.txt files for hosts.
|
RobotsManager.HostDescription |
This is the object description for a robots host object.
|
RobotsManager.RobotsCacheClass |
Cache class for robots.
|
RobotsManager.RobotsData |
This is a cached data item.
|
ScriptParseState |
This class interprets the tag stream generated by the HTMLParseState class, and causes script sections to be skipped
|
SequenceCredentials |
This interface describes immutable classes which represents authentication information for sequence-based authentication.
|
ThrottleDescription |
This class describes complex throttling criteria pulled from a configuration.
|
ThrottleDescription.ThrottleItem |
Class representing an individual throttle item.
|
ThrottledFetcher.ConnectionPool |
Each connection pool has identical connections we can draw on.
|
ThrottledFetcher.ConnectionPoolKey |
Connection pool key
|
ThrottledFetcher.ExecuteMethodThread |
This thread does the actual socket communication with the server.
|
ThrottledFetcher.ThrottledConnection |
Throttled connections.
|
TrustsDescription |
This class describes trust information pulled from a configuration.
|
WebcrawlerConnector.CanonicalizationPolicies |
Class representing a list of canonicalization rules
|
WebcrawlerConnector.CanonicalizationPolicy |
Class representing a URL regular expression match, for the purposes of determining canonicalization policy
|
WebcrawlerConnector.DocumentURLFilter |
This class describes the url filtering information (for crawling and indexing) obtained from a digested DocumentSpecification.
|
WebcrawlerConnector.EvaluatorToken |
Evaluator token.
|
WebcrawlerConnector.FetchStatus |
|
WebcrawlerConnector.MappingRule |
Class representing a mapping rule
|
WebcrawlerConnector.MappingRules |
Class that represents all mappings
|
WebcrawlerConnector.ProcessActivityLinkHandler |
This class is the handler for links that get added into a IProcessActivity object.
|
WebURL |
Replacement class for java.net.URI, which is broken in many ways.
|