| AbortChecker |
This class furnishes an abort signal whenever the job activity says it should.
|
| AuthenticationCredentials |
This interface describes immutable classes which represents authentication information for all kinds of authentication.
|
| CookieManager |
This class manages the database table into which we write cookies.
|
| CookieManager.CookiesCacheClass |
Cache class for robots.
|
| CookieManager.CookiesDescription |
This is the object description for a session key object.
|
| CredentialsDescription |
This class describes credential information pulled from a configuration.
|
| CredentialsDescription.SessionCredentialItem |
Session credential helper class
|
| CredentialsDescription.SessionCredentialParameter |
Session credential parameter class
|
| DataCache |
This class is a cache of a specific URL's data.
|
| DataCache.DocumentData |
This class represents everything we need to know about a document that's getting passed from the
getDocumentVersions() phase to the processDocuments() phase.
|
| DNSManager |
This class manages the database table into which we DNS entries for hosts.
|
| DNSManager.DNSCacheClass |
Cache class for robots.
|
| DNSManager.DNSInfo |
This is a cached data item.
|
| DNSManager.HostDescription |
This is the object description for a robots host object.
|
| FindHandler |
This class is used to discover links in a session login context
|
| FormData |
This interface describes the form data gleaned from an HTML page.
|
| FormDataAccumulator |
This class accumulates form data and allows overrides
|
| FormDataElement |
This interface describes individual form data elements, for form submission.
|
| IDiscoveredLinkHandler |
This interface describes the functionality needed by a link extractor to note a discovered link.
|
| IHTMLHandler |
This interface describes the functionality needed by an HTML processor in order to handle an HTML document.
|
| IMetaTagHandler |
This interface describes the functionality needed by a parser to handle metadata tags.
|
| IRedirectionHandler |
This interface describes the functionality needed by an redirection processor in order to handle a redirection.
|
| IThrottledConnection |
This interface represents an established connection to a URL.
|
| IXMLHandler |
This interface describes the functionality needed by an XML processor in order to handle an XML document.
|
| LinkParseState |
This class recognizes and interprets all links
|
| LoginCookies |
This interface describes cookies obtained during sequential authentication.
|
| LoginParameters |
This interface describes login parameters to be used to submit a page during sequential authentication.
|
| MetaParseState |
This class recognizes and interprets all meta tags
|
| PageCredentials |
This interface describes immutable classes which represents authentication information for page-based authentication.
|
| RobotsManager |
This class manages the database table into which we write robots.txt files for hosts.
|
| RobotsManager.HostDescription |
This is the object description for a robots host object.
|
| RobotsManager.RobotsCacheClass |
Cache class for robots.
|
| RobotsManager.RobotsData |
This is a cached data item.
|
| ScriptParseState |
This class interprets the tag stream generated by the HTMLParseState class, and causes script sections to be skipped
|
| SequenceCredentials |
This interface describes immutable classes which represents authentication information for sequence-based authentication.
|
| ThrottleDescription |
This class describes complex throttling criteria pulled from a configuration.
|
| ThrottleDescription.ThrottleItem |
Class representing an individual throttle item.
|
| ThrottledFetcher.ConnectionPool |
Each connection pool has identical connections we can draw on.
|
| ThrottledFetcher.ConnectionPoolKey |
Connection pool key
|
| ThrottledFetcher.ExecuteMethodThread |
This thread does the actual socket communication with the server.
|
| ThrottledFetcher.ThrottledConnection |
Throttled connections.
|
| TrustsDescription |
This class describes trust information pulled from a configuration.
|
| WebcrawlerConnector.CanonicalizationPolicies |
Class representing a list of canonicalization rules
|
| WebcrawlerConnector.CanonicalizationPolicy |
Class representing a URL regular expression match, for the purposes of determining canonicalization policy
|
| WebcrawlerConnector.DocumentURLFilter |
This class describes the url filtering information (for crawling and indexing) obtained from a digested DocumentSpecification.
|
| WebcrawlerConnector.EvaluatorToken |
Evaluator token.
|
| WebcrawlerConnector.FetchStatus |
|
| WebcrawlerConnector.MappingRule |
Class representing a mapping rule
|
| WebcrawlerConnector.MappingRules |
Class that represents all mappings
|
| WebcrawlerConnector.ProcessActivityLinkHandler |
This class is the handler for links that get added into a IProcessActivity object.
|
| WebURL |
Replacement class for java.net.URI, which is broken in many ways.
|