public static class IncrementalIngester.PipelineAddFanout extends java.lang.Object implements IOutputAddActivity
Modifier and Type | Field and Description |
---|---|
protected IncrementalIngester.PipelineAddEntryPoint[] |
entryPoints |
protected IOutputHistoryActivity |
finalHistoryActivity |
protected IOutputQualifyActivity |
finalQualifyActivity |
_rcsid
CREATED_DIRECTORY, EXCEPTION, EXCLUDED_CONTENT, EXCLUDED_DATE, EXCLUDED_LENGTH, EXCLUDED_MIMETYPE, EXCLUDED_URL, HTTP_ERROR, IOEXCEPTION, UNKNOWN_SECURITY
Constructor and Description |
---|
PipelineAddFanout(IncrementalIngester.PipelineAddEntryPoint[] entryPoints,
IOutputHistoryActivity finalHistoryActivity,
IOutputQualifyActivity finalQualifyActivity) |
Modifier and Type | Method and Description |
---|---|
boolean |
checkDateIndexable(java.util.Date date)
Detect if a document date is acceptable downstream or not.
|
boolean |
checkDocumentIndexable(java.io.File localFile)
Pre-determine whether a document (passed here as a File object) is acceptable downstream.
|
boolean |
checkLengthIndexable(long length)
Pre-determine whether a document's length is acceptable downstream.
|
boolean |
checkMimeTypeIndexable(java.lang.String mimeType)
Detect if a mime type is acceptable downstream or not.
|
boolean |
checkNeedToReindex() |
boolean |
checkURLIndexable(java.lang.String uri)
Pre-determine whether a document's URL is acceptable downstream.
|
void |
noDocument()
Send NO document via the pipeline to the next output connection.
|
java.lang.String |
qualifyAccessToken(java.lang.String authorityNameString,
java.lang.String accessToken)
Qualify an access token appropriately, to match access tokens as returned by mod_aa.
|
void |
recordActivity(java.lang.Long startTime,
java.lang.String activityType,
java.lang.Long dataSize,
java.lang.String entityURI,
java.lang.String resultCode,
java.lang.String resultDescription)
Record time-stamped information about the activity of the output connector.
|
int |
sendDocument(java.lang.String documentURI,
RepositoryDocument document)
Send a document via the pipeline to the next output connection.
|
protected final IncrementalIngester.PipelineAddEntryPoint[] entryPoints
protected final IOutputHistoryActivity finalHistoryActivity
protected final IOutputQualifyActivity finalQualifyActivity
public PipelineAddFanout(IncrementalIngester.PipelineAddEntryPoint[] entryPoints, IOutputHistoryActivity finalHistoryActivity, IOutputQualifyActivity finalQualifyActivity)
public boolean checkNeedToReindex()
public boolean checkDateIndexable(java.util.Date date) throws ManifoldCFException, ServiceInterruption
IOutputCheckActivity
checkDateIndexable
in interface IOutputCheckActivity
date
- is the date of the document.ManifoldCFException
ServiceInterruption
public boolean checkMimeTypeIndexable(java.lang.String mimeType) throws ManifoldCFException, ServiceInterruption
IOutputCheckActivity
checkMimeTypeIndexable
in interface IOutputCheckActivity
mimeType
- is the mime type of the document.ManifoldCFException
ServiceInterruption
public boolean checkDocumentIndexable(java.io.File localFile) throws ManifoldCFException, ServiceInterruption
IOutputCheckActivity
checkDocumentIndexable
in interface IOutputCheckActivity
localFile
- is the local file to check.ManifoldCFException
ServiceInterruption
public boolean checkLengthIndexable(long length) throws ManifoldCFException, ServiceInterruption
IOutputCheckActivity
checkLengthIndexable
in interface IOutputCheckActivity
length
- is the length of the document.ManifoldCFException
ServiceInterruption
public boolean checkURLIndexable(java.lang.String uri) throws ManifoldCFException, ServiceInterruption
IOutputCheckActivity
checkURLIndexable
in interface IOutputCheckActivity
uri
- is the URL of the document.ManifoldCFException
ServiceInterruption
public int sendDocument(java.lang.String documentURI, RepositoryDocument document) throws ManifoldCFException, ServiceInterruption, java.io.IOException
sendDocument
in interface IOutputAddActivity
documentURI
- is the document's URI.document
- is the document data to be processed (handed to the output data store).java.io.IOException
- only if there's an IO error reading the data from the document.ManifoldCFException
ServiceInterruption
public void noDocument() throws ManifoldCFException, ServiceInterruption
noDocument
in interface IOutputAddActivity
ManifoldCFException
ServiceInterruption
public java.lang.String qualifyAccessToken(java.lang.String authorityNameString, java.lang.String accessToken) throws ManifoldCFException
qualifyAccessToken
in interface IOutputQualifyActivity
authorityNameString
- is the name of the authority to use to qualify the access token.accessToken
- is the raw, repository access token.ManifoldCFException
public void recordActivity(java.lang.Long startTime, java.lang.String activityType, java.lang.Long dataSize, java.lang.String entityURI, java.lang.String resultCode, java.lang.String resultDescription) throws ManifoldCFException
recordActivity
in interface IOutputHistoryActivity
startTime
- is either null or the time since the start of epoch in milliseconds (Jan 1, 1970). Every
activity has an associated time; the startTime field records when the activity began. A null value
indicates that the start time and the finishing time are the same.activityType
- is a string which is fully interpretable only in the context of the connector involved, which is
used to categorize what kind of activity is being recorded. For example, a web connector might record a
"fetch document" activity. Cannot be null.dataSize
- is the number of bytes of data involved in the activity, or null if not applicable.entityURI
- is a (possibly long) string which identifies the object involved in the history record.
The interpretation of this field will differ from connector to connector. May be null.resultCode
- contains a terse description of the result of the activity. The description is limited in
size to 255 characters, and can be interpreted only in the context of the current connector. May be null.resultDescription
- is a (possibly long) human-readable string which adds detail, if required, to the result
described in the resultCode field. This field is not meant to be queried on. May be null.ManifoldCFException