Class IncrementalIngester.PipelineCheckFanout
- java.lang.Object
-
- org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.PipelineCheckFanout
-
- All Implemented Interfaces:
IOutputCheckActivity
- Enclosing class:
- IncrementalIngester
public static class IncrementalIngester.PipelineCheckFanout extends java.lang.Object implements IOutputCheckActivity
This class describes the entry stage of multiple siblings in a check pipeline.
-
-
Field Summary
Fields Modifier and Type Field Description protected IncrementalIngester.PipelineCheckEntryPoint[]entryPoints-
Fields inherited from interface org.apache.manifoldcf.agents.interfaces.IOutputCheckActivity
_rcsid
-
-
Constructor Summary
Constructors Constructor Description PipelineCheckFanout(IncrementalIngester.PipelineCheckEntryPoint[] entryPoints)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description booleancheckDateIndexable(java.util.Date date)Detect if a document date is acceptable downstream or not.booleancheckDocumentIndexable(java.io.File localFile)Pre-determine whether a document (passed here as a File object) is acceptable downstream.booleancheckLengthIndexable(long length)Pre-determine whether a document's length is acceptable downstream.booleancheckMimeTypeIndexable(java.lang.String mimeType)Detect if a mime type is acceptable downstream or not.booleancheckURLIndexable(java.lang.String uri)Pre-determine whether a document's URL is acceptable downstream.
-
-
-
Field Detail
-
entryPoints
protected final IncrementalIngester.PipelineCheckEntryPoint[] entryPoints
-
-
Constructor Detail
-
PipelineCheckFanout
public PipelineCheckFanout(IncrementalIngester.PipelineCheckEntryPoint[] entryPoints)
-
-
Method Detail
-
checkDateIndexable
public boolean checkDateIndexable(java.util.Date date) throws ManifoldCFException, ServiceInterruptionDescription copied from interface:IOutputCheckActivityDetect if a document date is acceptable downstream or not. This method is used to determine whether it makes sense to fetch a document in the first place.- Specified by:
checkDateIndexablein interfaceIOutputCheckActivity- Parameters:
date- is the date of the document.- Returns:
- true if the document with that date can be accepted by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
checkMimeTypeIndexable
public boolean checkMimeTypeIndexable(java.lang.String mimeType) throws ManifoldCFException, ServiceInterruptionDescription copied from interface:IOutputCheckActivityDetect if a mime type is acceptable downstream or not. This method is used to determine whether it makes sense to fetch a document in the first place.- Specified by:
checkMimeTypeIndexablein interfaceIOutputCheckActivity- Parameters:
mimeType- is the mime type of the document.- Returns:
- true if the mime type can be accepted by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
checkDocumentIndexable
public boolean checkDocumentIndexable(java.io.File localFile) throws ManifoldCFException, ServiceInterruptionDescription copied from interface:IOutputCheckActivityPre-determine whether a document (passed here as a File object) is acceptable downstream. This method is used to determine whether a document needs to be actually transferred. This hook is provided mainly to support search engines that only handle a small set of accepted file types.- Specified by:
checkDocumentIndexablein interfaceIOutputCheckActivity- Parameters:
localFile- is the local file to check.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
checkLengthIndexable
public boolean checkLengthIndexable(long length) throws ManifoldCFException, ServiceInterruptionDescription copied from interface:IOutputCheckActivityPre-determine whether a document's length is acceptable downstream. This method is used to determine whether to fetch a document in the first place.- Specified by:
checkLengthIndexablein interfaceIOutputCheckActivity- Parameters:
length- is the length of the document.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
checkURLIndexable
public boolean checkURLIndexable(java.lang.String uri) throws ManifoldCFException, ServiceInterruptionDescription copied from interface:IOutputCheckActivityPre-determine whether a document's URL is acceptable downstream. This method is used to help filter out documents that cannot be indexed in advance.- Specified by:
checkURLIndexablein interfaceIOutputCheckActivity- Parameters:
uri- is the URL of the document.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
-