Class IncrementalIngester.PipelineCheckFanout
- java.lang.Object
-
- org.apache.manifoldcf.agents.incrementalingest.IncrementalIngester.PipelineCheckFanout
-
- All Implemented Interfaces:
IOutputCheckActivity
- Enclosing class:
- IncrementalIngester
public static class IncrementalIngester.PipelineCheckFanout extends java.lang.Object implements IOutputCheckActivity
This class describes the entry stage of multiple siblings in a check pipeline.
-
-
Field Summary
Fields Modifier and Type Field Description protected IncrementalIngester.PipelineCheckEntryPoint[]
entryPoints
-
Fields inherited from interface org.apache.manifoldcf.agents.interfaces.IOutputCheckActivity
_rcsid
-
-
Constructor Summary
Constructors Constructor Description PipelineCheckFanout(IncrementalIngester.PipelineCheckEntryPoint[] entryPoints)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description boolean
checkDateIndexable(java.util.Date date)
Detect if a document date is acceptable downstream or not.boolean
checkDocumentIndexable(java.io.File localFile)
Pre-determine whether a document (passed here as a File object) is acceptable downstream.boolean
checkLengthIndexable(long length)
Pre-determine whether a document's length is acceptable downstream.boolean
checkMimeTypeIndexable(java.lang.String mimeType)
Detect if a mime type is acceptable downstream or not.boolean
checkURLIndexable(java.lang.String uri)
Pre-determine whether a document's URL is acceptable downstream.
-
-
-
Field Detail
-
entryPoints
protected final IncrementalIngester.PipelineCheckEntryPoint[] entryPoints
-
-
Constructor Detail
-
PipelineCheckFanout
public PipelineCheckFanout(IncrementalIngester.PipelineCheckEntryPoint[] entryPoints)
-
-
Method Detail
-
checkDateIndexable
public boolean checkDateIndexable(java.util.Date date) throws ManifoldCFException, ServiceInterruption
Description copied from interface:IOutputCheckActivity
Detect if a document date is acceptable downstream or not. This method is used to determine whether it makes sense to fetch a document in the first place.- Specified by:
checkDateIndexable
in interfaceIOutputCheckActivity
- Parameters:
date
- is the date of the document.- Returns:
- true if the document with that date can be accepted by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
checkMimeTypeIndexable
public boolean checkMimeTypeIndexable(java.lang.String mimeType) throws ManifoldCFException, ServiceInterruption
Description copied from interface:IOutputCheckActivity
Detect if a mime type is acceptable downstream or not. This method is used to determine whether it makes sense to fetch a document in the first place.- Specified by:
checkMimeTypeIndexable
in interfaceIOutputCheckActivity
- Parameters:
mimeType
- is the mime type of the document.- Returns:
- true if the mime type can be accepted by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
checkDocumentIndexable
public boolean checkDocumentIndexable(java.io.File localFile) throws ManifoldCFException, ServiceInterruption
Description copied from interface:IOutputCheckActivity
Pre-determine whether a document (passed here as a File object) is acceptable downstream. This method is used to determine whether a document needs to be actually transferred. This hook is provided mainly to support search engines that only handle a small set of accepted file types.- Specified by:
checkDocumentIndexable
in interfaceIOutputCheckActivity
- Parameters:
localFile
- is the local file to check.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
checkLengthIndexable
public boolean checkLengthIndexable(long length) throws ManifoldCFException, ServiceInterruption
Description copied from interface:IOutputCheckActivity
Pre-determine whether a document's length is acceptable downstream. This method is used to determine whether to fetch a document in the first place.- Specified by:
checkLengthIndexable
in interfaceIOutputCheckActivity
- Parameters:
length
- is the length of the document.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
checkURLIndexable
public boolean checkURLIndexable(java.lang.String uri) throws ManifoldCFException, ServiceInterruption
Description copied from interface:IOutputCheckActivity
Pre-determine whether a document's URL is acceptable downstream. This method is used to help filter out documents that cannot be indexed in advance.- Specified by:
checkURLIndexable
in interfaceIOutputCheckActivity
- Parameters:
uri
- is the URL of the document.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
-