Class WorkerThread.CheckActivity
- java.lang.Object
-
- org.apache.manifoldcf.crawler.system.WorkerThread.CheckActivity
-
- All Implemented Interfaces:
IOutputCheckActivity
- Direct Known Subclasses:
WorkerThread.OutputActivity
- Enclosing class:
- WorkerThread
protected static class WorkerThread.CheckActivity extends java.lang.Object implements IOutputCheckActivity
The check activity class
-
-
Field Summary
-
Fields inherited from interface org.apache.manifoldcf.agents.interfaces.IOutputCheckActivity
_rcsid
-
-
Constructor Summary
Constructors Constructor Description CheckActivity()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description boolean
checkDateIndexable(java.util.Date date)
Detect if a date is acceptable downstream or not.boolean
checkDocumentIndexable(java.io.File localFile)
Pre-determine whether a document (passed here as a File object) is acceptable downstream.boolean
checkLengthIndexable(long length)
Pre-determine whether a document's length is acceptable downstream.boolean
checkMimeTypeIndexable(java.lang.String mimeType)
Detect if a mime type is acceptable downstream or not.boolean
checkURLIndexable(java.lang.String url)
Pre-determine whether a document's URL is acceptable downstream.
-
-
-
Method Detail
-
checkDateIndexable
public boolean checkDateIndexable(java.util.Date date) throws ManifoldCFException, ServiceInterruption
Detect if a date is acceptable downstream or not. This method is used to determine whether it makes sense to fetch a document in the first place.- Specified by:
checkDateIndexable
in interfaceIOutputCheckActivity
- Parameters:
date
- is the document's date- Returns:
- true if the document with that date can be accepted by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
checkMimeTypeIndexable
public boolean checkMimeTypeIndexable(java.lang.String mimeType) throws ManifoldCFException, ServiceInterruption
Detect if a mime type is acceptable downstream or not. This method is used to determine whether it makes sense to fetch a document in the first place.- Specified by:
checkMimeTypeIndexable
in interfaceIOutputCheckActivity
- Parameters:
mimeType
- is the mime type of the document.- Returns:
- true if the mime type can be accepted by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
checkDocumentIndexable
public boolean checkDocumentIndexable(java.io.File localFile) throws ManifoldCFException, ServiceInterruption
Pre-determine whether a document (passed here as a File object) is acceptable downstream. This method is used to determine whether a document needs to be actually transferred. This hook is provided mainly to support search engines that only handle a small set of accepted file types.- Specified by:
checkDocumentIndexable
in interfaceIOutputCheckActivity
- Parameters:
localFile
- is the local file to check.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
checkLengthIndexable
public boolean checkLengthIndexable(long length) throws ManifoldCFException, ServiceInterruption
Pre-determine whether a document's length is acceptable downstream. This method is used to determine whether to fetch a document in the first place.- Specified by:
checkLengthIndexable
in interfaceIOutputCheckActivity
- Parameters:
length
- is the length of the document.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
checkURLIndexable
public boolean checkURLIndexable(java.lang.String url) throws ManifoldCFException, ServiceInterruption
Pre-determine whether a document's URL is acceptable downstream. This method is used to help filter out documents that cannot be indexed in advance.- Specified by:
checkURLIndexable
in interfaceIOutputCheckActivity
- Parameters:
url
- is the URL of the document.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFException
ServiceInterruption
-
-