Class WorkerThread.CheckActivity
- java.lang.Object
-
- org.apache.manifoldcf.crawler.system.WorkerThread.CheckActivity
-
- All Implemented Interfaces:
IOutputCheckActivity
- Direct Known Subclasses:
WorkerThread.OutputActivity
- Enclosing class:
- WorkerThread
protected static class WorkerThread.CheckActivity extends java.lang.Object implements IOutputCheckActivity
The check activity class
-
-
Field Summary
-
Fields inherited from interface org.apache.manifoldcf.agents.interfaces.IOutputCheckActivity
_rcsid
-
-
Constructor Summary
Constructors Constructor Description CheckActivity()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description booleancheckDateIndexable(java.util.Date date)Detect if a date is acceptable downstream or not.booleancheckDocumentIndexable(java.io.File localFile)Pre-determine whether a document (passed here as a File object) is acceptable downstream.booleancheckLengthIndexable(long length)Pre-determine whether a document's length is acceptable downstream.booleancheckMimeTypeIndexable(java.lang.String mimeType)Detect if a mime type is acceptable downstream or not.booleancheckURLIndexable(java.lang.String url)Pre-determine whether a document's URL is acceptable downstream.
-
-
-
Method Detail
-
checkDateIndexable
public boolean checkDateIndexable(java.util.Date date) throws ManifoldCFException, ServiceInterruptionDetect if a date is acceptable downstream or not. This method is used to determine whether it makes sense to fetch a document in the first place.- Specified by:
checkDateIndexablein interfaceIOutputCheckActivity- Parameters:
date- is the document's date- Returns:
- true if the document with that date can be accepted by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
checkMimeTypeIndexable
public boolean checkMimeTypeIndexable(java.lang.String mimeType) throws ManifoldCFException, ServiceInterruptionDetect if a mime type is acceptable downstream or not. This method is used to determine whether it makes sense to fetch a document in the first place.- Specified by:
checkMimeTypeIndexablein interfaceIOutputCheckActivity- Parameters:
mimeType- is the mime type of the document.- Returns:
- true if the mime type can be accepted by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
checkDocumentIndexable
public boolean checkDocumentIndexable(java.io.File localFile) throws ManifoldCFException, ServiceInterruptionPre-determine whether a document (passed here as a File object) is acceptable downstream. This method is used to determine whether a document needs to be actually transferred. This hook is provided mainly to support search engines that only handle a small set of accepted file types.- Specified by:
checkDocumentIndexablein interfaceIOutputCheckActivity- Parameters:
localFile- is the local file to check.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
checkLengthIndexable
public boolean checkLengthIndexable(long length) throws ManifoldCFException, ServiceInterruptionPre-determine whether a document's length is acceptable downstream. This method is used to determine whether to fetch a document in the first place.- Specified by:
checkLengthIndexablein interfaceIOutputCheckActivity- Parameters:
length- is the length of the document.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
checkURLIndexable
public boolean checkURLIndexable(java.lang.String url) throws ManifoldCFException, ServiceInterruptionPre-determine whether a document's URL is acceptable downstream. This method is used to help filter out documents that cannot be indexed in advance.- Specified by:
checkURLIndexablein interfaceIOutputCheckActivity- Parameters:
url- is the URL of the document.- Returns:
- true if the file is acceptable by the downstream connection.
- Throws:
ManifoldCFExceptionServiceInterruption
-
-