Class AsciiFoldingDataProcessor

java.lang.Object
de.cxp.ocs.preprocessor.impl.AsciiFoldingDataProcessor
All Implemented Interfaces:
DocumentPreProcessor

public class AsciiFoldingDataProcessor extends Object implements DocumentPreProcessor
Converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists for every searchable field.
  • Constructor Details

    • AsciiFoldingDataProcessor

      public AsciiFoldingDataProcessor()
  • Method Details

    • initialize

      public void initialize(FieldConfigAccess fieldConfig, Map<String,String> preProcessorConfig)
      Description copied from interface: DocumentPreProcessor
      DataPreProcessor MUST have a no-args constructor. To configure it afterwards, this method will be used.
      Specified by:
      initialize in interface DocumentPreProcessor
      Parameters:
      fieldConfig - field config access
      preProcessorConfig - custom string-to-string map that can be configured per DocumentPreProcessor.
    • process

      public boolean process(Document sourceDocument, boolean visible)
      Description copied from interface: DocumentPreProcessor
      Called for each source document.
      Specified by:
      process in interface DocumentPreProcessor
      Parameters:
      sourceDocument - source document
      visible - weather or not the record is currently marked for indexing.
      Returns:
      true if the record should be indexed, false otherwise.