An Index Pipeline takes content and transforms it into a document suitable for indexing by Solr via a series of modular operations called stages. The objects sent from stage to stage are PipelineDocument objects. Fusion provides many specialized index stages as well as a JavaScript Index stage that allows for custom processing via a JavaScript program. The general outline of theExtract/Transform/Load processing performed by an index pipeline is:Documentation Index
Fetch the complete documentation index at: https://doc.lucidworks.com/llms.txt
Use this file to discover all available pages before exploring further.
- Raw content is parsed into one or more PipelineDocument objects.
- Any number of intermediate stages operate on the document fields directly, or, in the case of specialized NLP tools, add annotations to a document.
- Finally, the PipelineDocument is sent to Solr for indexing.
Learn more
Available index pipeline stages are listed below:Document transformation
Document filtering and enrichment
Field transformation
- Date Parsing
- Field Mapping
- Filter Short Fields
- Find and Replace
- GeoIP Lookup
- Regex Field Extraction
- Regex Field Filter
- Regex Field Replacement
- Resolve Multivalued Fields
- Solr Dynamic Field Name Mapping