The Fusion machine learning indexing stage uses a trained machine learning model to analyze a field or fields of a PipelineDocument and stores the results of analysis in a new field of either the PipelineDocument or Context object. In order to use the Machine Learning Stage, you must train a machine learning model. There are two different ways to train a model:Documentation Index
Fetch the complete documentation index at: https://doc.lucidworks.com/llms.txt
Use this file to discover all available pages before exploring further.
- Use a Fusion AI job that trains a model, like Logistic Regression or Random Forest.
- Train a model using Spark’s MLlib API outside of Fusion, and upload this model into Fusion’s blob store. Complete details are available in Machine Learning Models in Fusion.
TIP: When specifying field names, multiple field names are supported, in this format:field1:weight,field2:weight,field3:weight
Although this stage is available without a Fusion AI license, it is only effective after running the Fusion AI jobs mentioned above.