Job configuration specifications
Default job name | COLLECTION_NAME_head_tail |
Input | Raw or aggregated signals (the COLLECTION_NAME_signals or COLLECTION_NAME_signals_aggr collections by default) |
Output | ● Rewrites for underperforming queries (the _query_rewrite_staging collection by default) ●Analytics tables (the COLLECTION_NAME_job_reports collection by default) |
COLLECTION_NAME_signals
collection) or aggregated (the _signals_aggr
collection).count_i
is the field that records the count of raw signals and aggr_count_i
is the field that records the count after aggregation.mainType
/Main Event Type parameter)filterType
/Filtering Event Type parameter)<input-collection>_job_reports
collection. The head/tail job adds a set of analytics results tables to the collection. You can find these table names in the doc_type_s
field of each document:
overall_distribution
summary_stat
queries_ordered
tokens_ordered
queryLength
tail_reasons
tail_rewriting
overall_distribution
)summary_stat
)topQ
)trafficPerc
)lastTraffic
)trafficCount
)queries_ordered
)tokens_ordered
)queryLength
)tail_reasons
)spelling | The query contains one or more misspellings; we can apply spelling suggestions based on the matching head. |
number | The query contains an attribute search on a specific dimension. To normalize these queries we can parse the number to deal with different formatting, and/or pay attention to unit synonyms or enrich the product catalog. For example, “3x5” should be converted to “3’ X 5’” to match the dimension field. |
other-specific | The query contains specific descriptive words plus a head query, which means the user is searching for a very specific product or has a specific requirement. We can boost on the specific part for better relevancy. |
other-extra | This is similar to ‘other-specific’ but the descriptive part may lead to ambiguity, so it requires boosting the head query portion of the query instead of the specific or descriptive words. |
rare-term | The user is searching for a rare item; use caution when boosting. |
re-wording | The query contains a sequence of terms in a less-common order. Flipping the word order to a more common one can change a tail query to a head query, and allows for consistent boosting on the last term in many cases. |
stopwords | Query contains stopwords plus head query. We would need to drop stopwords. |
keywordsBlobName
) field.
tail_rewriting
)tailQuery_orig
field), Managed Fusion tries to find its closest matching head queries (the headQuery_orig
field), then suggests a query rewrite (the suggested_query
field) which would improve the query. The rewrite suggestions in this table can be implemented in a variety of ways, including utilizing rules editor or configuring a query parser that rewrites tail queries.