Q3 Search Algorithm Test Report
Comparison of v1-baseline-us vs. v2-titleboost-us vs. vector
Generated on: 2025-08-01 | 200 Queries Analyzed
Executive Summary & LLM Judgement
WINNER
v2-titleboost-us
LLM Analyst Conclusion: The 'v2-titleboost-us' configuration demonstrates a statistically significant improvement across multiple key relevance metrics, including nDCG@10, Precision, and Recall, when compared to both the baseline and vector models. Although the 'vector' model shows strength in some areas, 'v2-titleboost-us' provides the most balanced and consistent uplift in search quality. It is therefore recommended for deployment.
Key Metrics Summary
Metric | v1-baseline-us | v2-titleboost-us | vector | Winner |
---|---|---|---|---|
Mean nDCG@10 | 0.2983 | 0.3355 | 0.2566 | v2-titleboost-us |
Mean RR | 0.4375 | 0.4907 | 0.4165 | v2-titleboost-us |
Mean Precision@5 | 0.2690 | 0.3080 | 0.2430 | v2-titleboost-us |
Queries with Zero Results | 1 | 1 | 0 | vector |