After introducing Cohere Rerank v3.5 into our pipeline, the relevance of the required chunks improved significantly, which directly reduced hallucination responses from the downstream LLMs, and the latency was quite good, making it acceptable for the enterprise-grade application.

