Adaptive Recursive Retrieval for Cost-Efficient Factual Accuracy in Small Language Models
Inference-time adaptive, verified recursive retrieval enables small language models to achieve reliable factual grounding on complex queries without large-model cost.