SFR-DeepResearch: Towards Effective Reinforcement Learning for Autonomously Reasoning Single Agents
Xuan-Phi Nguyen, Shrey Pandit, Revanth Gangi, Austin Xu, Silvio Savarese, Caiming Xiong, and Shafiq Joty. 2025.
PDF BibTex Slides