Direct Judgement Preference Optimization
Peifeng Wang, Austin Xu, Yilun Zhou, Caiming Xong, and Shafiq Joty. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP-25) 2025.
PDF BibTex Slides