RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment
Kevin Yang, Dan Klein, Asli Celikyilmaz, Nanyun Peng, and Yuandong Tian, in Proceedings of the Twelfth International Conference on Learning Representations (ICLR), 2024.
Abstract
Bib Entry
@inproceedings{yang2024rlcd,
title = {RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment},
author = {Yang, Kevin and Klein, Dan and Celikyilmaz, Asli and Peng, Nanyun and Tian, Yuandong},
booktitle = {Proceedings of the Twelfth International Conference on Learning Representations (ICLR)},
year = {2024}
}
Related Publications