RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment

Share this page:

RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment

Kevin Yang, Dan Klein, Asli Celikyilmaz, Nanyun Peng, and Yuandong Tian, in Proceedings of the Twelfth International Conference on Learning Representations (ICLR), 2024.

Download the full text

Abstract

Bib Entry

@inproceedings{yang2024rlcd,
  title = {RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment},
  author = {Yang, Kevin and Klein, Dan and Celikyilmaz, Asli and Peng, Nanyun and Tian, Yuandong},
  booktitle = {Proceedings of the Twelfth International Conference on Learning Representations (ICLR)},
  year = {2024}
}

Related Publications