Share this page:

Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization

Hritik Bansal, Ashima Suvarna, Gantavya Bhatt, Nanyun Peng, Kai-Wei Chang, and Aditya Grover, in Data-centric Machine Learning Research (DMLR) Workshop at The International Conference on Machine Learning (ICML), 2024.

Abstract


Bib Entry

@inproceedings{bansal2024alignment,
  author = {Bansal, Hritik and Suvarna, Ashima and Bhatt, Gantavya and Peng, Nanyun and Chang, Kai-Wei and Grover, Aditya},
  title = {Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization},
  booktitle = {Data-centric Machine Learning Research (DMLR) Workshop at The International Conference on Machine Learning (ICML)},
  year = {2024}
}

Related Publications