ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)

2 views

Yannic Kilcher

12 days ago

ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)

ORPO: Monolithic Preference Optimization without Reference Model (Paper Explained)