The Hidden Infinity in Preference Learning
09 July 2024
An illustration of how length normalization aids learning from model-annotated data
1Using LESS Data to Tune Models
04 April 2024
Data Selection in the Era of LLMs
2How to Scale Hyperparameters as Batch Size Increases
22 January 2024
Understanding Optimization using Stochastic Differential Equations
3