Introduction November 23, 2023 less than 1 minute read I set up this website to record and share personal learning blog. TODO Review Notes Share on X Facebook LinkedIn Bluesky Previous Next
Smooth marginal aware preference learning August 1, 2025 4 minute read Learning from Preferences with Stability: A Deep Dive into Marginal-Aware Fine-Tuning