Papers

MallowsPO: Fine-Tune Your LLM with Preference Dispersions

ICLR

2025

Haoxian Chen, Hanyang Zhao, Henry Lam, David Yao, Wenpin Tang

Direct Preference Optimization (DPO) has recently emerged as a popular approach to improve reinforcement learning from human feedback (RLHF), leading to better techniques to fine-tune large language models (LLM). A weakness of DPO, however, lies in its lack of capability to characterize the diversity of human preferences. Inspired by Mallows' theory of preference ranking, we develop in this paper a new approach, the *MallowsPO*. A distinct feature of this approach is a *dispersion index*, which reflects the dispersion of human preference to prompts. We show that existing DPO models can be reduced to special cases of this dispersion index, thus unified with MallowsPO. More importantly, we demonstrate empirically how to use this dispersion index to enhance the performance of DPO in a broad array of benchmark tasks, from synthetic bandit selection to controllable generation and dialogues, while maintaining great generalization capabilities. MallowsPO is also compatible with other SOTA offline preference optimization methods, boosting nearly 2\% extra LC win rate when used as a plugin for fine-tuning Llama3-Instruct.

MallowsPO: Fine-Tune Your LLM with Preference Dispersions

Circuit Transformer: A Transformer That Preserves Logical Equivalence

Tree-Wasserstein Distance for High Dimensional Data with a Latent Feature Hierarchy

CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification

SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix

Distilling Dataset into Neural Field

Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning

Spiking Vision Transformer with Saccadic Attention

Online Selection Problems against Constrained Adversary

Bayesian Pose Graph Optimization via Bingham Distributions and Tempered Geodesic MCMC

Diffusion Feature Field for Text-based 3D Editing with Gaussian Splatting

Diffusion Feedback Helps CLIP See Better

SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization

CPR for CSPs: A Probabilistic Relaxation of Constraint Propagation

Feature-distributed sparse regression: a screen-and-clean approach

Image Editing As Programs with Diffusion Models

Learning values across many orders of magnitude

GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

InsightBench: Evaluating Business Analytics Agents Through Multi-Step Insight Generation

State Relevance for Off-Policy Evaluation

ProtPainter: Draw or Drag Protein via Topology-guided Diffusion

Tree of Preferences for Diversified Recommendation

Efficient Interpolation between Extragradient and Proximal Methods for Weak MVIs

Optimizing (L0​,L1​)-Smooth Functions by Gradient Methods

REMI: Reconstructing Episodic Memory During Internally Driven Path Planning

NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals

Gradient-Free Generation for Hard-Constrained Systems

Variable KD-Tree Algorithms for Spatial Pattern Search and Discovery

One Hundred Neural Networks and Brains Watching Videos: Lessons from Alignment

Active Labeling: Streaming Stochastic Gradients

UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models

Learning View-invariant World Models for Visual Robotic Manipulation

DRESSing Up LLM: Efficient Stylized Question-Answering via Style Subspace Editing

Predicting the Energy Landscape of Stochastic Dynamical System via Physics-informed Self-supervised Learning

Preprocessors Matter! Realistic Decision-Based Attacks on Machine Learning Systems

Learning the structure of manifolds using random projections

A Biased Graph Neural Network Sampler with Near-Optimal Regret

Federated Class-Incremental Learning: A Hybrid Approach Using Latent Exemplars and Data-Free Techniques to Address Local and Global Forgetting

Score-based Self-supervised MRI Denoising

CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models

CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes

One Stone with Two Birds: A Null-Text-Null Frequency-Aware Diffusion Models for Text-Guided Image Inpainting

StableGuard: Towards Unified Copyright Protection and Tamper Localization in Latent Diffusion Models

LiteReality: Graphic-Ready 3D Scene Reconstruction from RGB-D Scans

Process Reward Model with Q-value Rankings

HaDeMiF: Hallucination Detection and Mitigation in Large Language Models

Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs

XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation

Gaussian Processes in Reinforcement Learning

A unifying framework for vector-valued manifold regularization and multi-view learning

Optimizing $(L_{0}, L_{1})$ -Smooth Functions by Gradient Methods