Policy-Based Trajectory Clustering in Offline Reinforcement Learning

Conference on Uncertainty in Artificial Intelligence (UAI 2026), 2025

Recommended citation: Xinqi Wang, Hao Hu, Simon S. Du. (2026). "Policy-Based Trajectory Clustering in Offline Reinforcement Learning." Conference on Uncertainty in Artificial Intelligence (UAI). https://arxiv.org/abs/2506.09202

[Download paper here](/files/policy-trajectory-clustering.pdf)