publications

2025

  1. Diffusion RLHF
    Fine-Tuning Diffusion Generative Models via Rich Preference Optimization
    Hanyang Zhao ,  Haoxian Chen ,  Yucheng Guo , and 5 more authors
    arXiv preprint arXiv:2503.11720, 2025
  2. Diffusion RLHF
    Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
    Hanyang Zhao ,  Haoxian Chen ,  Ji Zhang , and 2 more authors
    arXiv preprint arXiv:2502.01819, 2025
    Short version in DeLTa Workshop at ICLR 2025.
  3. ICLR 2025
    RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
    Hanyang Zhao* ,  Genta Indra Winata* ,  Anirban Das* , and 4 more authors
    International Conference on Learning Representations, 2025
  4. ICLR 2025
    MallowsPO: Fine-Tune Your LLM with Preference Dispersions
    Haoxian Chen* ,  Hanyang Zhao* ,  Henry Lam , and 2 more authors
    International Conference on Learning Representations, 2025
    Short version in Pluralistic Alignment Workshop at NeurIPS 2024.

2024

  1. NAACL 2025
    Worldcuisines: A massive-scale benchmark for multilingual and multicultural visual question answering on global cuisines
    Genta Indra Winata ,  Frederikus Hudi ,  Patrick Amadeus Irawan , and 8 more authors
    2025 Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics , 2024
  2. JAIR
    Preference tuning with human feedback on language, speech, and vision tasks: A survey
    Genta Indra Winata* ,  Hanyang Zhao* ,  Anirban Das* , and 4 more authors
    Journal of Artificial Intelligence Research, 2024
  3. Diffusion Models
    Score-based Diffusion Models via Stochastic Differential Equations–a Technical Tutorial
    Wenpin Tang ,  and  Hanyang Zhao
    arXiv preprint arXiv:2402.07487, 2024
  4. Diffusion Models
    Contractive diffusion probabilistic models
    Wenpin Tang ,  and  Hanyang Zhao
    arXiv preprint arXiv:2401.13115, 2024

2023

  1. NeurIPS 2023
    Policy optimization for continuous reinforcement learning
    Hanyang Zhao ,  Wenpin Tang ,  and  David Yao
    Advances in Neural Information Processing Systems, 2023