Academic Homepage

Hongzheng Yang

Researcher at The Chinese University of Hong Kong working on world models, large language models, RL post-training, and AI alignment.

Research

Aligning model behavior through explicit control mechanisms

World Models and Generative Control

Training and aligning generative systems with explicit signals for temporal consistency, preference, and controllable behavior.

LLMs and RL Post-Training

Improving reasoning and robustness under noisy supervision through post-training objectives, entropy control, and reward design.

AI Alignment and Reliability

Developing faithful mechanisms for uncertainty calibration, safety concept control, and behavior alignment beyond benchmark accuracy.

Publications

Publications

For the most up-to-date list, please see my Google Scholar profile.

  1. 2026
    Scaling Language Model Reliability via Determinantal Point Process Prompt Sampling Z. Lin, D. Zhu, H. Yang, V. A. Nguyen Preprint, 2026
  2. 2026
    Concept Concentration for Faithful Representation Intervention H. Yang*, Y. Chen*, Z. Qin, T. Liu, C. Xiao, K. Zhang, B. Han ICML 2026
  3. 2026
    RSTFA: Efficient Training-Free Human Preference Alignment via Rejection Sampling for Text-to-Image Diffusion Models H. Yang, J. C. L. Li, K. Liu, W. Ma, M. Xu, Y. Zhao, L. M. Po IEEE Transactions on Image Processing, 2026
  4. 2025
    From Exploration to Exploitation: A Two-Stage Entropy RLVR Approach for Noise-Tolerant MLLM Training D. Xu*, H. Yang*, Y. Zhao, P. Zhang, J. Chen, W. Ma, Z. Hou, M. Wu, X. Li, S. Hu, Z. Guan, J. C. L. Li, L. M. Po CVPR 2026
  5. 2023
    Uncertainty Estimation for Safety-Critical Scene Segmentation via Fine-Grained Reward Maximization H. Yang*, C. Chen*, Y. Chen, H. C. Yip, Q. Dou NeurIPS 2023
  6. 2022
    DLTTA: Dynamic Learning Rate for Test-Time Adaptation on Cross-Domain Medical Images H. Yang, C. Chen, M. Jiang, Q. Liu, J. Cao, P. A. Heng, Q. Dou IEEE Transactions on Medical Imaging, 2022

* indicates equal contribution.

Academic Service

Professional activities

  • Reviewer: ICML, NeurIPS
  • Reviewer: ECCV, CVPR
  • Reviewer: COLM

Recognition

Honors and awards

  • Champion, MICCAI FeTS Challenge, Generalization "In The Wild" track
  • Bachelor's Thesis with Distinction
  • Honor Student Scholarship for all academic years, Beihang University

Teaching

Teaching experience

2023-2024 Fall

Fundamentals of Artificial Intelligence

CSCI 3230 & ESTR 3108

2022-2023 Fall

Fundamentals of Artificial Intelligence

CSCI 3230 & ESTR 3108