Reseach Focus

[Topics change stage by stage along with the trending challenges in AI]

Foundation models and MedAI applications

Reasoning [AR-bench (ICML'25)]
Acceleration [DivBS (ICML'24), Dissect (ICCV'25), CateKV (ICML'25)]
MedAI [medical pretraining & post-training: UniChest (TMI'24), UniBrain (CMIG'25), AdaCoMed (CVPR'25); adaptiation: LoRKD (CVPR'24), RD (MICCAI'24)]

Generalized Imbalanced Learning

Imbalanced Learning is an old topic in machine learning area, which is still lack of the solid foundation from theory to algorithm, although it seems to be studied in (at least) two decades. The reason that we re-focus this problem is that the generalization of algorithms in a broad sense has been recently drawn more attention, especially in the context of the pretraining paradigm. The underlying evaluation metric on the holistic measure of each class, each task, each domain coincidentally is similar to the fine-grained measure in imbalanced learning. This motivates us to extend imbalanced learning to boost these typical paradigms like self-supervised learning, weakly-supervised learning or generative modeling to enhance the generalization. The following taxonomy is mainly based on the aspect that each research work considers, but actually these imbalance types may mix in practice.

Class imbalance [CBDM (CVPR'23), T2H (ICLR'24), RDR (NeurIPS'24)]
Spectral imbalance [FedGELA (NeurIPS'23), DISAM (ICLR'24), MoLA (ICML'24), PCD (NeurIPS'24)]
Implicit imbalance [BCL (ICML'22), GH (NeurIPS'23), RECORDS (ICLR'23), SHE (ICLR'24), BDR (TIP'24), GeoCLIP (MLJ'25)]

Noise Robust Machine Learning

Perturbation can be ubiquitous on real-world data and the proper mass can actually robustify the training of machine learning models. This is common in the training practice like label smoothing, dropout and data augmentation with randomness. However, when it is excessive or deliberate, or even only emerges during serving, the special design should be considered to reduce their negative impact. Motivated by this belief, we developed a range of methods as references on this way.

Label-noise learning under the class-conditional assumption [NeurIPS'18, AAAI'19, TPAMI'23]
Label-noise learning with the selection, correction or regularization [ICML'19, TIP'19, AAAI'21, NeurIPS'23, CVPR'24, ICML'24, MLJ'25]
Adversarial learning with distillation, slack or representation learning [ICLR'22, ICLR'23, ICML'23]
Out-of-Distribution Detection with intrinsic capacity, extrapolation or calibration [ICML'23, NeurIPS'23, NeurIPS'24]

Jiangchao Yao

Reseach Focus

Foundation models and MedAI applications

Generalized Imbalanced Learning

Noise Robust Machine Learning