Education

Ph.D in Computer Science, University of California, Santa Cruz, 2019–Now
M.S. in Data Science, Brown University, 2018–2019 (Graduate in advance)
B.S. in Honors Science (Mathematics and Applied Mathematics), Xi’an Jiaotong University, 2014–2018
High school in Honors Youth (Gifted Young), Xi’an Jiaotong University, 2012–2014

Work Experience

Research Intern, Bytedance AI Lab (June, 2023 – September, 2023)
Project topic: Trustworthy Large Language Model with Limited Human Efforts
Paper 1: Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting
[Under Review]
Paper 2: Human-Instruction-Free LLM Self-Alignment with Limited Samples
[Under Review]

Student Researcher, Google Research Brain Team (June, 2022 – September, 2022)
Project topic: Class-imbalanced learing, Distributional robustness optimziation
Paper: Distributionally Robust Post-hoc Classifiers under Prior Shifts
[First authored paper published in ICLR 2023]

Student Researcher, Google Research Brain Team (March, 2022 – May, 2022)
Project topic: Learning with crowd-sourced noisy labels
Paper: To Aggregate or Not? Learning with Separate Noisy Labels
[First authored paper published in KDD 2023]

Selected Publications

Large Language Models

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting
Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi Guo, Andrew Estornell, Yang Liu
(Under Review) [Category: LLM Evaluation; LLM Alignment; In-Context Learning; Supervised Fine-Tuning]
Human-Instruction-Free LLM Self-Alignment with Limited Samples
Hongyi Guo, Yuanshun Yao, Wei Shen, Jiaheng Wei, Xiaoying Zhang, Zhaoran Wang, Yang Liu
(Under Review) [paper] [Category: LLM Alignment; In-Context Learning]

Trustworthy Machine Learning

To Aggregate or Not? Learning with Separate Noisy Labels
Jiaheng Wei*, Zhaowei Zhu*, Tianyi Luo, Ehsan Amid, Abhishek Kumar, Yang Liu
KDD – ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
[paper] [Category: Label Noise]
Distributionally Robust Post-hoc Clasifiers under Prior Shifts
Jiaheng Wei, Harikrishna Narasimhan, Ehsan Amid, Wensheng Chu, Yang Liu, Abhishek Kumar
ICLR – International Conference on Learning Representations, 2023
[paper] [code] [Category: Long-Tailed Learning, Group-Dro]
To Smooth or Not? When Label Smoothing Meets Noisy Labels
Jiaheng Wei, Hangyu Liu, Tongliang Liu, Gang Niu, Masashi Sugiyama, Yang Liu
ICML (Long Presentation) – International Conference on Machine Learning, 2022
[paper] [code] [Category: Label Noise]
Learning with Noisy Labels Revisited: A Study Using Real-World Human Annotations
Jiaheng Wei*, Zhaowei Zhu*, Hao Cheng, Tongliang Liu, Gang Niu, Yang Liu
ICLR – International Conference on Learning Representations, 2022
[paper] [data] [code] [Category: Label Noise]
When Optimizing f-divergence is Robust with Label Noise
Jiaheng Wei and Yang Liu
ICLR – International Conference on Learning Representations, 2021
[paper] [code] [Category: Label Noise]
Fairness Improve Learning from Noisily Labeled Long-Tailed Data
Jiaheng Wei, Zhaowei Zhu, Gang Niu, Tongliang Liu, Sijia Liu, Masashi Sugiyama, Yang Liu
(Under Review) [paper] [Category: Long-Tailed Learning]

Incentives in Machine Learning

Sample Elicitation
Jiaheng Wei*, Zuyue Fu*, Yang Liu, Xingyu Li, Zhuoran Yang, Zhaoran Wang
AISTATS – International Conference on Artificial Intelligence and Statistics, 2021
[paper] [code] [Category: Information Elicitation]
Auditing for Federated Learning: A Model Elicitation Approach
Yang Liu, Rixing Lou, Jiaheng Wei [Alphabetical order] DAI – Distributed AI, 2023
[paper] [Category: Information Elicitation]

Other Papers

DuelGAN: A Duel between Two Discriminators Stabilizes the GAN Training
Jiaheng Wei*, Minghao Liu*, Jiahao Luo, Andrew Zhu, James Davis, Yang Liu
ECCV – European Conference on Computer Vision, 2022
[paper] [code] [Category: Deep Generative Models]

(*: denotes equal contributions)

Talks

May, 2023 I gave an invited talk at AI-Time.
Apr, 2023 I gave a invited talk at the TMLR Young Scientist Seminar.
Mar, 2023 I gave a short talk in the Crowd Science workshop at WSDM 2023.
Oct, 2022 I gave an invited talk from Domain Adaptation Team at University of Toronto.
Aug, 2022 I gave an invited talk at AI-Time.
Jul, 2022 I gave an oral presentation at ICML 2022 (Deep Learning: Robustness).
Jun, 2022 I gave an invited talk at AI-Time.
Nov, 2021 I gave a short talk in the Weakly Supervised Learning (WSL) workshop at ACML 2021.

Conference Reviews

Reviewer:
AAAI 2020, AISTATS 2021, CVPR 2021, ICCV 2021, NeurIPS 2021, ICLR 2022, CVPR 2022, ICML 2022, IJCAI 2023, KDD 2023, TMLR, TPAMI, NeurIPS 2023, etc.
Organizer:
A hands-on tutorial on learning with noisy labels at IJCAI 2023.
1st Learning and Mining with Noisy Labels Challenge at IJCAI-ECAI 2022 [link].

Teaching Experience

Teaching Assistant at UCSC CSE 242: Machine Learning. (Fall, 2021)
Teaching Assistant at UCSC CSE 290-T: Computing for Society. (Spring, 2021)
Teaching Assistant at UCSC CSE 142: Machine Learning. (Spring, 2020)

Honor and Awards

The only recipient of Jack Baskin and Peggy Downes-Baskin Fellowship. (2023)
Travel Grant from International Conference on Machine Learning (ICML). (2022)
The First Prize of National Mount Everest Programme Scholarship in 2016-2017 Academic Year. (2017)
The Second Prize of National Mount Everest Programme Scholarship in 2015-2016 Academic Year. (2016)
The Second Prize in the 7th Chinese Mathematical Competitions. (2016)
Xi’an Jiaotong Siyuan Scholarship in 2015-2016 Academic Year. (2015)
Xi’an Jiaotong Siyuan Scholarship in 2014-2015 Academic Year. (2014)

Jiaheng Wei