Bio
I am a final-year PhD student at the School of CS, the University of Sydney (USYD), under the supervision of Prof. Dacheng Tao. Before that, I received BEng from SASEE, Beihang University and MPhil in Computer Science from USYD in 2019 and 2022, respectively. My research interests include RL post-training, LLM reasoning, and data-centric AI.
Selected Publications
* indicates co-first authors
Revisiting LLM Reasoning via Information Bottleneck [paper]
Shiye Lei, Zhihao Cheng, Kai Jia, and Dacheng Tao
arXiv preprint, 2025Offline Behavioral Data Selection
Shiye Lei, Zhihao Cheng, and Dacheng Tao
SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2026Image Captions are Natural Prompts for Training Data Synthesis [paper][code]
Shiye Lei*, Hao Chen*, Sen Zhang, Bo Zhao, and Dacheng Tao
International Journal of Computer Vision (IJCV), 2025Offline Behavior Distillation [paper][code][poster]
Shiye Lei, Sen Zhang, and Dacheng Tao
Advances in Neural Information Processing Systems (NeurIPS), 2024A Comprehensive Survey of Dataset Distillation [paper]
Shiye Lei and Dacheng Tao
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024Attentive Learning Facilitates Generalization of Neural Networks [paper][code]
Shiye Lei, Fengxiang He, Haowen Chen, and Dacheng Tao
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2024Understanding Deep Learning via Decision Boundary [paper]
Shiye Lei, Fengxiang He, Yancheng Yuan, and Dacheng Tao
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2023
Teaching Assistant
- COMP5318: Machine Learning and Data Mining, 2024 S2 @ USYD
Academic Service
Conference reviewer: ICML, NeurIPS, ICLR, AISTATS, CVPR, ICCV, AAAI, ACM MM, etc.
Journal reviewer: JMLR, Springer Machine Learning, Neurocomputing, etc.
