Yongyi is a postdoctoral researcher in the CBS-NTT Program in Physics of Intelligence at Harvard University, working with Hidenori Tanaka. His research focuses on understanding the foundations and principles of deep learning, spanning research areas such as deep learning theory, science of deep learning, and mechanistic interpretability. Besides, he is also broadly interested in many other research topics in AI and theory, including efficient implementation of AI algorithms and learning on graphs.
Yongyi received his Ph.D. from the University of Michigan, advised by Wei Hu. Before that, he received his Bachelor of Science from Fudan University, under the advisedment of Xipeng Qiu, David Wipf and Zengfeng Huang.
Besides academic research, Yongyi also has a passion for mathematics, Chinese classical literature and Xiaoxue. Feel free to contact if you would like to connect.
I recently published an npm package mouseless, that helps to define high-level keyboard interactions in UI development.
I’ve added a page to collect some problems I’ve encountered during my research that I haven’t yet solved. See Problems. If you have any insights on (or just want to discuss about) any of them, I would greatly appreciate hearing from you.
SNIP: An Adaptive Mixed Precision Framework for Subbyte Large Language Model Training Yunjie Pan, Yongyi Yang, Hanmei Yang, Scott Mahlke
ASPLOS 2026
mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations Yongyi Yang*, Jianyang Gao*
arxiv preprint
An Equivariance Toolbox for Learning Dynamics Yongyi Yang, Liu Ziyin
arxiv preprint
Topological Invariance and Breakdown in Learning Yongyi Yang, Tomaso Poggio, Isaac Chuang, Liu Ziyin
arxiv preprint
Provable Low-Frequency Bias of In-Context Learning of Representations Yongyi Yang, Hidenori Tanaka, Wei Hu
arxiv preprint
New Evidence of the Two-Phase Learning Dynamics of Neural Networks Zhanpeng Zhou, Yongyi Yang, Mahito Sugiyama, Junchi Yan
arxiv preprint
RaanA: A Fast, Flexible, and Data-Efficient Post-Training Quantization Algorithm Yongyi Yang, Jianyang Gao, Wei Hu
arxiv preprint
ICLR: In-Context Learning of Representations Core Francisco Park*, Andrew Lee*, Ekdeep Singh Lubana*, Yongyi Yang*, Maya Okawa, Kento Nishi, Martin Wattenberg, Hidenori Tanaka
ICLR 2025
Swing-by Dynamics in Concept Learning and Compositional Generalization Yongyi Yang, Core Francisco Park, Ekdeep Singh Lubana, Maya Okawa, Wei Hu, Hidenori Tanaka
ICLR 2025
Practical and Asymptotically Optimal Quantization of High-Dimensional Vectors in Euclidean Space for Approximate Nearest Neighbor Search Jianyang Gao, Yutong Gou, Yuexuan Xu, Yongyi Yang, Cheng Long, Raymond Chi-Wing Wong
SIGMOD 2025
arXiv preprint arXiv:2409.09913 (September, 2024)
HERTA: A High-Efficiency and Rigorous Training Algorithm for Unfolded Graph Neural Networks Yongyi Yang, Jiaming Yang, Wei Hu, Michał Dereziński
arxiv preprint
Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity Zhanpeng Zhou, Yongyi Yang, Xiaojiang Yang, Junchi Yan, Wei Hu
Neurips 2023
Are Neurons Actually Collapsed? On the Fine-Grained Structure in Neural Representations Yongyi Yang, Jacob Steinhardt, Wei Hu
ICML 2023
Descent Steps of a Relation-Aware Energy Produce Heterogeneous Graph Neural Networks Hongjoon Ahn,Yongyi Yang, Quan Gan, David Wipf, Taesup Moon
Neurips 2022
Transformers from an Optimization Perspective Yongyi Yang, Zengfeng Huang, David Wipf
Neurips 2022
Why Propagate Alone? Parallel Use of Labels and Features on Graphs Yangkun Wang, Jiarui Jin, Weinan Zhang, Yongyi Yang, Jiuhai Chen, Quan Gan, Yong Yu, Zheng Zhang, Zengfeng Huang, David Wipf
ICLR 2022
Graph Neural Networks Inspired by Classical Iterative Algorithms Yongyi Yang , Tang Liu, Yangkun Wang, Jinjing Zhou, Quan Gan, Zhewei Wei, Zheng Zhang, Zengfeng Huang, David Wipf
ICML 2021, long talk
Implicit vs Unfolded Graph Neural Networks Yongyi Yang, Yangkun Wang, Tang Liu, Zengfeng Huang, David Wipf
JMLR vol. 26, 2025
arXiv preprint arXiv:2111.06592 (2021)
Relation of the Relations: A New Paradigm of the Relation Extraction Problem Zhijing Jin*, Yongyi Yang*, Xipeng Qiu, Zheng Zhang
arxiv preprint
(Last update: 05/28/2026)