Wenhao Zhu (朱文昊)
|
Ph.D. Candidate, Nanjing University
Natural Language Processing Research Group
Supervisor: Prof. Shujian Huang & Prof. Jiajun Chen
State Key Laboratory of Novel Software Technology
School of Computer Science
Email: zhuwh{at}smail.nju.edu.cn
[Twitter][Github][CV][Google Scholar]
|
About me
I am currently a final-year PhD candidate at School of Computer Science, Nanjing University, and a member of NJU-NLP Research Group, under the supervision of Prof. Shujian Huang and Prof. Jiajun Chen. Starting from October 2023, I have been fortunate to spend one-year visiting as a PhD student under the supervision of Prof. Alexandra Birch at StatMT Group, The University of Edinburgh. I have also had the privilege to collaborate with Prof. Lei Li, whose mentorship and insights have been invaluable to my research. Since September 2022, I have been working as a research intern at Shanghai AI Lab, working with Dr. Jingjing Xu and Prof. Lingpeng Kong. I am deeply grateful for the guidance and support from all my mentors throughout my academic journey. Before pursuing my PhD, I received my B.E. degree from the School of Management & Engineering at Nanjing University in June 2019.
My research interest lies in multilingual large language model (LLM) and machine translation (MT).
Pre-prints
-
Generalizing from Short to Long: Effective Data Synthesis for Long-Context Instruction Tuning
Wenhao Zhu, Pinzhen Chen, Hanxu Hu, Shujian Huang, Fei Yuan, Jiajun Chen, Alexandra Birch
arXiv 2502.15592
[Paper]
[Code]
-
BenchMaX: A Comprehensive Multilingual Evaluation Suite For Large Language Models
Xu Huang, Wenhao Zhu, Hanxu Hu, Conghui He, Lei Li, Shujian Huang, Fei Yuan
arXiv 2502.07346
[Paper]
[Code]
-
The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Wenhao Zhu, Shujian Huang, Fei Yuan, Cheng Chen, Jiajun Chen, Alexandra Birch
arXiv 2405.01345
[Paper]
[Code]
-
Extrapolating Large Language Models to Non-English by Aligning Languages
Wenhao Zhu, Yunzhe Lv, Qingxiu Dong, Fei Yuan, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li
arXiv 2308.04948
[Paper]
[Code]
[Blog]
Published Papers
-
Multilingual Contrastive Decoding via Language-agnostic Layers Skipping
Wenhao Zhu, Sizhe Liu, Shujian Huang, Shuaijie She, Chris Wendler, Jiajun Chen
In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) - Findings, 2024
[Paper]
[Code]
[Blog]
-
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages
Yinquan Lu, Wenhao Zhu, Lei Li, Yu Qiao, Fei Yuan
In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) - Findings, 2024
[Paper]
[Code]
-
Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners
Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang
In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
[Paper]
[Code]
-
MindMerger: Efficient Boosting LLM Reasoning in non-English Languages
Zixian Huang, Wenhao Zhu, Gong Cheng, Lei Li, Fei Yuan
In the Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS), 2024
[Paper]
[Code]
-
Question Translation Training for Better Multilingual Reasoning
Wenhao Zhu, Shujian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch
In Proceedings of the 62th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2024
[Paper]
[Code]
[Slides]
[Video]
[Blog]
-
MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization
Shuaijie She , Wei Zou , Shujian Huang, Wenhao Zhu, Xiang Liu, Xiang Geng, Jiajun Chen
In Proceedings of the 62th Annual Meeting of the Association for Computational Linguistics (ACL), 2024
[Paper]
[Code]
-
Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis
Wenhao Zhu, Hongyi Liu, Qingxiu Dong, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li
In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) - Findings, 2024
[Paper]
[Code]
[Slides]
[Video]
[Blog]
-
kNN-BOX: A Unified Framework for Nearest Neighbor Generation
Wenhao Zhu, Qianfeng Zhao, Yunzhe Lv, Shujian Huang, Siheng Zhao, Sizhe Liu, Jiajun Chen
In the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL) - Demo Track, 2024
[Paper]
[Slides]
[Code]
[Video]
-
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
Kanzhi Cheng, Wenpo Song, Zheng Ma, Wenhao Zhu, Zixuan Zhu, Jianbing Zhang
In Proceedings of the 31th ACM International Conference on Multimedia (ACM-MM), 2023
[Paper]
[Code]
[Blog]
-
INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation
Wenhao Zhu, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen
In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL), 2023
[Paper]
[Code]
[Slides]
[Video]
[Blog]
-
Lego-MT: Towards Detachable Models in Massively Multilingual Machine Translation
Fei Yuan, Yinquan Lu, Wenhao Zhu, Lingpeng Kong, Lei Li, Jingjing Xu
In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2023
[Paper]
[Code]
-
What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation
Wenhao Zhu, Shujian Huang, Yunzhe Lv, Xin Zheng, Jiajun Chen
In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2023
[Paper]
[Code]
[Slides]
[Video]
[Blog]
-
FGraDA: A Dataset and Benchmark for Fine-Grained Domain Adaptation in Machine Translation
Wenhao Zhu, Shujian Huang, Tong Pu, Pingxuan Huang, Xu Zhang, Jian Yu, Wei Chen, Yanfeng Wang, Jiajun Chen
In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), 2022
[Paper]
[Dataset]
[Slides]
[Video]
[Blog]
-
Improving Bilingual Lexicon Induction on Distant Language Pairs
Wenhao Zhu, Zhihao Zhou, Shujian Huang, Zhenya Lin, Xiangsheng Zhou, Yaofeng Tu, Jiajun Chen
In the 15th China Conference on Machine Translation (CCMT), 2019. Best English Paper Award.
[Paper]
[Slides]
[Blog]
Talks
-
Research and Challenges of Multilingual Large Language Models, invited talk, at NLPCC'2024.
Shujian Huang, Wenhao Zhu
[Slides]
[Video]
-
Research Topic Selection in the Age of Large Language Models, invited talk, at MLNLP'2024
Wenhao Zhu
[Slides]
[Video]
-
Research Topic Selection in the Age of Large Language Models, invited talk, at CCMT'2023
Wenhao Zhu
[Slides]
[Video]
-
INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation &
What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation, invited talk, at AIS'2023
Wenhao Zhu
[Slides]
-
Analyzing Multilingual Machine Translation Ability of Large Language Models, invited talk, at HIT.
Shujian Huang, Wenhao Zhu
[Slides]
[Video]
-
K-Nearest-Neighobor Machine Translation, invited talk, at MLNLP'2022.
Shujian Huang, Wenhao Zhu
[Slides]
[Video]
-
K-Nearest-Neighobor Machine Translation, invited tutorial, at NLPCC'2022.
Shujian Huang, Wenhao Zhu
[Slides]
[Video]
Services
-
Area Chair
ACL Rolling Review (ARR), 2025
Annual Meeting of the Association for Computational Linguistics (ACL), 2025
-
Conference Reviewer / Journal Reviewer
Conferen on Language Modeling (COLM), 2025
ACL Rolling Review (ARR), 2023-2024
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Annual Meeting of the Association for Computational Linguistics (ACL), 2023-2024
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023-2024
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024
-
Teaching Assistant / Teaching Support
Natural Language Understanding, Generation, and Machine Translation, UoE, Spring 2024
Machine Translation and Natural Language Generation, NJU, Spring 2023
Programming for Artificial Intelligence, NJU, Spring 2021
Advanced Programming, NJU, Autumn 2020
Experience
Correpondence
1024, Computer Science and Technology Building
Xianlin Campus of Nanjing University
163 Xianlin Avenue, Nanjing 210023, China
Last Update: Feb 2025
|