Wenhao Zhu (朱文昊)
About me
I am now a final-year PhD student at the School of Computer Science in Nanjing University and a member of NJUNLP Group. Before that, I received the B.E. degree in School of Management & Engineering in June 2019 from Nanjing University.
My research interest lies in multilingual large language model (LLM) and machine translation (MT).
Experience
Pre-prints
-
The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights
Wenhao Zhu, Shujian Huang, Fei Yuan, Cheng Chen, Jiajun Chen, Alexandra Birch
arXiv 2405.01345
[Paper]
[Code]
-
Extrapolating Large Language Models to Non-English by Aligning Languages
Wenhao Zhu, Yunzhe Lv, Qingxiu Dong, Fei Yuan, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li
arXiv 2308.04948
[Paper]
[Code]
[Blog]
Published Papers
-
Multilingual Contrastive Decoding via Language-agnostic Layers Skipping
Wenhao Zhu, Sizhe Liu, Shujian Huang, Shuaijie She, Chris Wendler, Jiajun Chen
In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) - Findings, 2024
[Paper]
[Code]
-
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages
Yinquan Lu, Wenhao Zhu, Lei Li, Yu Qiao, Fei Yuan
In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) - Findings, 2024
[Paper]
[Code]
-
Large Language Models are Good Spontaneous Multilingual Learners: Is the Multilingual Annotated Data Necessary?
Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang
In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
[Paper]
[Code]
-
MindMerger: Efficient Boosting LLM Reasoning in non-English Languages
Zixian Huang, Wenhao Zhu, Gong Cheng, Lei Li, Fei Yuan
In the Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS), 2024
[Paper]
[Code]
-
Question Translation Training for Better Multilingual Reasoning
Wenhao Zhu, Shujian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch
In Proceedings of the 62th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2024
[Paper]
[Code]
[Slides]
[Blog]
-
MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization
Shuaijie She , Wei Zou , Shujian Huang, Wenhao Zhu, Xiang Liu, Xiang Geng, Jiajun Chen
In Proceedings of the 62th Annual Meeting of the Association for Computational Linguistics (ACL), 2024
[Paper]
[Code]
-
Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis
Wenhao Zhu, Hongyi Liu, Qingxiu Dong, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li
In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) - Findings, 2024
[Paper]
[Code]
[Slides]
[Video]
[Blog]
-
kNN-BOX: A Unified Framework for Nearest Neighbor Generation
Wenhao Zhu, Qianfeng Zhao, Yunzhe Lv, Shujian Huang, Siheng Zhao, Sizhe Liu, Jiajun Chen
In the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL) - Demo Track, 2024
[Paper]
[Slides]
[Code]
[Video]
-
Research Development of Machine translation and Large Language Model
Wenhao Zhu, Hao Zhou, Changjiang Gao, Sizhe Liu, Shujian Huang
In the 21st China National Conference on Computational Linguistics (CCL), 2023
[Paper]
-
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model
Kanzhi Cheng, Wenpo Song, Zheng Ma, Wenhao Zhu, Zixuan Zhu, Jianbing Zhang
In Proceedings of the 31th ACM International Conference on Multimedia (ACM-MM), 2023
[Paper]
[Code]
[Blog]
-
INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation
Wenhao Zhu, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen
In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL), 2023
[Paper]
[Code]
[Slides]
[Video]
[Blog]
-
Lego-MT: Towards Detachable Models in Massively Multilingual Machine Translation
Fei Yuan, Yinquan Lu, Wenhao Zhu, Lingpeng Kong, Lei Li, Jingjing Xu
In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2023
[Paper]
[Code]
-
What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation
Wenhao Zhu, Shujian Huang, Yunzhe Lv, Xin Zheng, Jiajun Chen
In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2023
[Paper]
[Code]
[Slides]
[Video]
[Blog]
-
FGraDA: A Dataset and Benchmark for Fine-Grained Domain Adaptation in Machine Translation
Wenhao Zhu, Shujian Huang, Tong Pu, Pingxuan Huang, Xu Zhang, Jian Yu, Wei Chen, Yanfeng Wang, Jiajun Chen
In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), 2022
[Paper]
[Dataset]
[Slides]
[Video]
[Blog]
-
Improving Bilingual Lexicon Induction on Distant Language Pairs
Wenhao Zhu, Zhihao Zhou, Shujian Huang, Zhenya Lin, Xiangsheng Zhou, Yaofeng Tu, Jiajun Chen
In the 15th China Conference on Machine Translation (CCMT), 2019. Best English Paper Award.
[Paper]
[Slides]
[Blog]
Talks
-
Research and Challenges of Multilingual Large Language Models, invited talk, at NLPCC'2024.
Shujian Huang, Wenhao Zhu
[Slides]
-
Research Topic Selection in the Age of Large Language Models, invited talk, at MLNLP'2024
Wenhao Zhu
[Slides]
[Video]
-
Research Topic Selection in the Age of Large Language Models, invited talk, at CCMT'2023
Wenhao Zhu
[Slides]
[Video]
-
INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation &
What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation, invited talk, at AIS'2023
Wenhao Zhu
[Slides]
-
Analyzing Multilingual Machine Translation Ability of Large Language Models, invited talk, at HIT.
Shujian Huang, Wenhao Zhu
[Slides]
[Video]
-
K-Nearest-Neighobor Machine Translation, invited talk, at MLNLP'2022.
Shujian Huang, Wenhao Zhu
[Slides]
[Video]
-
K-Nearest-Neighobor Machine Translation, invited tutorial, at NLPCC'2022.
Shujian Huang, Wenhao Zhu
[Slides]
[Video]
Services
-
Program Committee / Conference Reviewer / Journal Reviewer
ACL Rolling Review (ARR), 2023-2024
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024
Annual Meeting of the Association for Computational Linguistics (ACL), 2023-2024
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023-2024
Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024
-
Teaching Assistant / Teaching Support
Natural Language Understanding, Generation, and Machine Translation, UoE, Spring 2024
Machine Translation and Natural Language Generation, NJU, Spring 2023
Programming for Artificial Intelligence, NJU, Spring 2021
Advanced Programming, NJU, Autumn 2020
Correpondence
1024, Computer Science and Technology Building
Xianlin Campus of Nanjing University
163 Xianlin Avenue, Nanjing 210023, China
Last Update: Nov 2024
|