Wenhao Zhu (朱文昊)

alt text 

Ph.D. Candidate, Nanjing University
Natural Language Processing Research Group
Supervisor: Prof. Shujian Huang & Prof. Jiajun Chen

State Key Laboratory of Novel Software Technology
School of Computer Science

Email: zhuwh{at}smail.nju.edu.cn
[Twitter][Github][CV][Google Scholar]

About me

I am now a final-year PhD student at the School of Computer Science in Nanjing University and a member of NJUNLP Group. Before that, I received the B.E. degree in School of Management & Engineering in June 2019 from Nanjing University.

My research interest lies in multilingual large language model (LLM) and machine translation (MT).

Experience

Pre-prints

  • The Power of Question Translation Training in Multilingual Reasoning: Broadened Scope and Deepened Insights

    Wenhao Zhu, Shujian Huang, Fei Yuan, Cheng Chen, Jiajun Chen, Alexandra Birch

    arXiv 2405.01345

    [Paper] [Code]

  • Extrapolating Large Language Models to Non-English by Aligning Languages

    Wenhao Zhu, Yunzhe Lv, Qingxiu Dong, Fei Yuan, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li

    arXiv 2308.04948

    [Paper] [Code] [Blog]

Published Papers

  • Multilingual Contrastive Decoding via Language-agnostic Layers Skipping

    Wenhao Zhu, Sizhe Liu, Shujian Huang, Shuaijie She, Chris Wendler, Jiajun Chen

    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) - Findings, 2024

    [Paper] [Code]

  • LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

    Yinquan Lu, Wenhao Zhu, Lei Li, Yu Qiao, Fei Yuan

    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP) - Findings, 2024

    [Paper] [Code]

  • Large Language Models are Good Spontaneous Multilingual Learners: Is the Multilingual Annotated Data Necessary?

    Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang

    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024

    [Paper] [Code]

  • MindMerger: Efficient Boosting LLM Reasoning in non-English Languages

    Zixian Huang, Wenhao Zhu, Gong Cheng, Lei Li, Fei Yuan

    In the Thirty-eighth Conference on Neural Information Processing Systems (NeurIPS), 2024

    [Paper] [Code]

  • Question Translation Training for Better Multilingual Reasoning

    Wenhao Zhu, Shujian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch

    In Proceedings of the 62th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2024

    [Paper] [Code] [Slides] [Blog]

  • MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization

    Shuaijie She , Wei Zou , Shujian Huang, Wenhao Zhu, Xiang Liu, Xiang Geng, Jiajun Chen

    In Proceedings of the 62th Annual Meeting of the Association for Computational Linguistics (ACL), 2024

    [Paper] [Code]

  • Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis

    Wenhao Zhu, Hongyi Liu, Qingxiu Dong, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li

    In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) - Findings, 2024

    [Paper] [Code] [Slides] [Video] [Blog]

  • kNN-BOX: A Unified Framework for Nearest Neighbor Generation

    Wenhao Zhu, Qianfeng Zhao, Yunzhe Lv, Shujian Huang, Siheng Zhao, Sizhe Liu, Jiajun Chen

    In the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL) - Demo Track, 2024

    [Paper] [Slides] [Code] [Video]

  • Research Development of Machine translation and Large Language Model

    Wenhao Zhu, Hao Zhou, Changjiang Gao, Sizhe Liu, Shujian Huang

    In the 21st China National Conference on Computational Linguistics (CCL), 2023

    [Paper]

  • Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model

    Kanzhi Cheng, Wenpo Song, Zheng Ma, Wenhao Zhu, Zixuan Zhu, Jianbing Zhang

    In Proceedings of the 31th ACM International Conference on Multimedia (ACM-MM), 2023

    [Paper] [Code] [Blog]

  • INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation

    Wenhao Zhu, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen

    In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL), 2023

    [Paper] [Code] [Slides] [Video] [Blog]

  • Lego-MT: Towards Detachable Models in Massively Multilingual Machine Translation

    Fei Yuan, Yinquan Lu, Wenhao Zhu, Lingpeng Kong, Lei Li, Jingjing Xu

    In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2023

    [Paper] [Code]

  • What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation

    Wenhao Zhu, Shujian Huang, Yunzhe Lv, Xin Zheng, Jiajun Chen

    In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2023

    [Paper] [Code] [Slides] [Video] [Blog]

  • FGraDA: A Dataset and Benchmark for Fine-Grained Domain Adaptation in Machine Translation

    Wenhao Zhu, Shujian Huang, Tong Pu, Pingxuan Huang, Xu Zhang, Jian Yu, Wei Chen, Yanfeng Wang, Jiajun Chen

    In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), 2022

    [Paper] [Dataset] [Slides] [Video] [Blog]

  • Improving Bilingual Lexicon Induction on Distant Language Pairs

    Wenhao Zhu, Zhihao Zhou, Shujian Huang, Zhenya Lin, Xiangsheng Zhou, Yaofeng Tu, Jiajun Chen

    In the 15th China Conference on Machine Translation (CCMT), 2019. Best English Paper Award.

    [Paper] [Slides] [Blog]

Talks

  • Research and Challenges of Multilingual Large Language Models, invited talk, at NLPCC'2024.

    Shujian Huang, Wenhao Zhu

    [Slides]

  • Research Topic Selection in the Age of Large Language Models, invited talk, at MLNLP'2024

    Wenhao Zhu

    [Slides] [Video]

  • Research Topic Selection in the Age of Large Language Models, invited talk, at CCMT'2023

    Wenhao Zhu

    [Slides] [Video]

  • INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation &

    What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation, invited talk, at AIS'2023

    Wenhao Zhu

    [Slides]

  • Analyzing Multilingual Machine Translation Ability of Large Language Models, invited talk, at HIT.

    Shujian Huang, Wenhao Zhu

    [Slides] [Video]

  • K-Nearest-Neighobor Machine Translation, invited talk, at MLNLP'2022.

    Shujian Huang, Wenhao Zhu

    [Slides] [Video]

  • K-Nearest-Neighobor Machine Translation, invited tutorial, at NLPCC'2022.

    Shujian Huang, Wenhao Zhu

    [Slides] [Video]

Services

  • Program Committee / Conference Reviewer / Journal Reviewer

    ACL Rolling Review (ARR), 2023-2024

    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2024

    Annual Meeting of the Association for Computational Linguistics (ACL), 2023-2024

    Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023-2024

    Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024

  • Teaching Assistant / Teaching Support

    Natural Language Understanding, Generation, and Machine Translation, UoE, Spring 2024

    Machine Translation and Natural Language Generation, NJU, Spring 2023

    Programming for Artificial Intelligence, NJU, Spring 2021

    Advanced Programming, NJU, Autumn 2020

Correpondence

1024, Computer Science and Technology Building

Xianlin Campus of Nanjing University

163 Xianlin Avenue, Nanjing 210023, China


Last Update: Nov 2024