Wenhao Zhu (朱文昊)

alt text 

Ph.D. Candidate, Nanjing University
Natural Language Processing Research Group
Supervisor: Prof. Shujian Huang & Prof. Jiajun Chen

State Key Laboratory of Novel Software Technology
Department of Computer Science and Technology

Email: zhuwh{at}smail.nju.edu.cn
[Twitter][Github][Google Scholar]

About me

I am now a fifth-year PhD student at Department of Computer Science and Technology in Nanjing University and a member of NJUNLP Group. Before that, I received the B.E. degree in School of Management & Engineering in June 2019 from Nanjing University.

My research interest lies in multilingual large language model (LLM) and machine translation (MT).

Experience

Pre-prints

  • Question Translation Training for Better Multilingual Reasoning

    Wenhao Zhu, Shujian Huang, Fei Yuan, Shuaijie She, Jiajun Chen, Alexandra Birch

    arXiv 2401.07817

    [Paper] [Code] [Slides] [Blog]

  • MAPO: Advancing Multilingual Reasoning through Multilingual Alignment-as-Preference Optimization

    Shuaijie She , Wei Zou , Shujian Huang, Wenhao Zhu, Xiang Liu, Xiang Geng, Jiajun Chen

    arXiv 2401.06838

    [Paper] [Code]

  • Extrapolating Large Language Models to Non-English by Aligning Languages

    Wenhao Zhu, Yunzhe Lv, Qingxiu Dong, Fei Yuan, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li

    arXiv 2308.04948

    [Paper] [Code] [Blog]

Published Papers

  • Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis

    Wenhao Zhu, Hongyi Liu, Qingxiu Dong, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li

    In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL) - Findings, 2024

    [Paper] [Code] [Slides] [Video] [Blog]

  • kNN-BOX: A Unified Framework for Nearest Neighbor Generation

    Wenhao Zhu, Qianfeng Zhao, Yunzhe Lv, Shujian Huang, Siheng Zhao, Sizhe Liu, Jiajun Chen

    In the 18th Conference of the European Chapter of the Association for Computational Linguistics (EACL) - Demo Track, 2024

    [Paper] [Slides] [Code] [Video]

  • Research Development of Machine translation and Large Language Model

    Wenhao Zhu, Hao Zhou, Changjiang Gao, Sizhe Liu, Shujian Huang

    In the 21st China National Conference on Computational Linguistics (CCL), 2023

    [Paper]

  • Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model

    Kanzhi Cheng, Wenpo Song, Zheng Ma, Wenhao Zhu, Zixuan Zhu, Jianbing Zhang

    In Proceedings of the 31th ACM International Conference on Multimedia (ACM-MM), 2023

    [Paper] [Code] [Blog]

  • INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation

    Wenhao Zhu, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen

    In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL), 2023

    [Paper] [Code] [Slides] [Video] [Blog]

  • Lego-MT: Towards Detachable Models in Massively Multilingual Machine Translation

    Fei Yuan, Yinquan Lu, Wenhao Zhu, Lingpeng Kong, Lei Li, Jingjing Xu

    In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2023

    [Paper] [Code]

  • What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation

    Wenhao Zhu, Shujian Huang, Yunzhe Lv, Xin Zheng, Jiajun Chen

    In Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL) - Findings, 2023

    [Paper] [Code] [Slides] [Video] [Blog]

  • FGraDA: A Dataset and Benchmark for Fine-Grained Domain Adaptation in Machine Translation

    Wenhao Zhu, Shujian Huang, Tong Pu, Pingxuan Huang, Xu Zhang, Jian Yu, Wei Chen, Yanfeng Wang, Jiajun Chen

    In Proceedings of the 13th Language Resources and Evaluation Conference (LREC), 2022

    [Paper] [Dataset] [Slides] [Video] [Blog]

  • Improving Bilingual Lexicon Induction on Distant Language Pairs

    Wenhao Zhu, Zhihao Zhou, Shujian Huang, Zhenya Lin, Xiangsheng Zhou, Yaofeng Tu, Jiajun Chen

    In the 15th China Conference on Machine Translation (CCMT), 2019. Best English Paper Award.

    [Paper] [Slides] [Blog]

Talks

  • Research Topic Selection in the Age of Large Language Models, invited talk, at MLNLP'2024

    Wenhao Zhu

    [Slides] [Video]

  • Research Topic Selection in the Age of Large Language Models, invited talk, at CCMT'2023

    Wenhao Zhu

    [Slides] [Video]

  • INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation &

    What Knowledge Is Needed? Towards Explainable Memory for kNN-MT Domain Adaptation, invited talk, at AIS'2023

    Wenhao Zhu

    [Slides]
  • Analyzing Multilingual Machine Translation Ability of Large Language Models, invited talk, at HIT.

    Shujian Huang, Wenhao Zhu

    [Slides] [Video]

  • K-Nearest-Neighobor Machine Translation, invited talk, at MLNLP'2022.

    Shujian Huang, Wenhao Zhu

    [Slides] [Video]

  • K-Nearest-Neighobor Machine Translation, invited tutorial, at NLPCC'2022.

    Shujian Huang, Wenhao Zhu

    [Slides] [Video]

Services

  • Program Committee / Conference Reviewer

    ACL Rolling Review (ARR), 2023-2024

    Annual Meeting of the Association for Computational Linguistics (ACL), 2023-2024

    Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023

    Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2024

    China National Conference on Computational Linguistics (CCL), 2022-2024

  • Teaching Assistant / Teaching Support

    Natural Language Understanding, Generation, and Machine Translation, UoE, Spring 2024

    Machine Translation and Natural Language Generation, NJU, Spring 2023

    Programming for Artificial Intelligence, NJU, Spring 2021

    Advanced Programming, NJU, Autumn 2020

Correpondence

1024, Computer Science and Technology Building

Xianlin Campus of Nanjing University

163 Xianlin Avenue, Nanjing 210023, China


Last Update: Mar 2024