Yumo Xu

I am a scientist at AWS AI Labs researching and building intelligent language services for enterprises.

My current interests lie in the following areas:

  • Effective Retrieval-Augmented Generation (RAG) systems,
  • Accurate faithfulness evaluation and source attribution, and
  • Reliable autonomous agents with generalizable tool-use capabilities.

Prior to Amazon, I received my PhD in NLP at the University of Edinburgh, advised by Prof. Mirella Lapata. My PhD research was on text summarization, the process of condensing a source text into a shorter version while preserving its salient information.


News

May 15, 2025 CiteEval was accepted by ACL’25. Check out our code here.
Sep 23, 2024 Two long papers accepted by EMNLP’24 main conference.
Jul 1, 2024 I will serve as an Area Chair for NLP at Amazon Machine Learning Conference (AMLC).
Sep 7, 2023 I will serve as an Area Chair for NLG at LREC-COLING’24.
May 1, 2023 Generative modeling for eXtractive summarization (GenX) was accepted by ACL’23.

(Recent) Selected Publications

  1. Trustworthy AI
    CiteEval: Principle-Driven Citation Evaluation for Source Attribution
    Yumo Xu, Peng Qi*, Jifan Chen*, Kunlun Liu, Rujun Han, Lan Liu, Bonan Min, Vittorio Castelli, Arshit Gupta, and Zhiguo Wang
    In ACL 2025
  2. LLM - SFT
    Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models
    Zhengxuan Wu, Yuhao Zhang*, Peng Qi*, Yumo Xu*, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, and Zhiheng Huang
    In EMNLP 2024
  3. LLM - RAG
    RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering
    Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, and Vittorio Castelli
    In EMNLP 2024
  4. Summarization
    QTSumm: Query-Focused Summarization over Tabular Data
    Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Xiangru Tang, Yumo Xu, Arman Cohan, and Dragomir Radev
    In EMNLP 2023
  5. Summarization
    Text Summarization with Oracle Expectation
    Yumo Xu, and Mirella Lapata
    In ICLR 2023
  6. Summarization
    Document Summarization with Latent Queries
    Yumo Xu, and Mirella Lapata
    TACL 2022
  7. Summarization
    Coarse-to-Fine Query Focused Multi-Document Summarization
    Yumo Xu, and Mirella Lapata
    In EMNLP 2020