I am now a search algorithm engineer at Xiaohongshu (Beijing). Before that, I received my Ph.D. degree from THUIR, Department of Computer Science and Technology in Tsinghua University, under the supervision of Prof. Yiqun Liu. My major research interests are about interactive search, pre-trained language models, and etc. I have also served as a PC member for top-tier IR/NLP conferences/journals such as SIGIR, ACL, WSDM, TOIS, EMNLP, COLING and so on.
Education
Year | Education |
---|
08.2018-06.2023 | Ph.D., Department of Computer Science and Technology, Tsinghua University, China. (Thesis) |
09.2014-06.2018 | B.E., School of Computer Science, Beijing University of Posts and Telecommunications, China. |
Experience
Year | Experience |
---|
07.2023-present | Xiaohongshu (Beijing), Community Search Division, Algorithm Engineer. |
07.2023-12.2023 | NTCIR-17 Session Search 2 (SS-2) Track, Track Chair. |
06.2022-09.2022 | Alibaba Group (Hangzhou), Big Taobao Search Division, Internship. |
04.2021-06.2022 | Main organizer of the NTCIR-16 Session Search (SS) task. |
06.2020-06.2021 | Chairman of Graduates Union, DCST, Tsinghua University, China. |
07.2019-09.2019 | Visiting student, NExT++ Research Centre, National University of Singapore, Singapore. |
Selected Publications
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models. |
---|
Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Zhijing Wu, Yiqun Liu. |
AAAI 2025 (Full, Acceptance Rate: 23.4%). Preprint Version. |
DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment. |
---|
Haitao Li, Qingyao Ai, Xinyan Han, Jia Chen, Qian Dong, Yiqun Liu. |
AAAI 2025 (Full, Acceptance Rate: 23.4%). Preprint Version. |
Scaling Laws For Dense Retrieval. |
---|
Yan Fang, Jingtao Zhan, Qingyao Ai, Jiaxin Mao, Weihang Su, Jia Chen, Yiqun Liu. |
SIGIR 2024 (Full, Acceptance Rate: 20.1%, Best Paper Award). Preprint Version. |
Capability-aware Prompt Reformulation Learning for Text-to-Image Generation. |
---|
Jingtao Zhan, Qingyao Ai, Yiqun Liu, Jia Chen, Shaoping Ma. |
SIGIR 2024 (Full, Acceptance Rate: 20.1%). Preprint Version. |
Wikiformer: Pre-training with structured information of wikipedia for ad-hoc retrieval. |
---|
Weihang Su, Qingyao Ai, Xiangsheng Li, Jia Chen, Yiqun Liu, Xiaolong Wu, Shengluan Hou. |
AAAI 2024 (Full, Acceptance Rate: 23.8%). Preprint Version. |
SAILER: Structure-aware Pre-trained Language Model for Legal Case Retrieval. |
---|
Haitao Li, Qingyao Ai, Jia Chen, Qian Dong, Yueyue Wu, Yiqun Liu, Chong Chen, Qi Tian. |
SIGIR 2023 (Full, Acceptance Rate: 20.1%). Preprint Version. |
THUIR at WSDM Cup 2023 Task 1: Unbiased Learning to Rank. |
---|
Jia Chen, Haitao Li, Weihang Su, Qingyao Ai, Yiqun Liu. |
WSDM Cup 2023 Task 1 (2/187 Teams). Preprint Version. Certificate. |
Axiomatically Regularized Pre-training for Ad hoc Search. |
---|
Jia Chen, Yiqun Liu, Yan Fang, Jiaxin Mao, Hui Fang, Shenghao Yang, Xiaohui Xie, Min Zhang, Shaoping Ma. |
SIGIR 2022 (Full, Acceptance Rate: 20.3%). Preprint Version |
Pre-training Methods in Information Retrieval. |
---|
Yixing Fan, Xiaohui Xie, Yinqiong Cai, Jia Chen, Xinyu Ma, Xiangsheng Li, Ruqing Zhang, Jiafeng Guo. |
FnTIR. Preprint Version |
Incorporating Query Reformulating Behavior into Web Search Evaluation. |
---|
Jia Chen, Yiqun Liu, Jiaxin Mao, Fan Zhang, Tetsuya Sakai, Weizhi Ma, Min Zhang, Shaoping Ma. |
CIKM 2021 (Full, Acceptance Rate: 21.7%). Preprint Version |
A Hybrid Framework for Session Context Modeling. |
---|
Jia Chen, Jiaxin Mao, Yiqun Liu, Ziyi Ye, Weizhi Ma, Chao Wang, Min Zhang, and Shaoping Ma. |
TOIS (Volume 39, Issue 3). Preprint Version. |
Towards a Better Understanding of Query Reformulation Behavior in Web Search. |
---|
Jia Chen, Jiaxin Mao, Yiqun Liu, Fan Zhang, Min Zhang, and Shaoping Ma. |
WWW 2021 (Full, Acceptance Rate: 20.6%). Preprint Version |
A Context-Aware Click Model for Web Search. |
---|
Jia Chen, Jiaxin Mao, Yiqun Liu, Min Zhang, and Shaoping Ma. |
WSDM 2020 (Full, Acceptance Rate: 14.8%). Preprint Version |
TianGong-ST: A New Dataset with Large-scale Refined Real-world Web Search Sessions. |
---|
Jia Chen, Jiaxin Mao, Yiqun Liu, Min Zhang, and Shaoping Ma. |
CIKM 2019 (Short, Acceptance Rate: 21.3%). Preprint Version |
Investigating Query Reformulation Behavior of Search Users. |
---|
Jia Chen, Jiaxin Mao, Yiqun Liu, Min Zhang, and Shaoping Ma. |
CCIR 2019 (Full). Preprint Version |
Improving Session Search Performance with a Multi-MDP Model. |
---|
Jia Chen, Yiqun Liu, Cheng Luo, Jiaxin Mao, Min Zhang, and Shaoping Ma. |
AIRS 2018 (Full). Preprint Version |
Honor and Awards
Year | Honor & Awards |
---|
2024 | Outstanding Doctoral Dissertation of Wu Wenjun Artificial Intelligence Science and Technology Award. link |
2023 | Outstanding Graduate of Department of Computer Science and Technology, Tsinghua University. |
2023 | WSDM Cup 2023: Pre-training for Web Search & Unbiased Learning to Rank, Global Runner-up🥈. link |
2020, 2021, 2022 | Overall Excellence Scholarship (First Prize), Tsinghua University. |
2022 | Longfor Academic Rising Star Scholarship, Tsinghua University. |
2021 | Social Work Excellence Scholarship, Tsinghua University. |
2015, 2016, 2017 | National Scholarship (2015 No.01214, 2016 No.01225, 2017 No.01225). (Top 2%) |
2017 | CCF Outstanding Undergraduate Award (OUA). (About 100 places per year in the Nation wide) link. |
2017 | Meritorious Winner (First Author), MCM/ICM 2017. |
2015 | Merit Student, Beijing. |
More
During 2022-2023, I have received special offers from Kwai (快Star), Baidu (AIDU), Tencent (技术大咖), Xiaohongshu (RedStar), and etc. As a result, I will be working at Xiaohongshu (Shuxing Information Technology Beijing Co, Ltd) from July 2023!
My hobbies mainly focus on popular music, comic and animation.
No music, no life! 🤠🎵🎶
Everybody loves Misaka Mikoto! ⚡️