Biography

I’m Danrui Qi, a 5th-year Ph.D. candidate at Simon Fraser University under the supervision of Prof. Jiannan Wang. I also work closely with Prof. Zhengjie Miao. My research interests mainly focus on good data for AI and AI for good data, which include:

  • Automated data preparation, with a specific focus on self-envolving data science agent and the automatic augmentation of features for complex relational tables.
  • Large Language Model reasoning on table tasks, e.g. Text-to-SQL and TableQA.
  • Automated Machine Learning (AutoML), especially automating the feature preprocessing part in the AutoML workflow.
  • The application of Bayesian Optimization and Reinforcement Learning techniques in the realm of Data Preparation.

Recently, I’m also very interested in Business Intelligence powered by Large Language Models (LLMs).

Collaboration & Mentoring

I welcome discussions on Automated Data Science & AI agents and am open to collaborations with researchers and industry professionals. I also enjoy mentoring students at various stages of their academic journey.

If you’re interested in exploring potential collaborations or discussing recent developments in the field, feel free to schedule a conversation here.

🎓 Education

  • 2020.09 - 2025.09 (expected), Ph.D. Candidate, Computer Science, Simon Fraser University, Burnaby, BC, Canada, under the supervision of Prof. Jiannan Wang.
  • 2017.09 - 2020.07, Master of Engineering, School of Software, Tsinghua University, Beijing, China, under the supervision of Prof. Shaoxu Song.
  • 2013.09 - 2017.07, Bachelor, School of Software, Tsinghua University, Beijing, China

📝 Publications

  • Fan Zhou, Siqiao Xue, Danrui Qi, Wenhui Shi, Wang Zhao, Ganglin Wei, Hongyang Zhang et al. “DB-GPT-Hub: Towards Open Benchmarking Text-to-SQL Empowered by Large Language Models.” arXiv preprint arXiv:2406.11434 (2024). [paper]

  • Siqiao Xue, Danrui Qi, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang et al. “Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models.” VLDB 2024 (Demo Track). [paper]

  • Danrui Qi, Zhengjie Miao and Jiannan Wang. “CleanAgent: Automating Data Standardization with LLM-based Agents.” DATAI@VLDB 2025. [paper] [code]

  • Danrui Qi, Weiling Zheng, and Jiannan Wang. “FeatAug: Automatic Feature Augmentation From One-to-Many Relationship Tables.” ICDE 2024. [paper] [code]

  • Danrui Qi, Jinglin Peng, Yongjun He, and Jiannan Wang. “Auto-FP: An Experimental Study of Automated Feature Preprocessing for Tabular Data.” In International Conference on Extending Database Technology (EDBT), 2024. [paper] [code] [talk]

  • Siqiao Xue, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang, Zhiping Zhang, Jianshan He, Hongyang Zhang, Ganglin Wei, Wang Zhao, Fan Zhou, Danrui Qi, Hong Yi, Shaodong Liu, Faqiang Chen. “DB-GPT: Empowering Database Interactions with Private Large Language Models.” arXiv preprint arXiv:2312.17449, 2023. [paper] [code] [demo]

  • Jinglin Peng, Weiyuan Wu, Jing Nathan Yan, Danrui Qi, Jeffrey M. Rzeszotarski, Jiannan Wang. “User Interfaces for Exploratory Data Analysis: A Survey of Open-Source and Commercial Tools.” In IEEE Data Eng. Bull. 45(3): 116-128, 2022. [paper]

  • Danrui Qi. “On concise explanations of non-answers over big data.” In Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD Student Research Competition), pp. 10-12. 2017. [paper]

🏛️ Experiences

  • 2024.05 - 2025.05, Research Intern at Microsoft Research, worked with Dr. Yeye He

💻 Open-Source Projects

  • 2020.7 - Present, Main Contributor of Dataprep, which has 1.9k stars
  • 2023.09 - Present, Main Contributor of DB-GPT-Hub, which has 1.1k stars
  • 2023.09 - Present, Main Contributor of DB-GPT, which has 11.3k stars

🏛️ Services

  • Shadow PC of VLDB 2026
  • Program Committee of CIKM 2025, IJCAI 2025 Survey Track, CIKM 2024, ICDE 2022
  • Reviewer of SRW@ACL 2025, KDD 2025 Research & ADS Track, WACV 2025, ICLR 2025, DeLTa@ICLR 2025, WMARK@ICLR 2025, VerifAI@ICLR 2025, FM-Wild@ICLR 2025
  • External Reviewer of DASFAA 2024, ICDE 2022, ICDE 2024, CIKM 2023

🏅 Awards

  • 2024.01, Westak International Sales Inc. Scholarship at Simon Fraser University
  • 2023.09, 2024.01, PhD Research Scholarship at Simon Fraser University
  • 2020.09, Graduate Dean’s Entrance (GDES) at Simon Fraser University
  • 2017.06, Outstanding Graduate Thesis Award at Tsinghua University
  • 2016.10, National Inspirational Scholarship at Tsinghua University