Biography

I’m Danrui Qi, a 4th-year Ph.D. candidate at Simon Fraser University under the supervision of Prof. Jiannan Wang. My research interests mainly focus on good data for AI and AI for good data, which include:

  • Automated data preparation, with a specific focus on the automatic augmentation of features for complex relational tables.
  • Text2SQL methodologies utilizing Large Language Models (LLMs).
  • Automated Machine Learning (AutoML), especially automating the feature preprocessing part in the AutoML workflow.
  • The application of Bayesian Optimization and Reinforcement Learning techniques in the realm of Data Preparation.

Recently, I’m also very interested in Business Intelligence powered by Large Language Models (LLMs).

🎓 Education

  • 2020.09 - 2025.09 (expected), Ph.D. Candidate, Computer Science, Simon Fraser University, Burnaby, BC, Canada, under the supervision of Prof. Jiannan Wang.
  • 2017.09 - 2020.07, Master of Engineering, School of Software, Tsinghua University, Beijing, China, under the supervision of Prof. Shaoxu Song.
  • 2013.09 - 2017.07, Bachelor, School of Software, Tsinghua University, Beijing, China

📝 Publications

  • Fan Zhou, Siqiao Xue, Danrui Qi, Wenhui Shi, Wang Zhao, Ganglin Wei, Hongyang Zhang et al. “DB-GPT-Hub: Towards Open Benchmarking Text-to-SQL Empowered by Large Language Models.” arXiv preprint arXiv:2406.11434 (2024).

  • Siqiao Xue, Danrui Qi, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang et al. “Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models.” VLDB 2024 (Demo Track). [paper]

  • Danrui Qi, and Jiannan Wang. “CleanAgent: Automating Data Standardization with LLM-based Agents.” arXiv preprint arXiv:2403.08291 (2024). [paper] [code]

  • Danrui Qi, Weiling Zheng, and Jiannan Wang. “FeatAug: Automatic Feature Augmentation From One-to-Many Relationship Tables.” ICDE 2024. [paper] [code]

  • Danrui Qi, Jinglin Peng, Yongjun He, and Jiannan Wang. “Auto-FP: An Experimental Study of Automated Feature Preprocessing for Tabular Data.” In International Conference on Extending Database Technology (EDBT), 2024. [paper] [code] [talk]

  • Siqiao Xue, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang, Zhiping Zhang, Jianshan He, Hongyang Zhang, Ganglin Wei, Wang Zhao, Fan Zhou, Danrui Qi, Hong Yi, Shaodong Liu, Faqiang Chen. “DB-GPT: Empowering Database Interactions with Private Large Language Models.” arXiv preprint arXiv:2312.17449, 2023. [paper] [code] [demo]

  • Jinglin Peng, Weiyuan Wu, Jing Nathan Yan, Danrui Qi, Jeffrey M. Rzeszotarski, Jiannan Wang. “User Interfaces for Exploratory Data Analysis: A Survey of Open-Source and Commercial Tools.” In IEEE Data Eng. Bull. 45(3): 116-128, 2022. [paper]

  • Danrui Qi. “On concise explanations of non-answers over big data.” In Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD Student Research Competition), pp. 10-12. 2017. [paper]

🏛️ Experiences

  • 2024.05 - 2024.08, Research Intern at Microsoft Research, worked with Dr. Yeye He

💻 Open-Source Projects

  • 2020.7 - Present, Main Contributor of Dataprep, which has 1.9k stars
  • 2023.09 - Present, Main Contributor of DB-GPT-Hub, which has 1.1k stars
  • 2023.09 - Present, Main Contributor of DB-GPT, which has 11.3k stars

🏛️ Services

  • Reviewer of ICLR 2025
  • Program Committee of CIKM 2024
  • External Reviewer of DASFAA 2024
  • External Reviewer of ICDE 2022, 2024
  • External Reviewer of CIKM 2023
  • Program Committee of ICDE 2022

🏅 Awards

  • 2024.01, Westak International Sales Inc. Scholarship at Simon Fraser University
  • 2023.09, 2024.01, PhD Research Scholarship at Simon Fraser University
  • 2020.09, Graduate Dean’s Entrance (GDES) at Simon Fraser University
  • 2017.06, Outstanding Graduate Thesis Award at Tsinghua University
  • 2016.10, National Inspirational Scholarship at Tsinghua University