Biography
I’m Danrui Qi, a 4th-year Ph.D. candidate at Simon Fraser University under the supervision of Prof. Jiannan Wang. My research interests mainly focus on good data for AI
and AI for good data
, which include:
- Automated data preparation, with a specific focus on the automatic augmentation of features for complex relational tables.
- Text2SQL methodologies utilizing Large Language Models (LLMs).
- Automated Machine Learning (AutoML), especially automating the feature preprocessing part in the AutoML workflow.
- The application of Bayesian Optimization and Reinforcement Learning techniques in the realm of Data Preparation.
Recently, I’m also very interested in Business Intelligence powered by Large Language Models (LLMs).
🎓 Education
- 2020.09 - 2025.09 (expected), Ph.D. Candidate, Computer Science, Simon Fraser University, Burnaby, BC, Canada, under the supervision of Prof. Jiannan Wang.
- 2017.09 - 2020.07, Master of Engineering, School of Software, Tsinghua University, Beijing, China, under the supervision of Prof. Shaoxu Song.
- 2013.09 - 2017.07, Bachelor, School of Software, Tsinghua University, Beijing, China
📝 Publications
-
Fan Zhou, Siqiao Xue,
Danrui Qi
, Wenhui Shi, Wang Zhao, Ganglin Wei, Hongyang Zhang et al. “DB-GPT-Hub: Towards Open Benchmarking Text-to-SQL Empowered by Large Language Models.” arXiv preprint arXiv:2406.11434 (2024). -
Siqiao Xue,
Danrui Qi
, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang et al. “Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models.” VLDB 2024 (Demo Track). [paper] -
Danrui Qi
, and Jiannan Wang. “CleanAgent: Automating Data Standardization with LLM-based Agents.” arXiv preprint arXiv:2403.08291 (2024). [paper] [code] -
Danrui Qi
, Weiling Zheng, and Jiannan Wang. “FeatAug: Automatic Feature Augmentation From One-to-Many Relationship Tables.” ICDE 2024. [paper] [code] -
Danrui Qi
, Jinglin Peng, Yongjun He, and Jiannan Wang. “Auto-FP: An Experimental Study of Automated Feature Preprocessing for Tabular Data.” In International Conference on Extending Database Technology (EDBT), 2024. [paper] [code] [talk] -
Siqiao Xue, Caigao Jiang, Wenhui Shi, Fangyin Cheng, Keting Chen, Hongjun Yang, Zhiping Zhang, Jianshan He, Hongyang Zhang, Ganglin Wei, Wang Zhao, Fan Zhou,
Danrui Qi
, Hong Yi, Shaodong Liu, Faqiang Chen. “DB-GPT: Empowering Database Interactions with Private Large Language Models.” arXiv preprint arXiv:2312.17449, 2023. [paper] [code] [demo] -
Jinglin Peng, Weiyuan Wu, Jing Nathan Yan,
Danrui Qi
, Jeffrey M. Rzeszotarski, Jiannan Wang. “User Interfaces for Exploratory Data Analysis: A Survey of Open-Source and Commercial Tools.” In IEEE Data Eng. Bull. 45(3): 116-128, 2022. [paper] -
Danrui Qi
. “On concise explanations of non-answers over big data.” In Proceedings of the 2017 ACM International Conference on Management of Data (SIGMOD Student Research Competition), pp. 10-12. 2017. [paper]
🏛️ Experiences
- 2024.05 - 2024.08, Research Intern at Microsoft Research, worked with Dr. Yeye He
💻 Open-Source Projects
- 2020.7 - Present, Main Contributor of Dataprep, which has
1.9k stars
- 2023.09 - Present, Main Contributor of DB-GPT-Hub, which has
1.1k stars
- 2023.09 - Present, Main Contributor of DB-GPT, which has
11.3k stars
🏛️ Services
- Reviewer of ICLR 2025
- Program Committee of CIKM 2024
- External Reviewer of DASFAA 2024
- External Reviewer of ICDE 2022, 2024
- External Reviewer of CIKM 2023
- Program Committee of ICDE 2022
🏅 Awards
- 2024.01, Westak International Sales Inc. Scholarship at Simon Fraser University
- 2023.09, 2024.01, PhD Research Scholarship at Simon Fraser University
- 2020.09, Graduate Dean’s Entrance (GDES) at Simon Fraser University
- 2017.06, Outstanding Graduate Thesis Award at Tsinghua University
- 2016.10, National Inspirational Scholarship at Tsinghua University