About me
Hi there! I am Shicheng Liu, fourth-year CS Ph.D. at Stanford advised by Prof. Monica S. Lam at Stanford Open Virtual Assistant Lab (OVAL) and Stanford NLP Group. I focus on real-life, practical NLP problems, often drawing perspectives from computer systems and programming languages. My recent research focuses on knowledge agents with LLMs, aiming to enable domain-independent approaches that effectively retrieve and navigate different sources of knowledge, including structured, unstructured, and hybrid (combination of structured and unstructured data) sources.
During summer 2025, I was an AI research intern at Meta, where I learned from many wonderful mentors, including Kai Sun, Scott Yih, and Luna Dong.
Education
Ph.D in Computer Science
Stanford University, 2022 - 2026 (Expected)B.S. (w. Honors) in Computer Science & in Mathematics, Minor in Physics
The University of Chicago, 2022- Quarter-long exchange (Autumn 2021), California Institute of Technology
Selected Recent Publications
- LLMs with Structured and Unstructured data
SCRIBES: Web-Scale Script-Based Semi-Structured Data Extraction with Reinforcement Learning
Shicheng Liu, Kai Sun, Lisheng Fu, Xilun Chen, Xinyuan Zhang, Zhaojiang Lin, Rulin Shao, Yue Liu, Anuj Kumar, Wen-tau Yih, Xin Luna Dong
Pre-print, under reviewSUQL: Conversational Search over Structured and Unstructured Data with Large Language Models
(10-min video presentation)
Shicheng Liu, Jialiang Xu, Wesley Tjangnaka, Sina J. Semnani, Chen Jie Yu, Monica S. Lam
Findings of the North American Chapter of Association for Computational Linguistics (NAACL 2024)
- LLMs with Knowledge Graph
SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions
(10-min video presentation)
Shicheng Liu</sup>, Sina J. Semnani*, Harold Triedman, Jialiang Xu, Isaac Dan Zhao, Monica S. Lam
*Findings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024)Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata
(6-min video presentation)
Silei Xu</sup>, Shicheng Liu*, Theo Culhane, Elizaveta Pertseva, Meng-Hsi Wu, Sina Semnani, Monica S. Lam
*The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)
- Programmable Task and Knowledge Agents with LLMs
- Coding Reliable LLM-based Integrated Task and Knowledge Agents with GenieWorksheets
Harshit Joshi, Shicheng Liu, James Chen, Robert Weigle, Monica S. Lam
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
- Coding Reliable LLM-based Integrated Task and Knowledge Agents with GenieWorksheets