I am a Research Lead at Salesforce AI Research, where I lead a team building reliable and trustworthy enterprise AI agents. My research is fundamentally driven by the pursuit of reliability and reasoning in LLMs, specifically focusing on building robust agents for long-horizon systems, including deep research, uncertainty quantification, and advancing multi-step reasoning via post-training/RL and test-time scaling. I am also deeply interested in pushing the boundaries of AI self-improvement (e.g., on-policy self-distillation).
I believe the next frontier of AI capability lies at the intersection of calibrated reasoning and self-improving agents — systems that know what they don’t know and can autonomously improve through principled exploration.
Previously, I was a Senior Staff Research Scientist and a founding member building the AI Research team from 0 to 1 at Intuit. I architected and deployed hallucination detection frameworks (SAC3), automatic prompt optimization libraries (PhaseEvo), reliable RAG systems (Ski), and post-training alignment pipelines (IMFL) for enterprise financial LLM models, chatbots, and agents.
My technical roots lie in extreme-scale computing. As a Staff Research Scientist at Oak Ridge National Laboratory (ORNL), I built distributed deep learning systems scaling to 20,000+ GPUs on world-class supercomputers (Summit, Frontier). As PI/co-PI, I led 7 DOE projects (6.4 million in total) pioneering Generative AI for Science across Physics, Chemistry, and Material Science, publishing in top-tier journals (Nature series, IF 40+). I am a recipient of the Promising Early-Career Researcher Award from the US Department of Energy. Before ORNL, I earned my Ph.D. from Johns Hopkins University.
Beyond research, I am an active open-source contributor, maintaining several projects with 3,000+ GitHub stars focused on LLM RAG, Prompt Optimization, and Reliability. I also serve as an Area Chair at NeurIPS, ICLR, ACL, EMNLP, and NAACL. Always feel free to reach out for discussion!
News & Updates
| 04/2026 | Presenting our NuRL paper (code) at ICLR 2026 in Brazil! 🇧🇷 |
|---|---|
| 04/2026 | We release CaOPD — calibration-aware on-policy distillation! Read the paper, check out the code and Hugging Face. |
| 04/2026 | 2 papers accepted by ACL 2026: From Passive Metric to Active Signal and Don’t Stop Early: Scalable Enterprise Deep Research. |
| 02/2026 | 2 invited talks on Building Reliable Long-horizon Agents at UCSD EnCORE Workshop and EPFL. |
| 01/2026 | 1 paper accepted by ICLR 2026: Nudging the Boundaries of LLM Reasoning. |
| 12/2025 | Attending NeurIPS 2025 in San Diego! 🇺🇸 |
| 10/2025 | We published a blog on Towards Trustworthy Enterprise Deep Research at Salesforce. |
| 08/2025 | 3 papers accepted by EMNLP 2025: R2I-Bench (Oral, Outstanding Paper Nomination), Statistical Factuality Guarantee, and Confidence-Aware Reasoning. |
| 04/2025 | 2 papers accepted by ACL 2025: SEE and Automatic Prompt Optimization Survey. |
| 03/2025 | 1 paper accepted by NAACL 2025: Gradient-guided Attention Map Editing. |
Recent Talks
→ all talks
- Feb 27, 2026Invited TalkReliable Long-horizon Agents
- Feb 17, 2026Invited TalkReliable Long-horizon AgentsEPFL
Selected Publications
(Full publication list can be found on Google Scholar and on the Publications page.)
- arXiv 20262026
- arXiv 2026
- arXiv 2026
- ACL 2026In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics, 2026
- ACL 2026Don’t Stop Early: Scalable Enterprise Deep Research with Controlled Information Flow and Evidence-Aware TerminationIn Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics, 2026AgentsReasoning
- ICLR 2026In International Conference on Learning Representations, 2026
- arXiv 20252025
- arXiv 20252025
- EMNLP 2025OralIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
- EMNLP 2025In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
- EMNLP 2025In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, 2025
- ACL 2025In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025
- ACL 2025In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025
- ICLR 2025In International Conference on Learning Representations, 2025
- NAACL 2025In Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics, 2025
- EMNLP 2024OralIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track, 2024
- EMNLP 2024OralIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
- EMNLP 2024In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
- EMNLP 2024In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing - Industry Track, 2024
- AISTATS 2024In International Conference on Artificial Intelligence and Statistics, 2024
- WACV 2024In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024
- WACV 2024In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024
- AAAI 2023Proceedings of the AAAI Conference on Artificial Intelligence, 2023
- AAAI 2023OralProceedings of the AAAI Conference on Artificial Intelligence, 2023
- CVPR 2022In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
- AAAI 2022In Proceedings of the AAAI Conference on Artificial Intelligence, 2022
- NeurIPS 2021Advances in Neural Information Processing Systems, 2021
- UAI 2021In Uncertainty in Artificial Intelligence, 2021
- AISTATS 2021In International Conference on Artificial Intelligence and Statistics, 2021
- NeurIPS 2019Advances in Neural Information Processing Systems, 2019
Awards & Honors
- • Intuit CTO Award (Top 1% Performance), Intuit 2024
- • Intuit A2D Innovation Award (Top 1%, Team Lead), Intuit 2024, 2025
- • Promising Early-Career Researcher Award, Oak Ridge National Laboratory, US Department of Energy 2020
- • Chinese Outstanding Students Abroad Award, Ministry of Education of the P.R. China 2019
- • Acheson J. Duncan Graduate Research Award, Johns Hopkins University 2018
- • Dean's Fellowship, Johns Hopkins University 2014
- • China National Scholarship, Ministry of Education of the P.R. China 2009, 2012
Professional Services
- Area Chair: NeurIPS, ICLR, ACL, EMNLP, NAACL 2024–now
- Reviewer: NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, TMLR, JMLR, CVPR, ICCV, ECCV, AAAI, AISTATS, KDD 2020–now
Conference Travel
- • Apr 2026, ICLR 2026 @ Rio de Janeiro 🇧🇷
- • Dec 2025, NeurIPS 2025 @ San Diego 🇺🇸
- • Dec 2024, NeurIPS 2024 @ Vancouver 🇨🇦
- • Nov 2024, EMNLP 2024 @ Miami 🇺🇸
- • Jul 2024, ICML 2024 @ Vienna 🇦🇹
- • May 2024, AISTATS 2024 @ Valencia 🇪🇸
- • Jan 2024, WACV 2024 @ Hawaii 🇺🇸
- • Dec 2023, NeurIPS 2023 @ New Orleans 🇺🇸
- • Dec 2023, EMNLP 2023 @ Singapore 🇸🇬
- • Feb 2023, AAAI 2023 @ Washington, DC 🇺🇸
- • Dec 2022, NeurIPS 2022 @ New Orleans 🇺🇸
- • Jul 2022, ICML 2022 @ Baltimore 🇺🇸
- • Jun 2022, CVPR 2022 @ New Orleans 🇺🇸
- • Dec 2021, NeurIPS 2021 @ Online 🌐
- • Dec 2020, NeurIPS 2020 @ Online 🌐
- • Dec 2019, NeurIPS 2019 @ Vancouver 🇨🇦