I am a Senior Staff Research Scientist (Research Lead) at Salesforce AI Research, where I lead a team building reliable, calibrated LLM models and self-evolving long-horizon AI agents. My research turns uncertainty, confidence, and consistency into first-class training signals for post-training/RL, scalable evaluation, agent oversight, and self-evolving.
I believe the next frontier of AI capability lies at the intersection of calibrated reasoning and self-improving agents — systems that know what they don’t know and can autonomously improve through principled exploration.
My current research focuses on:
- Agentic Reinforcement Learning — calibration-aware post-training, on-policy distillation, and self-evolving training environments (CaOPD, NuRL).
- Alignment, Calibration & Honesty — turning uncertainty and consistency into active training signals for honest, scalable LLM oversight (Passive→Active survey, Agentic Confidence Calibration).
- Long-horizon Agents & Evaluation — trajectory-level oversight and enterprise-scale agent benchmarks (Agentic Uncertainty Quantification, Trustworthy Deep Research).
Previously, I was a Senior Staff Research Scientist and founding research lead at Intuit AI Research, for building reliable LLM systems, spanning LLM post-training, alignment, evaluation, and production deployment. I architected and deployed hallucination detection (SAC3, used by 1,600+ internal users) and prompt optimization pipelines (PhaseEvo, used by 2,000+ developers) for enterprise financial LLMs — recognized with the Intuit CTO Award (top 1%). Earlier, as Staff Research Scientist at Oak Ridge National Laboratory, I architected distributed deep learning at 20,000+ GPUs on world-class supercomputers (Summit, Frontier) and led 7 DOE projects ($6.4M total) on Generative AI for Science, recognized with the DOE Promising Early-Career Researcher Award. I earned my Ph.D. at Johns Hopkins University.
News & Updates
| 05/2026 | 2 papers accepted by ICML 2026: Agentic Confidence Calibration and LaTtE-Flow. |
|---|---|
| 04/2026 | Presenting our NuRL paper (code) at ICLR 2026 in Brazil! 🇧🇷 |
| 04/2026 | We release CaOPD — calibration-aware on-policy distillation! Read the paper, check out the code and Hugging Face. |
| 04/2026 | 2 papers accepted by ACL 2026: From Passive Metric to Active Signal and Don’t Stop Early: Scalable Enterprise Deep Research. |
| 02/2026 | 2 invited talks on Building Reliable Long-horizon Agents at UCSD EnCORE Workshop and EPFL. |
| 01/2026 | 1 paper accepted by ICLR 2026: Nudging the Boundaries of LLM Reasoning. |
| 12/2025 | Attending NeurIPS 2025 in San Diego! 🇺🇸 |
| 10/2025 | We published a blog on Towards Trustworthy Enterprise Deep Research at Salesforce. |
| 08/2025 | 3 papers accepted by EMNLP 2025: R2I-Bench (Oral, Outstanding Paper Nomination), Statistical Factuality Guarantee, and Confidence-Aware Reasoning. |
| 04/2025 | 2 papers accepted by ACL 2025: SEE and Automatic Prompt Optimization Survey. |
Recent Talks
→ all talks
- Feb 27, 2026Invited TalkReliable Long-horizon Agents
- Feb 17, 2026Invited TalkReliable Long-horizon AgentsEPFL
Selected Publications
(Full publication list can be found on Google Scholar and on the Publications page.)
- arXiv 20262026
- ICML 2026In International Conference on Machine Learning, 2026
- arXiv 20262026Under review at COLM 2026
- ACL 2026In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics, 2026
- ACL 2026In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics: Industry Track, 2026
- ICLR 2026In International Conference on Learning Representations, 2026
- arXiv 20252025
- ICML 2026In International Conference on Machine Learning, 2026
- EMNLP 2025OralIn Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
- EMNLP 2025In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, 2025
- EMNLP 2025In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, 2025
- ACL 2025In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025
- ACL 2025In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, 2025
- ICLR 2025In International Conference on Learning Representations, 2025
- NAACL 2025In Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics, 2025
- EMNLP 2024OralIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track, 2024
- EMNLP 2024OralIn Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
- EMNLP 2024In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, 2024
- EMNLP 2024In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing - Industry Track, 2024
- AISTATS 2024In International Conference on Artificial Intelligence and Statistics, 2024
- WACV 2024In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024
- WACV 2024In IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024
- AAAI 2023Proceedings of the AAAI Conference on Artificial Intelligence, 2023
- AAAI 2023OralProceedings of the AAAI Conference on Artificial Intelligence, 2023
- CVPR 2022In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022
- AAAI 2022In Proceedings of the AAAI Conference on Artificial Intelligence, 2022
- NeurIPS 2021Advances in Neural Information Processing Systems, 2021
- UAI 2021In Uncertainty in Artificial Intelligence, 2021
- AISTATS 2021In International Conference on Artificial Intelligence and Statistics, 2021
- NeurIPS 2019Advances in Neural Information Processing Systems, 2019
Awards & Honors
- • Intuit CTO Award (Top 1% Performance), Intuit 2024
- • Intuit A2D Innovation Award (Top 1%, Team Lead), Intuit 2024, 2025
- • Promising Early-Career Researcher Award, Oak Ridge National Laboratory, US Department of Energy 2020
- • Chinese Outstanding Students Abroad Award, Ministry of Education of the P.R. China 2019
- • Acheson J. Duncan Graduate Research Award, Johns Hopkins University 2018
- • Dean's Fellowship, Johns Hopkins University 2014
- • China National Scholarship, Ministry of Education of the P.R. China 2009, 2012
Professional Services
- Area Chair: NeurIPS, ICLR, ACL, EMNLP, NAACL 2024–now
- Reviewer: NeurIPS, ICML, ICLR, ACL, EMNLP, NAACL, TMLR, JMLR, CVPR, ICCV, ECCV, AAAI, AISTATS, KDD 2020–now
Conference Travel
- • Jul 2026, ICML 2026 @ Seoul 🇰🇷
- • Apr 2026, ICLR 2026 @ Rio de Janeiro 🇧🇷
- • Dec 2025, NeurIPS 2025 @ San Diego 🇺🇸
- • Dec 2024, NeurIPS 2024 @ Vancouver 🇨🇦
- • Nov 2024, EMNLP 2024 @ Miami 🇺🇸
- • Jul 2024, ICML 2024 @ Vienna 🇦🇹
- • May 2024, AISTATS 2024 @ Valencia 🇪🇸
- • Jan 2024, WACV 2024 @ Hawaii 🇺🇸
- • Dec 2023, NeurIPS 2023 @ New Orleans 🇺🇸
- • Dec 2023, EMNLP 2023 @ Singapore 🇸🇬
- • Feb 2023, AAAI 2023 @ Washington, DC 🇺🇸
- • Dec 2022, NeurIPS 2022 @ New Orleans 🇺🇸
- • Jul 2022, ICML 2022 @ Baltimore 🇺🇸
- • Jun 2022, CVPR 2022 @ New Orleans 🇺🇸
- • Dec 2021, NeurIPS 2021 @ Online 🌐
- • Dec 2020, NeurIPS 2020 @ Online 🌐
- • Dec 2019, NeurIPS 2019 @ Vancouver 🇨🇦