👋 Welcome to Jiaxin's Blog

I document my notes and writings on AI research, LLMs, and engineering here. A mix of long-form posts hosted on this site and selected external articles.

Environment Scaling for Agentic RL

A pedagogical tour of how the LLM-agent community turns environments into scalable, verifiable RL training signal — the recurring pipeline, the design axes, and the open challenges.

June 10, 2026 · 40 min read · rl agents environment-scaling llm
Environment Scaling for Agentic RL (中文版)

一篇关于 LLM-agent 社区如何把环境变成可扩展、可验证的 RL 训练信号的教学式导览——反复出现的流水线、设计轴与开放挑战。

June 10, 2026 · 14 min read · rl agents environment-scaling llm
Towards Trustworthy Enterprise Deep Research

Deep Research is about understanding, reasoning, and synthesis—combining adaptive planning, retrieval, analysis, and context engineering to produce long-form, well-cited research outputs. This article explores how Enterprise Deep Research bridges internal knowledge and external insights to serve strategic business goals.

October 24, 2025 · 9 min read · Salesforce Blog · llm agents deep-research
Enhancing LLMs with Synthetic Knowledge Ingestion

A novel approach to enhancing Large Language Models through synthetic knowledge ingestion, presented at EMNLP 2024 from Intuit AI Research.

November 8, 2024 · 5 min read · Medium · Intuit AI Research · llm fine-tuning knowledge
End-to-End Document Enhancement using Diffusion

Our work on document enhancement using diffusion models, presented at WACV 2024 from Intuit AI Research.

January 4, 2024 · 5 min read · Medium · Intuit AI Research · diffusion documents vision
Cost-Effective Fine-Tuning of Language Models

An interactive framework for cost-effective fine-tuning of language models with sparse human supervision, presented at NeurIPS 2023.

December 14, 2023 · 4 min read · Medium · Intuit AI Research · llm fine-tuning human-feedback
SAC³: Reliable Hallucination Detection in Black-Box LLMs

Reliable hallucination detection in black-box language models via semantic-aware cross-check consistency (SAC³), accepted by EMNLP 2023.

December 13, 2023 · 4 min read · Medium · Intuit AI Research · llm hallucination reliability

👋 Welcome to Jiaxin's Blog

Environment Scaling for Agentic RL

Environment Scaling for Agentic RL (中文版)

Towards Trustworthy Enterprise Deep Research

Enhancing LLMs with Synthetic Knowledge Ingestion

End-to-End Document Enhancement using Diffusion

Cost-Effective Fine-Tuning of Language Models

SAC³: Reliable Hallucination Detection in Black-Box LLMs