Home • Justin Huang

About

Coding Agents · AI Infra · AI-native Builder

I have built end-to-end agent infrastructure and AI-native products, and now focus on coding agents, LLM post-training, long-horizon task synthesis, and executable evaluation. I care about the engineering system behind an agent: runtime state, tool protocols, sandboxes, memory, observability, prompt evaluation, billing, and the failure modes that only appear after a demo becomes a product.

More about me

Blog

2026年6月5日

Rebuilding My Personal Site Around a Clearer Thread

More blogs

Curated

Repo Anthropic

Model Context Protocol
A practical protocol surface for thinking about tool serving, capability boundaries, and agent infrastructure.
Report SWE-bench team

SWE-bench
SWE-bench is a useful external reference whenever I think about coding-agent evaluation, task realism, and the gap between fixing a real repository and passing a toy coding task.

More curated

Experience

DP Technology

Agent Infra / AI-native Products

Worked on agent runtime, MCP tool serving, sandboxed execution, OpenAPI Gateway, memory service, observability, and prompt evaluation infrastructure.

Coding Agent / LLM Post-training

Current research direction

Focusing on Terminal-Bench, long-horizon task synthesis, sandbox construction, verifiers, SFT/RL data quality, and credit assignment.

Beihang University

Computer Science / AI Systems

Turning production Agent Infra experience into research questions and long-form writing.

Selected Works

Coding Agent / Terminal-Bench

Long-horizon coding agents, terminal environments, data synthesis, verifiers, SFT/RL data quality, and credit assignment.

BohrClaw: Agentic Research Assistant

A research assistant product around paper reading, experiment execution, cloud workspaces, and reusable scientific workflows.

SciMaster Stateless Agent Runtime

Moving agent sessions, sandboxes, and tool-call state away from single-process memory into a distributed runtime path.

OpenAPI Gateway for Agent Products

Authentication, billing, rate limiting, tool routing, tracing, fallback, and prompt evaluation for internal agent products.

Education

Beihang University

B.S. — Computer Science and Technology

2021 - 2025

Beihang University

Graduate Student — Computer Science

2025 - Present

Skills

Languages

Frontend

Backend

AI & Agent

Tools

Site stats

2

Posts

3

Curated

8

Projects

0

Visitors