Home • Justin Huang

About

Coding Agents · AI Infra · AI-native Builder

北京航空航天大学在读，目前在做 Coding Agent 的研究，关注长程任务数据合成与可执行评测。之前做过 Agent 与 AI Infra 产品的端到端建设，这些经历让我会去思考 Agent 背后的工程系统设计：sandbox runtime、tool call、memory、observability、prompt evaluation 等等。我还希望能够借助可解释性的一些发现，去打开 LLM 的“黑盒”。

More about me

Blog

More blogs

Curated

Repo Anthropic

Model Context Protocol
A practical protocol surface for thinking about tool serving, capability boundaries, and agent infrastructure.
报告 SWE-bench team

SWE-bench
SWE-bench is a useful external reference whenever I think about coding-agent evaluation, task realism, and the gap between fixing a real repository and passing a toy coding task.

More curated

Experience

深势科技 DP Technology

Agent Infra / AI-native Products

参与 Agent Runtime、MCP Tool、沙箱执行、OpenAPI Gateway、Memory Service 与 Prompt Evaluation 等基础设施建设。

Coding Agent / LLM Post-training

当前研究方向

关注 Terminal-Bench、长程任务数据合成、sandbox 环境构造、Verifier、SFT/RL 数据质量与 credit assignment。

北京航空航天大学

计算机科学与技术 / AI Systems

把真实 Agent 产品中的系统经验，整理成更有研究价值的问题和长期写作线索。

Selected Works

Coding Agent / Terminal-Bench

长程 Coding Agent、terminal 环境、数据合成、Verifier、SFT/RL 数据质量和 credit assignment。

BohrClaw: Agentic Research Assistant

一个面向科研场景的智能助手产品，围绕读论文、跑实验、云端工作区和科研技能沉淀来设计。

SciMaster Stateless Agent Runtime

把依赖进程内存的 Agent session、sandbox 和工具调用状态，迁移到更适合水平扩容和恢复的分布式链路中。

OpenAPI Gateway for Agent Products

面向内部 Agent 产品的鉴权、计费、限流、工具路由、trace、fallback 和 Prompt 批量评测。

Education

北京航空航天大学

学士 - 计算机科学与技术

2021 - 2025

北京航空航天大学

研究生 - 计算机方向

2025 - Present

Skills

Languages

Frontend

Backend

AI & Agent

Tools

站点统计

2

文章

3

精选

8

项目

0

访客