Justin Huang

Back

Curated

我筛过的外部原文链接,每条只保留一句推荐。

  • Repo Anthropic
    Model Context Protocol

    A practical protocol surface for thinking about tool serving, capability boundaries, and agent infrastructure.

  • 报告 SWE-bench team
    SWE-bench

    A core benchmark for grounding coding-agent claims in real software maintenance tasks.

  • 报告 Terminal-Bench team
    Terminal-Bench

    A useful anchor for thinking about environment realism, task horizons, and executable grading.