Justin Huang

The site's interface is available in English, and some content translation is still in progress.

Back

Curated

Original links I would recommend, each with a one-line reason.

  • Repo Anthropic
    Model Context Protocol

    A practical protocol surface for thinking about tool serving, capability boundaries, and agent infrastructure.

  • Report SWE-bench team
    SWE-bench

    A core benchmark for grounding coding-agent claims in real software maintenance tasks.

  • Report Terminal-Bench team
    Terminal-Bench

    A useful anchor for thinking about environment realism, task horizons, and executable grading.