Warp Blog | RSS Feed

rsshub://warp/blog

Warp is an AI agent platform that lets you run multiple agents in parallel to complete any development task. - Powered by RSSHub

followers

entry/week

Warp Blog | RSS Feed·

The Coding Mandate: How Warp uses Warp to build Warp

Earlier this year, as we began building Warp 2.0, I introduced a simple mandate to our engineering team: use Warp to build Warp. Warp 2.0 aimed to reshape software development workflows for an agentic future: agent-assisted, prompt-driven, high-context coding. Our coding mandate…

Warp Blog | RSS Feed·

How we scored #1 on Terminal-Bench (52%)

To see how we achieved 71% (top 5) on SWE-bench Verified, see this post. Terminal-Bench is an open-source benchmark for evaluating how well AI agents perform on complex tasks that are rooted in the terminal. The tests range from resolving mangled Python dependencies, removing all…

Warp Blog | RSS Feed·

Introducing Warp 2.0: Reimagining coding with the Agentic Development Environment

Today, we’re excited to launch Warp 2.0, the first Agentic Development Environment. In Warp 2.0 you get: The top overall coding agent: #1 on Terminal-Bench (52%) and top-4 on SWE-bench Verified (71%). It features a fundamentally new and superior user interface compared to IDE…

Warp Blog | RSS Feed·

Warp scores 71% on SWE-bench Verified

SWE-bench is the primary benchmark for evaluating LLMs and AI agents on coding tasks. It assesses a system’s ability to fix problems pulled from real-world GitHub issues on large, complex open-source codebases. Using these realistic coding tasks lets SWE-bench evaluate several…