Warp Blog | RSS Feed
rsshub://warp/blog
Warp is an AI agent platform that lets you run multiple agents in parallel to complete any development task. - Powered by RSSHub
17
followers
1
entry/week
Warp Blog | RSS Feed·
The Coding Mandate: How Warp uses Warp to build Warp
Earlier this year, as we began building Warp 2.0, I introduced a simple mandate to our engineering team: use Warp to build Warp. Warp 2.0 aimed to reshape software development workflows for an agentic future: agent-assisted, prompt-driven, high-context coding. Our coding mandate…
Warp Blog | RSS Feed·
How we scored #1 on Terminal-Bench (52%)
To see how we achieved 71% (top 5) on SWE-bench Verified, see this post. Terminal-Bench is an open-source benchmark for evaluating how well AI agents perform on complex tasks that are rooted in the terminal. The tests range from resolving mangled Python dependencies, removing all…
Warp Blog | RSS Feed·
Introducing Warp 2.0: Reimagining coding with the Agentic Development Environment
Today, we’re excited to launch Warp 2.0, the first Agentic Development Environment. In Warp 2.0 you get:
The top overall coding agent: #1 on Terminal-Bench (52%) and top-4 on SWE-bench Verified (71%). It features a fundamentally new and superior user interface compared to IDE…
Warp Blog | RSS Feed·
Warp scores 71% on SWE-bench Verified
SWE-bench is the primary benchmark for evaluating LLMs and AI agents on coding tasks. It assesses a system’s ability to fix problems pulled from real-world GitHub issues on large, complex open-source codebases. Using these realistic coding tasks lets SWE-bench evaluate several…