Open-Source AI Agent Refact.ai Achieves Stunning 69.8% on SWE-bench Verified

2025-05-22
Open-Source AI Agent Refact.ai Achieves Stunning 69.8% on SWE-bench Verified

Refact.ai, a leading open-source AI programming agent, achieved a remarkable 69.8% score on the SWE-bench Verified benchmark, autonomously solving 349 out of 500 real-world GitHub issues. This success is attributed to its robust architecture: the Claude-3.7 model at its core, supported by a debug_script() sub-agent for debugging and code modification, and a strategic_planning() tool for optimized problem-solving. The entire Refact.ai pipeline is open-source, and its real-world application demonstrates significant productivity gains for developers.

Read more
AI