• Inside Devin’s Workflow: Tool Use, Planning, and Autonomy
    Apr 26 2026

    Read the full article: Inside Devin’s Workflow: Tool Use, Planning, and Autonomy

    Discover more at AI Builds It: Easy Coding Tools

    Excerpt:

    Introduction Devin (from Cognition AI) is a new autonomous AI software engineer that can plan software development tasks and carry them out largely on its own. It works end-to-end on code projects, using tools like a code editor, a command-line shell, and a web browser to research, write, test, and deploy code. In demos and press, Devin has been shown scanning a codebase, generating a plan, editing files, running tests, and making pull requests with surprisingly little human input (medium.com) (www.linkedin.com). Cognition claims Devin can handle “complex engineering tasks requiring thousands of decisions,” recalling context at each step and even learning from mistakes (medium.com) (www.linkedin.com). We therefore explore the public details of Devin’s design and workflow. This includes how Devin breaks down tasks (its planning process), how it literally works in a developer environment (editor, terminal, browser), how it keeps memory or context across a coding session, how it self-corrects and iterates, and what guardrails or safety measures it uses. We also note what is not revealed – for example the exact model internals are undisclosed, so some community discussion relies on educated guesswork.

    Task Planning and Decomposition When a developer gives Devin a new assignment, the first step is planning what files to change and in what order. Cognition’s notes explain that Devin uses a “planning mode” sub-agent whose job is to figure out which files in the repository are relevant to the task (medium.com) (docs.devin.ai). In practice, Devin “investigates” the repo and proposes a plan before writing any code (docs.devin.ai) (docs.devin.ai). For complex tasks, developers see this plan and can approve or adjust it; if the Agency mode is enabled, Devin will automatically proceed with its plan without waiting for approval (docs.devin.ai) (docs.devin.ai).

    ... Continue reading

    Show More Show Less
    18 mins
  • Cursor IDE Agent: Repository-Scale Edits and Developer Reports
    Apr 23 2026

    Read the full article: Cursor IDE Agent: Repository-Scale Edits and Developer Reports

    Discover more at AI Builds It: Easy Coding Tools

    Excerpt:

    Cursor IDE Agent: Repository-Scale Edits and Developer Reports

    Cursor is an AI-native code editor (a VS Code fork) designed to manage entire codebases with built-in artificial intelligence. Unlike basic autocomplete tools, Cursor’s Agent Mode lets the AI act “in the driver’s seat,” reading, editing and creating code across multiple files at once (federicocalo.dev) (www.datacamp.com). In this mode, the AI can search your code, update imports, change function definitions everywhere they appear, run build or test commands, and fix errors in a loop – much like a senior developer working in parallel (federicocalo.dev) (www.datacamp.com). It truly works at repository scale: for example, one guide describes telling the AI “Add JWT authentication to this Angular app” and watching it create services, update components, run tests, and repair errors without manual edits (federicocalo.dev). These agentic features are powered by a “tool use” architecture: the AI can call functions like read_file, edit_file, search_files, or even run_terminal_command to inspect and modify your project (federicocalo.dev). In practice, Cursor’s agent can autonomously carry out large refactors and feature builds by combining language understanding with direct code manipulation.

    ... Continue reading

    Show More Show Less
    18 mins