Software Development Is Becoming an Agent Workflow

Software development has always been a workflow. As AI agents take on the work, the SDLC maps precisely to the agent architecture it helped inspire.

The architecture of AI-native systems has a defining pattern. An agent reasons about what to do. It calls a tool. It evaluates the result. It decides whether to proceed, branch, or escalate to a human. It holds state across those decisions, checkpointing progress so that failures don't require starting over. Humans participate at defined gates, reviewing and approving before execution continues.

That pattern has a name software engineers already know. It is a pull request.

The accidental template

Software development workflows were not designed as an analogy for agent systems. They evolved from decades of engineering practice: test-driven development, version control, code review, continuous integration. But when you lay the SDLC against modern agent architecture, the structural parallels are precise enough to be instructive.

Agent workflows involve tool calls, durable state, and human-in-the-loop approval steps. Software development has the same components. The build system, linters, static analysis tools, and test suites are tool calls. The git commit history, branch, and pull request are durable state. The code review is the human-in-the-loop checkpoint. The merge gate is the approval step that resumes execution.

The software development lifecycle is, structurally, an agent workflow that engineers figured out how to run reliably before there were agents to run it.

Tool calls that engineers ran manually

In a production agent workflow, a tool call is a structured invocation: the agent specifies what it wants executed, the runtime executes it, and the result comes back for evaluation. The agent doesn't know in advance what the result will be. It interprets the output and decides what to do next.

That is what happens when a developer pushes a commit to a branch. The CI pipeline runs. Tests pass or fail. A linter surfaces style violations. A type checker flags an inconsistency. Each of these is a tool call the developer has to interpret and respond to. In most development workflows, this cycle runs multiple times before the code is ready for review.

The tooling in software development has always been designed around this pattern. Test frameworks produce structured output that can be parsed and acted on. Linters return specific error codes with file locations. Build systems produce deterministic outputs and clear failure signals. This precision exists because developers need to automate and script against these outputs. It turns out it is exactly the precision that agents need too.

Branches and PRs as durable execution state

Orchestration runtimes for agent workflows provide checkpointing: they save the state of a workflow at each step so that failures don't require restarting from scratch. Agent runs that span hours or involve human review steps need this capability because they cannot hold all context in memory across interruptions.

Git branches and pull requests serve this function in software development. A branch captures the state of the work at a given point. A commit is a checkpoint. The pull request description holds context: what is being changed, why, what was tried, what decisions were made along the way. That context persists across the time between opening and merging, allowing reviewers to pick up where the developer left off without requiring a live handoff.

This is not a coincidence. The requirements are the same. Long-running work involving multiple steps and human checkpoints needs durable state that outlasts any single session. Software engineers built that into their tooling because they needed it. Agent runtimes are building the equivalent infrastructure because agents need it for the same reasons.

Code review as a human-in-the-loop gate

The most structurally significant parallel is code review. Human-in-the-loop patterns in agent systems describe workflows that pause at defined points, present a human with the relevant context, wait for review and approval, and resume only when the human has cleared the gate. The human is not doing the work. The human is verifying that the work was done correctly and that it is safe to proceed.

Code review is that gate. The developer submits a pull request. Execution pauses. A reviewer receives the proposed changes, the context attached to them, and the results of automated checks. The reviewer inspects the work, leaves comments, requests changes if needed, and eventually approves. Execution resumes at merge.

The design requirements for this gate in software development are identical to the design requirements for human-in-the-loop checkpoints in agent systems. The reviewer needs to understand what changed and why. The proposed changes need to be scoped and reviewable. The automated checks need to have already run so the reviewer isn't doing work a machine could do. Approval needs to be explicit, not inferred.

Software engineering communities spent years refining code review practice. The patterns they landed on, small PRs, clear descriptions, CI-first review, required approvals, are directly applicable to designing agent supervision workflows.

Agents are now running the loop

The observation that software development and agent workflows share a structure would be interesting but abstract if the two were still separate domains. They are not.

Amazon Q Developer can pick up a GitHub issue, write code, run the test suite, and iterate on CI failures until the pipeline passes. Amazon describes the pattern explicitly: the agent can be triggered by a PR label and run unattended overnight, submitting fixes and responding to failures without human involvement until it is ready for review. Claude Code, which launched in February 2025, has had users create and merge over one million pull requests through agent-driven development since then.

GitHub Copilot Workspace, Devin, and SWE-agent take the same approach from different directions. These systems don't just write code. They open pull requests, respond to review comments, rerun tests when builds fail, and hold the full context of the task across the lifecycle of the PR. The software development workflow, which engineers designed for human participants, turns out to accommodate AI agents with minimal modification. The tool interfaces are the right shape. The state model is the right shape. The approval gates are the right shape.

Code review agent adoption reflects this fit. Among engineering teams tracking these tools, adoption went from 14.8% in January 2025 to 51.4% in October 2025. Gartner estimates that 40% of enterprise applications will feature task-specific AI agents by end of 2026, up from less than 5% in 2025. Software development tooling is a primary site of that growth because the tooling was already designed as an agent-compatible workflow.

The developer toolchain is an orchestration runtime

The implication for developer tooling companies is not subtle. GitHub, GitLab, and their equivalents are not peripheral to AI-native software development. They are the orchestration layer. The pull request is the primary execution surface. CI/CD pipelines are the tool runtime. Branch protection and required reviews are the approval gate configuration.

Microsoft's blueprint for an AI-led SDLC formalises this: spec-driven development places a structured specification at the center of the workflow, with workflow engines enforcing phase transitions from requirements through architecture review to implementation, testing, and deployment. The orchestration is not happening in a new tool built for agents. It is happening in the existing software development workflow, with agents plugged in as participants.

This has a direct consequence for what machine-consumable interfaces look like in the development domain. The GitHub REST and GraphQL APIs, the webhook event model, the Actions workflow syntax: these are the tool interfaces that coding agents use. They were designed for developer automation and scripting. They work for agents because good automation APIs and good agent APIs have the same requirements: precise inputs, structured outputs, predictable semantics.

What this means for the teams building dev tools

The companies building developer tooling in 2026 are, whether or not they have framed it this way, building agent orchestration infrastructure. The decisions that determine whether their platforms are good at human-operated software development also determine whether they are good at agent-operated software development. Structured output from CI. Queryable state for branches and PRs. Webhook events that are reliable enough to trigger on. Review workflows that can surface context programmatically.

Teams building agents that operate in the software development domain don't need a separate interface for agents. They need the existing interface to be complete, reliable, and structurally precise. When it is, the agent slots into the workflow that engineers already built and trusts.

The software development lifecycle wasn't designed as an agent workflow. But it turns out that is what it always was: a structured, tool-driven, state-preserving, human-gated execution system for complex tasks that take time and require judgment. The agents arriving now did not require the workflow to change. They just needed to be invited in.

If your team is mapping out how AI agents change your development process and tooling strategy, our AI Product Strategy playbook covers the architectural decisions that determine whether you are building for both human and machine operators from the start.

Want to learn more?

We write about AI, product strategy, and the future of building. Get in touch to continue the conversation.

Start a conversation