Microsoft Delivery Copilots

Overview

Microsoft Delivery Copilots is engineering work on Microsoft's AI-powered software delivery platform — helping teams ship at the pace of innovation while keeping delivery grounded in specs, traceable decisions, and production-ready quality.

The public expression of this work is DevSquad Copilot — an open-source delivery framework for GitHub Copilot that treats delivery as a loop, not a pipeline. A conductor agent orchestrates 13 specialized sub-agents, guiding teams from business envisioning through specification, planning, implementation, review, and refinement.

"Ship at the pace of innovation with agents that keep delivery grounded in specs, traceable decisions, and production-ready quality." — DevSquad Copilot

Public resources:

Role & Responsibilities

Senior Frontend Engineer / UI Architect at Microsoft

Led UI architecture for delivery copilot experiences and agent workflow interfaces
Built Fluent UI-based dashboards and interaction patterns for multi-agent orchestration
Designed interfaces for spec review, ADR traceability, task decomposition, and implementation tracking
Collaborated with AI platform, engineering systems, and GitHub Copilot integration teams
Ensured accessibility, performance, and consistency across internal delivery tooling

Problem Statement

AI-assisted coding accelerates implementation, but without delivery discipline it creates hidden risks: untraceable decisions, spec drift, missing security review, and context loss across handoffs. Teams need more than code generation — they need an orchestration layer that keeps the why, what, and how aligned from business intent to merged code.

Solution

A conductor-led delivery loop where persistent artifacts — envisioning docs, feature specs, ADRs, plans, and tasks — form shared memory across developers and agents. Specs are sliced thin and revised as teams learn; implementation that reveals new understanding triggers scoped amendments rather than silent deviation.

Four Delivery Layers

Layer	Artifact	Purpose
Envisioning	`docs/envisioning/README.md`	Strategic vision — why the work matters
Specification	`docs/features/*/spec.md`	Scope, acceptance criteria, constraints for the current slice
Architecture	ADRs + `plan.md`	Technical decisions evaluated against ranked priorities
Execution	`tasks.md` + work items	Granular tasks flowing to GitHub Issues or Azure DevOps

The Delivery Loop

Envision — Surface the why: business goals and success criteria
Specify — Write thin vertical slices, not full-system specs upfront
Plan — Produce technical design and ADRs for significant decisions
Decompose — Break slices into single-session tasks with acceptance criteria
Implement — TDD discipline, impact classification, conventional commits
Review — Independent validation against spec, ADRs, and plan
Refine — Amend specs when reality shifts; scan backlog health between sprints

Agent Architecture (13 Specialists)

The devsquad conductor delegates to specialist agents by phase using a Mediated Coordinator-Worker pattern:

Agent	Purpose	Produces
`devsquad`	Conductor — intent detection and delegation	Phase routing
`devsquad.init`	Initialize project	Framework files and templates
`devsquad.envision`	Capture strategic vision	Envisioning README
`devsquad.kickoff`	Structure project hierarchy	Board structure + structure.md
`devsquad.specify`	Write feature specs	Feature spec.md files
`devsquad.plan`	Technical planning	ADRs + plan.md
`devsquad.decompose`	Decompose to tasks	tasks.md + work items
`devsquad.implement`	Execute code (TDD)	Source code + PR
`devsquad.review`	Validate implementation	Review log with findings
`devsquad.security`	Security assessment	Security report
`devsquad.sprint`	Sprint planning	sprint-N.md + scope options
`devsquad.refine`	Backlog health + spec amendment	Analysis report or scoped updates
`devsquad.extend`	Framework extension	Custom components

Coordinator agents (plan, implement, review, refine) delegate to hidden worker sub-agents with isolated context for parallel execution.

Key Features

Conductor pattern — Single entry point routes to specialists; direct invocation also supported
Persistent artifacts — Disk-based specs, ADRs, and plans as source of truth (git-auditable)
Impact-scaled rigor — Low-impact changes move fast; high-impact triggers ADRs and explicit approval
Spec amendment protocol — Mid-flight updates when implementation reveals model shifts
TDD embedded — Test-first implementation regardless of language or AI model
Security by phase — Architectural assessment during design; code scans during implementation
MCP integrations — GitHub / Azure DevOps, Microsoft Learn, Azure services
Extensibility — Custom instructions, skills, agents, hooks, and tool extensions

Four Delivery Guardrails

What separates delivery copilots from "describe everything, then let AI code it" tools:

Thin vertical slices — Specs written at slice scope and amended when reality shifts
ADRs for non-trivial choices — Reviewers and future agents read reasoning, not just code
Impact classification — Ceremony matches risk (low / medium / high)
Comprehension checkpoints — Optimizes for developer understanding, not just throughput

Tech Stack

React.js · TypeScript · Fluent UI · Azure OpenAI
GitHub Copilot · Agent orchestration · MCP servers
Azure DevOps · GitHub Issues integration
Component architecture · Design systems · REST APIs

Challenges

Designing UI that surfaces complex multi-agent workflows without overwhelming developers
Visualizing artifact chains (envisioning → spec → ADR → task → code) across long-running sessions
Representing impact classification and approval gates clearly in the interface
Keeping agent picker and conductor routing intuitive when 13+ specialists are available
Balancing speed (low-impact fast path) with rigor (high-impact review flows) in a single UX

Learnings

Delivery copilots succeed when artifacts are first-class — specs and ADRs are not bureaucracy, they are shared memory
The loop model beats linear pipelines for AI-assisted work because learning mid-implementation is expected, not exceptional
Context isolation between worker sub-agents prevents assumption contamination across phases
Extensibility (custom skills, hooks, coding guidelines) is essential — teams won't adopt a framework that fights their stack

Business & User Impact

Enables Microsoft engineering teams to adopt agentic delivery with production-ready guardrails
Open-sourced as DevSquad Copilot for broader community use
Reduces spec drift and untraceable AI-generated changes across multi-developer projects
Integrates with GitHub Issues and Azure DevOps for real work-item tracking, not markdown-only checklists

Future Improvements

Richer visualization of agent delegation and parallel worker execution
Deeper Copilot Studio integration for enterprise-wide agent deployment
Enhanced sprint and backlog health dashboards via the refine agent
Expanded MCP connectors for additional Azure and Microsoft 365 services