Why did the author abandon Copilot Studio for this project?

Copilot Studio's branching logic couldn't cleanly support the multi-turn human-in-the-loop approval flow required for escalation emails, where stakeholders had to approve, edit, or revise a draft before it could be sent.

What did the author build instead of the Copilot Studio agent?

A custom system using a LangGraph coordinator pattern with a RAG sub-agent and an email sub-agent, FastAPI as the backend, Azure AI Search for retrieval, Azure Logic Apps for email dispatch, and a React/TypeScript Teams tab frontend.

What is the main lesson from this experience?

Always prototype the hardest, most structurally demanding workflow first rather than the most visible feature, and write down platform assumptions before committing engineering time to them.

When to abandon the low-code path

Q: Did Copilot Studio later add the missing capability?

Yes. About six weeks after the decision, Microsoft shipped a preview feature called Request for Information, a native human-in-the-loop action for Copilot Studio agent flows, close to what the author had built manually.

Why I rebuilt my procurement agent from scratch in week three of a three-month internship.

My first week at SSA Marine, in June 2025, I had a clear technical direction. The procurement team needed an AI agent that could answer common policy questions for stakeholders across the company, and when it couldn't, draft an escalation email to the procurement team on the stakeholder's behalf so the question got routed cleanly instead of disappearing into someone's inbox. The company was Microsoft-native. The documents lived in SharePoint. Microsoft Copilot Studio gave me Teams integration and SSO essentially for free.

The path was obvious. I prototyped in Copilot Studio and the RAG functionality worked beautifully. SharePoint documents indexed in minutes, and the answers were grounded and useful. By the end of week two, I was confident the project would ship on time.

Then I tried to build the email approval workflow.

The wall

The requirement was simple to describe and hard to implement. When the RAG layer couldn't ground an answer to a stakeholder's question, the agent had to draft an escalation email to the procurement team on the stakeholder's behalf, but it could never dispatch one autonomously. The stakeholder had to approve, edit, or revise the draft before anything actually went to procurement. This was non-negotiable for governance reasons.

Copilot Studio could not do this cleanly. The conversational state required to support multi-turn revisions (RAG comes up empty, the agent drafts an escalation email, the user edits it, the agent regenerates with the edits, the user approves, the agent dispatches) was beyond what the platform's branching logic could express. I spent two days trying to make it work. The closest I got involved encoding state in the conversation history and parsing it on every turn, which was fragile and would have failed silently in production.

This was the moment I had to decide.

The decision

I had two options. The first was to descope the email approval workflow to something simpler (a one-shot draft with no revision) and ship on the original timeline. The second was to abandon Copilot Studio entirely and build a custom agent with LangGraph and FastAPI, which would mean rebuilding the RAG pipeline, configuring SSO manually, and shipping a custom Teams tab application instead of a native one.

I chose the second option, knowing it would cost me at least two weeks of work I'd already done.

The lesson I took from this isn't about Copilot Studio specifically. It's about prototyping order: always prototype the hardest workflow first, not the easiest one.

I had inverted that. RAG was the visible feature, the one stakeholders would see in the demo, so I built it first. The HITL workflow was the structurally harder requirement, but I'd treated it as a follow-on. By the time I discovered the platform couldn't support it, I'd invested two weeks in the wrong foundation.

What I built instead

The custom architecture was a coordinator agentic pattern in LangGraph. A top-level coordinator agent routed each question to one of two specialized sub-agents: a RAG sub-agent for policy Q&A grounded in Azure AI Search, and an email sub-agent that took over when the RAG layer came up empty, drafting an escalation email and managing the multi-turn approval state. FastAPI served as the backend. Azure Logic Apps handled the actual email dispatch, gated strictly by the agent's tool calls. The Teams tab application was a React/TypeScript frontend, which gave me an admin dashboard with health checks and node traces I never could have built in Copilot Studio.

Getting back to where the prototype had been took about two weeks. Building out from there, the HITL approval loop, the two sub-agents, the audit trail, and packaging it all as a Teams tab, took the rest of the summer. The HITL workflow worked correctly, and the system shipped on time, with a week of buffer.

What I'd do differently

The decision was right. The order was wrong. If I were building this again, I would have spent the first two days of the project specifically on the HITL flow, not the RAG pipeline, not the indexing strategy, not the agent prompts. The hardest workflow first.

The other thing I'd change: I should have written down the architectural assumption I was making. Copilot Studio can handle multi-turn approval flows was an assumption I'd inherited from the platform's marketing materials, and I never tested it before committing. Writing assumptions down has a way of making them visible enough to interrogate.

A coda, because it cuts against my own story. I made this call in week three, at the end of June 2025. About six weeks later, in August, while I was still on the same internship, Microsoft shipped Request for Information: a native human-in-the-loop action for Copilot Studio's agent flows that pauses a workflow to collect input from a human reviewer, close to what I'd been trying to build by hand. It's still in preview as I write this. That doesn't make the decision wrong; in week three of a three-month project I needed something that worked, not a feature that didn't exist yet. But the timing is the sharper lesson underneath the prototyping one: the exact gap I hit closed within weeks of my hitting it.

This is, broadly, what I think about when I'm building production AI systems now. The capability layer is impressive and improves quickly. The constraints (governance, audit, approval, validation, the parts that look unglamorous) are usually where the actual difficulty lives. Prototype them first.

When to abandon the low-code path

The wall

The decision

What I built instead

What I'd do differently

Frequently asked

Why did the author abandon Copilot Studio for this project?

What did the author build instead of the Copilot Studio agent?

Did Copilot Studio later add the missing capability?

What is the main lesson from this experience?

Discussion

When to abandon the low-code path

The wall

The decision

What I built instead

What I'd do differently

Frequently asked

Why did the author abandon Copilot Studio for this project?

What did the author build instead of the Copilot Studio agent?

Did Copilot Studio later add the missing capability?

What is the main lesson from this experience?

Discussion

New writing on AI, data, and the work of building it.