This workshop covers building a coding agent with tool calls, a skills system, and two framework implementations. Several important topics were intentionally left out.

Evaluation

We did not measure how well the agent performs.

A real coding agent needs evaluations with:

a task set
an automated runner
metrics that tell you whether a change to the prompt or tools made things better or worse

Without evaluation, every prompt tweak is a guess.

Multi-agent patterns

The agent in this workshop is a single loop with one set of tools. More complex setups use multiple agents that specialize and collaborate. One agent plans, another writes code, and a third reviews it. That pattern adds power but also adds coordination complexity.

Deployment

We ran the agent in a notebook. Deploying it as a production service is a separate problem. You need an API, authentication, rate limiting, and persistent history. See the end-to-end agent deployment workshop for one approach.

MCP integration

The Model Context Protocol (MCP) is a standard for exposing tools to agents. The skills system we built is a simple version of this idea. MCP provides a more general protocol with discovery, schema negotiation, and tool composition. Integrating MCP would let the agent use tools from any MCP-compatible server.

Other frameworks

We used ToyAIKit and PydanticAI. The same patterns apply to other agent frameworks such as LangGraph, CrewAI, AutoGen, and others. The important part is understanding the tool-call loop - the framework is packaging around that core idea.

Where to go from here

Evaluation

Multi-agent patterns

Deployment

MCP integration

Other frameworks

Questions & Answers

Where to go from here

Evaluation

Multi-agent patterns

Deployment

MCP integration

Other frameworks

Questions & Answers (0)

Questions & Answers