Back to Events
Past Main or above

Cloudflare Workers Vectorize Agent

June 17, 2026, 02:00 Europe/Berlin

Continue with the workshop writeup

Open the canonical pages, recording, materials, and code repo.

View workshop writeup

We take the FAQ agent from the earlier deployment workshops and move the whole online path to Cloudflare. The Worker serves the UI and API, Workers AI embeds the query and generates the answer, and Vectorize stores the FAQ vectors.

We test whether Cloudflare can replace the usual stack for a small FAQ agent:

  • FastAPI for the API server
  • a separate vector database
  • a paid always-on host

Cloudflare can run the full online path, with one caveat. Ingestion still runs locally as an operator task instead of becoming a public endpoint.

flowchart LR subgraph ingest["local ingest"] FAQ["FAQ JSON"] --> INGEST["ingestion script"] INGEST -->|embed batches| WAI1["Workers AI REST"] INGEST -->|upsert vectors| VEC[("Vectorize")] end subgraph serve["Cloudflare runtime"] USER["browser"] --> WORKER["Worker UI + API"] WORKER -->|embed query| WAI2["Workers AI binding"] WORKER -->|semantic search| VEC WORKER -->|stream answer| WAI3["Workers AI binding"] end

We start with the Cloudflare architecture decision: the deployed app stays on Workers, while ingestion runs locally as an operator command.

Then we build the Cloudflare-first path. We create credentials, create a Vectorize index, and ingest the FAQ. After that, we run the Worker locally against real Cloudflare services, deploy it, and clean it up.

In the session, we also try a Python Worker rewrite. That version is useful if you prefer Python, but Cloudflare Python Workers are still beta. The TypeScript Worker stays the stable path for the main build.

Links

Resources used here:

Hosted by

Alexey Grigorev

Alexey Grigorev

Chief Agent Officer at AI Shipping Labs

Software engineer and machine learning practitioner with 15+ years of experience building production ML systems. I focus on practical, production-grade ML and AI systems, from early prototypes to reliable systems in production.

I'm the founder of DataTalks.Club, a free community that connects tens of thousands of practitioners worldwide, and the creator of the Zoomcamp series, free, code-first programs that have reached 100,000+ learners globally.

At AI Shipping Labs, I'm building the kind of environment that would have accelerated my own career growth. After years of teaching at scale, I wanted something more focused: a space for action-oriented builders who want to turn AI ideas into real projects. The community gives members the structure, accountability, and peer support to ship practical AI products consistently, even alongside their main jobs.

alexey@aishippinglabs.com

Feedback