Staging environment

Agent Engineering Bootcamp: Developers Edition

Course

Agent Engineering Bootcamp: Developers Edition

4.8 (107)

Featured in

Lenny’s List

Hamza Farooq

Founder | Ex-Google | Prof UCLA & UMN

Zain Hasan

Staff AI/ML Engineer (DX) @ Together AI

Build Production-Ready Agentic RAG, Voice & Multi-Agent Systems

Master the full agent stack from first principles to production with RAG, voice, multi-agent systems, & inference

6 live sessions · 6 office hours · 1 demo day · 20+ hours

What You’ll Build

⚡ Agent Harness Design: Build a observable perceive–think–act loop in Python with structured tracing, evolving from simple LLM calls to full agent systems with tools, reflection, and planning

⚡ Agentic RAG Systems: Create advanced RAG pipelines with routing, multi-hop reasoning, Knowledge Graph, and semantic caching to improve accuracy and efficiency

⚡ Real-Time Voice Agents: Engineer a streaming STT → LLM → TTS pipeline with turn-taking, interruptions, and sub-second latency across modern voice platforms

⚡ Multi-Agent Orchestration: Design coordinated agent systems using MCP and A2A with patterns like hierarchical, debate, and orchestrator-worker workflows

⚡ Guardrails & Evaluation: Build reliable systems with safety guardrails and trajectory-based evaluation using LLM-as-judge and validated benchmark tasks

⚡ Production Deployment: Ship multi-agent systems with Google ADK, MCP, A2A, Llama Guard, and GCP monitoring

💻 Prerequisites: Python

👉 This bootcamp is for engineers who ship real agent systems

What you’ll learn

Master Advanced Techniques for Building and Optimizing Agentic RAG Systems and Multi-Agent Workflows — Designed for Builders

Outcome: Engineer ReAct agents with structured tracing, decision auditing, memory, and termination control.
Framework: Implement perceive-think-act loops, planning agents, reflection, and tool-use architectures.
Hands-on: Building a production-grade agent harness and unpacking what makes Claude Code powerful beyond the model.

Outcome: Serve production LLMs with predictable latency, lower cost, and benchmarks you can defend.
Deep dive: Master GPTQ, GGUF, QLoRA, AWQ, prefill, decode, KV caching, and speculative decoding.
Infrastructure: Measure TTFT, inter-token latency, throughput, and the economics of each deployment choice.

Outcome: Build retrieval systems that route, reason, adapt, and move beyond naive top-k search.
Framework: Implement retrieval-as-a-tool, query planning, multi-hop reasoning, and semantic chunking.
Advanced: Add Graph RAG and semantic caching to improve reasoning quality, cost, and latency.

Outcome: Engineer real-time voice agents with sub-second responses and natural conversational flow.
Architecture: Build streaming STT, LLM, and TTS loops with turn-taking, end detection, and barge-in.
Tools: Compare Deepgram, ElevenLabs, OpenAI Realtime, Vapi, and Retell across latency, quality, and cost.

Outcome: Design multi-agent systems with clear topology, communication protocols, and operational judgment.
Patterns: Implement orchestrator-worker, hierarchical, debate, and handoff workflows for real use cases.
Protocols: Connect agents with MCP and A2A, and evaluate when single-agent systems are the better choice.

Outcome: Ship agent systems with measurable quality, layered safety, and production-ready eval loops.
Guardrails: Implement Llama Guard, NeMo Guardrails, validation, and prompt-injection defenses.
Evals: Build trajectory-based evals with golden tasks, partial-credit grading, and LLM-as-judge scoring.

Learn directly from Hamza & Zain

Hamza Farooq

Contact

Founder | Ex-Google | Adjunct UCLA & UMN, SCU | Venture Partner

Worked at:

Zain Hasan

Contact

Staff AI/ML Engineer (DX) @ Together AI | AI Educator & Researcher | Founder

Worked at:

See all products from Hamza

Who this course is for

Machine Learning Engineer exploring different techniques to scale LLM solutions
Researcher, who would like to delve in to various aspects of open-source LLMs
Software Engineer, looking to learn how to integrate AI into their products

What's included

Live sessions

Learn directly from Hamza Farooq & Zain Hasan in a real-time, interactive format.

Lifetime access

Go back to course content and recordings whenever you need to.

Office Hours

Get your questions answered during live office hours.

Community of peers

Stay accountable and share insights with like-minded professionals.

Certificate of completion

Share your new skills with your employer or on LinkedIn.

Maven Guarantee

Your purchase is backed by the Maven Guarantee.

Course syllabus

11 live sessions • 11 lessons

Week 1

May 28—May 31

Welcome to the Course

1 item

Everything You Need to Get Started

1 item

Claude Code: Introduction & Setup

6 items

Module 1 : The Agent Loop - ReAct, Architectures, and the Claude Code Harness

2 items

May

Session 1: The Agent Loop - ReAct, Architectures, and the Claude Code Harness

Sat 5/304:00 PM—6:00 PM (UTC)

Project 1 : Run all Code Files

1 item

Week 2

Jun 1—Jun 7

Jun

Optional: Office Hours

Thu 6/44:00 PM—4:45 PM (UTC)

Optional

Jun

Session 2: LLM Quantization and KV Caching

Sat 6/64:00 PM—6:00 PM (UTC)

Free resources

Schedule

Live sessions

2-3 hrs / week

Sat, May 30
4:00 PM—6:00 PM (UTC)
Thu, Jun 4
4:00 PM—4:45 PM (UTC)
Sat, Jun 6
4:00 PM—6:00 PM (UTC)

Projects

1-3 hrs / week

Async content

1-3 hrs / week

Frequently asked questions

Maven for Teams

Reimbursement

Get your company to pay

Everything L&D needs: email template, receipts, and certificate of completion.

Get reimbursed

Private cohort

Run a cohort for your org

A dedicated cohort with a custom schedule and curriculum, tailored to your team.

Book a private cohort

Get course updates