4.8 (107)
Featured in
Lenny’s List
Founder | Ex-Google | Prof UCLA & UMN
Staff AI/ML Engineer (DX) @ Together AI


3 people enrolled last week.
6 live sessions · 6 office hours · 1 demo day · 20+ hours
What You’ll Build⚡ Agent Harness Design: Build a observable perceive–think–act loop in Python with structured tracing, evolving from simple LLM calls to full agent systems with tools, reflection, and planning
⚡ Agentic RAG Systems: Create advanced RAG pipelines with routing, multi-hop reasoning, Knowledge Graph, and semantic caching to improve accuracy and efficiency
⚡ Real-Time Voice Agents: Engineer a streaming STT → LLM → TTS pipeline with turn-taking, interruptions, and sub-second latency across modern voice platforms
⚡ Multi-Agent Orchestration: Design coordinated agent systems using MCP and A2A with patterns like hierarchical, debate, and orchestrator-worker workflows
⚡ Guardrails & Evaluation: Build reliable systems with safety guardrails and trajectory-based evaluation using LLM-as-judge and validated benchmark tasks
⚡ Production Deployment: Ship multi-agent systems with Google ADK, MCP, A2A, Llama Guard, and GCP monitoring
💻 Prerequisites: Python👉 This bootcamp is for engineers who ship real agent systems
Master Advanced Techniques for Building and Optimizing Agentic RAG Systems and Multi-Agent Workflows — Designed for Builders
Outcome: Engineer ReAct agents with structured tracing, decision auditing, memory, and termination control.
Framework: Implement perceive-think-act loops, planning agents, reflection, and tool-use architectures.
Hands-on: Building a production-grade agent harness and unpacking what makes Claude Code powerful beyond the model.
Outcome: Serve production LLMs with predictable latency, lower cost, and benchmarks you can defend.
Deep dive: Master GPTQ, GGUF, QLoRA, AWQ, prefill, decode, KV caching, and speculative decoding.
Infrastructure: Measure TTFT, inter-token latency, throughput, and the economics of each deployment choice.
Outcome: Build retrieval systems that route, reason, adapt, and move beyond naive top-k search.
Framework: Implement retrieval-as-a-tool, query planning, multi-hop reasoning, and semantic chunking.
Advanced: Add Graph RAG and semantic caching to improve reasoning quality, cost, and latency.
Outcome: Engineer real-time voice agents with sub-second responses and natural conversational flow.
Architecture: Build streaming STT, LLM, and TTS loops with turn-taking, end detection, and barge-in.
Tools: Compare Deepgram, ElevenLabs, OpenAI Realtime, Vapi, and Retell across latency, quality, and cost.
Outcome: Design multi-agent systems with clear topology, communication protocols, and operational judgment.
Patterns: Implement orchestrator-worker, hierarchical, debate, and handoff workflows for real use cases.
Protocols: Connect agents with MCP and A2A, and evaluate when single-agent systems are the better choice.
Outcome: Ship agent systems with measurable quality, layered safety, and production-ready eval loops.
Guardrails: Implement Llama Guard, NeMo Guardrails, validation, and prompt-injection defenses.
Evals: Build trajectory-based evals with golden tasks, partial-credit grading, and LLM-as-judge scoring.

Founder | Ex-Google | Adjunct UCLA & UMN, SCU | Venture Partner




Staff AI/ML Engineer (DX) @ Together AI | AI Educator & Researcher | Founder


Machine Learning Engineer exploring different techniques to scale LLM solutions
Researcher, who would like to delve in to various aspects of open-source LLMs
Software Engineer, looking to learn how to integrate AI into their products
Live sessions
Learn directly from Hamza Farooq & Zain Hasan in a real-time, interactive format.
Lifetime access
Go back to course content and recordings whenever you need to.
Office Hours
Get your questions answered during live office hours.
Community of peers
Stay accountable and share insights with like-minded professionals.
Certificate of completion
Share your new skills with your employer or on LinkedIn.
Maven Guarantee
Your purchase is backed by the Maven Guarantee.
May
30
Jun
4
Jun
6
Live sessions
2-3 hrs / week
Sat, May 30
4:00 PM—6:00 PM (UTC)
Thu, Jun 4
4:00 PM—4:45 PM (UTC)
Sat, Jun 6
4:00 PM—6:00 PM (UTC)
Projects
1-3 hrs / week
Async content
1-3 hrs / week
Maven for Teams
Reimbursement
Get your company to pay
Everything L&D needs: email template, receipts, and certificate of completion.
Get reimbursedTeam discount
Learn with your teammates
Save 20%+ when 2 or more teammates enroll in the same cohort.
Save 20%+ with a teamPrivate cohort
Run a cohort for your org
A dedicated cohort with a custom schedule and curriculum, tailored to your team.
Book a private cohort$2,500
USD