Staging environment

Build your own vector database

Doug Turnbull

Led Search + ML at Reddit and Shopify

Make embedding retrieval an asset, not a headache

In modern search, embeddings answer questions. They go beyond simple keyword matching to find semantically similar answers. Increasingly: RAG, recommendations, and traditional search need zero-in on what's relevant. In this class we'll live-code core data structures that live behind search systems like Pinecone, Weaviate, Turbopuffer, QDrant, Vespa, Elasticsearch etc etc etc.

By building your own vector database, you'll be better equiped to work with production vector search systems. You'll have first-hand experience with the knobs to turn to improve performance and develop a robust hybrid + vector search system

What you’ll learn

    Workshop agenda

    • Benchmarking vector search

      How we think through vector database stats: recall, latency, throughput

    • Building HNSW algorithm from scratch

      Hands on developing the core graph-based algorithm behind search engines: HNSW

    • Filtering vector search

      How to enhance vector algorithms to filter based on metadata

    • Optimizing with quantization

      How to use standard quantization techniques to reduce memory, improve speed, without sacrificing recall

    • Layering in hybrid search

      Lexical retrieval isn't obsolete - we'll talk about approaches to combining lexical and vector search into a single solution

    Learn directly from Doug

    Doug Turnbull

    Doug Turnbull

    Search at Reddit, Shopify, Wikipedia

    Coached teams at
    Reddit
    Shopify.com
    Apple
    Amazon Web Services
    Wikipedia
    See all products from Doug

    Who this workshop is for

    • Infrastructure teams - anyone tasked to squeeze the most performance out of a high-scale retrieval system

    • Search developers - anyone that needs to build relevant, fast vector search for RAG or agentic search applications

    • AI engineers - need to find that relevant context? Anyone who needs to find context to answer questions from AI

    What's included

    Doug Turnbull

    Live sessions

    Learn directly from Doug Turnbull in a real-time, interactive format.

    Lifetime access

    Go back to course content and recordings whenever you need to.

    Community of peers

    Stay accountable and share insights with like-minded professionals.

    Certificate of completion

    Share your new skills with your employer or on LinkedIn.

    Maven Guarantee

    Your purchase is backed by the Maven Guarantee.

    Frequently asked questions

    Maven for Teams

    Reimbursement

    Get your company to pay

    Everything L&D needs: email template, receipts, and certificate of completion.

    Get reimbursed

    Team discount

    Learn with your teammates

    Save 20%+ when 2 or more teammates enroll in the same cohort.

    Save 20%+ with a team

    Private cohort

    Run a cohort for your org

    A dedicated cohort with a custom schedule and curriculum, tailored to your team.

    Book a private cohort

    $600

    USD

    Aug 11
    Enroll