AI Engineering Mastery for Teams That Ship

🔥 Free Tutorial: Build a "Chat with My Data" app with LangChain, Pinecone, OpenAI and the Vercel AI SDK.

Join 1,700+ engineers learning to build what actually works

Trusted by industry leaders

Cloudflare

Gruntwork

Pinecone

Loading visualization...

Your AI Engineering Blueprint

Transform your team and business with battle-tested architectures, RAG pipelines, and production-grade AI systems — honed at Cloudflare, Pinecone, and Fortune 500s, and distilled into code-first workshops.

Foundations

How do LLMs See Text?

Explore how large language models process and break down text into tokens for understanding.

Key Outcomes

Understand how language models process and interpret text at the token level

Embeddings & Vector Mathematics

Learn how vectors represent data in AI systems and the mathematical principles that power them.

Key Outcomes

Create powerful semantic search capabilities with vector embeddings

Data Science Fundamentals

Learn core data science tools and concepts for effective machine learning and analysis workflows.

Key Outcomes

Accelerate development time and ship AI applications with production-ready architectures

Intermediate & Advanced Topics

AI Implementation Strategies

Discover proven patterns and approaches for designing and implementing robust AI systems.

Key Outcomes

Accelerate development time and ship AI applications with production-ready architectures

Practical Machine Learning

Build real-world machine learning projects through hands-on tutorials and code examples.

Key Outcomes

Accelerate development time and ship AI applications with production-ready architectures

Retrieval Augmented Generation

Learn how to enhance LLMs with external knowledge sources to reduce hallucinations and improve accuracy.

Key Outcomes

Build AI systems that can access and reason with external knowledge sources

Specializations

Fine-tuning LLMs

Master techniques for customizing pre-trained language models to perform specific tasks in your domain.

Key Outcomes

Customize LLMs for specific tasks with higher accuracy and lower costs

Scaling Vector Infrastructure

Design vector database backed applications that efficiently handle millions of queries.

Key Outcomes

Design vector databases that scale to handle millions of queries efficiently

Enterprise Security for AI Systems

Implement fine-grained security controls for RAG systems and document management on AWS.

Key Outcomes

Ensure your AI systems enforce proper access controls to sensitive information

Featured Premium Project

💎 Premium Project

$49

Build a Chatbot That Actually Knows Your Shit

Learn to create a chatbot that answers questions about your content.

No hallucinations – Answers grounded in your docs

State of the art tech – Vercel AI SDK, embeddings, vector retrieval

Battle-tested – I've built production RAG pipelines for years

5.0 rating

“Thanks for publishing the tutorial, very helpful.”

Scott McCallum • Full Stack Developer at Intermine

Get the $49 Tutorial →Try the live demo →

Interactive AI Laboratory

Learn by doing! Explore how AI systems work under the hood through hands-on interactive experiences.

January 1, 2024

How do you give an AI system your specific knowledge?

Try a live AI chatbot powered by RAG (Retrieval Augmented Generation) that knows my content. See how AI systems can be grounded with specific data to avoid hallucinations.

January 1, 2024

How do LLMs understand meaning in text?

Visualize how text transforms into vectors that capture semantic meaning. See why similar concepts cluster together and understand the foundation of RAG systems.

January 1, 2024

Why do LLMs have context limits and token costs?

See how language models break text into tokens. Understand why context windows exist, how pricing works, and optimize your prompts for better performance.

View All Interactive Demos →

Knowledge Collections

Explore my comprehensive library of AI engineering resources and implementation guides.

All Projects →All Publications →Client Success Stories →

📚 Deep and Machine Learning Tutorials

September 22, 2024

Cloud GPU Services for Deep Learning and fine-tuning with Jupyter Notebooks Reviewed: Colab, Paperspace Gradient, Lightning.ai, and more

I tried a handful of services when I last needed to fine-tune an LLM, and I was mostly disappointed...

September 22, 2024

How to create a custom Alpaca instruction dataset for fine-tuning LLMs

A step by step tutorial with companion notebook.

September 22, 2024

How to Fine-tune Llama 3.1 on Lightning.ai with Torchtune

One of the better Jupyter Notebooks to GPU-backed environment experiences I've had...

⚙️ Open-source AI / ML / Pipelines Projects

July 28, 2024

Building a Hand-Drawn Digit Recognizer with PyTorch and MNIST

I trained a neural net to recognize hand-drawn digits, then built a Next.js UI for it

May 10, 2024

Build a RAG pipeline for your blog with LangChain, OpenAI and Pinecone

You can chat with my writing and ask me questions I've already answered even when I'm not around

May 7, 2024

Vector Databases in Production for Busy Engineers: RAG Evaluation

I wrote the RAG evaluation chapter for Pinecone's latest book

🤖 AI-assisted Development

August 29, 2024

Autocomplete is not all you need: Why Cursor and Zed are going to dominate

AI assisted developer tooling is not created equally...

April 24, 2024

Updated Codeium analysis and review

It's been about a year since I last looked at Codeium - what has changed?

May 24, 2023

Automations - shell scripts leveraging OpenAI to make your developer workflow buttery smooth and way more fun

I have open sourced my automations project, which is a collection of shell scripts that automatically handle git operations, provide local code reviews, pull requests, and more!

💼 Career Advice

October 25, 2023

Wash three walls with one bucket

Design your side projects, blog posts and even your fun experiments to triangulate multiple learning paths simultaneously. Then, use them to build out your portfolio.

October 18, 2023

Run your own tech blog

Control your own destiny, build your personal brand, and master web technologies by running your own tech blog.

October 14, 2023

You get to keep the neural connections

Going the extra mile only to be unrewarded by your company feels like a personal slight and a waste of your time. It is not.

🎥 Videos

May 29, 2024

How to build chat with your data using Pinecone, LangChain and OpenAI

I show step by step how to build a RAG chatbot to talk to your data in this easy to follow tutorial for beginners.

May 29, 2024

How to use ChatGPT in your terminal

Did you know you can use ChatGPT in your terminal? No more copying and pasting...

October 19, 2023

What is a vector database?

Vector databases explained, featuring clowns, embeddings, neural networks, feature extraction, semantic search and Retrieval Augmented Generation (RAG).

See all videos →

🏗️ Reference Architectures and Demos

January 23, 2024

Testing Pinecone Serverless at Scale with the AWS Reference Architecture

A step-by-step walkthrough on how to generate arbitrary system load and flex Pinecone Serverless

November 27, 2023

Announcing the Pinecone AWS Reference Architecture in Pulumi

Deploy production-ready systems using Pinecone in minutes with the AWS Reference Architecture

November 27, 2023

Pinecone AWS Reference Architecture Technical Walkthrough

An examination of the Reference Architecture components and functionality