AI ENGINEERING

LLM Engineering

Building with large language models — prompting, decoding, inference serving and cost — from the handbooks down to the softmax that powers every token.

10 pieces · 5 formats

Handbooks 2

Handbook

The Prompting Handbook

A friendly, hands-on field guide for everyday humans — learn the CRISP framework, spot bad prompts, practice with real recipes, play a drag-and-drop game, and test yourself with a quiz. No code required.

Handbook

The Senior AI Engineer Interview Handbook

60 questions across architecture, production incidents, agentic systems, RAG, evals, cost, safety, and leadership — what staff-level AI interviewers actually probe for.

AIEngineeringCareer

Roadmaps 1

Roadmap

Prompt Engineering Roadmap

A visual transit-map roadmap from tokens and CRISP through chain-of-thought, RAG, and agent prompting to production monitoring. 18 stations across 3 tracks — interactive, free, your own pace.

AI System Designs 2

AI System Design

Design a Conversational AI

Build a production conversational AI system (think ChatGPT) step by step. See how the request path splits an inference gateway from the model servers, how the context window is assembled and token-budgeted, how conversation memory is stored and recalled, how tokens stream back over a persistent connection, and how guardrails gate every prompt and response — through an interactive diagram that grows with each concept.

LLMInferenceStreaming

AI System Design

Design an LLM Inference Server

Build an LLM inference serving system step by step. See how a request queue absorbs spiky traffic, how the prefill/decode split and continuous batching keep GPUs full, how the KV cache and paged attention make each token cheap, how tensor sharding fits a giant model, and how autoscaling rides demand — all balancing latency against throughput, through an interactive diagram that grows with each concept.

InferenceGPUScalability

Coding Challenges 1

Challenge

Softmax

The function at the end of every classifier and language model: turn raw scores (logits) into a probability distribution. Implement the numerically stable version so big logits do not overflow. Solve it in Python or TypeScript.

AI EngineeringLLMDecoding

Interactive Tools 4

Tool

RAG Chunking Playground

Drop in any text and compare chunking strategies — fixed-size, recursive, by-sentence, by-paragraph — with overlap highlighted and an estimated token count per chunk. Stop guessing your chunk size; see exactly how your RAG pipeline will split a document.

AIRAGLLM

Tool

Context Budget & Cost Planner

Add a system prompt, tool definitions, conversation history and retrieved context, then see your context window fill up and the cost per call — plus the bill at 1k and 1M requests — across model price tiers. An architecture planner, not a toy token counter.

AILLMCost

Tool

LLM-as-Judge Rubric Builder

Define your evaluation criteria and a scoring scale, then generate a clean, copy-pasteable LLM-as-judge prompt you can drop into your eval pipeline — with the common pitfalls (position bias, verbosity bias, ties) called out. Turns eval theory into a prompt you can ship.

AIEvalsLLM

Tool

Tool-Schema Designer

Compose a tool/function definition field by field — name, description, parameters, required flags — and export valid tool-use JSON for the Claude and OpenAI formats, with the JSON Schema generated for you. Stop hand-writing function-calling schemas and fighting silent validation errors.

AIAgentsLLM

More in AI Engineering

RAG & Retrieval → AI Agents & Tools → AI Evaluation → ML Foundations →

← Browse all topics