03 · Serving InfrastructurevLLMOn this pagevLLM Layer: Model serving Status: 🔴 Not started What it is How it works Core mechanism Gotchas & production behavior Why it matters Key terms TermMeaning Code / demo My notes Resources