Topics
Topics are the durable knowledge base. Each topic lives at topics/<category>/<slug>/index.md and can have subtopics underneath it.
AI
- AI, category overview
- Prompt Engineering, crafting LLM inputs to reliably elicit a target output
- AI Harness Development, the scaffolding around an LLM that turns it into something useful
- RAG (Retrieval-Augmented Generation), grounding LLM responses in external knowledge
- AI Skill Development, packaged, invocable capabilities loaded on demand
- AI Coding Tool Blindspots, predictable failure modes of modern AI coding assistants
- DESIGN.md, Google Labs’ open-source format for giving agents a persistent design-system understanding
- LLM reasoning benchmarks and metrics, 8-part series on what the numbers on the leaderboards actually mean
- Knowledge and reasoning, MMLU, MMLU-Pro, GPQA Diamond, HLE, ARC-AGI
- Math benchmarks, GSM8K, MATH, AIME, FrontierMath
- Coding benchmarks, HumanEval, LiveCodeBench, SWE-bench and variants
- Agent benchmarks, Terminal-Bench, TAU-bench, OSWorld, WebArena, GAIA
- Long-context benchmarks, NIAH, RULER, LongBench, U-NIAH
- Multimodal benchmarks, MMMU, MathVista, ChartQA, DocVQA
- Evaluation methodology and metrics, LMArena, LLM-as-judge, pass@k, contamination
Computer Science
- CS, category overview
- Data Structures, the top 10 interview data structures with Python code
- LeetCode 150 (NeetCode), 150 problems across 18 categories, each with brute-force, improved, and optimal Python approaches
- Haversine Distance, great-circle distance with Python/SQL/JavaScript, and when to upgrade to Vincenty or Karney
- Vehicle Routing Problem, CVRP, VRPTW, pickup-and-delivery, and what Google OR-Tools actually does
- Functional Core, Imperative Shell, Gary Bernhardt’s architecture pattern
- Flight Itinerary with Transfers, time-respecting paths in a temporal graph: DFS, BFS, Dijkstra, and the time-expanded graph framework
Web
- Web, category overview
- Django, 10-part series from beginner to expert
Testing
- Testing, category overview
- Test-Driven Development (TDD), red-green-refactor as a discipline
- Unit tests, fast, isolated, hundreds of them
- Component tests, the middle tier; UI rendering, DRF viewsets
- Integration tests, real databases, real message brokers, real wiring
- Smoke tests, the minimum-viable “is it alive?” checks
- Fuzz tests, property-based and coverage-guided fuzzing
- End-to-end tests (E2E), real browser, real backend, real user flow
Operations
- Ops, category overview
- Kubernetes, the container orchestrator: architecture, workloads, networking, storage, RBAC, autoscaling, footguns
- Helm, the package manager for Kubernetes: charts, values, templates, releases
- Terraform, infrastructure as code: providers, resources, state, modules, multi-environment patterns
- GitOps, Git as the source of truth for infrastructure, with a reconciliation agent doing the work
- ArgoCD, the reference GitOps controller for Kubernetes
- Gitflow, Vincent Driessen’s 2010 branching model and its modern alternatives
- MLOps, the engineering discipline around getting ML models into production
Conventions
- Topic, a folder with
index.md. The overview/hub page. - Subtopic (flat),
<subtopic>.mdinside the topic folder. Use for lightweight subtopics. - Subtopic (folder),
<subtopic>/index.mdinside the topic folder. Use when the subtopic has images, code samples in files, or its own children. - Depth, keep nesting to max 3 levels (category → topic → subtopic) for navigability.