All articles

LLM Engineering

106 articles

LLM Engineering

Gorilla: Teaching LLMs to Actually Call APIs Without Hallucinating

By Rob Ragan ★ 12.8k Python Mar 22, 2026
LLM Engineering

Dalai: The NPX One-Liner That Brought LLaMA to Your Laptop

By Rob Ragan ★ 13.0k CSS Mar 22, 2026
LLM Engineering

LitGPT: Lightning AI's No-Abstraction Approach to Production LLM Training

By Rob Ragan ★ 13.3k Python Mar 22, 2026
LLM Engineering

Onyx: Building a Self-Hosted AI Platform That Doesn't Lock You Into Vendor Hell

By Rob Ragan ★ 18.0k Python Mar 22, 2026
LLM Engineering

Building Better LLM Benchmarks: Inside OpenAI's Evals Framework

By Rob Ragan ★ 18.1k Python Mar 22, 2026
LLM Engineering

verl: Building Production-Grade RLHF Pipelines with Hybrid-Controller Architecture

By Rob Ragan ★ 20.1k Python Mar 22, 2026
LLM Engineering

Guidance: Programming Language Models Like They're Python Objects

By Rob Ragan ★ 21.4k Jupyter Notebook Mar 22, 2026
LLM Engineering

Microsoft's UniLM: The Foundation Model Laboratory That Birthed 1-Bit Transformers

By Rob Ragan ★ 22.1k Python Mar 22, 2026
LLM Engineering

Repomix: The CLI Tool That Turns Your Entire Codebase Into a Single LLM-Ready File

By Rob Ragan ★ 22.6k TypeScript Mar 22, 2026
LLM Engineering

Haystack: Why Explicit Pipelines Beat Magic Abstractions in Production RAG

By Rob Ragan ★ 24.6k MDX Mar 22, 2026
LLM Engineering

SGLang: How RadixAttention and Prefix Caching Are Reshaping LLM Serving at Scale

By Rob Ragan ★ 24.9k Python Mar 22, 2026
LLM Engineering

Stanford Alpaca: How $500 and Synthetic Data Sparked the Open LLM Revolution

By Rob Ragan ★ 30.3k Python Mar 22, 2026
LLM Engineering

BitNet.cpp: Running 100B Parameter Models on Your Laptop at Human Reading Speed

By Rob Ragan ★ 36.3k Python Mar 22, 2026
LLM Engineering

Open-Assistant: How LAION Crowdsourced the First Open RLHF Dataset for Conversational AI

By Rob Ragan ★ 37.4k Python Mar 22, 2026
LLM Engineering

GPT4All: Running Production LLMs on a 2012 Laptop

By Rob Ragan ★ 77.2k C++ Mar 22, 2026
LLM Engineering

Inside awesome-llm-apps: A Living Catalog of 40+ Production-Ready LLM Patterns

By Rob Ragan ★ 103.1k Python Mar 22, 2026
LLM Engineering

Inside Hyperspace AGI: Building a Peer-to-Peer Network Where AI Agents Autonomously Research Themselves

By Rob Ragan ★ 1.1k Unknown Mar 21, 2026
LLM Engineering

Running a 397B Parameter Model on a Laptop: How Flash-MoE Streams Experts from SSD

By Rob Ragan ★ 146 Objective-C Mar 19, 2026
LLM Engineering

Parameter Golf: Training Language Models Under Extreme Memory Constraints

By Rob Ragan ★ 402 Python Mar 19, 2026
LLM Engineering

PlanAI: Type-Safe Workflow Orchestration for Hybrid LLM and Traditional Compute

By Rob Ragan ★ 42 Python Mar 12, 2026
LLM Engineering

RuVector: The Vector Database That Learns From Your Queries

By Rob Ragan ★ 3.1k Rust Mar 10, 2026
LLM Engineering

PyReason: Temporal Logic Reasoning Over Knowledge Graphs with Explainability Built In

By Rob Ragan ★ 332 Python Mar 5, 2026
LLM Engineering

NullClaw: A 678KB AI Assistant That Boots in 8 Milliseconds

By Rob Ragan ★ 2.1k Zig Feb 25, 2026
LLM Engineering

Berry: Hallucination Detection as a First-Class Development Primitive

By Rob Ragan ★ 1.6k Python Feb 22, 2026