All articles

LLM Engineering

140 articles

LLM Engineering

HunyuanVideo: Tencent's Diffusion Transformer Architecture for 720p Video Generation

By Rob Ragan ★ 12.1k Unknown May 9, 2026
LLM Engineering

Building Statistically Robust LLM Rankings with Pairwise Comparisons

By Rob Ragan ★ 170 Unknown May 9, 2026
LLM Engineering

BitNet: Running 100B Parameter Models on Your Laptop at Human Reading Speed

By Rob Ragan ★ 38.9k Unknown May 9, 2026
LLM Engineering

LLM-Check: Detecting Hallucinations by Reading Your Model's Mind

By Rob Ragan ★ 40 Unknown May 9, 2026
LLM Engineering

Mapping LLM Safety as a Landscape: How Weight Perturbations Reveal the Fragility of Alignment

By Rob Ragan ★ 40 Unknown May 9, 2026
LLM Engineering

VERL: The Hybrid-Controller Framework Reshaping How We Train LLMs with Reinforcement Learning

By Rob Ragan ★ 21.2k Unknown May 9, 2026
LLM Engineering

IB4LLMs: Using Information Bottleneck Theory to Build Jailbreak-Resistant Language Models

By Rob Ragan ★ 27 Unknown May 9, 2026
LLM Engineering

SRMT: Teaching Robots to Share Their Thoughts Through Memory

By Rob Ragan ★ 36 Unknown May 9, 2026
LLM Engineering

Building Resumable LLM Evaluations: A Template for Rate-Limited API Testing

By Rob Ragan ★ 7 Unknown May 9, 2026
LLM Engineering

Burpference: Adding LLM Intelligence to Your Security Proxy Workflow

By Rob Ragan ★ 210 Unknown May 9, 2026
LLM Engineering

LLMmap: Fingerprinting Large Language Models Through Behavioral Analysis

By Rob Ragan ★ 292 Unknown May 8, 2026
LLM Engineering

Inside Transformer Debugger: OpenAI's Circuit Tracing Tool for Mechanistic Interpretability

By Rob Ragan ★ 4.1k Unknown May 8, 2026
LLM Engineering

Dora-VAE: How Inference-Time Scalability Solves the 3D Diffusion Training Bottleneck

By Rob Ragan ★ 580 Unknown May 8, 2026
LLM Engineering

Repomix: Why Packing Your Entire Codebase Into One File Is the Future of LLM Workflows

By Rob Ragan ★ 24.5k Unknown May 8, 2026
LLM Engineering

Reasoning Gym: How Procedural Generation Solved RL Training's Ground Truth Problem

By Rob Ragan ★ 1.4k Unknown May 8, 2026
LLM Engineering

Swark: Auto-Generating Architecture Diagrams by Feeding Your Codebase to GitHub Copilot

By Rob Ragan ★ 1.7k Unknown May 8, 2026
LLM Engineering

Open Interface: Teaching GPT-4 Vision to Drive Your Desktop with Screenshots and PyAutoGUI

By Rob Ragan ★ 2.7k Unknown May 8, 2026
LLM Engineering

Chainlit: The Python Framework That Turns LLM Scripts Into Production UIs

By Rob Ragan ★ 12.1k Unknown May 8, 2026
LLM Engineering

MoBA: How Moonshot AI Serves 1M-Token Contexts in Production with Learned Sparse Attention

By Rob Ragan ★ 2.1k Unknown May 8, 2026
LLM Engineering

Inside the LLM Post-Training Knowledge Base That 2,400+ Researchers Are Using

By Rob Ragan ★ 2.4k Unknown May 8, 2026
LLM Engineering

Search-R1: Training Language Models to Think and Search Without Supervision

By Rob Ragan ★ 4.6k Unknown May 8, 2026
LLM Engineering

AgentDojo: The Security Benchmark That Exposes LLM Agents' Achilles Heel

By Rob Ragan ★ 558 Unknown May 8, 2026
LLM Engineering

SEAL: Teaching Language Models to Write Their Own Training Data

By Rob Ragan ★ 1.8k Unknown May 8, 2026
LLM Engineering

Onyx: Building ChatGPT-Grade AI Search with Hybrid RAG and MCP Agents

By Rob Ragan ★ 29.2k Unknown May 8, 2026