All articles

LLM Engineering

106 articles

LLM Engineering

MemPalace: The Local-First AI Memory System That Remembers Everything

By Rob Ragan ★ 1.7k Python Apr 7, 2026
LLM Engineering

JARVIS: The LLM Orchestrator That Sparked the AI Agent Revolution

By Rob Ragan ★ 24.6k Python Apr 4, 2026
LLM Engineering

Mesh LLM: Distributed Inference Without the Latency Tax

By Rob Ragan ★ 326 Rust Apr 3, 2026
LLM Engineering

Medusa: How Multiple Prediction Heads Eliminate the Draft Model Problem in LLM Inference

By Rob Ragan ★ 2.7k Jupyter Notebook Apr 3, 2026
LLM Engineering

Building RAG Systems From First Principles: A Workshop Teardown

By Rob Ragan ★ 4 Python Apr 3, 2026
LLM Engineering

Building a Privacy-First File Organizer with On-Device AI Models

By Rob Ragan ★ 3.2k Python Apr 3, 2026
LLM Engineering

MLX-VLM: Running Vision Language Models Locally on Apple Silicon Without the CUDA Tax

By Rob Ragan ★ 2.6k Python Apr 3, 2026
LLM Engineering

Building Computer-Use AI Agents with E2B Desktop Sandbox: A Virtual Desktop for LLMs

By Rob Ragan ★ 1.3k Python Apr 3, 2026
LLM Engineering

Inside NYU's LLM CTF Leaderboard: Git as a Decentralized Benchmark Database

By Rob Ragan ★ 5 Shell Apr 3, 2026
LLM Engineering

SRMT: When Multi-Agent Pathfinding Meets Shared Memory Transformers

By Rob Ragan ★ 34 Python Apr 2, 2026
LLM Engineering

Building an LLM Evaluation Framework That Won't Burn Your API Budget

By Rob Ragan ★ 7 Python Apr 2, 2026
LLM Engineering

LLM Council: Building Consensus Through Multi-Agent Deliberation

By Rob Ragan ★ 16.3k Python Apr 1, 2026
LLM Engineering

Running Mixtral-8x7B on Consumer Hardware: Expert Offloading with LRU Caching

By Rob Ragan ★ 2.3k Python Mar 31, 2026
LLM Engineering

Inside PALLMS: A Field Guide to Breaking Large Language Models

By Rob Ragan ★ 130 Unknown Mar 31, 2026
LLM Engineering

Terminal-Bench: Testing AI Agents Where Synthetic Benchmarks Fear to Tread

By Rob Ragan ★ 1.8k Python Mar 31, 2026
LLM Engineering

IBProtector: Defending LLMs from Jailbreaks Using Information Bottleneck Theory

By Rob Ragan ★ 27 Python Mar 25, 2026
LLM Engineering

LLM-Check: Detecting Hallucinations by Reading Your Model's Mind

By Rob Ragan ★ 38 Jupyter Notebook Mar 25, 2026
LLM Engineering

Mapping the Safety Basin: How LLM-Landscape Reveals Where Model Alignment Actually Breaks

By Rob Ragan ★ 39 Python Mar 25, 2026
LLM Engineering

Open Asset Model: Building a Graph-Based Specification for Attack Surface Management

By Rob Ragan ★ 58 Go Mar 25, 2026
LLM Engineering

Inside the Foundation Model Transparency Index: How Researchers Score AI Giants on 100 Disclosure Metrics

By Rob Ragan ★ 86 Unknown Mar 25, 2026
LLM Engineering

MiniHF: Turning Prompts Into Models Through Iterative Fine-Tuning

By Rob Ragan ★ 184 Python Mar 25, 2026
LLM Engineering

Building a Privacy Attack Lab: Inside the Model Inversion Attack ToolBox

By Rob Ragan ★ 192 Python Mar 25, 2026
LLM Engineering

Burpference: Teaching Burp Suite to Think with Local and Cloud LLMs

By Rob Ragan ★ 208 Python Mar 25, 2026
LLM Engineering

LLMmap: Fingerprinting Black-Box Language Models with Minimal Queries

By Rob Ragan ★ 229 Python Mar 25, 2026