All articles

LLM Engineering

139 articles

LLM Engineering

GPT4All: Running Production LLMs on a 2015 MacBook

By Rob Ragan ★ 77.4k Unknown May 11, 2026
LLM Engineering

Unlimiformer: Turn Any Transformer Into a Long-Context Model with Retrieval-Augmented Attention

By Rob Ragan ★ 1.1k Unknown May 11, 2026
LLM Engineering

StructGPT: Teaching ChatGPT to Query Databases Without Fine-Tuning

By Rob Ragan ★ 417 Unknown May 11, 2026
LLM Engineering

Prompt Optimizer: Cutting LLM API Costs by Compressing Tokens Without Model Access

By Rob Ragan ★ 304 Unknown May 11, 2026
LLM Engineering

AIx: ProjectDiscovery's Minimalist CLI for Piping Unix Philosophy into GPT

By Rob Ragan ★ 314 Unknown May 11, 2026
LLM Engineering

Gorilla: Building LLMs That Actually Call APIs Without Hallucinating

By Rob Ragan ★ 12.9k Unknown May 11, 2026
LLM Engineering

Inside Microsoft's UniLM: A Research Catalog of Foundation Models from BitNet to Kosmos

By Rob Ragan ★ 22.1k Unknown May 11, 2026
LLM Engineering

Marsha: The Programming Language That Compiles English Into Tested Python Using LLMs

By Rob Ragan ★ 469 Unknown May 11, 2026
LLM Engineering

OWASP's Open Asset Model: Treating Your Attack Surface Like a Knowledge Graph

By Rob Ragan ★ 60 Unknown May 11, 2026
LLM Engineering

DeepEval: Testing LLM Applications Like You Test Your Code

By Rob Ragan ★ 15.3k Unknown May 11, 2026
LLM Engineering

Training Custom LLMs with Synthetic Data: How GPT-4 Can Build Your Dataset

By Rob Ragan ★ 4.2k Unknown May 11, 2026
LLM Engineering

Axflow: A Zero-Dependency TypeScript Framework That Treats AI Like First-Class Infrastructure

By Rob Ragan ★ 1.1k Unknown May 11, 2026
LLM Engineering

LLM Guard: Building a Security Middleware Layer for Large Language Models

By Rob Ragan ★ 2.9k Unknown May 10, 2026
LLM Engineering

Tree of Thoughts: Teaching Language Models to Think by Exploring, Not Just Reasoning

By Rob Ragan ★ 5.9k Unknown May 10, 2026
LLM Engineering

CircuitStream: A Lightweight LLM Proxy for Teams Sharing Rate Limits

By Rob Ragan ★ 5 Unknown May 10, 2026
LLM Engineering

Building a Slack Doppelgänger: Fine-Tuning LLMs on Your Message History with Modal

By Rob Ragan ★ 6 Unknown May 10, 2026
LLM Engineering

Inside the Foundation Model Transparency Index: How Stanford Scores AI Giants on Disclosure

By Rob Ragan ★ 87 Unknown May 10, 2026
LLM Engineering

SimplyRetrieve: Building Privacy-First RAG Systems That Treat LLMs as Context Interpreters, Not Oracles

By Rob Ragan ★ 218 Unknown May 10, 2026
LLM Engineering

Deconstructing Sparse Matrix Performance: A Case Study in Rust Optimization

By Rob Ragan ★ 48 Unknown May 10, 2026
LLM Engineering

LLM-CLI: A Crystal-Fast Command Generator That Learns What You Mean

By Rob Ragan ★ 14 Unknown May 10, 2026
LLM Engineering

Instructor: How Pydantic Models Turned LLM JSON into Type-Safe Python Objects

By Rob Ragan ★ 12.9k Unknown May 10, 2026
LLM Engineering

Inside Microsoft's LMOps: Research Prototypes That Reveal How LLMs Actually Work

By Rob Ragan ★ 4.4k Unknown May 10, 2026
LLM Engineering

Paxml: Google's JAX Framework for Training Models at Trillion-Parameter Scale

By Rob Ragan ★ 550 Unknown May 10, 2026
LLM Engineering

BishopFox/llm-testing-findings: The Missing Standard for Documenting AI Security Vulnerabilities

By Rob Ragan ★ 74 Unknown May 10, 2026