LLM Engineering
140 articles
LLM Engineering
HunyuanVideo: Tencent's Diffusion Transformer Architecture for 720p Video Generation
LLM Engineering
Building Statistically Robust LLM Rankings with Pairwise Comparisons
LLM Engineering
BitNet: Running 100B Parameter Models on Your Laptop at Human Reading Speed
LLM Engineering
LLM-Check: Detecting Hallucinations by Reading Your Model's Mind
LLM Engineering
Mapping LLM Safety as a Landscape: How Weight Perturbations Reveal the Fragility of Alignment
LLM Engineering
VERL: The Hybrid-Controller Framework Reshaping How We Train LLMs with Reinforcement Learning
LLM Engineering
IB4LLMs: Using Information Bottleneck Theory to Build Jailbreak-Resistant Language Models
LLM Engineering
SRMT: Teaching Robots to Share Their Thoughts Through Memory
LLM Engineering
Building Resumable LLM Evaluations: A Template for Rate-Limited API Testing
LLM Engineering
Burpference: Adding LLM Intelligence to Your Security Proxy Workflow
LLM Engineering
LLMmap: Fingerprinting Large Language Models Through Behavioral Analysis
LLM Engineering
Inside Transformer Debugger: OpenAI's Circuit Tracing Tool for Mechanistic Interpretability
LLM Engineering
Dora-VAE: How Inference-Time Scalability Solves the 3D Diffusion Training Bottleneck
LLM Engineering
Repomix: Why Packing Your Entire Codebase Into One File Is the Future of LLM Workflows
LLM Engineering
Reasoning Gym: How Procedural Generation Solved RL Training's Ground Truth Problem
LLM Engineering
Swark: Auto-Generating Architecture Diagrams by Feeding Your Codebase to GitHub Copilot
LLM Engineering
Open Interface: Teaching GPT-4 Vision to Drive Your Desktop with Screenshots and PyAutoGUI
LLM Engineering
Chainlit: The Python Framework That Turns LLM Scripts Into Production UIs
LLM Engineering
MoBA: How Moonshot AI Serves 1M-Token Contexts in Production with Learned Sparse Attention
LLM Engineering
Inside the LLM Post-Training Knowledge Base That 2,400+ Researchers Are Using
LLM Engineering
Search-R1: Training Language Models to Think and Search Without Supervision
LLM Engineering
AgentDojo: The Security Benchmark That Exposes LLM Agents' Achilles Heel
LLM Engineering
SEAL: Teaching Language Models to Write Their Own Training Data
LLM Engineering