AI Agents

232 articles

AI Agents

Amazon Nova Act: AI Agents That Actually Navigate the Web Without Breaking

By Rob Ragan ★ 911 Unknown May 8, 2026

AI Agents

Strands Agents Tools: Building Multi-Agent Systems Without Reinventing the Wheel

By Rob Ragan ★ 1.1k Unknown May 8, 2026

AI Agents

Tongyi DeepResearch: The First Open-Source MoE Agent Built for Multi-Step Research

By Rob Ragan ★ 19.1k Unknown May 8, 2026

AI Agents

Inside the Framework Measuring How Good AI Agents Are at Hacking

By Rob Ragan ★ 8 Unknown May 8, 2026

AI Agents

WASP: The Security Benchmark That Catches What Your Web Agent Misses

By Rob Ragan ★ 87 Unknown May 8, 2026

AI Agents

Web-Shepherd: Training Web Agents with Process Rewards Instead of Binary Success

By Rob Ragan ★ 56 Unknown May 8, 2026

AI Agents

GUARDIAN: Detecting When Your AI Agents Start Lying to Each Other

By Rob Ragan ★ 8 Unknown May 8, 2026

AI Agents

Process-Supervised RL for Agentic RAG: How ReasonRAG Achieves 18x Data Efficiency

By Rob Ragan ★ 14 Unknown May 8, 2026

AI Agents

Building a Unified AI Gateway: How IBM's ContextForge Federates MCP, REST, and Agent Protocols

By Rob Ragan ★ 3.8k Unknown May 8, 2026

AI Agents

AgentAuditor: The Invisible Research Project That Might Transform AI Agent Verification

By Rob Ragan ★ 4 Unknown May 8, 2026

AI Agents

SPORT: Teaching Multimodal Agents to Self-Improve Without Human Labels

By Rob Ragan ★ 20 Unknown May 8, 2026

AI Agents

RF-Agent: Teaching Language Models to Design Reward Functions Through Tree Search

By Rob Ragan ★ 11 Unknown May 8, 2026

AI Agents

Superpowers: Teaching AI Agents to Stop Coding Like Caffeinated Interns

By Rob Ragan ★ 214.0k Unknown May 8, 2026

AI Agents

ARTEMIS: Stanford's Multi-Agent Red Teaming System That Orchestrates LLMs to Hunt Vulnerabilities

By Rob Ragan ★ 516 Unknown May 8, 2026

AI Agents

LatentMAS: How Multi-Agent Systems Learned to Think Without Speaking

By Rob Ragan ★ 966 Unknown May 8, 2026

AI Agents

Maestro: Orchestrating Multiple AI Coding Agents with Git Worktrees and Batch Automation

By Rob Ragan ★ 3.0k Unknown May 8, 2026

AI Agents

AG-UI: The Missing Protocol Between AI Agents and Real-Time User Interfaces

By Rob Ragan ★ 13.4k Unknown May 8, 2026

AI Agents

Membrane: Building a Transparent Sandbox for AI Agents with eBPF and Nested Containers

By Rob Ragan ★ 53 Unknown May 8, 2026

AI Agents

Leash: Runtime Guardrails for AI Coding Agents Using eBPF and Cedar Policies

By Rob Ragan ★ 555 Unknown May 8, 2026

AI Agents

Amazon Nova Act: AI Agents That Actually Navigate the Web Without Breaking

Strands Agents Tools: Building Multi-Agent Systems Without Reinventing the Wheel

PenGym: Training Reinforcement Learning Agents Against Real Vulnerable Systems

CodeMachine: Turning AI Coding Assistants Into Orchestrated Workflows

GPTSwarm: Building Self-Optimizing Agent Networks with Reinforcement Learning

Inside HGM: A Self-Rewriting AI That Improves Its Own Code

AutoAgents: Dynamic Multi-Agent Generation for GPT-4 Orchestration

Tongyi DeepResearch: The First Open-Source MoE Agent Built for Multi-Step Research

Inside the Framework Measuring How Good AI Agents Are at Hacking

WASP: The Security Benchmark That Catches What Your Web Agent Misses

Web-Shepherd: Training Web Agents with Process Rewards Instead of Binary Success

GUARDIAN: Detecting When Your AI Agents Start Lying to Each Other

Process-Supervised RL for Agentic RAG: How ReasonRAG Achieves 18x Data Efficiency

Building a Unified AI Gateway: How IBM's ContextForge Federates MCP, REST, and Agent Protocols

AgentAuditor: The Invisible Research Project That Might Transform AI Agent Verification

SPORT: Teaching Multimodal Agents to Self-Improve Without Human Labels

RF-Agent: Teaching Language Models to Design Reward Functions Through Tree Search

Superpowers: Teaching AI Agents to Stop Coding Like Caffeinated Interns

ARTEMIS: Stanford's Multi-Agent Red Teaming System That Orchestrates LLMs to Hunt Vulnerabilities

LatentMAS: How Multi-Agent Systems Learned to Think Without Speaking

Maestro: Orchestrating Multiple AI Coding Agents with Git Worktrees and Batch Automation

AG-UI: The Missing Protocol Between AI Agents and Real-Time User Interfaces

Membrane: Building a Transparent Sandbox for AI Agents with eBPF and Nested Containers

Leash: Runtime Guardrails for AI Coding Agents Using eBPF and Cedar Policies