AI Agents
232 articles
AI Agents
Amazon Nova Act: AI Agents That Actually Navigate the Web Without Breaking
AI Agents
Strands Agents Tools: Building Multi-Agent Systems Without Reinventing the Wheel
AI Agents
PenGym: Training Reinforcement Learning Agents Against Real Vulnerable Systems
AI Agents
CodeMachine: Turning AI Coding Assistants Into Orchestrated Workflows
AI Agents
GPTSwarm: Building Self-Optimizing Agent Networks with Reinforcement Learning
AI Agents
Inside HGM: A Self-Rewriting AI That Improves Its Own Code
AI Agents
AutoAgents: Dynamic Multi-Agent Generation for GPT-4 Orchestration
AI Agents
Tongyi DeepResearch: The First Open-Source MoE Agent Built for Multi-Step Research
AI Agents
Inside the Framework Measuring How Good AI Agents Are at Hacking
AI Agents
WASP: The Security Benchmark That Catches What Your Web Agent Misses
AI Agents
Web-Shepherd: Training Web Agents with Process Rewards Instead of Binary Success
AI Agents
GUARDIAN: Detecting When Your AI Agents Start Lying to Each Other
AI Agents
Process-Supervised RL for Agentic RAG: How ReasonRAG Achieves 18x Data Efficiency
AI Agents
Building a Unified AI Gateway: How IBM's ContextForge Federates MCP, REST, and Agent Protocols
AI Agents
AgentAuditor: The Invisible Research Project That Might Transform AI Agent Verification
AI Agents
SPORT: Teaching Multimodal Agents to Self-Improve Without Human Labels
AI Agents
RF-Agent: Teaching Language Models to Design Reward Functions Through Tree Search
AI Agents
Superpowers: Teaching AI Agents to Stop Coding Like Caffeinated Interns
AI Agents
ARTEMIS: Stanford's Multi-Agent Red Teaming System That Orchestrates LLMs to Hunt Vulnerabilities
AI Agents
LatentMAS: How Multi-Agent Systems Learned to Think Without Speaking
AI Agents
Maestro: Orchestrating Multiple AI Coding Agents with Git Worktrees and Batch Automation
AI Agents
AG-UI: The Missing Protocol Between AI Agents and Real-Time User Interfaces
AI Agents
Membrane: Building a Transparent Sandbox for AI Agents with eBPF and Nested Containers
AI Agents