Starlog — Page 62
// LATEST
AI Agents
Building a Unified AI Gateway: How IBM's ContextForge Federates MCP, REST, and Agent Protocols
AI Agents
AgentAuditor: The Invisible Research Project That Might Transform AI Agent Verification
Cybersecurity
RAPTOR: Building an Autonomous Security Agent from Claude Code and Adversarial Thinking
AI Dev Tools
Happy: Monitoring AI Coding Agents From Your Phone Without Leaking Your Code
LLM Engineering
TheAgentCompany: The First Real-World Benchmark That Makes AI Agents Look Bad
Developer Tools
Training Web Agents Through Test-Time Interaction: Inside TTI's Filtered BC Approach
Developer Tools
AGI SDK: Building Browser Agents Against Production-Quality Web Replicas
AI Agents
SPORT: Teaching Multimodal Agents to Self-Improve Without Human Labels
AI Agents
RF-Agent: Teaching Language Models to Design Reward Functions Through Tree Search
LLM Engineering
SEC-bench: A NeurIPS Framework for Benchmarking LLM Agents Against Real Security Vulnerabilities
AI Agents
Superpowers: Teaching AI Agents to Stop Coding Like Caffeinated Interns
Automation
Stagehand: The Browser Automation SDK That Caches AI Actions Like Code
Automation
Steel Browser: The Open-Source Browser API That Lets AI Agents See the Web
Cybersecurity
HackingBuddyGPT: Teaching LLMs to Think Like Penetration Testers
AI Agents
ARTEMIS: Stanford's Multi-Agent Red Teaming System That Orchestrates LLMs to Hunt Vulnerabilities
AI Agents
LatentMAS: How Multi-Agent Systems Learned to Think Without Speaking
AI Dev Tools
HumanLayer: The Context Engineering Framework That's Mostly Vapor
AI Agents
Maestro: Orchestrating Multiple AI Coding Agents with Git Worktrees and Batch Automation
Cybersecurity
Teaching Machines to Hack: Inside AutoPentest-DRL's Reinforcement Learning Approach
AI Agents
AG-UI: The Missing Protocol Between AI Agents and Real-Time User Interfaces
AI Agents
Membrane: Building a Transparent Sandbox for AI Agents with eBPF and Nested Containers
LLM Engineering
Heretic: Automatic Abliteration for Uncensoring Language Models
AI Agents
Leash: Runtime Guardrails for AI Coding Agents Using eBPF and Cedar Policies
AI Dev Tools