Offsec & AI Agent Tool Intelligence

// LATEST

AI Agents

OpenManus-RL: Teaching LLM Agents to Think Better Through Reinforcement Learning

★ 4.0k May 8, 2026

AI Agents

DeerFlow: ByteDance's Production-Grade Framework for Hour-Long Autonomous AI Agents

★ 66.1k May 8, 2026

AI Dev Tools

Inside Microsoft's AI Red Teaming Playground: Training Security Professionals to Break LLMs

★ 1.9k May 8, 2026

AI Agents

Robin: Building a Multi-Agent System That Generates Drug Discovery Hypotheses

★ 301 May 8, 2026

AI Agents

WebVoyager: Teaching GPT-4V to Navigate the Web Like a Human

★ 1.1k May 8, 2026

Cybersecurity

HackBench: Measuring What Happens When LLMs Learn to Exploit Vulnerabilities

★ 70 May 8, 2026

AI Agents

Inside the Daily Knowledge Engine Tracking 2,000+ Autonomous Agent Papers

★ 1.3k May 8, 2026

AI Agents

Autono: Why Dynamic ReAct Beats Static Planning for Failure-Prone Agent Tasks

★ 210 May 8, 2026

Data & Knowledge

Persona-Hub: How Tencent's Billion-Scale Perspective Engine Reimagines Synthetic Data

★ 1.6k May 8, 2026

AI Agents

ToolHive: Bringing Kubernetes-Grade Security to Model Context Protocol Servers

★ 1.8k May 8, 2026

LLM Engineering

AgentDojo: The Security Benchmark That Exposes LLM Agents' Achilles Heel

★ 558 May 8, 2026

AI Agents

Ruflo: Building Self-Learning Agent Swarms for Claude with Federation

★ 46.6k May 8, 2026

AI Dev Tools

Strudel: How TidalCycles' Pattern Algebra Was Reimagined for the Web

★ 2.9k May 8, 2026

AI Agents

Magentic-UI: Microsoft's Plan-Then-Execute Web Agent That Shows Its Work

★ 9.8k May 8, 2026

AI Agents

Agenspy: Protocol-First Architecture Brings Modern Agent Communication to DSPy

★ 77 May 8, 2026

Cybersecurity

CyberGym: Building an AI Agent Benchmark on 10TB of Real Vulnerabilities

★ 292 May 8, 2026

Data & Knowledge

Building Interactive Knowledge Graphs from Text: A Three-Phase LLM Pipeline

★ 2.3k May 8, 2026

AI Agents

Super Agent Party: Building Self-Evolving AI Companions with Desktop Vision and Multi-Platform Reach

★ 2.2k May 8, 2026

AI Agents

PentAGI: Multi-Agent AI Architecture for Autonomous Penetration Testing

★ 16.6k May 8, 2026

AI Agents

Building a Multi-Agent Penetration Testing System with AutoGen: A Deep Dive into AI-Powered Security Workflows

★ 30 May 8, 2026

Cybersecurity

MASAPT: When Academic Multi-Agent Systems Meet Penetration Testing Reality

★ 30 May 8, 2026

Starlog — Page 59

// LATEST

VulnBot: When Multi-Agent LLMs Take Over Penetration Testing

smolagents: Why Hugging Face Built an Agent Framework in Just 1,000 Lines

CAI: The Uncensored AI Framework Rewriting the Rules of Offensive Security

OpenManus-RL: Teaching LLM Agents to Think Better Through Reinforcement Learning

DeerFlow: ByteDance's Production-Grade Framework for Hour-Long Autonomous AI Agents

Inside Microsoft's AI Red Teaming Playground: Training Security Professionals to Break LLMs

Robin: Building a Multi-Agent System That Generates Drug Discovery Hypotheses

WebVoyager: Teaching GPT-4V to Navigate the Web Like a Human

HackBench: Measuring What Happens When LLMs Learn to Exploit Vulnerabilities

Inside the Daily Knowledge Engine Tracking 2,000+ Autonomous Agent Papers

Autono: Why Dynamic ReAct Beats Static Planning for Failure-Prone Agent Tasks

Persona-Hub: How Tencent's Billion-Scale Perspective Engine Reimagines Synthetic Data

ToolHive: Bringing Kubernetes-Grade Security to Model Context Protocol Servers

AgentDojo: The Security Benchmark That Exposes LLM Agents' Achilles Heel

Ruflo: Building Self-Learning Agent Swarms for Claude with Federation

Strudel: How TidalCycles' Pattern Algebra Was Reimagined for the Web

Magentic-UI: Microsoft's Plan-Then-Execute Web Agent That Shows Its Work

Agenspy: Protocol-First Architecture Brings Modern Agent Communication to DSPy

CyberGym: Building an AI Agent Benchmark on 10TB of Real Vulnerabilities

Building Interactive Knowledge Graphs from Text: A Three-Phase LLM Pipeline

Super Agent Party: Building Self-Evolving AI Companions with Desktop Vision and Multi-Platform Reach

PentAGI: Multi-Agent AI Architecture for Autonomous Penetration Testing

Building a Multi-Agent Penetration Testing System with AutoGen: A Deep Dive into AI-Powered Security Workflows

MASAPT: When Academic Multi-Agent Systems Meet Penetration Testing Reality