Blog
.png?dpl=dpl_EZ2CbAdrTdJi1oU63uXH2h3yqvSt)
Announcing our $7.4m seed fundraising to build the AI Operating system for security teams

Evaluating AI Agents in Security Operations (December 2025)

The Security Infrastructure Powering EliseAI's Rapid Scale
How EliseAI uses Cotool to automate investigations, expand detection coverage, and defend customer data across housing and healthcare

Engineering Blogs
Research Blogs

Evaluating AI Agents in Security Operations (December 2025)
Which frontier model should you use for SecOps automation? We added the latest cohort of frontier models to our benchmark to find out.

Evaluating AI Agents in Security Operations
We benchmarked frontier AI models on realistic security operations (SecOps) tasks using Cotool’s agent harness and the Splunk BOTSv3 dataset. GPT-5 achieved the highest accuracy (63%), while Claude Haiku-4.5 completed tasks the fastest with strong accuracy. GPT-5 variants dominated the performance-cost frontier. These results provide practical guidance for model selection in enterprise SecOps automation.
Company Blogs
.png?dpl=dpl_EZ2CbAdrTdJi1oU63uXH2h3yqvSt)
Announcing our $7.4m seed fundraising to build the AI Operating system for security teams
Cotool announces $7.4m seed round led by Andreessen Horowitz to build the AI Operating System for security teams
March 5, 2026
-1.png?dpl=dpl_EZ2CbAdrTdJi1oU63uXH2h3yqvSt)
Security without the Spectacle
Why would anyone want to start an AI security company?
May 12, 2025
Attackers are scaling with tokens
Cotool helps defenders operate at machine speed. See how security teams are scaling Detection & Response.
Book a demo
All Reads
Design and implement software systems, conduct code reviews, optimize application performance.

The Security Infrastructure Powering EliseAI's Rapid Scale
How EliseAI uses Cotool to automate investigations, expand detection coverage, and defend customer data across housing and healthcare
.png?dpl=dpl_EZ2CbAdrTdJi1oU63uXH2h3yqvSt)
Announcing our $7.4m seed fundraising to build the AI Operating system for security teams
Cotool announces $7.4m seed round led by Andreessen Horowitz to build the AI Operating System for security teams

Evaluating AI Agents in Security Operations (December 2025)
Which frontier model should you use for SecOps automation? We added the latest cohort of frontier models to our benchmark to find out.

Evaluating AI Agents in Security Operations
We benchmarked frontier AI models on realistic security operations (SecOps) tasks using Cotool’s agent harness and the Splunk BOTSv3 dataset. GPT-5 achieved the highest accuracy (63%), while Claude Haiku-4.5 completed tasks the fastest with strong accuracy. GPT-5 variants dominated the performance-cost frontier. These results provide practical guidance for model selection in enterprise SecOps automation.

Context Management for Agentic Security
How we are solving the LLM Security Data problem
-1.png?dpl=dpl_EZ2CbAdrTdJi1oU63uXH2h3yqvSt)
Security without the Spectacle
Why would anyone want to start an AI security company?
