Product
EnterprisePricingCompanyBlogCommunityDocsD

Evals

5 articles

January 30, 2026 Update

January 30, 2026 Update

OpenCode plugin with AI agent teams, complete queue system, sandbox snapshots, TTL support across services, eval lifecycle hooks, and 12 releases worth of improvements.

Agentuity v1 Reaches Beta

January 7, 2026 by Agentuity

Agentuity v1 Reaches Beta

Agentuity v1 reaches beta with sandbox infrastructure, SSH support, type-safe RPC, built-in auth, and a first-class evaluations system.

Summer Contributions - LLM as a Judge

June 10, 2025 by Jeff Haynie

Summer Contributions - LLM as a Judge

Joel, a student at University of Florida, takes LLM as a Judge and runs with it with this great pattern example built on Agentuity.

Summer Contributions - Evals

June 9, 2025 by Jeff Haynie

Summer Contributions - Evals

This evaluation system uses multiple specialized agents to create a robust, scalable framework for testing AI models against ground truth datasets. Each agent has a discrete task and can pass information to others through the Agentuity key-value store.

Collider: Our AI Gateway Testing With Intelligent Automation

May 29, 2025 by Bobby Christopher

Collider: Our AI Gateway Testing With Intelligent Automation

How Agentuity built Collider — an AI-powered testing framework that validates AI gateway integrations across models and runtimes, then auto-triages failures.

The full-stack platform
for AI agents

Copyright © 2026 Agentuity, Inc.

  • Contact
  • Privacy
  • Terms
  • Features
  • AI Gateway
  • APIs
  • Custom Domains
  • Evals
  • Instant I/O
  •  
  • React Frontend
  • Sandboxes
  • Storage
  • Workbench
  • Company
  • Enterprise
  • Pricing
  • Blog
  • About Us
  • Careers
  • FAQ
  • Links
  • App
  • Docs
  • Discord
XLinkedInYouTubeGitHubDiscord

Copyright © 2026 Agentuity, Inc.

  • Contact
  • Privacy
  • Terms

Thought Leadership, Developer Ready (TLDR)

AI Agent InfrastructureAI Agent DeploymentAI Agent ObservabilityAI Agent RuntimeMulti-Agent Orchestration