跳转到主要内容
HuntAI
首页排行榜分类
提交
HuntAI

通过 SimilarWeb 真实流量数据,发现最佳 AI 工具。

探索

首页排行榜分类

资源

提交工具

数据

流量数据由 SimilarWeb 提供,每日更新。

© 2026 HuntAI. 保留所有权利。

Privacy PolicyTerms of Service

真实流量数据 · 每日更新

  1. 首页
  2. /
  3. API
  4. /
  5. APIEval-20
APIEval-20

APIEval-20

An open benchmark for AI agents that test APIs

APIArtificial IntelligenceDeveloper Tools
访问网站

No traffic data available yet

Data is sourced from SimilarWeb

关于 APIEval-20

APIEval-20 is a black-box benchmark for API testing agents. Each agent gets only a JSON schema and one sample payload, then generates a test suite. We run those tests against live reference APIs with planted bugs and score bug detection, API coverage, and efficiency. Unlike LLM-as-judge evals, scoring is fully objective: a bug is either caught or it isn’t. Tasks span auth, errors, pagination, schemas, and multi-step flows. Open on Hugging Face.

常见问题

APIEval-20 是什么?

APIEval-20 是 APIEval-20 is a black-box benchmark for API testing agents. Each agent gets only a JSON schema and one sample payload, then generates a test suite. We run those tests against live reference APIs with planted bugs and score bug detection, API coverage, and efficiency. Unlike LLM-as-judge evals, scoring is fully objective: a bug is either caught or it isn’t. Tasks span auth, errors, pagination, schemas, and multi-step flows. Open on Hugging Face.

APIEval-20 免费吗?

是的,APIEval-20 免费使用。

APIEval-20 有哪些替代品?

APIEval-20 的热门替代品包括 Airtop Auth, OpenAI o3-mini, Anything API, Fish Audio S1。可在上方比较它们的月流量和功能。

Best API Alternatives to APIEval-20

Airtop Auth

Airtop Auth

54.8K/mo

OpenAI o3-mini

OpenAI o3-mini

191.2M/mo

Anything API

Anything API

Fish Audio S1

Fish Audio S1

2.6M/mo

ChatGPT Images

ChatGPT Images

191.2M/mo

Gemini Deep Research Agent

Gemini Deep Research Agent

8.5M/mo