Benchmark for API ModelBenchmark for API ModelTestingsequences.Assessing models on complex API callArtificial IntelligenceNew Benchmark for Evaluating API-Using ModelsA fresh evaluation method for large language models using nested API calls.2025-06-17T11:46:18+00:00 ― 5 min read