Benchmark for API ModelBenchmark for API ModelTestingsequences.Assessing models on complex API callArtificial IntelligenceNew Benchmark for Evaluating API-Using ModelsA fresh evaluation method for large language models using nested API calls.Jun 17, 2025 ― 5 min read