API-Bank: 工具增强型 LLMs 的基准测试（英文版）

发布者：wx****33

2023-04-22

495 KB 12 页

人工智能（AI）

文件列表：

API-Bank: 工具增强型 LLMs 的基准测试【英文版】.pdf

下载文档

资源简介

英文标题：API-Bank: A Benchmark for Tool-Augmented LLMs中文摘要：本文介绍了 API-Bank，它是第一个为工具增强的 LLMs 定制的基准测试，旨在全面评估 LLMs 规划逐步 API 调用、检索相关 API 和正确执行 API 调用以满足人类需求的能力，实验结果表明，GPT-3.5 在使用工具方面比 GPT3 有更好的性能，虽然 GPT-4 在规划性能方面更强，但仍有继续改进的空间，此外，详细的错误分析和案例研究证明了工具增强 LLMs 的可行性以及未来需要解决的主要挑战。英文摘要：Recent research has shown that Large Language Models (LLMs) can utilizeexternal tools to improve their contextual processing abilities, moving awayfrom the pure language modeling paradigm and paving the way for ArtificialGeneral Int

加载中...

已阅读到文档的结尾了

下载文档