OpenAI ：2024年OpenAI o1大模型技术报告（英文版）

发布者：wx****53

2024-10-11

2 MB 43 页

人工智能（AI）

文件列表：

OpenAI ：2024年OpenAI o1大模型技术报告（英文版）.pdf

下载文档

资源简介

The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. These advanced reasoning capabilities provide new avenues for improving the safety and robustness of our models. In particular, our models can reason about our safety policies in context when responding to potentially unsafe prompts. This leads to state-of-the-art performance on certain benchmarks for risks such as generating illicit advice, choosing stereotyped responses, and succumbing t

加载中...

本文档仅能预览20页

继续阅读请下载文档