OpenAI 称最新的 o1 模型处于“新水平”，可以“先思考后回答”

OpenAI 称最新的 o1 模型处于“新水平”，可以“先思考后回答”
OpenAI Says Latest o1 Model On "New Level", Can "Think Before It Answers"

原始链接: https://www.zerohedge.com/technology/openai-says-latest-o1-model-new-level-can-think-it-answers

人工智能公司 OpenAI 推出了名为 OpenAI o1 的新模型。该模型旨在“先思考后回答”，以解决更复杂的问题，尤其是在 STEM 科目和编程方面。与 GPT-4o 等之前的版本相比，OpenAI o1 在生物、化学和物理等科学领域表现更好，甚至在某些基准测试中优于 GPT-4o。虽然 OpenAI o1 在非 STEM 事务上可能没有那么广泛的了解，但他们计划在未来的更新中解决这个问题。值得注意的是，OpenAI o1 能够从头开始编写视频游戏并在演示过程中解决复杂的谜题。 ChatGPT Plus 用户目前可以访问该模型的“预览”版本，预计很快就会进行增强迭代。有专家预测OpenAI将在9月份推出以推理为中心的AI模型，名为Strawberry；然而，目前还没有明确确认这是否与 OpenAI o1 或其他正在开发的项目有关。

原文

Authored by Brayden Lindrea via CoinTelegraph.com,

OpenAI has released several new artificial intelligence models under a revised naming scheme — starting with its latest OpenAI o1 model it says can “think before it answers.”

“For complex reasoning tasks, this is a significant advancement and represents a new level of AI capability,” OpenAI said in a Sept. 12 blog post.

“Given this, we are resetting the counter back to one and naming this series OpenAI o1.”

The new models can take their time to think and use “chain-of-thought” reasoning to solve complex tasks — particularly in STEM (science, technology, engineering and math) and coding-related tasks, OpenAI said.

Source: OpenAI

The AI firm shared videos of OpenAI o1 coding a video game from a prompt and solving a complex logical puzzle, among other things.

The OpenAI o1 “preview” and “mini” models were made available to ChatGPT Plus subscribers with the firm planning to release improved versions in the coming months.

OpenAI shared data suggesting OpenAI o1 defeats GPT-4o in several benchmarks, including PhD-level science topics in Biology, Chemistry and Physics and some United States high school exams.

OpenAI o1 improvement model compared with GPT-4o on several benchmarks. Source: OpenAI

OpenAI o1 mini’s focus on STEM reasoning capabilities means it isn’t as knowledgeable in other areas outside of its narrow focus, OpenAI said.

“[Its] factual knowledge on non-STEM topics such as dates, biographies, and trivia is comparable to small LLMs such as GPT-4o mini.”

“We will improve these limitations in future versions, as well as experiment with extending the model to other modalities and specialties outside of STEM,” it added.

Industry pundits anticipated OpenAI would release a reasoning-focused AI model in September under the codename Strawberry.

However, OpenAI doesn’t disclose distinctions between different models under development.

OpenAI 称最新的 o1 模型处于“新水平”，可以“先思考后回答” OpenAI Says Latest o1 Model On "New Level", Can "Think Before It Answers"

OpenAI 称最新的 o1 模型处于“新水平”，可以“先思考后回答”
OpenAI Says Latest o1 Model On "New Level", Can "Think Before It Answers"