我们的下一代型号：Gemini 1.5

我们的下一代型号：Gemini 1.5
Our next-generation model: Gemini 1.5

原始链接: https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/

本文讨论了 Google 推出的 Gemini，特别是 Gemini 1.5 Pro，它在准确性、相关性、理解性、情境化以及长上下文处理等新颖功能方面展示了显着进步。 Gemini 1.5 Pro 将上下文扩展到 100 万个标记，通过识别较大文本中的小细节来提高解决问题的能力。它目前可供全球选定地区的开发人员和企业客户使用，但到年底将向所有用户开放。与前几代产品一样，Gemini 1.5 Pro 正在接受严格的安全评估。还提到，在全面上市之前，将在 AI Studio 和 Vertex AI 等平台上提供有限预览。安全考虑包括行为和代表性危害评估以及红队演习。

然而，为了避免丢失重要的历史细节，研究人员开发了在模型结构内紧凑地存储历史事件的方法，例如使用权重矩阵或采用压缩算法。这些方法使模型能够在高效运行的同时长时间保留关键信息。此外，研究人员利用缓存机制和渐进式训练策略来降低计算成本，从而实现更长的保留持续时间。最终，这些技术使模型能够在资源有限的环境中有效运行，并针对大量历史数据提供有意义的见解和发现。

原文

By Demis Hassabis, CEO of Google DeepMind, on behalf of the Gemini team

This is an exciting time for AI. New advances in the field have the potential to make AI more helpful for billions of people over the coming years. Since introducing Gemini 1.0, we’ve been testing, refining and enhancing its capabilities.

Today, we’re announcing our next-generation model: Gemini 1.5.

Gemini 1.5 delivers dramatically enhanced performance. It represents a step change in our approach, building upon research and engineering innovations across nearly every part of our foundation model development and infrastructure. This includes making Gemini 1.5 more efficient to train and serve, with a new Mixture-of-Experts (MoE) architecture.

The first Gemini 1.5 model we’re releasing for early testing is Gemini 1.5 Pro. It’s a mid-size multimodal model, optimized for scaling across a wide-range of tasks, and performs at a similar level to 1.0 Ultra, our largest model to date. It also introduces a breakthrough experimental feature in long-context understanding.

Gemini 1.5 Pro comes with a standard 128,000 token context window. But starting today, a limited group of developers and enterprise customers can try it with a context window of up to 1 million tokens via AI Studio and Vertex AI in private preview.

As we roll out the full 1 million token context window, we’re actively working on optimizations to improve latency, reduce computational requirements and enhance the user experience. We’re excited for people to try this breakthrough capability, and we share more details on future availability below.

These continued advances in our next-generation models will open up new possibilities for people, developers and enterprises to create, discover and build using AI.

我们的下一代型号：Gemini 1.5 Our next-generation model: Gemini 1.5

我们的下一代型号：Gemini 1.5
Our next-generation model: Gemini 1.5