Metaflow:构建、管理和部署人工智能/机器学习系统
Metaflow: Build, Manage and Deploy AI/ML Systems

原始链接: https://github.com/Netflix/metaflow

Metaflow 是一个以人为本的 Python 框架,用于构建和管理真实的 AI 和 ML 系统,它简化了从原型设计到生产的整个开发生命周期。 Metaflow 最初由 Netflix 开发,现在由 Outerbounds 支持,它统一了代码、数据和计算,从而实现无缝的端到端管理。 Metaflow 提高了研究和工程效率,支持从基础统计到高级深度学习的项目。 它提供了一个简单的 API,涵盖了实验跟踪、版本控制和可视化,从而促进了快速原型设计和在具有 CPU 和 GPU 的云基础设施上的扩展。 它可以轻松地进行依赖管理,并一键部署到生产协调器。 目前,Metaflow 为亚马逊、DoorDash 和高盛等不同公司的数千种 AI 和 ML 体验提供支持。 在 Netflix,它支持超过 3000 个项目,处理数 PB 的数据。 Metaflow 易于通过 pip 或 conda 安装,并通过一个教程指导用户完成他们的第一个流程。 丰富的文档和 Slack 社区提供支持。 欢迎为 Metaflow 做出贡献。

## Metaflow:基于 Python 的 AI/ML 系统编排器 Metaflow 是 Netflix 开源的一个项目,旨在简化 AI/ML 系统的构建、管理和部署。用户称赞它易于使用,数据科学家可以利用熟悉的 Python API 定义有向无环图 (DAG) 来描述工作流程。它擅长在 AWS Batch 和 Kubernetes 等平台上扩展并行作业,并提供用户友好的界面。 虽然主要设计用于 ML/AI,但 Metaflow 可以通过系统调用和 Docker 镜像处理非 Python 任务。它与 Airflow 等数据工程工具的区别在于,它专注于计算密集型的 ML 任务,并提供更好的实验支持。 最近的更新包括使用自定义装饰器和可配置流水线来组合流程的功能。Metaflow 与 Weights & Biases 等工具集成,并提供工件跟踪功能,可能减少了对单独的实验跟踪工具(如 MLflow)的需求。尽管 Netflix 的重点有所转变,但 Metaflow 仍然在积极开发中,并被 CloudKitchens 和 Flexport 等公司使用。
相关文章

原文

Metaflow_Logo_Horizontal_FullColor_Ribbon_Dark_RGB

Metaflow is a human-centric framework designed to help scientists and engineers build and manage real-life AI and ML systems. Serving teams of all sizes and scale, Metaflow streamlines the entire development lifecycle—from rapid prototyping in notebooks to reliable, maintainable production deployments—enabling teams to iterate quickly and deliver robust systems efficiently.

Originally developed at Netflix and now supported by Outerbounds, Metaflow is designed to boost the productivity for research and engineering teams working on a wide variety of projects, from classical statistics to state-of-the-art deep learning and foundation models. By unifying code, data, and compute at every stage, Metaflow ensures seamless, end-to-end management of real-world AI and ML systems.

Today, Metaflow powers thousands of AI and ML experiences across a diverse array of companies, large and small, including Amazon, Doordash, Dyson, Goldman Sachs, Ramp, and many others. At Netflix alone, Metaflow supports over 3000 AI and ML projects, executes hundreds of millions of data-intensive high-performance compute jobs processing petabytes of data and manages tens of petabytes of models and artifacts for hundreds of users across its AI, ML, data science, and engineering teams.

From prototype to production (and back)

Metaflow provides a simple and friendly pythonic API that covers foundational needs of AI and ML systems:

  1. Rapid local prototyping, support for notebooks, and built-in support for experiment tracking, versioning and visualization.
  2. Effortlessly scale horizontally and vertically in your cloud, utilizing both CPUs and GPUs, with fast data access for running massive embarrassingly parallel as well as gang-scheduled compute workloads reliably and efficiently.
  3. Easily manage dependencies and deploy with one-click to highly available production orchestrators with built in support for reactive orchestration.

For full documentation, check out our API Reference or see our Release Notes for the latest features and improvements.

Getting up and running is easy. If you don't know where to start, Metaflow sandbox will have you running and exploring in seconds.

To install Metaflow in your Python environment from PyPI:

Alternatively, using conda-forge:

conda install -c conda-forge metaflow

Once installed, a great way to get started is by following our tutorial. It walks you through creating and running your first Metaflow flow step by step.

For more details on Metaflow’s features and best practices, check out:

If you need help, don’t hesitate to reach out on our Slack community!

Deploying infrastructure for Metaflow in your cloud

While you can get started with Metaflow easily on your laptop, the main benefits of Metaflow lie in its ability to scale out to external compute clusters and to deploy to production-grade workflow orchestrators. To benefit from these features, follow this guide to configure Metaflow and the infrastructure behind it appropriately.

We'd love to hear from you. Join our community Slack workspace!

We welcome contributions to Metaflow. Please see our contribution guide for more details.

联系我们 contact @ memedata.com