展示 HN：Pipelex – 用于可重复 AI 工作流程的声明式语言

展示 HN：Pipelex – 用于可重复 AI 工作流程的声明式语言
Show HN: Pipelex – Declarative language for repeatable AI workflows

原始链接: https://github.com/Pipelex/pipelex

## Pipelex：构建可重复的AI工作流 Pipelex 是一种开源语言，用于创建和运行可靠的AI工作流。它不使用复杂的提示词，而是将任务分解为聚焦且经过验证的步骤，称为“管道”（pipes），并使用“概念”（Concepts）进行清晰的数据类型定义。 **入门：** 1. **安装：** `pip install pipelex` 并使用 `pipelex init` 初始化。 2. **API密钥：** 从 Pipelex Discord 获取免费 API 密钥，或使用 OpenAI、Anthropic、Google、Mistral 或本地 AI 模型（Ollama、vLLM 等）的现有密钥。 3. **构建工作流：** 使用 `pipelex build` 生成一个 `.plx` 文件，定义您的工作流。例如，创建一个简历/职位匹配管道。 **主要特性：** * **易于阅读：** `.plx` 文件设计为易于理解。 * **模块化：** 管道由可重用的 pipes 构建。 * **可扩展：** 支持各种 AI 提供商，并提供 OCR 和图像生成等功能。 * **迭代开发：** AI 辅助工具（Cursor、Copilot 等）帮助使用自然语言优化管道。 * **遥测：** 可选的匿名使用数据收集，用于产品改进（可配置）。 **运行与贡献：** 通过 CLI 或 Python 运行管道。探索 Cookbook 仓库中的示例，并通过 GitHub 贡献。加入 Discord 社区以获取支持和协作。

## Pipelex：用于可重复AI的声明式工作流 Pipelex是一种新的DSL（领域特定语言）和Python运行时，旨在构建可重复和可审计的AI工作流，类似于“LLM流水线的Dockerfiles或SQL”。由Robin、Louis和Thomas创建，它专注于*声明*AI工作流应该做什么，让运行时处理执行细节。主要特性包括“agent优先”的方法——将自然语言上下文嵌入到每个步骤中，以供LLM理解和优化——以及一个采用MIT许可的开源标准，并提供Python库、API服务器、VS Code扩展以及与n8n的集成等工具。 Pipelex旨在解决现有AI工作流构建器的局限性，提供确定性、可重现性和人类和AI都能理解的结构化语法。创建者强调其可组合性，允许用户构建和共享工作流。目前正在开发中，Pipelex正在寻求社区对工作流构建、agent/MCP可用性以及潜在集成的反馈。尽管存在一些限制（连接器可用性、可视化、托管API），但该项目为可靠且协作的AI工作流开发提供了一种有希望的方法。 [https://github.com/Pipelex/pipelex](https://github.com/Pipelex/pipelex)

原文

pip install pipelex
pipelex init

2. Get Your API Key (Free)

To use AI models, you need an API key:

Free Pipelex API Key: Join our Discord community and request your free API key (no credit card required) in the 🔑・free-api-key channel.
Bring your own API keys: OpenAI, Anthropic, Google, Mistral)
Local AI: Ollama, vLLM, LM Studio, llama.cpp... any endpoint based on the OpenAI API or not, as you can plug-in your own non-standard APIs.

See Configure AI Providers for details.

3. Generate Your First Workflow

Create a complete AI workflow with a single command:

pipelex build pipe "Take a CV and Job offer in PDF, analyze if they match and generate 5 questions for the interview" --output results/cv_match.plx

This command generates a production-ready .plx file with domain definitions, concepts, and multiple processing steps that analyzes CV-job fit and prepares interview questions.

cv_match.plx

domain = "cv_match"
description = "Matching CVs with job offers and generating interview questions"
main_pipe = "analyze_cv_job_match_and_generate_questions"

[concept.MatchAnalysis]
description = """
Analysis of alignment between a candidate and a position, including strengths, gaps, and areas requiring further exploration.
"""

[concept.MatchAnalysis.structure]
strengths = { type = "text", description = "Areas where the candidate's profile aligns well with the requirements", required = true }
gaps = { type = "text", description = "Areas where the candidate's profile does not meet the requirements or lacks evidence", required = true }
areas_to_probe = { type = "text", description = "Topics or competencies that need clarification or deeper assessment during the interview", required = true }

[concept.Question]
description = "A single interview question designed to assess a candidate."
refines = "Text"

[pipe.analyze_cv_job_match_and_generate_questions]
type = "PipeSequence"
description = """
Main pipeline that orchestrates the complete CV-job matching and interview question generation workflow. Takes a candidate's CV and a job offer as PDF documents, extracts their content, performs a comprehensive match analysis identifying strengths, gaps, and areas to probe, and generates exactly 5 targeted interview questions based on the analysis results.
"""
inputs = { cv_pdf = "PDF", job_offer_pdf = "PDF" }
output = "Question[5]"
steps = [
    { pipe = "extract_documents_parallel", result = "extracted_documents" },
    { pipe = "analyze_match", result = "match_analysis" },
    { pipe = "generate_interview_questions", result = "interview_questions" },
]

📄 Click to view the supporting pipes implementation

[pipe.extract_documents_parallel]
type = "PipeParallel"
description = """
Executes parallel extraction of text content from both the CV PDF and job offer PDF simultaneously to optimize processing time.
"""
inputs = { cv_pdf = "PDF", job_offer_pdf = "PDF" }
output = "Dynamic"
parallels = [
    { pipe = "extract_cv_text", result = "cv_pages" },
    { pipe = "extract_job_offer_text", result = "job_offer_pages" },
]
add_each_output = true

[pipe.extract_cv_text]
type = "PipeExtract"
description = """
Extracts text content from the candidate's CV PDF document using OCR technology, converting all pages into machine-readable text format for subsequent analysis.
"""
inputs = { cv_pdf = "PDF" }
output = "Page[]"
model = "extract_text_from_pdf"

[pipe.extract_job_offer_text]
type = "PipeExtract"
description = """
Extracts text content from the job offer PDF document using OCR technology, converting all pages into machine-readable text format for subsequent analysis.
"""
inputs = { job_offer_pdf = "PDF" }
output = "Page[]"
model = "extract_text_from_pdf"

[pipe.analyze_match]
type = "PipeLLM"
description = """
Performs comprehensive analysis comparing the candidate's CV against the job offer requirements. Identifies and structures: (1) strengths where the candidate's profile aligns well with requirements, (2) gaps where the profile lacks evidence or doesn't meet requirements, and (3) specific areas requiring deeper exploration or clarification during the interview process.
"""
inputs = { cv_pages = "Page[]", job_offer_pages = "Page[]" }
output = "MatchAnalysis"
model = "llm_to_answer_hard_questions"
system_prompt = """
You are an expert HR analyst and recruiter specializing in candidate-job fit assessment. Your task is to generate a structured MatchAnalysis comparing a candidate's CV against job requirements.
"""
prompt = """
Analyze the match between the candidate's CV and the job offer requirements.

Candidate CV:
@cv_pages

Job Offer:
@job_offer_pages

Perform a comprehensive comparison and provide a structured analysis.
"""

[pipe.generate_interview_questions]
type = "PipeLLM"
description = """
Generates exactly 5 targeted, relevant interview questions based on the match analysis results. Questions are designed to probe identified gaps, clarify areas of uncertainty, validate strengths, and assess competencies that require deeper evaluation to determine candidate-position fit.
"""
inputs = { match_analysis = "MatchAnalysis" }
output = "Question[5]"
model = "llm_to_write_questions"
system_prompt = """
You are an expert HR interviewer and talent assessment specialist. Your task is to generate structured interview questions based on candidate-position match analysis.
"""
prompt = """
Based on the following match analysis between a candidate and a position, generate exactly 5 targeted interview questions.

@match_analysis

The questions should:
- Probe the identified gaps to assess if they are deal-breakers or can be mitigated
- Clarify areas that require deeper exploration
- Validate the candidate's strengths with concrete examples
- Be open-ended and behavioral when appropriate
- Help determine overall candidate-position fit

Generate exactly 5 interview questions.
"""

View the pipeline flowchart:

flowchart TD
 subgraph PAR["extract_documents_parallel (PipeParallel)"]
    direction LR
        EXTRACT_CV["extract_cv_text (PipeExtract)"]
        EXTRACT_JOB["extract_job_offer_text (PipeExtract)"]
  end
 subgraph MAIN["analyze_cv_job_match_and_generate_questions (PipeSequence)"]
    direction TB
        PAR
        CV_PAGES[["cv_pages: Page"]]
        JOB_PAGES[["job_offer_pages: Page"]]
        ANALYZE["analyze_match (PipeLLM)"]
        MATCH[["MatchAnalysis"]]
        GENERATE["generate_interview_questions (PipeLLM)"]
        OUT[["Question"]]
  end
    CV_IN[["cv_pdf: PDF"]] --> EXTRACT_CV
    JOB_IN[["job_offer_pdf: PDF"]] --> EXTRACT_JOB
    EXTRACT_CV --> CV_PAGES
    EXTRACT_JOB --> JOB_PAGES
    CV_PAGES --> ANALYZE
    JOB_PAGES --> ANALYZE
    ANALYZE --> MATCH
    MATCH --> GENERATE
    GENERATE --> OUT
    classDef default stroke:#1976D2,stroke-width:2px,fill:#E3F2FD,color:#0D47A1
    style EXTRACT_CV stroke:#1565C0,fill:#BBDEFB,color:#0D47A1
    style EXTRACT_JOB stroke:#1565C0,fill:#BBDEFB,color:#0D47A1
    style PAR fill:#FFF9C4,stroke:#F57C00,stroke-width:2px
    style CV_PAGES stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
    style JOB_PAGES stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
    style ANALYZE stroke:#1565C0,fill:#BBDEFB,color:#0D47A1
    style MATCH stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
    style GENERATE stroke:#1565C0,fill:#BBDEFB,color:#0D47A1
    style OUT stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
    style CV_IN stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
    style JOB_IN stroke:#2E7D32,fill:#C8E6C9,color:#1B5E20
    style MAIN fill:#F3E5F5,stroke:#7B1FA2,stroke-width:2px

Via CLI:

# Run with input file
pipelex run results/cv_match.plx --inputs inputs.json

Create an inputs.json file with your PDF URLs:

{
  "cv_pdf": {
    "concept": "PDF",
    "content": {
      "url": "https://pipelex-web.s3.amazonaws.com/demo/John-Doe-CV.pdf"
    }
  },
  "job_offer_pdf": {
    "concept": "PDF",
    "content": {
      "url": "https://pipelex-web.s3.amazonaws.com/demo/Job-Offer.pdf"
    }
  }
}

Via Python:

import asyncio
import json
from pipelex.pipeline.execute import execute_pipeline
from pipelex.pipelex import Pipelex

async def run_pipeline():
    with open("inputs.json", encoding="utf-8") as f:
        inputs = json.load(f)

    pipe_output = await execute_pipeline(
        pipe_code="cv_match",
        inputs=inputs
    )
    print(pipe_output.main_stuff_as_str)

Pipelex.make()
asyncio.run(run_pipeline())

5. Iterate with AI Assistance

Install AI assistant rules to easily modify your pipelines:

This installs rules for Cursor, Claude, OpenAI Codex, GitHub Copilot, Windsurf, and Blackbox AI. Now you can refine pipelines with natural language:

"Include confidence scores between 0 and 100 in the match analysis"
"Write a recap email at the end"

🚀 See Pipelex in Action

Pipelex is an open-source language that enables you to build and run repeatable AI workflows. Instead of cramming everything into one complex prompt, you break tasks into focused steps, each pipe handling one clear transformation.

Each pipe processes information using Concepts (typing with meaning) to ensure your pipelines make sense. The Pipelex language (.plx files) is simple and human-readable, even for non-technical users. Each step can be structured and validated, giving you the reliability of software with the intelligence of AI.

Learn More:

We highly recommend installing our extension for .plx files into your IDE. You can find it in the Open VSX Registry. It's coming soon to VS Code marketplace too. If you're using Cursor, Windsurf or another VS Code fork, you can search for it directly in your extensions tab.

Explore real-world examples in our Cookbook repository:

Clone it, fork it, and experiment with production-ready pipelines for various use cases.

The package supports the following additional features:

anthropic: Anthropic/Claude support for text generation
google: Google models (Vertex) support for text generation
mistralai: Mistral AI support for text generation and OCR
bedrock: Amazon Bedrock support for text generation
fal: Image generation with Black Forest Labs "FAL" service

Install all extras:

Using pip:

pip install "pipelex[anthropic,google,google-genai,mistralai,bedrock,fal]"

Pipelex collects optional, anonymous usage data to help improve the product. On first run, you'll be prompted to choose your telemetry preference:

Off: No telemetry data collected
Anonymous: Anonymous usage data only (command usage, performance metrics, feature usage)
Identified: Usage data with user identification (helps us provide better support)

Your prompts, LLM responses, file paths, and URLs are automatically redacted and never transmitted. You can change your preference at any time or disable telemetry completely by setting the DO_NOT_TRACK environment variable.

For more details, see the Telemetry Documentation or read our Privacy Policy.

We welcome contributions! Please see our Contributing Guidelines for details on how to get started, including development setup and testing information.

Join our vibrant Discord community to connect with other developers, share your experiences, and get help with your Pipelex projects!

GitHub Issues: For bug reports and feature requests
Discussions: For questions and community discussions
Documentation

If you find Pipelex helpful, please consider giving us a star! It helps us reach more developers and continue improving the tool.

This project is licensed under the MIT license. Runtime dependencies are distributed under their own licenses via PyPI.

"Pipelex" is a trademark of Evotis S.A.S.