展示HN：Transcribee：构建知识库的YouTube转录工具

展示HN：Transcribee：构建知识库的YouTube转录工具
Show HN: Transcribee: YouTube transcriber that builds a knowledge base

原始链接: https://github.com/itsfabioroma/transcribee

## Transcribee：您的自我组织知识库 Transcribee 是一款 macOS 开源工具，可自动转录来自 YouTube、Instagram Reels、TikTok 和本地文件的音频和视频。它的目标是以最少的努力构建一个可搜索的个人知识库。它的工作原理是：Transcribee 下载媒体，使用 ElevenLabs（带有说话人识别）进行转录，然后利用 Claude 将转录内容自动分类到 `~/Documents/transcripts/` 目录中的结构化文件夹系统中。转录文本带有说话人标签，可与 ChatGPT 和 Claude 等 LLM 一起使用。高级用户可以访问带有时间戳的原始转录数据。它可以通过 ClawdHub 或从 GitHub 手动克隆安装，需要依赖项，如 `yt-dlp` 和 `ffmpeg`，以及 ElevenLabs 和 Anthropic API 密钥。本质上，Transcribee 允许您“一次转录，永久保存知识”，将消费的内容转化为易于访问、有组织的资源。

Hacker News 新闻 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交登录 Show HN: Transcribee: YouTube 视频转录工具，构建知识库 (github.com/itsfabioroma) 10 分，by ofabioroma 2 小时前 | 隐藏 | 过去 | 收藏 | 1 条评论 ofabioroma 1 小时前 [–] 大家好，我刚刚更新了，现在也支持 ig reels。这是我个人常用的转录工具，顺便说一下，它在 Clawdbot/Moltbot 中也能很好地工作。告诉我你们是否喜欢它，以及有什么改进建议。回复指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系搜索：

原文

Open source macOS transcriber for YouTube, Instagram Reels, TikTok, and local media — evolves a self-organizing knowledge base.

transcribee "https://youtube.com/watch?v=..."
transcribee "https://instagram.com/reel/..."
transcribee "https://vt.tiktok.com/..."
transcribee ~/Downloads/podcast.mp3

Over time, your ~/Documents/transcripts/ folder naturally evolves into a personal library:

transcripts/
├── AI-Research/
│   ├── ilya-sutskever-agi-2024/
│   └── anthropic-constitutional-ai/
├── Startups/
│   ├── ycombinator-how-to-get-users/
│   └── pmarca-founder-mode/
└── Health/
    └── huberman-sleep-optimization/

Each transcript is speaker-labeled and ready to paste into ChatGPT, Claude, or any LLM.

I consume a lot of video content — YouTube, Instagram, TikTok, podcasts, interviews. I wanted to:

Ask questions about videos in LLMs
Have all that knowledge searchable and organized
Not do any manual work to maintain it

transcribee does exactly that. Transcribe once, knowledge stays forever.

Transcribes YouTube, Instagram Reels, TikTok, and local audio/video files
Speaker diarization — identifies different speakers
Auto-categorizes transcripts using Claude based on content
Builds a knowledge library that organizes itself over time

transcribee is available as a Clawdbot skill. Just ask your agent to transcribe any YouTube video:

"Transcribe this video: https://youtube.com/watch?v=..."

# Install from ClawdHub (recommended)
clawdhub install transcribee

# Or clone manually
git clone https://github.com/itsfabioroma/transcribee.git ~/.clawdbot/skills/transcribee

Make sure you have the dependencies installed (brew install yt-dlp ffmpeg) and API keys configured.

# Install dependencies (macOS)
brew install yt-dlp ffmpeg
pnpm install

# Configure API keys
cp .env.example .env
# Add your ElevenLabs + Anthropic API keys to .env

# Transcribe anything
transcribee "https://youtube.com/watch?v=..."
transcribee "https://instagram.com/reel/..."
transcribee "https://vt.tiktok.com/..."
transcribee ~/Downloads/podcast.mp3
transcribee ~/Videos/interview.mp4

Shell alias (recommended)

Add to ~/.zshrc:

alias transcribee="noglob /path/to/transcribee/transcribe.sh"

Each transcript saves to ~/Documents/transcripts/{category}/{title}/:

File	What it's for
`transcript.txt`	Speaker-labeled transcript — paste this into your LLM
`metadata.json`	Video info, language, auto-detected theme

For power users who need word-level timestamps and confidence scores:

transcribee --raw "https://youtube.com/watch?v=..."

This adds transcript-raw.json with the full ElevenLabs response.

Downloads audio from YouTube (yt-dlp) or extracts from local video (ffmpeg)
Transcribes with ElevenLabs (scribe_v1_experimental with speaker diarization)
Claude analyzes content and existing library structure
Auto-categorizes into the right folder
Saves transcript files with metadata

Type	Formats
Audio	mp3, m4a, wav, ogg, flac
Video	mp4, mkv, webm, mov, avi
URLs	youtube.com, youtu.be, instagram.com/reel, tiktok.com

bzz bzz 🐝

展示HN：Transcribee：构建知识库的YouTube转录工具 Show HN: Transcribee: YouTube transcriber that builds a knowledge base

Shell alias (recommended)

展示HN：Transcribee：构建知识库的YouTube转录工具
Show HN: Transcribee: YouTube transcriber that builds a knowledge base