展示HN:Transcribee:构建知识库的YouTube转录工具
Show HN: Transcribee: YouTube transcriber that builds a knowledge base

原始链接: https://github.com/itsfabioroma/transcribee

## Transcribee:您的自我组织知识库 Transcribee 是一款 macOS 开源工具,可自动转录来自 YouTube、Instagram Reels、TikTok 和本地文件的音频和视频。它的目标是以最少的努力构建一个可搜索的个人知识库。 它的工作原理是:Transcribee 下载媒体,使用 ElevenLabs(带有说话人识别)进行转录,然后利用 Claude 将转录内容自动分类到 `~/Documents/transcripts/` 目录中的结构化文件夹系统中。 转录文本带有说话人标签,可与 ChatGPT 和 Claude 等 LLM 一起使用。高级用户可以访问带有时间戳的原始转录数据。它可以通过 ClawdHub 或从 GitHub 手动克隆安装,需要依赖项,如 `yt-dlp` 和 `ffmpeg`,以及 ElevenLabs 和 Anthropic API 密钥。 本质上,Transcribee 允许您“一次转录,永久保存知识”,将消费的内容转化为易于访问、有组织的资源。

Hacker News 新闻 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交 登录 Show HN: Transcribee: YouTube 视频转录工具,构建知识库 (github.com/itsfabioroma) 10 分,by ofabioroma 2 小时前 | 隐藏 | 过去 | 收藏 | 1 条评论 ofabioroma 1 小时前 [–] 大家好,我刚刚更新了,现在也支持 ig reels。这是我个人常用的转录工具,顺便说一下,它在 Clawdbot/Moltbot 中也能很好地工作。 告诉我你们是否喜欢它,以及有什么改进建议。 回复 指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系 搜索:
相关文章

原文

Open source macOS transcriber for YouTube, Instagram Reels, TikTok, and local media — evolves a self-organizing knowledge base.

transcribee "https://youtube.com/watch?v=..."
transcribee "https://instagram.com/reel/..."
transcribee "https://vt.tiktok.com/..."
transcribee ~/Downloads/podcast.mp3

Over time, your ~/Documents/transcripts/ folder naturally evolves into a personal library:

transcripts/
├── AI-Research/
│   ├── ilya-sutskever-agi-2024/
│   └── anthropic-constitutional-ai/
├── Startups/
│   ├── ycombinator-how-to-get-users/
│   └── pmarca-founder-mode/
└── Health/
    └── huberman-sleep-optimization/

Each transcript is speaker-labeled and ready to paste into ChatGPT, Claude, or any LLM.

I consume a lot of video content — YouTube, Instagram, TikTok, podcasts, interviews. I wanted to:

  • Ask questions about videos in LLMs
  • Have all that knowledge searchable and organized
  • Not do any manual work to maintain it

transcribee does exactly that. Transcribe once, knowledge stays forever.

  • Transcribes YouTube, Instagram Reels, TikTok, and local audio/video files
  • Speaker diarization — identifies different speakers
  • Auto-categorizes transcripts using Claude based on content
  • Builds a knowledge library that organizes itself over time

transcribee is available as a Clawdbot skill. Just ask your agent to transcribe any YouTube video:

"Transcribe this video: https://youtube.com/watch?v=..."

# Install from ClawdHub (recommended)
clawdhub install transcribee

# Or clone manually
git clone https://github.com/itsfabioroma/transcribee.git ~/.clawdbot/skills/transcribee

Make sure you have the dependencies installed (brew install yt-dlp ffmpeg) and API keys configured.

# Install dependencies (macOS)
brew install yt-dlp ffmpeg
pnpm install

# Configure API keys
cp .env.example .env
# Add your ElevenLabs + Anthropic API keys to .env

# Transcribe anything
transcribee "https://youtube.com/watch?v=..."
transcribee "https://instagram.com/reel/..."
transcribee "https://vt.tiktok.com/..."
transcribee ~/Downloads/podcast.mp3
transcribee ~/Videos/interview.mp4

Shell alias (recommended)

Add to ~/.zshrc:

alias transcribee="noglob /path/to/transcribee/transcribe.sh"

Each transcript saves to ~/Documents/transcripts/{category}/{title}/:

File What it's for
transcript.txt Speaker-labeled transcript — paste this into your LLM
metadata.json Video info, language, auto-detected theme

For power users who need word-level timestamps and confidence scores:

transcribee --raw "https://youtube.com/watch?v=..."

This adds transcript-raw.json with the full ElevenLabs response.

  1. Downloads audio from YouTube (yt-dlp) or extracts from local video (ffmpeg)
  2. Transcribes with ElevenLabs (scribe_v1_experimental with speaker diarization)
  3. Claude analyzes content and existing library structure
  4. Auto-categorizes into the right folder
  5. Saves transcript files with metadata
Type Formats
Audio mp3, m4a, wav, ogg, flac
Video mp4, mkv, webm, mov, avi
URLs youtube.com, youtu.be, instagram.com/reel, tiktok.com

bzz bzz 🐝

联系我们 contact @ memedata.com