🎤 Stem separation
Vocals are isolated from instrumentals using the UVR Karaoke model or Demucs. Guide vocal volume is adjustable.
📝 Word-level lyrics
WhisperX transcribes and aligns every word to the audio. Existing lyrics from LRCLIB are used when available.
🎯 Pitch scoring
Sing into your mic and get scored in real-time. Star ratings and per-song scoreboards track your progress.
👤 Player profiles
Multiple profiles with separate score histories. Switch between singers without losing anyone's records.
🎬 Video file support
Drop .mp4 or .mkv files into your library. Vocals are separated and the original video plays as the background.
🌌 Dynamic backgrounds
GPU shader effects (plasma, aurora, nebula...), Pixabay video loops, or the source video for video files.
🎮 Gamepad
Navigate menus, pick songs, and control playback entirely with a controller. D-pad, sticks, face buttons.
📦 Single binary
ffmpeg, Python, PyTorch, and the ML models are all bootstrapped on first launch. Nothing to install.