Drax：具有离散流匹配的语音识别

Drax：具有离散流匹配的语音识别
Drax: Speech Recognition with Discrete Flow Matching

原始链接: https://huggingface.co/papers/2510.04162

我们提出Drax，一种使用离散流匹配的非自回归ASR模型，它包含一个音频条件化的中间分布，以更好地匹配推理动态。Drax实现了与最先进的自回归模型相当的准确性，同时在准确率-效率权衡方面提供更好的控制。

黑客新闻新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交登录 Drax: 使用离散流匹配的语音识别 (huggingface.co) 7点由 cliffly 38分钟前 | 隐藏 | 过去 | 收藏 | 讨论考虑申请YC冬季2026批次！申请截止至11月10日指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请YC | 联系搜索：

We propose Drax, a non-autoregressive ASR model using discrete flow matching that includes an audio-conditioned intermediate distribution to better match inference dynamics.
Drax achieves accuracy comparable to state-of-the-art autoregressive models while offering better control over the accuracy-efficiency trade-off point.

Drax：具有离散流匹配的语音识别 Drax: Speech Recognition with Discrete Flow Matching

Drax：具有离散流匹配的语音识别
Drax: Speech Recognition with Discrete Flow Matching