常数Q变换:可视化指南
Constant Q Transform – A Visual Guide

原始链接: https://brendanjameslynskey.github.io/ConstantQ-Transform/

第一章 “恒定 Q” 意味着每个频率分区的中心频率与其带宽之比是恒定的。低音使用较宽的分析窗口(频率分辨率高);高音使用较窄的分析窗口(时间分辨率高)。这反映了音乐音高的工作原理。在钢琴上,从 C3 到 C4 的距离在感知上与从 C5 到 C6 的区间相同,尽管后者涵盖的赫兹范围是前者的四倍。CQT 将其分区按对数排列以匹配这一特性,通常每八度音程设置 12、24、36 或更多分区。Q 因子:Q = fk / Δfk。在标准的 FFT 中,Q 是变化的——低频分区较窄(高 Q 值),高频分区较宽(低 Q 值)。而在 CQT 中,每个分区的 Q 值相同。这就是为什么在 CQT 频谱图中,音符看起来是等间距分布的原因。

抱歉。
相关文章

原文

Chapter 1

"Constant Q" means the ratio of each frequency bin's centre frequency to its bandwidth is constant. Low notes get wide analysis windows (good frequency resolution); high notes get narrow windows (good time resolution).

This mirrors how musical pitch works. On a piano, the distance from C3 to C4 spans the same perceptual interval as C5 to C6, even though the latter covers four times the Hz range. The CQT spaces its bins logarithmically to match, typically placing 12, 24, 36, or more bins per octave.

Q factor: Q = fk / Δfk. In a standard FFT, Q varies — low-frequency bins are narrow (high Q), high-frequency bins are wide (low Q). In the CQT, Q is the same for every bin. This is why musical notes look equally spaced in a CQT spectrogram.

联系我们 contact @ memedata.com