Rendered at 11:43:05 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.
RossBencina 9 hours ago [-]
Nice. To the folks saying "nothing to see here" this appears to be a variation of filter-bank spectral analysis where each band varies in frequency to track "the" in-band sinusoid. Somewhat like a bank of PLLs each with its own tracking bandpass filter. By using IIR filters rather than FFTs you avoid the latency of buffering up a full frame of data before you can run the FFT analysis. I am curious how this handles input containing broadband transients. It might be interesting to use CIC filters rather than an IIR lowpass to get better time selectivity, but maybe that's already been addressed, I didn't read the papers.
j-river 10 hours ago [-]
A useful demo for this kind of tool would be a side-by-side on difficult inputs: fast pitch bends, dense chords, and low-SNR recordings. Latency is easy to appreciate visually, but robustness under messy audio is what usually decides whether spectral tools become part of a workflow. Even a small set of repeatable test clips would make the tradeoff much clearer.
_def 8 hours ago [-]
Slowed down higher pitch example was nice to hear, as this is where often conventional methods are heavily artifacted
jmpman 12 hours ago [-]
There are some piano tuners I've found who are a bit on the spectrum, who believe they can tune a piano in a way that no digital device can replicate. I'm skeptical, and would like to see how this method holds up against one of these savants.
Spectral analysis has indeed been around as a concept for centuries and there have been apps based on the FFT for decades, so definitely nothing new there.
What I have implemented however, while based in known concepts and techniques, allows to achieve real-time, low latency and high resolution (both in time and frequency dimensions) performance that I believe are out of reach of established (published) methods.
The apps you link are most likely making use of the FFT, which has become widely supported with efficient hardware acceleration and easy to use libraries, because of its central role in ubiquitous DSP applications, e.g. compression.
I would be interested in any publications or at least technical descriptions of algorithms/systems that achieve similar performance!
rfgplk 4 days ago [-]
Is it the same algorithm or a similar domain? Overlap can exist
bialamusic 3 days ago [-]
It is more complex than the one described here. The idea is the same but for a working solution many different coefficients are needed and adjusted properly. Resonances are adjusted to have some match to the human perception.
It is all time domain as there are no real frequencies in sound.
It is good to see the idea investigated by more people but the man should not try to claim it as his own. We are doing such tings for years and I want this knowledge stays to people so no one should claim it
arjf 3 days ago [-]
Sounds really interesting! Could you share some description of the algorithm used for chord detection? What model of tonality are you using for pitch/chord naming?
bialamusic 1 days ago [-]
My email is alex@mlazev.com I will write some details when I have time.
Also very old stuff :)
https://apps.apple.com/us/app/chord-detector/id1495811175
It is good to see the idea investigated by more people but the man should not try to claim it as his own. We are doing such tings for years and I want this knowledge stays to people so no one should claim it