MFCCs combine consideration of aspects of human hearing (logarithmic frequency perception, the mel scale) and physics of musical instruments (these systems often have well defined overtones that are harmonic. They are commonly used for speech detection and concatenative synthesis of speech.
Check pitch detection methods Python notebook for cepstrum analysis. Review: cepstrum analysis captures spectral envelope. How can we link this to the way the ear perceives sound? (Prahallad)
Mel-Frequency Analysis
Lindasalwa: 7 steps to computing MFCC
When two sine waves are very close in frequency, their combination sounds pleasant to the ear: a pure tone with some beating. As the frequency difference between the two waves grows, the sense of roughness or dissonance grows, until the two are far enough apart that the ear perceives them as two separate pitches.
Idea: measure dissonance by counting the beats. Experimental results show that beating at 20 to 30 hz is perceived as roughest.
"LPF is a Remez filter with cutoff at 100 Hz and BPF (which influences the detailed shape of the output signal) was a second-order Butterworth filter with passband between 15 and 35 Hz." (Sethares, 49)
More information is in the appendices, which are not available online.