Chair of Multimedia Telecommunications and Microelectronics - Audio Research Group

AM/FM Modeling of Harmonic Sounds

Demonstration: trumpet

Download all the audio files from this page in a single ZIP archive (3MB).

This demo shows, how the technique deals with a raspy and slightly breathy trumpet sound. This sound doesn't seem to be very challenging, since the partials are quite stable and the amount of noise is moderate. Nevertheless, the pure heterodyne analysis/synthesis method does not offer a transparent quality, while the AMFM scaling delivers a very accurate representation of the original sound.

Original sound (WAV file, 44.1kHz, 16bit, 370kB)

Reconstructed sound (WAV file, 44.1kHz, 16bit, 370kB) obtained from synthesis based on F0 + Harmonic Envelope subsampled 1:1000
(A phase incoherent, baseline heterodyne analysis, acting as a mock-up of a perfect harmonic sinusoidal model, without residual noise)

Reconstructed sound (WAV file, 44.1kHz, 16bit, 370kB) obtained from synthesis based on instantaneous F0 + Harmonic Envelope subsampled 1:1000 + prototype signal. NOTE: no residual noise is modeled in this example.

Reconstructed sound (WAV file, 44.1kHz, 16bit, 370kB) obtained from synthesis based on F0 + Harmonic Envelope subsampled 1:1000 + a noise residual modeled by 10-th order WLPC.

Reconstructed sound (WAV file, 44.1kHz, 16bit, 370kB) obtained from synthesis based on instantaneous F0 + Harmonic Envelope subsampled 1:1000 + prototype signal. As above, the noise residual is modeled by 10-th order WLPC.


Reconstructed sound (WAV file, 44.1kHz, 16bit, 395kB) obtained from the SMS technique with frame rate 44Hz (hop = 1000 samples) and no residual noise. There is a noticeable artifact in the middle of the sound similar to a squeal.


Reconstructed sound (WAV file, 44.1kHz, 16bit, 370kB) obtained from the SMS technique (as above) + background noise modeled using 10-order WLPC. We used our own noise model in this example, since the traditional LPC-based model in the SMS software produced too much artifacts.

Reconstructed sound (WAV file, 44.1kHz, 16bit, 370kB) obtained from the LORIS technique bandwidth association region width of 100Hz and partials constrained to harmonic. Note that LORIS is quite successful in re-synthesis of a sound very similar to the original one, which is not so apparent from the spectrogram.