Aures Tonality

Average Tonality Aures (tu) and Temporal Tonality Aures are based on Aures' model for tonality.

Aures modeled tonality is based on the subjective evaluation of pure tones and bandpass-filtered noises by using 4 weighting functions. These 4 weightings account respectively for the effects of bandwidth, center frequency, prominence of the tone, and loudness of the tone as compared to total loudness. Most of these weightings were proposed by Terhardt et al. (Terhardt E., Stoll G., Seewann M., Algorithm for extraction of pitch and pitch salience from complex tonal signals, J Acous. Soc. Am 71(3),1982).

Temporal Tonality Aures is an output resulting from Average Tonality Aures computation that displays the following three curves in a window:

the Aures Tonality vs. Time, in tonality units (tu)
the relative loudness weighting W_Gr vs. Time
the tonal component weighting W_T vs. Time

The Aures' model details are presented below:

Aures proposed an algorithm for separating the tonal and noise parts in the sound and estimating the tonality through the calculation of the different weightings. The separation procedure is based on a 400-point spectrum for the frequency range between 0 and 5 kHz (corresponding to a line spacing ∆ƒ of 12.5 Hz). A tone is identified when the level of a particular spectral bin is (1) higher than that of the just preceding bin and higher than or equal to that of the just following bin, and (2) at least seven dB higher than those of the two further preceding and the two further following bins. The group of five successive spectral bins centered on that with the maximum level is therefore identified as a tone. Its exact frequency ƒ_ciis calculated with the following formula:

with ƒ_i the frequency in Hz of the maximum level, L_i+1 the level in dB SPL of the upper bin, and L_i-1 that of the lower bin.

For each i^th tone at the frequency ƒ_ci (in Hz) with a level L_i (in dB) and a bandwidth Δz_i expressed on the Bark scale (namely as a fraction of the critical bandwidth), these functions are defined as follows:

The model uses the bandwidth weighting function from Terhardt and others:
The frequency weighting function is:
The prominence weighting function is:

where ΔL_i is the excess level in dB defined by:

and A_Ek is the secondary excitation at ƒ_i due to the k_th component, E_Gr is the masking intensity of the noise, and E_HS is the intensity at the threshold of hearing.

Aures combines these into the tonal component weighting:

where

The relative loudness weighting accounts for the relative contribution of the tonal loudness to the overall loudness:

where N_Gr and N are the loudnesses of respectively the noise and the whole sound (in sones).

Finally, Aures’ Tonality T in tu (tonality unit) is:

where c = 1.09 is a constant chosen such that a one-kHz pure tone with a level of 60 dB SPL would have a tonality of 1 tu.

Sound: Analysis and Specification outputs the following data:
- A tonality array over time,
- The mean tonality value over time.
  Given the rather high line spacing (∆ƒ = 12.5 Hz), the bandwidth weighting w₁ is disabled by default (namely w₁= 1 whatever the input). This weighting can be enabled in the parameter window of Aures’ tonality by ticking in the Bandwidth weighting w1 box. In that case, the bandwidth Δz_i in Bark rate is defined as the 3 dB bandwidth around the tone frequency (the default 3 dB value can be modified in the Bandwidth threshold field in the parameter window). Then, on each side of the tone frequency (with level L_max), the limit of this band is computed by linear interpolation between the first two spectral bins with levels closest to L_max - 3 dB.