Aures Tonality

Average Tonality Aures (tu) and Temporal Tonality Aures are based on Aures' model for tonality.

Aures modeled tonality is based on subjective evaluation of pure tones and bandpass-filtered noise by using 4 weighting functions for the effects of bandwidth, center frequency, prominence of the tone, and the loudness of the tone as compared to the total loudness, most of which were proposed by Terhardt et al. (Terhardt E., Stoll G., Seewann M., Algorithm for extraction of pitch and pitch salience from complex tonal signals, J Acous. Soc. Am 71(3),1982).

Temporal Tonality Aures is an output resulting from Average Tonality Aures computation that displays the three fluctuating curves in a window:

the fluctuating tone Aures Tonality vs. Time,
the fluctuating tone WGr vs. Time,
and the fluctuating tone WT vs. Time.

The Aures' model details is presented below:

Aures proposed an algorithm for separating the tonal and noise parts in the sound and estimated the tonality through the calculation of the different weightings. The separation procedure is based on a 400-point spectrum for the frequency range between zero and five kHz (namely a line spacing ∆ƒ of 12.5 Hz). A tone is identified when the level of a particular spectral bin is (1) higher than that of the just preceding bin and higher than or equal to that of the just following bin, and (2) at least seven dB higher than those of the two further preceding and the two further following bins. The group of five successive spectral bins centered on that with the maximum level is therefore identified as a tone. Its exact frequency ƒ_ciis calculated with the following formula:

with ƒ_i the frequency in Hz of the maximum level, L_i+1 the level in dB SPL of the upper bin, and L_i-1 that of the lower bin.

For each i^th tone at the frequency ƒ_ci (in Hz) with a level L_i (in dB) and a bandwidth ∆Zi expressed on the Bark scale (namely as a fraction of the critical bandwidth), these functions are defined as follows:

The model uses the bandwidth weighting function from Terhardt et al.:
The frequency weighting function is:
The prominence weighting function is:
where A_Ek is the secondary excitation at ƒ_i due to the k_th component, E_Gr is the masking intensity of the noise, and E_HS is the intensity at the threshold of hearing.
Aures combines these into :
where
The loudness weighting accounts for the relative contribution of the tonal loudness to the overall loudness:
where N_Gr and N are the loudnesses of respectively the noise and the whole sound (in sones).
Finally, Aures’ Tonalness T in tu (tonality unit) is:
where c = 1.09 is a constant chosen such that a one-kHz pure tone with a level of 60 dB SPL would have a tonalness of one.
Sound: Analysis and Specification outputs the following data:
- A tonality array over time,
- The mean tonality value over time.
  Given the rather high line spacing (∆ƒ = 12.5 Hz), the bandwidth weighting w₁ is disabled by default (namely w₁= 1 in any case). This weighting can be enabled in the parameter window of Aures’ tonality by ticking in the Bandwidth weighting w1 box. In that case, the bandwidth ∆_zi in Bark rate is defined as the three-dB bandwidth around the tone frequency (the default three-dB value can be modified in the Bandwidth threshold field in the parameter window). On each side of the tone frequency (with a L_max maximum level), the limit of this band is computed by linear interpolation between the two first spectral bins with levels closest to L_max - three dB.