What is a sound wave?
     
    A sound wave is the result of compressed air pressure propagating away from its source of compression. It can be displayed as amplitude (as the size of vibration) against time:
     
     



    Amplitude (measured in dB, vertical axis) as a function of time (horizontal)

    (adapted from: http://sprg.ssl.berkeley.edu/marsmic/HTML/lr/spec1.html#spectrogram)



     

Which other properties does a sound wave have? Frequency is measured in the unit of decibel (dB). 1 dB corresponds to 10log10 ( Ia / Ib ). (Ia is the measured intensity, Ib is the reference intensity.)
 
 
 

fig. 2: Amplitude (vertical) as a function of time (horizontal)

the wavelength shows the length of a phase

(source: http://sprg.ssl.berkeley.edu/marsmic/HTML/lr/spec1.html#spectrogram)


 



What are simple sound waves?

     
    Simple signals consist of one sine wave only and they can be calculated by the formula:


    f
     
     

What are complex sound waves?
     
    According to Fourier's Theorem complex signals can be represented as the point by point sum of a possibly infinite number of sinusoid signals:
     
     

    xn = c + m1 ( sin (t1 + Q1) + c + mn ( sin (tn + j n))
     

What is the Nyquist theorem (sampling theorem)?
     
    The sampling frequency determines the limit of audio frequencies that can be reproduced digitally. One of the most important rules of sampling is called the Nyquist Theorem (or sampling theorem), which states that the highest frequency which can be accurately represented is one-half of the sampling rate. The Nyquist frequency for a given signal is twice the frequency of the highest frequency component in the signal.

    In other words, there must be at least 2 measurements within one phase. If not, an aliasing effect occurs, which leads to the distortion of the frequencies in the signal, by introducing spurious low frequency artefacts corresponding to the difference between the sampling frequency and the signal frequency. In the visual domain this corresponds to the moiré effect with silk, some types of patterning on television screens, wheels turning backwards in films.
     
     

What is a spectrum?
     
    A spectrum is a function of frequency, often expressed in Pa/Hz (if from a sound). In a spectrum amplitude (loudness) is displayed as a function of frequency (cf. fig. 3). A spectrum is also called 2D-spectrogram. From the point of view of time a spectrum is a snapshot of amplitudes relating to the frequencies they occur at.
     
What is a spectrum?
     
    A spectrum is a function of frequency, often expressed in Pa/Hz (if from a sound). In a spectrum amplitude (loudness) is displayed as a function of frequency:
     
     


    fig. 3: a spectrum

    (source: http://sprg.ssl.berkeley.edu/marsmic/HTML/lr/spec1.html#spectrogram)


What is a 3D-spectrogram?
     
    A 3D-spectrogram is also called sonagramme and it displays the features amplitude, frequency and time in one diagram. Frequency (Hz) is displayed as a function of time (ms). Amplitude comes into play by the use of colour (therefore these types of spectrograms are also called colour spectrograms).
     
     

    fig. 4: a colour spectrogram


     


What is pitch?
     
    Pitch is the perceptual correlate of the frequency of vibration of the vocal folds. The frequency is determined by sub-glottal pressure and by laryngeal adjustments governing the length, tension and mass of the vocal of the folds themselves. Pitch lowering is associated with reduced tension in the vocal folds. Pitch is largely determined by the fundamental frequency of the sound, but the relationship between pitch and fundamental frequency is non-linear and varies with the frequency involved. Pitch is measured in cycles per second (cps). Pitch and frequency have a subtle difference and are often used interchangeable. However, they describe changes in air pressure. Scientists tend to use the word "frequency", musicians rather say "pitch".
     
What is fundamental frequency?
     
    Fundamental frequency (F0) is defined as the lowest frequency of the sine waves composing a complex sine wave. The two higher frequency components are the second and third harmonics, the fundamental frequency is also called first harmonic.
     
What is signal annotation (labelling)?

Signal annotation describes the process of assigning properties like orthographic letters, phonemes or break indices to sound signals.