This article was originally posted on WomenLearnThai.com.
What is “Praat”?…
Praat is Dutch for “talk”. It’s a program used for voice analysis. It’s very powerful and has a lot of very advanced functions of which I will I will only discuss the most basic function: obtaining the pitch from an audio fragment.
What is the “Pitch”?…
First I would like to talk about “pure tones”. A “pure tone” is a sound wave that is consist of found of one single frequency. It’s the kind of sound you hear from a tuning fork. If you would display the wave as a graphs in time and in place, both graphs would be a sine wave.
Our voice is not a pure tone. If you would analyse our voice you would see it consists of several sine waves, with different frequencies (tone heights). Each frequency has a different amplitude (strength) and phase (starting point). All these waves are produced by our voice at the same time.
The “pitch” in phonetics is the frequency (or tone height) of the lowest frequency tone wave in our voice. It’s like the basic “hum” of our voice. It is the pitch what we would define as the “tone” in Thai.
The purpose of this document is to show you how you can visualize the pitch. It can help you to analyze and improve your own pronunciation or it can help you to recognize the tone in case you wouldn’t recognize it by listening.
The program we’re going to use can display the pitch of the audio (in time). The result will look like this. The blue line represents the pitch.
How do the tones in Thai look like?…
Basically they look like this:
- The mid tone is constant (there might be a slight drop on the end).
- The low tone start low and might even go a little bit lower.
- The falling tone starts high and drops significantly.
- The tone start high and goes even higher.
- The rising to starts low and rises significantly.
You can download and install praat from Praat: doing phonetics by computer. It’s available for Linux 32/64bit, for MAC OSX, for FreeBSD and for Windows.
Once you start Praat you get two windows: the Objects window and Picture window. You’ll only need the Objects window. The pictures window is a window that allows you to draw on and manipulate the pictures Praat generates.
From the Objects windows menu choose “Open – Open long sound file …” and select the audio fragment you want to analyse. This can be any file, just a recording you want to analyse or a recording of your own voice. If possible save your audio fragment always as “.wav” file and not as “.mp3” because a “.mp3” file can cause a tiny time offset between the graphs and the actual audio.
In the Objects window, select your audio fragment (1. LongSound tones in this case) and click on “View”.
Now a new window will appear.
When you click at the play buttons directly under the spectrogram/pitch part you can play the audio left or right of the cursor. When you make a selection the audio will be split into 3 parts : one part before the beginning of your selection, then your selection, and finally a part after your selection and there will be 3 play buttons.
You can use the “in”-button (zooms in), “out”-button (zooms out), “sel”-button (zoom to selection) and “all”-button (zooms to all) below the spectrogram , together with the play buttons and the scroll-bar below the spectrogram to go to any part that might interest you.
Take into account that the pitch and spectrogram will only be displayed when the audio fragment that is visible is less than 10 seconds.
By clicking on the frequency number of the right side of the spectrogram you can zoom-in and zoom-out the frequency scale.
The first part of the picture above looks like a mid-tone. After that we see a low tone, a falling tone, a high tone and a rising tone.
The yellow curve in the diagram represents the intensity.
The high tone might look a bit strange to you. That’s because the big jump at the end has a very low intensity or volume and can be ignored. To show the intensity choose “Intensity-Show Intensity” from the menu. The yellow curve represents the intensity.
How to see the difference between aspirated and unaspirated sounds?…
The difference between aspirated sounds such as พ in พา and an unaspirated sound like the ป in ปา is the voice onset time (VOT). That is the time between the start of the syllable and the first occurrence of the voiced vowel. For aspirated sounds the VOT is much bigger. Usually the start of the blue pitch line indicates the start of the voicing, while the rising part of the yellow intensity line indicates the beginning of the syllable. Voicing is a vibration of the vocal cords. It’s much easier to recognize a pitch in those sounds than in sounds that are made with the mouth. That’s why the blue pitch line starts at the voiced vowel า. The next picture shows the voice onset time in the word ปา. It’s only about 18ms.
This picture shows the voice onset time in the word พา. พ is aspirated consonant. The voice onset time here is 78 ms, which is significantly more than that of the unaspirated consonant. You should play and listen to the selections to make they don’t include any part of the vowel.
PS. Take into account that the time scales of both pictures are not the same.