You are on page 1of 4

Referensi

HTKBook
VoxForge
http://www.voxforge.org/home/dev/acousticmodels/linux/create/htkjulius/how-to
http://www.voxforge.org/home/dev/acousticmodels/linux/create/htkjulius/tutorial
Kyle Gorman
http://www.ling.upenn.edu/~kgorman/papers/segmentation/.speechseg.html
Kumpulan script dari Kyle Gorman seg.tar.gz Training process based on this model
Praat
Audacity
Alur Training

Prosedur Training

Proses Inisialisasi dari Prototype


Persiapan data
Rekaman Suara

Siapkan kata-kata yang akan direkam


Spesifikasi Rekaman
Format Audio
Sampling Freq
bit rate
awal dan akhir
Format file WAV

: lossless WAV
: 18 kHz
: 16 bit, mono
: silent 500 ms
: per-speaker, per-kalimat

Spesifikasi Ruangan (Sekurang-kurangnya)


Soundproof parameter
Sound insulation level
: 30 dB
Background noise level : 22 dB
Reverberation time
: 0,15 second
Rekam masing-masing kata 10 x (3 orang)
Beri nama dengan format kataXXX_MYYY.wav
Contoh :
maju001_M001.wav
maju002_M001.wav
maju001_M002.wav
dst...
Buat label file yang berisi konten file wav
Penamaannya sama dengan .wav, hanya diganti extensionnya menjadi .lab; contoh maju001_M001.wav
file labelnya adalah maju001_M001.lab)
Contoh:
maju001_M001.lab berisi teks maju
CATATAN: maju dengan spasi di belakang huruf terakhir
Membuat lexicon (pronounciation dictionary)
Contoh:
maju m a j u
kanan k a n a n
Menyiapkan file configurasi
copy.conf
Berisi informasi
SOURCEFORMAT = WAV
TARGETKIND = MFCC_0_D
TARGETRATE = 100000.0

SAVECOMPRESSED = T
SAVEWITHCRC = T
WINDOWSIZE = 250000.0
USEHAMMING = T
PREEMCOEF = 0.97
NUMCHANS = 24
CEPLIFTER = 22
NUMCEPS = 12
RAWENERGY = F
ENORMALISE = F
ZMEANSOURCE = F
train.conf
berisi informasi
TARGETKIND = MFCC_0_D_N_Z
TARGETRATE = 100000.0
mkPhones0.led
berisi informasi
EX
IS sil sil
DE sp
Membuat pair antara .wav dan .mfc (dengan asumsi file .wav di direktori ./data dan .mfc di direktori
./mfc)
Contoh :
data/maju001_M001.wav

mfc/maju001_M001.mfc

Membuat file train.scp


Berisi semua data yang akan ditrain
Contoh:
mfc/maju001_M001.mfc
mfc/maju002_M001.mfc
...
Membuat file proto berisi prototipe model training (sesuai dengan train.conf)
~o <VecSize> 25 <MFCC_0_D_N_Z>
~h "proto"
<BeginHMM>
<NumStates> 5
<State> 2
<Mean> 25
0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

<Variance> 25
1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<State> 3
<Mean> 25
0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
<Variance> 25
1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<State> 4
<Mean> 25
0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
<Variance> 25
1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<TransP> 5
0.0 1.0 0.0 0.0 0.0
0.0 0.6 0.4 0.0 0.0
0.0 0.0 0.6 0.4 0.0
0.0 0.0 0.0 0.7 0.3
0.0 0.0 0.0 0.0 0.0
<EndHMM>

You might also like