PKL Unsri

Referensi
HTKBook
VoxForge
http://www.voxforge.org/home/dev/acousticmodels/linux/create/htkjulius/how-to
http://www.voxforge.org/home/dev/acousticmodels/linux/create/htkjulius/tutorial
Kyle Gorman
http://www.ling.upenn.edu/~kgorman/papers/segmentation/.speechseg.html
Kumpulan script dari Kyle Gorman seg.tar.gz Training process based on this model
Praat
Audacity
Alur Training
Prosedur Training
Proses Inisialisasi dari Prototype

Persiapan data
Rekaman Suara
Siapkan kata-kata yang akan direkam

Spesifikasi Rekaman
Format Audio
Sampling Freq
bit rate
awal dan akhir
Format file WAV
: lossless WAV
: 18 kHz
: 16 bit, mono
: silent 500 ms
: per-speaker, per-kalimat
Spesifikasi Ruangan (Sekurang-kurangnya)

Soundproof parameter
Sound insulation level
: 30 dB
Background noise level : 22 dB
Reverberation time
: 0,15 second
Rekam masing-masing kata 10 x (3 orang)
Beri nama dengan format kataXXX_MYYY.wav
Contoh :
maju001_M001.wav
maju002_M001.wav
maju001_M002.wav
dst...
Buat label file yang berisi konten file wav
Penamaannya sama dengan .wav, hanya diganti extensionnya menjadi .lab; contoh maju001_M001.wav
file labelnya adalah maju001_M001.lab)
Contoh:
maju001_M001.lab berisi teks maju
CATATAN: maju dengan spasi di belakang huruf terakhir
Membuat lexicon (pronounciation dictionary)
Contoh:
maju m a j u
kanan k a n a n
Menyiapkan file configurasi
copy.conf
Berisi informasi
SOURCEFORMAT = WAV
TARGETKIND = MFCC_0_D
TARGETRATE = 100000.0
SAVECOMPRESSED = T
SAVEWITHCRC = T
WINDOWSIZE = 250000.0
USEHAMMING = T
PREEMCOEF = 0.97
NUMCHANS = 24
CEPLIFTER = 22
NUMCEPS = 12
RAWENERGY = F
ENORMALISE = F
ZMEANSOURCE = F
train.conf
berisi informasi
TARGETKIND = MFCC_0_D_N_Z
TARGETRATE = 100000.0
mkPhones0.led
berisi informasi
EX
IS sil sil
DE sp
Membuat pair antara .wav dan .mfc (dengan asumsi file .wav di direktori ./data dan .mfc di direktori
./mfc)
Contoh :
data/maju001_M001.wav
mfc/maju001_M001.mfc
Membuat file train.scp

Berisi semua data yang akan ditrain
Contoh:
...
Membuat file proto berisi prototipe model training (sesuai dengan train.conf)
~o <VecSize> 25 <MFCC_0_D_N_Z>
~h "proto"
<BeginHMM>
<NumStates> 5
<State> 2
<Mean> 25
0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
<Variance> 25
1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<State> 3
<Mean> 25
0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
<Variance> 25
1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<State> 4
<Mean> 25
0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0
<Variance> 25
1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0
<TransP> 5
0.0 1.0 0.0 0.0 0.0
0.0 0.6 0.4 0.0 0.0
0.0 0.0 0.6 0.4 0.0
0.0 0.0 0.0 0.7 0.3
0.0 0.0 0.0 0.0 0.0
<EndHMM>

PKL Unsri

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

PKL Unsri

Uploaded by

Copyright:

Available Formats

Referensi

Proses Inisialisasi dari Prototype

Siapkan kata-kata yang akan direkam

Spesifikasi Ruangan (Sekurang-kurangnya)

Membuat file train.scp

You might also like