Simulasi dan Analisis Speaker Recognition Menggunakan Metode Mel Frequency Cepstrum Coefficient (MFCC) dan Gaussian Mixture Model (GMM)

DOANDA KHABI PUTRA

Informasi Dasar

83 kali
17.04.1476
006.454
Karya Ilmiah - Skripsi (S1) - Reference

This essay discusses about speaker recognition, a system that recognize subject identity by their voice. First, subject’s voice features are extracted using MFCC (Mel-Frequency Cepstrum Coefficients) method. Steps in the MFCC are pre-emphasis, framing, windowing, FFT (Fast Fourier Transform), mel scaling and DCT (Discrete Cosine Transform), which produce feature vector called cepstrums. These cepstrums are then modelled using GMM (Gaussian Mixture Model). Steps in th GMM are Expectation-step and Maximization-step, which produce gaussian distribution along with its parameters, mean (µ) and variance (?^2) which are different for every subjects. Classification step is done by comparing between training data parameters and testing data parameters. If the comparation gets high score, it means two datas are match, vice versa. Previous research done by student group at Preston University and Jinnah Women University, Pakistan, with title “Speaker Identification Using GMM with MFCC” gets accuration score 87,5% using feature extraction method MFCC, clustering method K-Means, modelling method GMM and classification by log probability. In this essay, we will pass the clustering step and classify by doing comparation between gaussian distribution using parameter mean (µ) and variance (?^2), which is the fastest and easiest way. In this essay, we do the best thing to get as close as possible with previous research accuration score knowing that the classification step can be called the ‘rough’ one in GMM usage so we are not expecting high, even though there are so many factors that can influence the simulation accuracy. Keywords : MFCC, GMM, Speaker Recognition

Subjek

SPEECH RECOGNITION
 

Katalog

Simulasi dan Analisis Speaker Recognition Menggunakan Metode Mel Frequency Cepstrum Coefficient (MFCC) dan Gaussian Mixture Model (GMM)
 
 
Indonesia

Sirkulasi

Rp. 0
Rp. 0
Tidak

Pengarang

DOANDA KHABI PUTRA
Perorangan
IWAN IWUT, RATRI DWI ATMAJA
 

Penerbit

Universitas Telkom, S1 Teknik Telekomunikasi
Bandung
2017

Koleksi

Kompetensi

  • TT3113 - PENGOLAHAN SINYAL DIGITAL
  • TTG4K3 - PENGOLAHAN SINYAL BICARA
  • TTH4O3 - SPEECH PROCESSING

Download / Flippingbook

 

Ulasan

Belum ada ulasan yang diberikan
anda harus sign-in untuk memberikan ulasan ke katalog ini