Blind audio source separation using independent component analysis and independent vector analysis methods

Alrwstım, Alyaa Abdulhussein Mahdi

Yıldız Teknik Üniversitesi Açık Arşivi
→
Tezler
→
Fen Bilimleri Enstitüsü
→
Fen Bilimleri Enstitüsü Yüksek Lisans Tezleri
→
Bilgisayar Mühendisliği
→
Öğe Göster

dc.contributor.author	Alrwstım, Alyaa Abdulhussein Mahdi
dc.date.accessioned	2022-08-09T10:59:11Z
dc.date.available	2022-08-09T10:59:11Z
dc.date.issued	2017
dc.identifier.uri	http://dspace.yildiz.edu.tr/xmlui/handle/1/12945
dc.description	Tez (Yüksek Lisans) - Yıldız Teknik Üniversitesi, Fen Bilimleri Enstitüsü, 2017	en_US
dc.description.abstract	Blind Source Separation (BSS) is one of the most challenging problems in the field of audio and speech processing. Many different methods have been proposed to solve BSS problem in the literature. In addition, speaker recognition systems have gained considerable interest from researchers for decades due to the breadth of their field of application. In this study, we have compared the performance of three popular BSS methods implementations: Fast-ICA, Kernel-ICA and Fast-IVA which are based on Independent Component analysis (ICA) and Independent Vector Analysis (IVA) respectively. Initially, classical performance comparison metrics such as Source-to-Artifact Ratio, Source-to- Distortion Ratio, Source-to-Noise Ratio, are implemented for comparison. For further investigation, speaker recognition system has been developed to examine the effect of speech separation on the performance of these recognition systems. In our experiments, we used two data set the first one is in Arabic languge and contains voice records frome 13 speaker: 3 female , 10 male.the second data set is the ELSDSR data which in English languge and contains voice records from 22 speakers: 10 female, 12 male. The performance of BSS methods is measured under four scenarios. The first three is composed to see the effect of noise. Therefore, we used the mixture of clean source signals, the mixture of source signals with additive Gaussian noise, adding Gaussian noise to clean source mixture. In the fourth scenario, we applied speaker recognition system based on Gaussian mixture models (GMMs) and I-vectors, the performance of the speaker recognition system is measured by Equal Error Ratio (EER), which is, the most reliable measurement in this field. Experimental results show that the Fast-IVA has better performance than the Fast-ICA method according to performance metrics used in this study. In terms of EER, I-vector gives the better result than GMM for separated signals by IVA and ICA.	en_US
dc.language.iso	en	en_US
dc.subject	Blind source separation	en_US
dc.subject	Independent component analysis	en_US
dc.subject	Independent vector analysis	en_US
dc.title	Blind audio source separation using independent component analysis and independent vector analysis methods	en_US
dc.type	Thesis	en_US