Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings

Sakar, Betul Erdogdu; Isenkul, M. Erdem; Sakar, C. Okan; Sertbas, Ahmet; Gurgen, Fikret; Delil, Sakir; Apaydin, Hulya; Kursun, Olcay

doi:10.1109/JBHI.2013.2245674

Yayınlanmış 1 Ocak 2013 | Sürüm v1

Dergi makalesi Açık

Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings

1. Bahcesehir Univ, Dept Comp Programming, TR-34381 Istanbul, Turkey
2. Istanbul Univ, Dept Comp Engn, TR-34320 Istanbul, Turkey
3. Bahcesehir Univ, Dept Comp Engn, TR-34353 Istanbul, Turkey
4. Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
5. Istanbul Univ, Cerrahpasa Fac Med, Dept Neurol, TR-34098 Fatih, Turkey

There has been an increased interest in speech pattern analysis applications of Parkinsonism for building predictive telediagnosis and telemonitoring models. For this purpose, we have collected a wide variety of voice samples, including sustained vowels, words, and sentences compiled from a set of speaking exercises for people with Parkinson's disease. There are two main issues in learning from such a dataset that consists of multiple speech recordings per subject: 1) How predictive these various types, e. g., sustained vowels versus words, of voice samples are in Parkinson's disease (PD) diagnosis? 2) How well the central tendency and dispersion metrics serve as representatives of all sample recordings of a subject? In this paper, investigating our Parkinson dataset using well-known machine learning tools, as reported in the literature, sustained vowels are found to carry more PD-discriminative information. We have also found that rather than using each voice recording of each subject as an independent data sample, representing the samples of a subject with central tendency and dispersion metrics improves generalization of the predictive model.

Dosyalar

bib-617d4e12-e437-4572-8da9-8fa9dfcb223e.txt

Dosyalar (265 Bytes)

Ad	Boyut	Hepisini indir
bib-617d4e12-e437-4572-8da9-8fa9dfcb223e.txt md5:a073a7a5b0936e7898c9fe8eac0a3f4d	265 Bytes	Ön İzleme İndir

	Tüm sürümler	Bu sürüm
Görüntüleme	60	60
İndirilenler	17	17
Veri miktarı	4.8 kB	4.8 kB

Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings

Dosyalar

bib-617d4e12-e437-4572-8da9-8fa9dfcb223e.txt

Dosyalar (265 Bytes)

TÜBİTAK ULAKBİM

İLETİŞİM

Collection and Analysis of a Parkinson Speech Dataset With Multiple Types of Sound Recordings

Oluşturanlar

Açıklama

Dosyalar

bib-617d4e12-e437-4572-8da9-8fa9dfcb223e.txt

Dosyalar (265 Bytes)