Konferans bildirisi Açık Erişim

Short Utterance Speaker Recognition Using Time-Delay Neural Network

Toruk, Muhammet Mesut; Gokay, Ramazan


MARC21 XML

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">user-tubitak-adresli-yayinlar</subfield>
    <subfield code="o">oai:zenodo.org:100081</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">In recent years, important studies on speaker recognition have been implemented. Some solutions have been proposed and high achievements have been achieved. But one of the major issues faced by speaker recognition researchers is the short utterance speaker recognition. In short utterances, the recognition performance decreases. Within the scope of this study, it is aimed to increase the speaker recognition performance in short-term utterances by using Time-Delay Neural Networks (TDNN). I-vector-based systems have been developed using conventional GMM-UBM and TDNN-UBM-based methods. In this study, error rate changes of the audio files at various durations are compared using GMM and TDNN methods.</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">conferencepaper</subfield>
  </datafield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="a">2019 16TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS &amp; DEVICES (SSD)</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="a">Creative Commons Attribution</subfield>
    <subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.81043/aperta.100080</subfield>
    <subfield code="n">doi</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Toruk, Muhammet Mesut</subfield>
    <subfield code="u">TUBITAK BILGEM, Speech &amp; Language Technol Lab, Kocaeli, Turkey</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="z">md5:47f6c5c8600f5eeafbdeccc12ab99f19</subfield>
    <subfield code="s">175</subfield>
    <subfield code="u">https://aperta.ulakbim.gov.trrecord/100081/files/bib-3f76bf1e-da24-467d-be67-caf550c07118.txt</subfield>
  </datafield>
  <controlfield tag="005">20210316142134.0</controlfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2019-01-01</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.81043/aperta.100081</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Short Utterance Speaker Recognition Using Time-Delay Neural Network</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="a">Gokay, Ramazan</subfield>
    <subfield code="u">TUBITAK BILGEM, Speech &amp; Language Technol Lab, Kocaeli, Turkey</subfield>
  </datafield>
  <controlfield tag="001">100081</controlfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tubitak-adresli-yayinlar</subfield>
  </datafield>
</record>
33
4
görüntülenme
indirilme
Görüntülenme 33
İndirme 4
Veri hacmi 700 Bytes
Tekil görüntülenme 30
Tekil indirme 4

Alıntı yap