Konferans bildirisi Açık Erişim

SPEECH DETECTION ON BROADCAST AUDIO

Zubari, Unal; Ozan, Ezgi Can; Acar, Banu Oskay; Ciloglu, Tolga; Esen, Ersin; Ates, Tugrul K.; Onur, Duygu Oskay


DataCite XML

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="URL">https://aperta.ulakbim.gov.tr/record/92905</identifier>
  <creators>
    <creator>
      <creatorName>Zubari, Unal</creatorName>
      <givenName>Unal</givenName>
      <familyName>Zubari</familyName>
      <affiliation>TUBITAK UZAY, Video &amp; Audio Proc Grp, TR-06531 Ankara, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Ozan, Ezgi Can</creatorName>
      <givenName>Ezgi Can</givenName>
      <familyName>Ozan</familyName>
    </creator>
    <creator>
      <creatorName>Acar, Banu Oskay</creatorName>
      <givenName>Banu Oskay</givenName>
      <familyName>Acar</familyName>
      <affiliation>TUBITAK UZAY, Video &amp; Audio Proc Grp, TR-06531 Ankara, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Ciloglu, Tolga</creatorName>
      <givenName>Tolga</givenName>
      <familyName>Ciloglu</familyName>
      <affiliation>METU, Dept Elect &amp; Elect Engn, TR-06531 Ankara, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Esen, Ersin</creatorName>
      <givenName>Ersin</givenName>
      <familyName>Esen</familyName>
      <affiliation>TUBITAK UZAY, Video &amp; Audio Proc Grp, TR-06531 Ankara, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Ates, Tugrul K.</creatorName>
      <givenName>Tugrul K.</givenName>
      <familyName>Ates</familyName>
    </creator>
    <creator>
      <creatorName>Onur, Duygu Oskay</creatorName>
      <givenName>Duygu Oskay</givenName>
      <familyName>Onur</familyName>
      <affiliation>TUBITAK UZAY, Video &amp; Audio Proc Grp, TR-06531 Ankara, Turkey</affiliation>
    </creator>
  </creators>
  <titles>
    <title>Speech Detection On Broadcast Audio</title>
  </titles>
  <publisher>Aperta</publisher>
  <publicationYear>2010</publicationYear>
  <dates>
    <date dateType="Issued">2010-01-01</date>
  </dates>
  <resourceType resourceTypeGeneral="Text">Conference paper</resourceType>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://aperta.ulakbim.gov.tr/record/92905</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.81043/aperta.92904</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsIdenticalTo">10.81043/aperta.92905</relatedIdentifier>
  </relatedIdentifiers>
  <rightsList>
    <rights rightsURI="http://www.opendefinition.org/licenses/cc-by">Creative Commons Attribution</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">Speech boundary detection contributes to performance of speech based applications such as speech recognition and speaker recognition. Speech boundary detector implemented in this study works on broadcast audio as a pre-processor module of a keyword spotter. Speech boundary detection is handled in 3 steps. At first step, audio data is segmented into homogeneous regions in an unsupervised manner. After an ACTIVITY/NON-ACTIVITY decision is made for each region, ACTIVITY regions are classified as Speech/Non-speech via Gaussian Mixture Model (GMM) based classification. GMM's are trained using a novel feature, Spectral Flow Direction (SFD), and an improved multi-band harmonicity feature in addition to widely used Mel Frequency Cepstral Coefficients (MFCC's).</description>
  </descriptions>
</resource>
29
10
görüntülenme
indirilme
Görüntülenme 29
İndirme 10
Veri hacmi 1.8 kB
Tekil görüntülenme 28
Tekil indirme 10

Alıntı yap