Konferans bildirisi Açık Erişim

Alternative PPM Model for Quality Score Compression

Akgun, Mete; Sagiroglu, Mahmut Samil


DataCite XML

<?xml version='1.0' encoding='utf-8'?>
<resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd">
  <identifier identifierType="URL">https://aperta.ulakbim.gov.tr/record/91661</identifier>
  <creators>
    <creator>
      <creatorName>Akgun, Mete</creatorName>
      <givenName>Mete</givenName>
      <familyName>Akgun</familyName>
      <affiliation>Tubitak BILGEM, TR-41470 Gebze, Kocaeli, Turkey</affiliation>
    </creator>
    <creator>
      <creatorName>Sagiroglu, Mahmut Samil</creatorName>
      <givenName>Mahmut Samil</givenName>
      <familyName>Sagiroglu</familyName>
      <affiliation>Tubitak BILGEM, TR-41470 Gebze, Kocaeli, Turkey</affiliation>
    </creator>
  </creators>
  <titles>
    <title>Alternative Ppm Model For Quality Score Compression</title>
  </titles>
  <publisher>Aperta</publisher>
  <publicationYear>2013</publicationYear>
  <dates>
    <date dateType="Issued">2013-01-01</date>
  </dates>
  <resourceType resourceTypeGeneral="Text">Conference paper</resourceType>
  <alternateIdentifiers>
    <alternateIdentifier alternateIdentifierType="url">https://aperta.ulakbim.gov.tr/record/91661</alternateIdentifier>
  </alternateIdentifiers>
  <relatedIdentifiers>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.81043/aperta.91660</relatedIdentifier>
    <relatedIdentifier relatedIdentifierType="DOI" relationType="IsIdenticalTo">10.81043/aperta.91661</relatedIdentifier>
  </relatedIdentifiers>
  <rightsList>
    <rights rightsURI="http://www.opendefinition.org/licenses/cc-by">Creative Commons Attribution</rights>
    <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights>
  </rightsList>
  <descriptions>
    <description descriptionType="Abstract">Next Generation Sequencing (NGS) platforms generate header data and quality information for each nucleotide sequence. These platforms may produce gigabyte-scale datasets. The storage of these datasets is one of the major bottlenecks of NGS technology. Information produced by NGS are stored in FASTQ format. In this paper, we propose an algorithm to compress quality score information stored in a FASTQ file. We try to find a model that gives the lowest entropy on quality score data. We combine our powerful statistical model with arithmetic coding to compress the quality score data the smallest. We compare its performance to text compression utilities such as bzip2, gzip and ppmd and existing compression algorithms for quality scores. We show that the performance of our compression algorithm is superior to that of both systems.</description>
  </descriptions>
</resource>
26
5
görüntülenme
indirilme
Görüntülenme 26
İndirme 5
Veri hacmi 1.0 kB
Tekil görüntülenme 22
Tekil indirme 5

Alıntı yap