Konferans bildirisi Açık Erişim
Akgun, Mete; Sagiroglu, Mahmut Samil
<?xml version='1.0' encoding='utf-8'?> <resource xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://datacite.org/schema/kernel-4" xsi:schemaLocation="http://datacite.org/schema/kernel-4 http://schema.datacite.org/meta/kernel-4.1/metadata.xsd"> <identifier identifierType="URL">https://aperta.ulakbim.gov.tr/record/91661</identifier> <creators> <creator> <creatorName>Akgun, Mete</creatorName> <givenName>Mete</givenName> <familyName>Akgun</familyName> <affiliation>Tubitak BILGEM, TR-41470 Gebze, Kocaeli, Turkey</affiliation> </creator> <creator> <creatorName>Sagiroglu, Mahmut Samil</creatorName> <givenName>Mahmut Samil</givenName> <familyName>Sagiroglu</familyName> <affiliation>Tubitak BILGEM, TR-41470 Gebze, Kocaeli, Turkey</affiliation> </creator> </creators> <titles> <title>Alternative Ppm Model For Quality Score Compression</title> </titles> <publisher>Aperta</publisher> <publicationYear>2013</publicationYear> <dates> <date dateType="Issued">2013-01-01</date> </dates> <resourceType resourceTypeGeneral="Text">Conference paper</resourceType> <alternateIdentifiers> <alternateIdentifier alternateIdentifierType="url">https://aperta.ulakbim.gov.tr/record/91661</alternateIdentifier> </alternateIdentifiers> <relatedIdentifiers> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsVersionOf">10.81043/aperta.91660</relatedIdentifier> <relatedIdentifier relatedIdentifierType="DOI" relationType="IsIdenticalTo">10.81043/aperta.91661</relatedIdentifier> </relatedIdentifiers> <rightsList> <rights rightsURI="http://www.opendefinition.org/licenses/cc-by">Creative Commons Attribution</rights> <rights rightsURI="info:eu-repo/semantics/openAccess">Open Access</rights> </rightsList> <descriptions> <description descriptionType="Abstract">Next Generation Sequencing (NGS) platforms generate header data and quality information for each nucleotide sequence. These platforms may produce gigabyte-scale datasets. The storage of these datasets is one of the major bottlenecks of NGS technology. Information produced by NGS are stored in FASTQ format. In this paper, we propose an algorithm to compress quality score information stored in a FASTQ file. We try to find a model that gives the lowest entropy on quality score data. We combine our powerful statistical model with arithmetic coding to compress the quality score data the smallest. We compare its performance to text compression utilities such as bzip2, gzip and ppmd and existing compression algorithms for quality scores. We show that the performance of our compression algorithm is superior to that of both systems.</description> </descriptions> </resource>
Görüntülenme | 26 |
İndirme | 5 |
Veri hacmi | 1.0 kB |
Tekil görüntülenme | 22 |
Tekil indirme | 5 |