Veri seti Açık Erişim

PHACTboost predictions

Nurdan Kuru; Onur Dereli; Emrah Akkoyun; Aylin Bircan; Oznur Tastan; Ogün Adebali


MARC21 XML

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by-nc-sa/4.0/</subfield>
    <subfield code="a">Creative Commons Attribution-NonCommercial-ShareAlike</subfield>
  </datafield>
  <controlfield tag="001">263791</controlfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="o">oai:aperta.ulakbim.gov.tr:263791</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="2">opendefinition.org</subfield>
    <subfield code="a">cc-by</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Sabancı Üniversitesi</subfield>
    <subfield code="a">Onur Dereli</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Sabancı Üniversitesi</subfield>
    <subfield code="a">Emrah Akkoyun</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Sabancı Üniversitesi</subfield>
    <subfield code="a">Aylin Bircan</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Sabancı Üniversitesi</subfield>
    <subfield code="a">Oznur Tastan</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Sabancı Üniversitesi</subfield>
    <subfield code="a">Ogün Adebali</subfield>
    <subfield code="0">(orcid)0000-0001-9213-4070</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;Most algorithms that are used to predict the effects of variants rely on evolutionary conservation. However, a majority of such techniques compute evolutionary conservation by solely using the alignment of multiple sequences while overlooking the evolutionary context of substitution events. We had introduced PHACT, a scoring-based pathogenicity predictor for missense mutations that can leverage phylogenetic trees, in our previous study. By building on this foundation, we now propose PHACTboost, a gradient boosting tree-based classifier that combines PHACT scores with information from multiple sequence alignments, phylogenetic trees, and ancestral reconstruction. The results of comprehensive experiments on carefully constructed sets of variants demonstrated that PHACTboost can outperform 40 prevalent pathogenicity predictors reported in the dbNSFP, including conventional tools, meta-predictors, and deep learning-based approaches as well as state-of-the-art tools, AlphaMissense, EVE, and CPT-1. The superiority of PHACTboost over these methods was particularly evident in case of hard variants for which different pathogenicity predictors offered conflicting results. We provide predictions of 215 million amino acid alterations over 20,191 proteins. PHACTboost can improve our understanding of genetic diseases and facilitate more accurate diagnoses.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="2">doi</subfield>
    <subfield code="a">10.48623/aperta.263791</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">5821818571</subfield>
    <subfield code="z">md5:5a75bc704cec9d68a87486e0cfc95e16</subfield>
    <subfield code="u">https://aperta.ulakbim.gov.trrecord/263791/files/Results_PHACTboost.zip</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2024-03-24</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="i">isVersionOf</subfield>
    <subfield code="n">doi</subfield>
    <subfield code="a">10.48623/aperta.263790</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">PHACTboost predictions</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Sabancı Üniversitesi</subfield>
    <subfield code="a">Nurdan Kuru</subfield>
  </datafield>
  <controlfield tag="005">20240324192812.0</controlfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">missense mutations</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">phylogenetics</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">PHACT</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">PHCTboost</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">mutation effect prediction</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">protein sequences</subfield>
  </datafield>
</record>
69
7
görüntülenme
indirilme
Tüm sürümler Bu sürüm
Görüntülenme 6969
İndirme 77
Veri hacmi 40.8 GB40.8 GB
Tekil görüntülenme 5656
Tekil indirme 55

Alıntı yap