Dergi makalesi Açık Erişim

LncMachine: a machine learning algorithm for long noncoding RNA annotation in plants

Cagirici, H. Busra; Galvez, S.; Sen, Taner Z.; Budak, Hikmet


MARC21 XML

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">LncMachine: a machine learning algorithm for long noncoding RNA annotation in plants</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="4">
    <subfield code="p">FUNCTIONAL &amp; INTEGRATIVE GENOMICS</subfield>
    <subfield code="v">21</subfield>
    <subfield code="n">2</subfield>
    <subfield code="c">195-204</subfield>
  </datafield>
  <controlfield tag="001">229882</controlfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-tubitak-destekli-proje-yayinlari</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">Following the elucidation of the critical roles they play in numerous important biological processes, long noncoding RNAs (lncRNAs) have gained vast attention in recent years. Manual annotation of lncRNAs is restricted by known gene annotations and is prone to false prediction due to the incompleteness of available data. However, with the advent of high-throughput sequencing technologies, a magnitude of high-quality data has become available for annotation, especially for plant species such as wheat. Here, we compared prediction accuracies of several machine learning algorithms using a 10-fold cross-validation. This study includes a comprehensive feature selection step to refine irrelevant and repeated features. We present a crop-specific, alignment-free coding potential prediction tool, LncMachine, that performs at higher prediction accuracies than the currently available popular tools (CPC2, CPAT, and CNIT) when used with the Random Forest algorithm. Further, LncMachine with Random Forest performed well on human and mouse data, with an average accuracy of 92.67%. LncMachine only requires either a FASTA file or a TAB separated CSV file containing features as input files. LncMachine can deploy several user-provided algorithms in real time and therefore be effortlessly applied to a wide range of studies.</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="2">opendefinition.org</subfield>
    <subfield code="a">cc-by</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Univ Malaga, ETSI Informat, Andalucia Tech, E-29071 Malaga, Spain</subfield>
    <subfield code="a">Galvez, S.</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">USDA ARS, Crop Improvement Genet Res Unit, Western Reg Res Ctr, 800 Buchanan St, Albany, CA 94710 USA</subfield>
    <subfield code="a">Sen, Taner Z.</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Montana BioAgr Inc, Missoula, MT 59801 USA</subfield>
    <subfield code="a">Budak, Hikmet</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="b">article</subfield>
    <subfield code="a">publication</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="a">Cagirici, H. Busra</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2021-01-01</subfield>
  </datafield>
  <controlfield tag="005">20221007074553.0</controlfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="o">oai:aperta.ulakbim.gov.tr:229882</subfield>
    <subfield code="p">user-tubitak-destekli-proje-yayinlari</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="z">md5:7a1345fa1cda184d214069d47a6ff3af</subfield>
    <subfield code="s">190</subfield>
    <subfield code="u">https://aperta.ulakbim.gov.trrecord/229882/files/bib-39757d2a-e2f3-4be3-930e-cc89f6a04b52.txt</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">http://www.opendefinition.org/licenses/cc-by</subfield>
    <subfield code="a">Creative Commons Attribution</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.1007/s10142-021-00769-w</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
</record>
25
8
görüntülenme
indirilme
Görüntülenme 25
İndirme 8
Veri hacmi 1.5 kB
Tekil görüntülenme 20
Tekil indirme 8

Alıntı yap