Dergi makalesi Açık Erişim

A Methodology for Comparing the Reliability of GPU-Based and CPU-Based HPCs

Cini, Nevin; Yalcin, Gülay


JSON

{
  "conceptrecid": "273916", 
  "created": "2024-09-12T10:32:43.795843+00:00", 
  "doi": "10.1145/3372790", 
  "files": [
    {
      "bucket": "29c8a228-9bee-4db0-872f-d7998430aa3d", 
      "checksum": "md5:ba5273044fb4ade31a70b24dda0199ee", 
      "key": "Makale1.pdf", 
      "links": {
        "self": "https://aperta.ulakbim.gov.tr/api/files/29c8a228-9bee-4db0-872f-d7998430aa3d/Makale1.pdf"
      }, 
      "size": 516140, 
      "type": "pdf"
    }
  ], 
  "id": 273917, 
  "links": {
    "badge": "https://aperta.ulakbim.gov.tr/badge/doi/10.1145/3372790.svg", 
    "bucket": "https://aperta.ulakbim.gov.tr/api/files/29c8a228-9bee-4db0-872f-d7998430aa3d", 
    "doi": "https://doi.org/10.1145/3372790", 
    "html": "https://aperta.ulakbim.gov.tr/record/273917", 
    "latest": "https://aperta.ulakbim.gov.tr/api/records/273917", 
    "latest_html": "https://aperta.ulakbim.gov.tr/record/273917"
  }, 
  "metadata": {
    "access_right": "open", 
    "access_right_category": "success", 
    "creators": [
      {
        "name": "Cini, Nevin", 
        "orcid": "0000-0001-5348-4043"
      }, 
      {
        "name": "Yalcin, G\u00fclay"
      }
    ], 
    "description": "<p>Today, GPUs are widely used as coprocessors/accelerators in High-Performance Heterogeneous Computing<br>\ndue to their many advantages. However, many researches emphasize that GPUs are not as reliable as desired<br>\nyet. Despite the fact that GPUs are more vulnerable to hardware errors than CPUs, the use of GPUs in HPCs<br>\nis increasing more and more. Moreover, due to native reliability problems of GPUs, combining a great number<br>\nof GPUs with CPUs can significantly increase HPCs&rsquo; failure rates. For this reason, analyzing the reliability<br>\ncharacteristics of GPU-based HPCs has become a very important issue. Therefore, in this study we evaluate<br>\nthe reliability of GPU-based HPCs. For this purpose, we first examined field data analysis studies for GPU-<br>\nbased and CPU-based HPCs and identified factors that could increase systems failure/error rates. We then<br>\ncompared GPU-based HPCs with CPU-based HPCs in terms of reliability with the help of these factors in<br>\norder to point out reliability challenges of GPU-based HPCs. Our primary goal is to present a study that can<br>\nguide the researchers in this field by indicating the current state of GPU-based heterogeneous HPCs and<br>\nrequirements for the future, in terms of reliability. Our second goal is to offer a methodology to compare the<br>\nreliability of GPU-based HPCs and CPU-based HPCs. To the best of our knowledge, this is the first survey<br>\nstudy to compare the reliability of GPU-based and CPU-based HPCs in a systematic manner.</p>", 
    "doi": "10.1145/3372790", 
    "has_grant": false, 
    "journal": {
      "issue": "No. 1", 
      "pages": "Article 22", 
      "title": "ACM Computing Surveys", 
      "volume": "Vol. 53"
    }, 
    "keywords": [
      "Computer systems organization", 
      "Reliability", 
      "System failure", 
      "log file analysis", 
      "checkpoint/recovery", 
      "Graphics Processing Unit", 
      "Y\u00fcksek ba\u015far\u0131ml\u0131 hesaplama", 
      "High Performance Computing", 
      "Dependable and fault-tolerant systems and networks", 
      "Hardware", 
      "Hardware test", 
      "Robustness", 
      "Computer systems organization", 
      "Cross-computing tools and techniques", 
      "Software and its engineering", 
      "Software organization and properties", 
      "Extra-functional properties", 
      "failure prediction"
    ], 
    "language": "eng", 
    "license": {
      "id": "cc-by-nc-4.0"
    }, 
    "publication_date": "2020-02-06", 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "273917"
          }, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "273916"
          }
        }
      ]
    }, 
    "resource_type": {
      "subtype": "article", 
      "title": "Dergi makalesi", 
      "type": "publication"
    }, 
    "science_branches": [
      "Teknik Bilimler > Bilgisayar Bilimleri", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Bilgi G\u00fcvenli\u011fi ve G\u00fcvenilirli\u011fi", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Bilgi G\u00fcvenli\u011fi ve G\u00fcvenilirli\u011fi > Donan\u0131m G\u00fcvenli\u011fi", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Bilgi G\u00fcvenli\u011fi ve G\u00fcvenilirli\u011fi > Yaz\u0131l\u0131m G\u00fcvenli\u011fi", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Donan\u0131m", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Donan\u0131m > \u0130\u015flemci Mimarisi", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Algoritmalar > Ba\u015far\u0131m Modellemesi ve De\u011ferlendirmesi", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Algoritmalar > Paralel Algoritmalar", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Donan\u0131m > Mant\u0131ksal Tasar\u0131m", 
      "Teknik Bilimler > Bilgisayar Bilimleri > Donan\u0131m > Uygulama Tabanl\u0131 Mimari"
    ], 
    "title": "A Methodology for Comparing the Reliability of GPU-Based and CPU-Based HPCs"
  }, 
  "owners": [
    2349
  ], 
  "revision": 1, 
  "stats": {
    "downloads": 93.0, 
    "unique_downloads": 88.0, 
    "unique_views": 220.0, 
    "version_downloads": 93.0, 
    "version_unique_downloads": 88.0, 
    "version_unique_views": 220.0, 
    "version_views": 249.0, 
    "version_volume": 48001020.0, 
    "views": 249.0, 
    "volume": 48001020.0
  }, 
  "updated": "2024-09-12T10:32:43.830845+00:00"
}
249
93
görüntülenme
indirilme
Görüntülenme 249
İndirme 93
Veri hacmi 48.0 MB
Tekil görüntülenme 220
Tekil indirme 88

Alıntı yap