Published January 1, 2011
| Version v1
Journal article
Open
Particle simulation on the Cell BE architecture
- 1. Bogazici Univ, Dept Comp Engn, TR-34342 Istanbul, Turkey
- 2. Marmara Univ, Dept Comp Engn, TR-34722 Istanbul, Turkey
- 3. Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
Description
This paper presents two parallel formulations for the Barnes-Hut algorithm on the Cell architecture, which differ in tree distribution and construction phases of the algorithm. In the initial parallelization, the domains are dynamically partitioned and assigned to the synergistic processing elements (SPEs), and SPEs construct local trees of the sub-domains in parallel. The enhanced parallelization scheme provides better clustering of the particles by sequentially constructing the global tree of the entire work space in the power processing element (PPE) and by partitioning the tree into sub-trees that can fit in the Local Store. SPEs operate on the sub-tree data and construct local trees in parallel. Our experimental evaluation indicates that this application performs much faster on the Cell BE compared to the Intel Xeon based system. Specifically, our first and second methods on the Cell BE outperform Intel Xeon by a factor of 5.8 and 7.1 for 8192 particles, respectively.
Files
bib-c2b441d6-bcf8-48ab-b84e-c3510967874c.txt
Files
(200 Bytes)
| Name | Size | Download all |
|---|---|---|
|
md5:32b25a9a77a493c6113b1f167d5dd817
|
200 Bytes | Preview Download |