Managing Dynamic Reconfiguration for Fault-tolerance on a Manycore Architecture
2012 (English)In: Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2012, New York, USA: IEEE Computer Society, 2012, 312-319 p.Conference paper (Refereed)
With the advent of manycore architectures comprising hundreds of processing elements, fault management has become a major challenge. We present an approach that uses the occam-pi language to manage the fault recovery mechanism on a new manycore architecture, the Platform 2012 (P2012). The approach is made possible by extending our previously developed compiler framework to compile occam-pi implementations to the P2012 architecture. We describe the techniques used to translate the salient features of the occam-pi language to the native programming model of the P2012 architecture. We demonstrate the applicability of the approach by an experimental case study, in which the DCT algorithm is implemented on a set of four processing elements. During run-time, some of the tasks are then relocated from assumed faulty processing elements to the faultless ones by means of dynamic reconfiguration of the hardware. The working of the demonstrator and the simulation results illustrate not only the feasibility of the approach but also how the use of higher-level abstractions simplifies the fault handling. © 2012 IEEE.
Place, publisher, year, edition, pages
New York, USA: IEEE Computer Society, 2012. 312-319 p.
IdentifiersURN: urn:nbn:se:hh:diva-17336DOI: 10.1109/IPDPSW.2012.38ISI: 000309409400035ScopusID: 2-s2.0-84867429212OAI: oai:DiVA.org:hh-17336DiVA: diva2:506920
26th IEEE International Parallel & Distributed Processing Symposium, May 21-25, Regal Shanghai East Asia Hotel Shanghai, China, 2012
The research leading to these results has received funding from the ARTEMIS Joint Undertaking under grant agreement number 100230 and from the national programmes / funding authorities.2012-04-122012-03-012015-08-21Bibliographically approved