Change search
ReferencesLink to record
Permanent link

Direct link
Programming Real-time Autofocus on a Massively Parallel Reconfigurable Architecture using Occam-pi
Halmstad University, School of Information Science, Computer and Electrical Engineering (IDE), Halmstad Embedded and Intelligent Systems Research (EIS), Centre for Research on Embedded Systems (CERES).ORCID iD: 0000-0002-4932-4036
Business Area Electronic Defence Systems, Saab AB, Gothenburg, Sweden.
Halmstad University, School of Information Science, Computer and Electrical Engineering (IDE), Halmstad Embedded and Intelligent Systems Research (EIS), Centre for Research on Embedded Systems (CERES).ORCID iD: 0000-0001-6625-6533
2011 (English)In: Proceedings of the 19th Annual IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM'2011), Los Alamitos, Calif.: IEEE Computer Society, 2011, 194-201 p.Conference paper (Refereed)
Abstract [en]

Recently we proposed occam-pi as a high-level language for programming massively parallel reconfigurable architectures. The design of occam-pi incorporates ideas from CSP and pi-calculus to facilitate expressing parallelism and reconfigurability. The feasability of this approach was illustratedby building three occam-pi implementations of DCT executing on an Ambric. However, because DCT is a simple and well studied algorithm it remained uncertain whether occam-pi would also be effective for programming novel, more complex algorithms.

In this paper, we demonstrate the applicability of occam-pi for expressing various degrees of parallelism by implementinga significantly large case-study of focus criterion calculation inan autofocus algorithm on the Ambric architecture. Autofocus is a key component of synthetic aperture radar systems. Two implementations of focus criterion calculation were developedand evaluated on the basis of performance. The comparison of the performance results with a single threaded software implementation of the same algorithm show that the throughput of the two implementations are 11x and 23x higher than the sequential implementation despite a much lower (9x) clock frequency. The two designs are, respectively, 29x and 40x moreenergy efficient.

Place, publisher, year, edition, pages
Los Alamitos, Calif.: IEEE Computer Society, 2011. 194-201 p.
Series
, Annual IEEE Symposium on Field-Programmable Custom Computing Machines
National Category
Computer Engineering
Identifiers
URN: urn:nbn:se:hh:diva-14616DOI: 10.1109/FCCM.2011.20ISI: 000298664800034ScopusID: 2-s2.0-79958730167ISBN: 978-1-61284-277-6ISBN: 978-0-7695-4301-7OAI: oai:DiVA.org:hh-14616DiVA: diva2:405548
Conference
IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM'2011), Campus Univ Utah, Salt Lake City, UT, MAY 01-03, 2011
Projects
SMECY
Funder
EU, FP7, Seventh Framework Programme, 100230
Note

©2011 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Available from: 2011-03-22 Created: 2011-03-22 Last updated: 2015-08-21Bibliographically approved
In thesis
1. Programming of Coarse-Grained Reconfigurable Architectures
Open this publication in new window or tab >>Programming of Coarse-Grained Reconfigurable Architectures
2011 (English)Doctoral thesis, comprehensive summary (Other academic)
Abstract [en]

Coarse-grained reconfigurable architectures, which offer massive parallelism coupled with the capability of undergoing run-time reconfiguration, are gaining attention in order to meet not only the increased computational demands of high-performance embedded systems, but also to fulfill the need of adaptability to functional requirements of the application. This thesis focuses on the programming aspects of such coarse-grained reconfigurable computing devices, including the relevant computation models that are capable of exposing different kinds of parallelism inherent in the application and the ability of these models to capture the adaptability requirements of the application. The thesis suggests the occam-pi language for programming of a broad class of coarse-grained reconfigurable architectures as an intermediate language; we call it intermediate, since we believe that the applicationprogramming is best done in a high-level domain-specific language. The salient properties of the occam-pi language are explicit concurrency with built-in mechanisms for interprocessorcommunication, provision for expressing dynamic parallelism, support for the expression of dynamic reconfigurations, and placement attributes. To evaluate the programming approach, a compiler framework was extended to support the language extensions in the occam-pi language, and backends were developed to target two different coarse-grained reconfigurable architectures. XPP and Ambric. The results on XPP reveal that the occam-pi based implementations produce comparable throughput to those of NML programs, while programming at a much higher level of abstraction than that of NML. Similarly the two occam-pi implementations of autofocus criterion calculation targeted to the Ambric platform outperform the CPU implementation by factors of 11-23. Thus, the results of the implemented case-studies suggest that the occam-pi language based approach simplifies the development of applications employing run-time reconfigurable devices without compromising the performance benefits.

Place, publisher, year, edition, pages
Örebro: Örebro universitet, 2011
Series
, Örebro Studies in Technology, ISSN 1650-8580 ; 48
National Category
Computer Science
Identifiers
urn:nbn:se:hh:diva-15050 (URN)
Public defence
2011-05-26, Wigforssalen, Högskolan i Halmstad, Halmstad, 10:15 (English)
Opponent
Supervisors
Available from: 2011-05-09 Created: 2011-04-28 Last updated: 2015-12-16Bibliographically approved

Open Access in DiVA

fulltext(576 kB)345 downloads
File information
File name FULLTEXT01.pdfFile size 576 kBChecksum SHA-512
bbc23c99631a41ea3af2efa9d2e82db9cac43508020788f4f8e49ca1a84204ee3751c5f4e7bc22526e861632b7e854fe81bc3b4d50df6caf3da907349b46aaec
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Search in DiVA

By author/editor
Zain-ul-AbdinÅhlander, AndersSvensson, Bertil
By organisation
Centre for Research on Embedded Systems (CERES)
Computer Engineering

Search outside of DiVA

GoogleGoogle Scholar
Total: 345 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

Altmetric score

Total: 260 hits
ReferencesLink to record
Permanent link

Direct link