Performance analysis and optimization of cluster-based mesh FPGA architectures: design methodology and CAD tool support
Performance analysis and optimization of cluster-based mesh FPGA architectures: design methodology and CAD tool support
Field programmable gate arrays (FPGAs) have become an attractive implementation medium for digital circuits. FPGA design s big challenge is to find a good trade-off between flexibility and performance in terms of power dissipation, area density, and delay. This paper presents a new cluster-based FPGA architecture combining mesh and hierarchical interconnect topologies. Based on experimental method and benchmarks circuit implementation, this work provides a detailed exploration and analyses of the effect of cluster functionality on the proposed cluster-based FPGA in terms of power dissipation, area density, and delay. The exploration results showed that architecture with high cluster size provides high speed performance and low power dissipation. We noted also that architecture with small cluster size is more efficient in terms of area. Look-up-table (LUT) exploration showed that using architecture with 4-input LUT offers the best trade-off between power dissipation, area density, and delay.
___
- [1] Kuon I. Measuring and navigating the gap between FPGAs and ASICs. PhD, University of Toronto, Toronto, ON, Canada, 2008.
- [2] Gaillardon P, Tang X, Kim G, De Micheli G. A novel FPGA architecture based on ultrafine grain reconfigurable logic cells. IEEE T VLSI Syst 2015; 23: 2187-2197.
- [3] Chen D, Cong J, Fan Y. Low-power high-level synthesis for FPGA architectures. In: The International Symposium on Low Power Electronics and Design; 2527 August 2003; Seoul, South Korea. New York, NY, USA: IEEE. pp 134-139.
- [4] Betz V, Rose J, Marquardt A. Architecture and CAD for Deep-Submicron FPGAs. Norwell, MA, USA: Kluwer Academic Publishers, 1999.
- [5] Singh A, Marek-Sadowska M. Efficient circuit clustering for area and power reduction in FPGAs. ACM T Des Automat El 2002; 7: 643-663.
- [6] Touil L, Kechiche L, Ouni B. Design of low power system on programmable chip for video zoom-in processing. Turk J Elec Eng & Comp Sci 2015; 24: 3405-3418.
- [7] Ahmed E, Rose J. The effect of LUT and cluster size on deep-submicron FPGA performance and density. IEEE T VLSI Syst 2004; 12: 1063-8210.
- [8] Li F, Chen D, He L, Cong J. Architecture evaluation for power-efficient FPGAs. In: The International Symposium on Field Programmable Gate Arrays; 2325 February 2003; Monterey, CA, USA. New York, NY, USA: ACM. pp. 12-20.
- [9] Lemieux G, Lewis D. Directional and Single-Driver Wires in FPGA Interconnect. In: International Conference on Field Programmable Technology; 68 December 2004; Brisbane, Australia. New York, NY, USA: IEEE. pp 41-48.
- [10] Lemieux G, Lewis D. Design of Interconnection Networks for Programmable Logic. Norwell, MA, USA: Kluwer Academic Publishers, 2004.
- [11] Feng W, Kaptanoglu S. Designing efficient input interconnect blocks for LUT clusters using counting and entropy. In: The International Symposium on Field Programmable Gate Arrays; 1820 February 2007; Monterey, CA, USA. New York, NY, USA: ACM. pp. 23-32.
- [12] Lamoureux J, Wilton S. Activity estimation for field programmable gate arrays. In: International Conference on Field Programmable Logic and Applications; 2830 August 2006; Madrid, Spain. New York, NY, USA: IEEE. pp 1-8.
- [13] Hung E, Wilton S, Yu H, Chau T, Leong P. A Detailed Delay Path Model for FPGAs. In: International Conference on Field Programmable Technology; 911 December 2009; Sydney, NSW, Australia. New York, NY, USA: IEEE. pp 96-103.
- [14] Weste N, Harris D. CMOS VLSI Design: a Circuits and Systems Perspective. New York, NY, USA: Addison Wesley, 2010.
- [15] Goeders J, Wilton S. VersaPower: power estimation for diverse FPGA architectures. In: International Conference on Field Programmable Technology; 1012 December 2012; Seoul, Korea. New York, NY, USA: IEEE. pp. 229-234.
- [16] Yang S. Logic Synthesis and Optimization Benchmarks User Guide. Raleigh, NC, USA: Microelectronics Center of North Carolina (MCNC), 1991.
- [17] Marquart A, Betz V, Rose J. Using cluster-based logic block and timing-driven packing to improve FPGA speed and density. In: The International Symposium on Field Programmable Gate Arrays; 2123 February 1999; Monterey, CA, USA. New York, NY, USA: ACM. pp. 37-46.
- [18] McMurchie L, Ebeling C. Pathfinder: a negotiation-based performance-driven router for FPGAs. In: The International Symposium on Field Programmable Gate Arrays; 1214 February 1995; Monterey, CA, USA. New York, NY, USA: ACM. pp. 111-117.