G. K. SHARMA, Bharat GARG

A process-tolerant low-power adder architecture for image processing applications

The aggressive CMOS technology scaling in the sub-100-nm regime leads to highly challenging VLSI designdue to the presence of unreliable components. The delay failures in arithmetic units are increasing rapidly due to theincreased effect of process variation (PV) in scaled technology. This paper introduces a novel process-tolerant lowpower adder (Prot-LA) architecture for error-tolerant applications. The proposed Prot-LA architecture segments theoperands into two parts and computes addition of the upper parts in carry-propagate, whereas it computes the lowerparts in a carry-free manner. In the Prot-LA, the number of bits in carry-propagate and carry-free additions can bereconfigured based on the amount of PV. An on-chip PV detector is embedded to determine the PV severity. Becauseof this reconfigurability, the proposed adder completes the carry propagation with minimum error even under severeprocess variation. The simulation results show that proposed Prot-LA provides 19.9% reduced power consumption overthe state-of-the-art approximate adder. The efficacy of the proposed adder is demonstrated in the real application bydesigning an image scaling processor (ISP). The simulation results show that the Prot-LA embedded ISP consumes 7.75%reduced energy with 2.43 dB higher PSNR over the existing approximate adder embedded ISP.

PDF

___

[1] Borkar S, Karnik T, Narendra S, Tschanz J, Keshavarzi A, De V. Parameter variations and impact on circuits and microarchitecture. In: ACM 2003 Proceedings of the 40th annual Design Automation Conference, 2-6 June 2003, Anaheim, CA, USA: pp. 338-342.
[2] Unsal O, Tschanz J, Bowman K, De V, Vera X, Gonzalez A, Ergin O. Impact of parameter variations on circuits and microarchitecture. IEEE MICRO 2006; 26: 30-39.
[3] Snigdha FS, Sengupta D, Hu J, Sapatnekar SS. Optimal design of JPEG hardware under the approximate computing paradigm. In: ACM/EDAC/IEEE Proceedings of the 53rd Annual Design Automation Conference, 5-9 June 2016; Austin, TX, USA: IEEE pp. 1-6.
[4] Shafique M, Hafiz R, Rehman S, El-Harouni W, Henkel J. Cross-layer approximate computing: from logic to architectures. In: ACM/EDAC/IEEE Proceedings of the 53rd Annual Design Automation Conference, 5-9 June 2016; Austin, TX, USA: IEEE pp. 1-6.
[5] Han J, Orshansky M. Approximate computing: an emerging paradigm for energy-efficient design. In: IEEE 2013 European Test Symposium (ETS), 27-30 May 2013; Avignon, France: IEEE pp. 1-6.
[6] Zhu N, Goh WL, Zhang W, Yeo KS, Kong ZH. Design of low-power high-speed truncation-error-tolerant adder and its application in digital signal processing. IEEE T VLSI Syst 2010; 18: 1225-1229.
[7] Zhu N, Goh WL, Yeo KS. An enhanced low-power high-speed adder for error-tolerant application. In: IEEE 2009 12th Proceedings of the International Symposium on Integrated Circuits, (ISIC), 14-16 Dec. 2009; Singapore: IEEE pp. 69-72.
[8] Zhu N, Goh WL, Yeo KS. Ultra low-power high-speed flexible probabilistic adder for error-tolerant applications. In: IEEE 2011 International SoC Design Conference (ISOCC), 22-23 Nov. 2011; Seoul, South Korea: IEEE pp. 393-396.
[9] Zhu N, Goh WL, Wang G, Yeo KS. Enhanced low-power high-speed adder for error-tolerant application. In: IEEE 2010 International SoC Design Conference (ISOCC), 22-23 Vov. 2010; Seoul, South Korea: IEEE pp. 323-327.
[10] Shin D, Gupta SK. A re-design technique for datapath modules in error tolerant applications. In: IEEE 2008 17th Asian Test Symposium (ATS), 24-27 Nov 2008; Sapporo, Japan: IEEE pp. 431-437.
[11] Gupta V, Mohapatra D, Raghunathan A, Roy K. Low-power digital signal processing using approximate adders. IEEE T Computer-Aided Design of Integrated Circuits and Systems 2013; 32: 124-137.
[12] Du L, Varman P, Mohanram K. High performance reliable variable latency carry select addition. In: IEEE 2012 Design, Automation & Test in Europe Conference & Exhibition (DATE), 12-16 March 2012; Dresden, Germany: IEEE pp. 1257-1262.
[13] Mazahir S, Hasan O, Hafiz R, Shafique M, Henkel J. An area-efficient consolidated configurable error correction for approximate hardware accelerators. In: ACM/EDAC/IEEE 2016 Proceedings of the 53rd Annual Design Automation Conference, 5-9 June 2016; Austin, TX, USA: IEEE p. 96.
[14] Pashaeifar M, Kamal M, Kusha AA, Pedram M. Approximate reverse carry propagate adder for energy-efficient DSP applications. IEEE VLSI Syst 2018; 99: 1-12.
[15] Huang J, Lach J, Robins G. A methodology for energy-quality tradeoff using imprecise hardware. In: IEEE 2012 Proceedings of the 49th Annual Design Automation Conference, 3-7 June 2012; San Francisco, CA, USA: IEEE pp. 504-509.
[16] Ayhan T, Kula F, Altun M. A power efficient system design methodology employing approximate arithmetic units. In: IEEE 2017 Computer Society Annual Symposium on VLSI (ISVLSI) 3-5 July 2017; Bochum, Germany: IEEE pp. 243-248.
[17] Lee S, John LK, Gerstlauer A. High-level synthesis of approximate hardware under joint precision and voltage scaling. In: IEEE 2017 Proceedings of the Conference on Design, Automation & Test in Europe (DATE), 27-31 March 2017; Lausanne, Switzerland: IEEE pp. 187-192.
[18] Rahimi A, Benini L, Gupta RK. Accuracy-configurable OpenMP. From Variability Tolerance to Approximate Computing in Parallel Integrated Architectures and Accelerators, Springer 2017, 133-149.
[19] Kahng AB, Kang S. Accuracy-configurable adder for approximate arithmetic designs. In: IEEE 2012 49th ACM/EDAC/IEEE Design Automation Conference (DAC), 3-7 June 2012; San Francisco, CA, USA: IEEE pp. 820-825.
[20] Shafique M, Ahmad W, Hafiz R, Henkel J. A low latency generic accuracy configurable adder. In: 52nd ACM/EDAC/IEEE Design Automation Conference (DAC), 8-12 June 2015; San Francisco, CA, USA: IEEE pp. 1-6.
[21] Garg B, Dutt S, Sharma GK. Bit-width-aware constant-delay run-time accuracy programmable adder for errorresilient applications. Microelectronics Journal 2016; 50: 1-7.
[22] Akbari O, Kamal M, Kusha AA, Pedram M. RAP-CLA: A reconfigurable approximate carry look-ahead adder. IEEE T Circuits and Systems II: Express Briefs 2018; 65: 1089-1093.
[23] Banerjee N, Karakonstantis G, Roy K. Process variation tolerant low power DCT architecture. In: IEEE 2007 Proceedings of the conference on Design, automation and test in Europe, 16-20 April 2007; Nice, France: IEEE pp. 630-635.
[24] Karakonstantis G, Banerjee N, Roy K. Process-variation resilient and voltage-scalable DCT architecture for robust low-power computing. IEEE T VLSI Syst 2010; 18: 1461-1470.
[25] Garg B, Chaitanya GVNSK, Sharma GK. A low-cost energy efficient image scaling processor for multimedia applications. In: IEEE 2016 Symposium on VLSI Design and Test (VDAT), 24-27 May 2016; Guwahati, India: IEEE pp. 1-6.
[26] Liang J, Han J, Lombardi F. New metrics for the reliability of approximate and probabilistic adders. IEEE T Computers 2013; 62: 1760-1771.
[27] Wang Z, Bovik AC, Sheikh HR, Simoncelli EP. Image quality assessment: from error visibility to structural similarity. IEEE T Image Processing 2004; 13, pp. 600-612.