• 検索結果がありません。

FPGAの消費電力を削減するHDLコーディング手法の検討

N/A
N/A
Protected

Academic year: 2021

シェア "FPGAの消費電力を削減するHDLコーディング手法の検討"

Copied!
2
0
0

読み込み中.... (全文を見る)

全文

(1)情報処理学会第 76 回全国大会. 2A-7. Examination of HDL coding styles to reduce power consumption for FPGAs †. Ryohei Kobayashi † Kenji Kise † Graduate School of Information Science and Engineering Tokyo Institute of Technology. 1 Introduction   The advantages of using Field Programmable Gate Arrays (FPGAs) are to easily change design by users and to reduce development time. Although users can get these benefits, the FPGA has a disadvantage of high power consumption. FPGAs with high power consumption incur high packaging costs, shortens chip life-times, expensive cooling systems, and decreases of system reliability. Therefore, it is truly important to reduce power consumption of FPGAs. In this paper, we examine the hardware description language (HDL) coding style, which has already been proposed to reduce power consumption of FPGAs, and discuss other effective ways. 2 Prior Work   We describe a prior work [1] about HDL coding styles to reduce power consumption for FPGAs. Minimizing the number of Flip-Flop updates is one of the ways to reduce power consumption, such as not overwriting a register with the value that exactly match the value that is already stored at the register. Figure 1 shows a circuit of normal coding style. Value from a user logic circuit is stored into D FlipFlop (DFF). Since the clock input of DFF is connected to the clock signal, the DFF is clocked even when it is not necessary. For example, when the D input has the same value as the Q output (D = Q), the DFF does not need to be clocked. This unnecessary Flip-Flop update wastes power. To address this problem, a new coding style is proposed in the prior work. Figure 2 shows a circuit employing the coding style. The feature of the circuit is same as that of the circuit of normal coding style. The T Flip-Flop (TFF) is clocked only when the Q output and input from the user logic circuit are different value. Thus, since the TFF is clocked only when it is needed to update storage data, it is possible to reduce power consumption. In addition to this benefit, circuits employing the new coding style can run faster than generated circuits from normal coding style. In generated circuits from normal coding style, the DFF can be clocked properly if its D input is stable at least before its setup time. In circuits employing the new coding style, since the T input of the TFF is always connected to logic ’1’, the TFF is always ready to be clocked. As a result, circuits employing the new coding style can run faster than generated circuits from normal coding style. In the prior work, the Microelectronic Center of North Carolina (MCNC) benchmark circuits [2] are used to evaluate the new coding style. Each MCNC benchmark circuit is converted into VHDL files to represent the new coding style and normal coding style.. D Q D Flip-Flop. User Logic Circuit CLK Figure 1: A circuit of normal coding style.. ʻ1ʼ. T Q T Flip-Flop. XOR User Logic Circuit. CLK. AND. Figure 2: A circuit employing the new coding style in the prior work. The circuits are implemented on an FPGA (Altera Stratix EP1S10F484C5) using an FPGA tool (Altera Quartus II 6.0). The new coding style reduces total power consumption by 13-90% and runs 2-20% faster compared to normal coding style. 3. Examination of Prior Work.   In order to confirm benefits of the prior work, we examine it using test circuits shown in Figure 3. Figure 3a shows a normal n-bit counter circuit, and Figure 3b show an n-bit counter circuit employing the new coding style of the prior work. In this examination, we write 1000-bit (n=1000) counter circuits with and without the new coding style in Verilog HDL, and we measure power consumption of two 1000-bit counters. As shown in Figure 3b, note that the TFF consists of one XOR and one DFF because TFF is not provided as FPGA primitives in the used FPGA for this examination. We use Xilinx ISE 14.7 for logic synthesis for written circuits, DIGILENT Atlys Board with FPGA Xilinx Spartan-6 XCS6SLX45 for implementation of these circuits. We set FPGA frequency as 120MHz. To measure power consumption of these circuits, we use Adept2.13.1 software system (32/64-bit Windows) shown in Figure 4. This tool can measure power consumption of 3.3V, 2.5V, 1.8V, and 1.2V supplies. Since power consumption of the test circuits correspond to that of 1.2V, we measure power consumption of the test circuits driven by 1.2V supply. Figure 5 shows power consumption of the test circuits. ”Normal” represents power consumption of the normal two 1000-bit counters, and ”Employing prior. 1-25. Copyright 2014 Information Processing Society of Japan. All Rights Reserved..

(2) 情報処理学会第 76 回全国大会. D Q D Flip-Flop. XOR. cnt[0] XOR. CLK. D Q D Flip-Flop. cnt[1]. XOR. CLK. D Q D Flip-Flop. 600 cnt[n-1]. Power Consumption[mW]. ʻ1ʼ. 500. CLK. 400. a. n-bit counter ʻ1ʼ ʻ1ʼ. XOR. D Q D Flip-Flop. CLK. T Q T Flip-Flop. cnt[0] ʻ1ʼ. T Q T Flip-Flop. cnt[1]. ʻ1ʼ. T Q T Flip-Flop. cnt[n-1]. 200 100. Q. CLK. 165%. 300. AND. 0. AND. Normal. Employing prioir work. b. n-bit counter employing prior work. Figure 5: Power consumption of the normal two 1000Figure 3: Test circuits with and without the new cod- bit counters and the two 1000-bit counters employing ing style in the prior work. prior work. !""#$%!"#$%&"!&!'(#)!'!(#*!'!&+$,-.! !()!*"!/'+,"-.$-!0&1-!/-$#%! !!!!#0!'"#$%&"-!,+23!1&!,$3$.! -%.!. DATA IN1 IN2 LOAD CLK. D Q D Flip-Flop CE. ENABLE. Recommended coding style. DOUT. !""#$%!4$3"0&1!&!'(#)!'!(#*!'!0&1-.! !()!*"!/'+,"-.$-!4$3"0&1-!/-$#%! !!!!#0!'&+$,-!,+23!1&!,$3$.! -%.!. DATA LOAD IN1 IN2 CLK. D Q D Flip-Flop CE. DOUT. ENABLE Not recommended coding style. Figure 6: Recommended and not recommended coding styles. ferent clock per FF. Figure 4: Measurement of power consumption by It is not necessarily good to gate the clock port, Adept2.13.1 software system. because gated clocks can increase clock delay, clock skew, and cause glitches and other undesirable effects. work” represents power consumption of the two 1000- From these reasons, Xilinx recommends using the dedbit counters with the new coding style. The normal cir- icated clock-enable (CE) port instead of gating the cuit consumes 213mW, the circuit employing the prior clock port [4]. Figure 6 shows recommended and not work consumes 564mW and 165% higher than Normal. recommended coding styles by Xilinx. Using the clock In the next section, we discuss the reason why the new enable can improve timing characteristics and analysis coding style does not achieve lower power consumption. of the design. Therefore, it is promising to use clock enable. 4 Discussion and Summary In the future work, it is crucial to seek better HDL   Implemented circuits on Xilinx FPGA Spartan-6 coding styles based on obtained findings from this exconsist of configurable logic blocks (CLBs) and switch amination. matrixes connecting between CLBs. A CLB has logic cells, called ”Slices”, and each slice has Lookup Ta- References bles (LUTs), Flip-Flops (FFs), and single clock input [1] Thomas Marconi, Dimitris Theodoropoulos, Koen Bertels, and Georgi Gaydadjiev. A Novel HDL Coding Style port [3]. LUTs are used to realize combinational logic to Reduce Power Consumption for Reconfigurable Decircuits, and FFs are used to realize sequential logic vices. In Jinian Bian, Qiang Zhou, Peter Athanas, Yacircuits. jun Ha, and Kang Zhao, editors, FPT, pp. 295–299. In our test circuit with the new coding style, differIEEE, 2010. ent clock per FF is fed the clock input. Since each slice has single clock input port, the Xilinx logic synthesis [2] Saeyang Yang. Logic Synthesis and Optimization tool cannot place the FFs fed with such different clocks Benchmarks User Guide Version 3.0, 1991. in the same slice. In fact, the occupied slice ratio of [3] 7 Series FPGAs Configurable Logic Block User Guide. Employ prior work is 29 % while that of Normal is http://www.xilinx.com/support/documentation/ 11%. This causes the increase of the number of the user guides/ug474 7Series CLB.pdf. occupied slices. The more the number of switch ma[4] HDL Coding Practices to Accelerate Design Perfortrixes increases, the higher the power consumption of mance. the FPGA is. Therefore, it is better to use coarse-grain http://www.eng.utah.edu/∼cs3710/xilinx-docs/ approach like feeding different clock per 32-bit register wp231.pdf. rather than to use fine-grain approach like feeding dif-. 1-26. Copyright 2014 Information Processing Society of Japan. All Rights Reserved..

(3)

Figure 1: A circuit of normal coding style.
Figure 3: Test circuits with and without the new cod- cod-ing style in the prior work.

参照

関連したドキュメント

The author, with the aid of an equivalent integral equation, proved the existence and uniqueness of the classical solution for a mixed problem with an integral condition for

Now we are going to construct the Leech lattice and one of the Niemeier lattices by using a higher power residue code of length 8 over Z 4 [ω].. We are going to use the same action

Greenberg and G.Stevens, p-adic L-functions and p-adic periods of modular forms, Invent.. Greenberg and G.Stevens, On the conjecture of Mazur, Tate and

Then it follows immediately from a suitable version of “Hensel’s Lemma” [cf., e.g., the argument of [4], Lemma 2.1] that S may be obtained, as the notation suggests, as the m A

The proof uses a set up of Seiberg Witten theory that replaces generic metrics by the construction of a localised Euler class of an infinite dimensional bundle with a Fredholm

Using the batch Markovian arrival process, the formulas for the average number of losses in a finite time interval and the stationary loss ratio are shown.. In addition,

[Mag3] , Painlev´ e-type differential equations for the recurrence coefficients of semi- classical orthogonal polynomials, J. Zaslavsky , Asymptotic expansions of ratios of

We link together three themes which had remained separated so far: the Hilbert space properties of the Riemann ze- ros, the “dual Poisson formula” of Duffin-Weinberger (also named by