implementation of turbo decoder using max-log-map ... - ijater

International Journal of Advanced Technology & Engineering Research (IJATER) 

IMPLEMENTATION OF TURBO DECODER 

USING MAX-LOG-MAP ALGORITHM FOR 

WIRELESS APPLICATION 

Mr. RAGHUKRISHNA.S 1, Dr. K. N. HARI BHATT 2 

1,2 Nagarjuna College of Engineering and Technology, Bangalore, Karnataka, India. 

Email: raghukrishnas85@gmail.com, knhari.bhat@gmail.com 

Abstract 

Turbo code is one of the most significant achievements 

in coding theory. Turbo decoding architectures have 

greater error correcting capability than any other known 

code. The successive decoding procedures carried out in 

the conventional Max-Log-MAP algorithm are performed 

in parallel, and well formulated into a set of simple and 

regular matrix operations, which can therefore considerably 

speed up the decoding operations and reduce the computational 

complexity. The Max-Log-MAP algorithms also 

maintain the advantage of the general logarithmic MAP 

like algorithms in avoiding complex numerical representation 

problems. The Max-Log-Map algorithm is the least 

complex of the four algorithms (MAP, LOG-MAP, SOVA 

and Improved SOVA). It has twice the complexity of the 

Viterbi algorithm for each half-iteration but offers the 

worst BER performance. The Max-Log-Map algorithm 

has the additional benefit of being tolerant of imperfect 

noise variance estimates when operating on an AWGN 

channel. 

In 1948 paper, Shannon defined the limits of a communication 

System. The gap between the Shannon limit 

and practice was still 2dB until 1993[1]. Berrou et al in 

1993introduced a major advancement in the channel coding 

area was the advent of turbo codes [1][8]. Their decoding 

complexity was very high for them to be efficiently 

implemented in hardware when compared with a decoder 

for convolutional codes like a Viterbi decoder [3]. Demand 

for turbo codes in wireless communication systems 

has been increasing since their appearance in the early 

1990s, due to their outstanding performance in terms of bit 

error rate (BER) [2]. For this reason, they have been 

adopted by various wireless systems such as DVB-RCS, 

3GPP UMTS, IEEE 802.16, and CCSDS [2]. Various turbo 

decoders have been developed to improve their performance 

at algorithm and architecture levels. A Dual 

mode decoder for convolutional and turbo codes have also 

been introduced for multi-standard wireless Communication 

systems. Since there are different methods and approaches 

for decoding the received signals at the receiver, 

but in general trellis-based algorithms are commonly used. 

The figure1 shown below represents the decoding algorithms 

available: VITERBI, SOVA, MAP, MAX-LOG- 

MAP, LOG MAP, and IMPROVED SOVA. 

Many researchers have studied the MAP algorithm to 

simplify its hardware implementation as well as to improve 

its performance. Log MAP and ML-MAP algorithms 

[3] have reduced the implementation complexity. 

Since the Log MAP and ML-MAP algorithms have been 

proposed, they are widely used to implement the turbo decoders. 

Furthermore, the sliding window (SW) method [4] 

provides efficient area usage by dividing a block of input 

symbols into a number of sub-blocks. 

Index Terms—Turbo code, a posteriori probability (APP), 

complexity, convolutional code, dual code, fixed point, 

high rate, implementation, maximum a posteriori probability(MAP) 

decoder, Log MAP and ML-MAP algorithms. 

I. Introduction 

Fig. 1 Trellis based estimation algorithms 

Two-step Soft output Viterbi algorithm (SOVA) decoder 

[2] offers a low complexity solution to the turbo decoder. 

The SOVA turbo decoder has less BER performance 

than the MAP turbo Decoder for the same SNR. 

ISSN No: 2250-3536 Volume 2, Issue 4, July 2012 246

The MAP algorithm provides the best performance in 

BER while its complexity is higher than the two-step SO- 

VA. 

II. Turbo Encoder Structure 


(A) 

B. ML-MAP SISO Decoder Architecture 

Fig. 2 Turbo encoder structure 

A generic structure for turbo encoding based on parallel 

concatenation of two Recursive Systematic Convolutional 

(RSC) encoders is given in Fig 2. Two identical 

RSC encoders produce the redundant data as parity bits. 

The input data stream and parity bits are combined in series 

to form the turbo coded word. The size of the input 

data word may vary from 40 bits to 6144 bits for UMTS 

[1] and take specified values such as 378, 570, and 20730 

for CDMA2000 [6] turbo coding which are the two main 

standards of 3GPP and 3GPP2 respectively. 

III. Turbo Decoder 

A. Turbo Decoder Structure 

Figure 4 shows the SISO decoder architecture which 

consists of the forward and backward state metric, LLR 

computation, and memory (LIFO and FIFO) blocks. The 

FIFO 1 and 2 are used to Buffer the input data symbols 

and the LIFO 3 and 4 are to store the forward state metric 

and the LLR values, respectively. The SISO decoder has 

been built with two backward state metric units, β1 and 

β2. α and γ denote the forward state and branch metric 

units. 

Fig. 4 SISO Architecture 

The ML-MAP algorithm can be used for reduced 

complexity decoder implementation [4].The decoding 

process in MAP algorithm performs calculations of the 

forward and backward state metric values to obtain the log 

likelihood ratio (LLR) values, which have the decoded bit 

information and reliability values. The LLR values are 

represented by the Following equation [6]: 

Fig. 3 Iterative Turbo- Decoder 

The turbo decoder structure represents two soft-input 

soft-output (SISO) decoders and one interleaver/deinterleaver 

between them. Decoding process in a turbo 

decoder is performed iteratively through the two SISO 

decoders via the interleaver and the deinterleaver. As 

shown in Figure 3. When the Maximum A Posteriori 

(MAP) algorithm is applied to each SISO decoder, the 

Log-Likelihood Ratio (LLR) for each double-binary pair 

can be expressed as equation (A) [7]: 

k-1. The equation of γ, α, and β can be represented to logarithm 

form as shown below [3],[6]: 



The main kernel of the Turbo-Viterbi algorithm is 

ADD-COMPARE-SELECT (ACS) operation which is 

performed by Forward recursion, Reverse recursion and 

dummy reverse recursion blocks. There are 8 parallel ACS 

blocks and hence 8 states can be processed in parallel. Fully 

parallel architectures assign one ACS for each state to 

meet the performance constraints on speed and latency [5]. 

Where the branch metric (γ) is calculated by the a priori 

information (Le), channel reliability value (Lc), input 

Symbols (x and y1), the systematic bit (uk s) and the parity 

bit (uk p). As described in previous section, a priori information 

is obtained from the LLR value computed in 

previous decoding process after subtracting the input symbol 

data and a priori values from the LLR value. The MAP 

algorithm, which uses the above equations, is not suitable 

for hardware implementation due to the logarithm function. 

This problem can be addressed using a well know 

approximation, called Jacobi logarithm, which is given below. 

1. Branch And State Metric Unit: 

The branch and state metric units (BMU and SMU) 

are implemented using ML-MAP algorithm. The conventional 

BMU and SMU consists of branch metric calculation, 

add, compare, select, and normalization processes 

which is shown in fig 5. The general SMU in turbo SISO 

decoder must include the normalization process to avoid 

overflow of the state metric values [3]. 

This approximation is used to implement the state metric 

unit (SMU) and LLR computation unit (LCU) in Log 

MAP and ML-MAP SISO decoder. The 2nd term of the 

right hand side in equation (4) is a correction term which 

can be implemented through a simple look-up table [3]. 

However, the correction term is ignored in this paper as 

we have implemented ML-MAP SISO decoder. 

C. Specifications 

Complexity studies showed that Max-Log-MAP is the 

Best compromise between performance and complexity 

when compared with MAP, Log-MAP and SOVA. Therefore, 

our implementation will be based on the Max-Log- 

MAP algorithm. Regarding Turbo decoder implementations, 

several interesting implementations were recently 

proposed, most of which are based on fixed-point arithmetic 

[6]. Since fixed-point operations require multiplications 

and divisions for normalization, computational complexity 

is still high. In this paper, we propose an integer 

based Turbo decoder. The turbo decoder considered in this 

paper has the following specifications: 

I. Code rate R = 1/3. 

II. Puncturing pattern: even/odd parity. 

III. Block size: N = 40 to 6144 bits. 

IV. Interleaver: S-random interleaver with S = 8. 

Fig. 5 conventional structures of the branch and state metric units 

The branch metric values are obtained from input data 

symbols. The new state metric values are calculated in 

single clock cycle recursively using add, compare (C), select, 

and normalization (N) processes from branch metric 

and state metric values [3],[6]. 

The critical path delay is determined by the above 

processes. In the proposed architecture which is shown in 

fig 6 the normalization is done in the branch metric values 

itself. This normalization method leads to a simplified 

SMU (shown in fig.7) [3], but more complex in BMU. 

The novel architecture reduces the critical path delay significantly 

by eliminating the state metric normalization 

process used in the conventional SMU [6]. 

D. Implementation 



Fig. 6 Branch Metric Unit. 

Fig. 9 Conventional LCU Units 

Fig. 7 SMU Unit 

2. Normalization / Saturation: 

To avoid overflow metrics, Normalization is usually 

employed as shown in figure 5 and 6. We have adopted a 

very efficient normalization scheme where at each time 

instant we check if any of the state metrics is greater than 

2,[5] then a fixed value 2 is subtracted from all 

state metrics. This is shown by normalization (N) block 

shown in figure 5 and 6[3]. The block comprises of a subtractor 

that subtracts a fixed value (2) from state metrics 

and a multiplexer that selects the subtracted value if 

the normalization has to be employed. The multiplexer select 

signal is provided by each ACS block and in case of 

state serial architecture mappings (states >8) the select 

signal is provided after all the states are processed [5]. 

The LLR values (L1 or L0) are calculated using forward 

(α0-7) and backward (β0-7) states and branch metric 

(γ0-1) values of all states. The LLR computation unit 

(LCU) is similar to the SMU which consists of 3-stage 

compare and select process results long critical path delay. 

In order to reduce the critical path delay LCU is pipelined 

[6]. 

E. LLR Output Iterations 

The output L (dˆ) of the decoder in Figure 10 is made 

up of the LLR from the detector, L′ (dˆ), and the extrinsic 

LLR output, Le (dˆ), representing knowledge gleaned 

from the decoding process. As illustrated in Figure 10, for 

iterative decoding, the extrinsic likelihood is fed back to 

the decoder input, to serve as a refinement of the a priori 

probability of the data for the next iteration [9]. 

3. LLR COMPUTATION UNIT: 

Fig. 10 LLR iteration output values 



The log-MAP algorithm is the most complex of the 

four algorithms when implemented in software, but as will 

be shown later, generally offers the best bit error rate 

(BER) performance. The max-log-MAP algorithm is the 

least complex of the four algorithms (it has twice the 

complexity of the Viterbi algorithm for each half-iteration) 

but offers the worst BER performance [9]. The max-log- 

MAP algorithm has the additional benefit of being tolerant 

of imperfect noise variance estimates when operating on 

an AWGN channel. 

F. SIMULATION RESULTS 

pipelining architecture introducing for performing LLR 

value computations, has been shown that decoder 

achieved a slight reduction in power consumption and a 

slight increase in area usage, it has achieved 58% speedup 

compared to non-pipelined conventional decoder. 

Therefore, by adopting this kind of techniques, the turbo 

decoder can be applied to wireless communication systems 

requiring high data rates and low power consumption. 

Further implementation of improved technologies 

like “sliding window” can improve the performance with 

slight increase in resources. 

V. Acknowledgment 

The authors would like to thank Dr. A. T. Kalghatgi, 

Chief Scientist, Mr. Manoj Jain, Member (Senior Research 

Staff), Mrs. A.Thirija Sharmila, Member (Senior 

Research Staff) and Mr. Chaitanya Umbare Member (Research 

Staff), Central Research Laboratory, Bangalore, for 

their constant encouragement and support to carry out this 

work. 

VI. References 

Fig. 7 Branch metric result using MODELSIM 

The ML-MAP turbo SISO decoder presented in this 

paper was initially simulated at high level to verify its 

functionality. In our simulation a 1024 size block type interleaver 

has been used. 

Fig. 8 State metric result using MODELSIM 

The figure 7 above represents the branch metric {γ} 

and fig 8 represents the state metric {α} for the value of k 

= 1024 and system and parity bits of length 1024. 

IV. Conclusion 

Normalize operation was applied to branch metric 

values instead of to state metric values. In addition, with 

[1] Mustafa Taskaldiran, Richard C.S. Morling, and 

Izzet Kale. “The Modified Max-Log-MAP Turbo 

Decoding Algorithm by Extrinsic Information Scal 

ing for Wireless Applications”. 

[2]. J. H. Han, A. T. Erdogan, and T. Arslan. “A Power 

Efficient Reconfigurable Max-Log-MAP Turbo De 

coder for Wireless Communication Systems”. 

[3]. J. H. Han, A. T. Erdogan, T. Arslan “High Speed 

Max-Log-MAP Turbo SISO DecoderImplementtion 

Using Branch Metric Normalization”. 

[4]. Hirohisa GAMBE, Yoshinori TANAKA, Kazuhisa 

OHBUCHI, Teruo ISHIHARA, and Jifeng LI. “An 

Improved Sliding Window Algorithm for Max-Log- 

MAP Turbo Decoder and Its Programmable LSI Im 

plementation”. 

[5]. Shivani verma and kumar.s “An FPGA realizationof 

simplified turbo decoder architecture” International 

Journal of the Physical Sciences.Vol. 6(10), pp. 

2338-2347, 18 May, 2011 

[6]. J.M.Mathana, Dr.P.Rangarajan “FPGA Implementa 

tion of High Speed Architecture for Max Log Map 

Turbo SISO Decoder.” International Journal of Re 

cent Trends in Engineering, Vol 2, No. 6, Novem 

ber 2009. 

[7]. Huili Guo, Juntao Zhao, Jianwen Chen, Xiang Chen, 

Jing Wang “High Performance Turbo Decoder on 

CELL BE for WiMAX System”. 

[8]. Hye-Mi Choi, Ji-Hoon Kim, and In-Cheol Park 

“Low- Power Hybrid Turbo Decoding Based on Re 


verse Calculation”. 

[9]. Bernard Sklar “Digital Communications: Fundamen 

tals and Applications”, Second Edition (Prentice 

Hall,2001, ISBN 0-13-084788-7). 

Biographies 


RAGHUKRISHNA.S received B.E. degree in Electronics 

and Communication engineering in 2008 and M.Tech degree 

in VLSI Design and Embedded systems in 2012 from 

Visveswaraya Technological University, Karnataka. His 

main area of interest includes Error Control Coding and 

VLSI Design. 

Email: raghukrishnas85@gmail.com 

K.N. HARI BHATT received the B.E. degree with honours 

from Mysore University in 1966. M.Tech and Ph.D. 

degrees in Electronics and Communication Engineering 

from Indian Institute of Technology, Kanpur in 1973 and 

1986 respectively. He is currently working as Dean Academic 

and Head, Department of Electronics and Communication 

Engineering (P.G.) at Nagarjuna College of Engineering 

and Technology, Bangalore, India. He was with 

Karnataka Regional Engineering College, Suratkal (now 

known as National Institute of Technology, Karnataka) for 

more than 30 years up to 2001. He has coauthored three 

books in the area of Communication. His areas of interest 

are Analog and Digital communication and Cryptography. 

Email: knhari.bhat@gmail.com

implementation of turbo decoder using max-log-map ... - ijater

Create successful ePaper yourself

Delete template?

Save as template?