并行交织器的内存映射策略：避免多读写冲突的非二进制LDPC码解码

LDPC

需积分: 9 40 浏览量更新于2024-09-11 收藏 158KB PDF 举报

身份认证购VIP最低享 7 折!

30元优惠券

"这篇文档主要讨论的是非二进制LDPC码在硬件设计及译码算法中的应用，特别是在高吞吐量应用中的平行交织器设计。文档提到了避免内存访问冲突的问题，这对于实现高效的并行架构至关重要，无论是LDPC码还是Turbo码。文中提出了一种方法，可以确保变量在内存银行中的映射无碰撞，并优化由此产生的交织架构。该研究得到了欧洲DAVINCI项目的支持，重点关注并行架构、交织器、类似Turbo的代码以及内存映射技术。" 非二进制（Nonbinary）LDPC码是编码理论中的一种扩展，相比于二进制LDPC码，它允许信息位和校验位取更多于两个值的域中的元素。这通常带来更好的纠错性能，尤其是在噪声较大的通信环境中。非二进制LDPC码的编码和译码过程比二进制形式更为复杂，需要处理更大域的运算。硬件设计对于实现高效非二进制LDPC码译码器至关重要，特别是对于高吞吐量的应用，如多媒体和电信领域。并行架构是解决这一问题的关键，它可以显著提高解码速度。然而，并行架构也带来了挑战，如并发读写操作可能导致内存块冲突，即多个读写操作同时指向同一内存位置，这会降低系统效率。为了解决这个问题，文献提出了一个内存映射方法，确保变量在内存银行中的映射总是没有冲突。这意味着在执行迭代解码时，不同的处理单元不会同时尝试访问或修改同一块内存。这种策略对于优化交织器设计特别有用，交织器在迭代解码过程中起到关键作用，通过打乱输入序列，增加错误纠正能力。论文中通过一个教学示例展示了所提方法的优势。交织器设计是提高并行解码性能的关键组成部分，因为它可以控制数据流以避免潜在的内存访问冲突。通过优化内存映射，不仅可以避免这些冲突，还可以最大化利用硬件资源，从而提高整个系统的吞吐量和效率。这篇文献提供了对非二进制LDPC码硬件实现和译码算法的深入洞察，特别是关注如何在并行架构中有效管理内存访问，以适应高性能通信系统的需求。其提出的内存映射策略对于未来的设计者和工程师来说，是优化非二进制LDPC码系统性能的一个宝贵工具。

资源详情

资源推荐

A memory Mapping Approach for Parallel Interleaver design with multiples read and write accesses

C. Chavet, P. Coussy

LabSTICC Lab., Université de Bretagne Sud

Abstract-For high throughput applications, turbo-like iterative

decoders are implemented with parallel architectures. However,

to be efficient parallel architectures require to avoid collision

accesses i.e. concurrent read/write accesses should not target

the same memory block. This consideration applies to the two

main classes of turbo-like codes which are Low Density Parity

Check (LDPC) and Turbo-Codes. In this paper we propose a

methodology which always finds a collision-free mapping of

the variables in the memory banks and which optimizes the

resulting interleaving architecture. Finally, we show through a

pedagogical example the interest our approach. This research

was supported by the European project DAVINCI.

Index Terms - Parallel architecture, interleavers, turbo-like

codes, memory mapping.

1. INTRODUCTION

In the multimedia and telecommunications domain, continuously

emerging customer services require severe performance to

implement the new communication standards. Indeed,

communication systems require high throughput -on the order of

several hundred Mb/s- accompanied by both low latency and

severe bit error rate BER constraints (e.g. wireless, fiber-optic

communication…). Owing to their impressive near-Shannon-limit

error correcting performance, turbo-like codes in their parallel or

serially concatenated versions [3], originally dedicated to channel

coding, or LDPC codes [5], are being currently reused in most of

digital communication systems (e.g. equalization, demodulation,

synchronization, MIMO…).

These coders are formed by two or more processing elements PE

(encoders/decoders) and one communication network composed of

steering components (multiplexers, butterflies, barrel shifters…) and

memory elements (registers, RAMs…). This network interleaves the

data blocks exchanged by the PEs according to a predefined rule

named interleaving law or permutation law. The turbo-like

decoding principle is based on an iterative algorithm using

decoders exchanging information in order to improve the error

correction performance through the iterations. The iterative nature

of these algorithms is a severe constraint to satisfy the

aforementioned requirements with an affordable implementation

complexity. A widespread solution is to realize the decoder in a

parallel fashion. On the one hand, this solution increases the

throughput since the latency of the system becomes the latency of

constituent sub-blocks [3]. On the other hand, the complexity and

the cost of the system are increased due to parallel nature of the

architecture.

By the way, depending on the interleaving law, different parallel

processing elements may try to simultaneously access the same

memory block (cf. Fig.

). This problem is known as the “collision”

problem [11]. In this case, three classes of solution are available:

The designer may

- define his own dedicated interleaving law in order to avoid such

collision problems, but the resulting architecture may not be

standard compliant.

- add extra memory elements and control logic in the communication

network in order to buffer and postpone the conflicting data.

- find a memory mapping avoiding any conflict access and taking into

account the cost of the architecture (i.e. interconnection network).

The paper is organized as follows: the second section presents the

existing solutions to design parallel interleaver architectures.

Mem 0

Mem 1

Mem 2

PE 0

PE 1

PE 2

Interconnection network

Fig. 1 Memory collision problem

The third section is dedicated to the problem formulation of the

interleaver design. In the fourth section we present the proposed

approach to automatically find a memory mapping solution that

avoids any conflict access. Finally, the last section presents

experimental results on a pedagogical example.

2. RELATED WORKS

An interleaving law is a permutation law, also referred as π, that

scrambles data to break up neighbourhood-relations [11]. It is a

key factor for turbo-codes performances, which varies from one

communication standard to another. Moreover for a given

standard, different interleaving rules can be used for different

modes through varying frame lengths and/or data rates [7]. In this

context, taking into account the aforementioned constraints and the

collision problems to design hardware implementations of parallel

turbo decoders require the integration of complex interconnection

network topology (cf. Fig.

) supporting the intensive interleaved

memory accesses. Indeed, in state-of-the-art parallel turbo-

decoding, interleaving is considered as a limiting factor for the

overall system performance and the architectural cost. To

successfully tackle these problems, different solutions exist.

Multiple solutions have been proposed in classical Single-Read /

Single-Write approaches. A first solution to get rid of collisions

with non prunable interleavers, consists in designing a specific

interleaver rule. In [11], the authors propose a deterministic

methodology to design collision-free interleavers. In [12] and [8]

the authors define collision-free permutations thanks to a

combination of a spatial and a temporal permutation. The authors

of [14] simply integrate the collision-free constraint in the design

of their interleaver. However, the multi-modes architectures

(depending on the frame length, the data-rate…) cannot be handled

by such approaches. Another solution consists in defining a

collision-free interleaver that preserves this property even when

pruned. In [7], the authors describe a design rule to obtain such

interleavers, with an incremental algorithm that generates

collision-free interleavers by adding new elements in successive

steps, to a small initial permutation. Of course, all these solutions

are viable if and only if the designer is free to choose the

permutation law to be used in the system. As a consequence, the

resulting architecture may not be standard compliant.

A second approach consists in adding extra memory elements in

the communication network. The aim is to buffer and to postpone

the conflicting data. In [19] the authors propose, when a collision

appears, to store the conflicting information in the communication

network until the targeted sub-block can process it. Of course, the

additional network buffering resources, and consequently the time

needed to interleave information, increase with the number of

parallel processors. This is a suboptimal strategy, in terms of

latency and thus throughput, which avoids collisions at the

下载后可阅读完整内容，剩余3页未读，立即下载

哼哼哈嘿111

粉丝: 0
资源: 2

并行交织器的内存映射策略：避免多读写冲突的非二进制LDPC码解码

Low Complexity X-EMS Algorithms for Nonbinary LDPC Codes

Hard-Information Bit-Reliability Based Decoding Algorithm for Majority-Logic Decodable Nonbinary LDPC Codes

4K LDPC 对比2K LDPC

matlab simulink ldpc,LDPC编码仿真

matlab simulink ldpc,matlab ldpc 编码解码

非二进制ldpc码和二进制LDPC有什么区别

ldpc码matlab仿真

Matlab实现LDPC编码

matlab 64进制ldpc

ldpc encode and decode ic

多元ldpc译码算法

ldpc译码误码率仿真

LDPC码国内外研究现状

ldpc程序代写 c语言

ldpc编解码verilog代码

ldpc ip核使用

配置中心.zip

基于java的物流管理系统报告的开题报告.docx

企业级SpringCould脚手架工程：Eureka、Ribbon、Hystrix、Zuul、Feign、分布式事务.zip

基于Java的微信小程序html2wxml转换接口设计源码

最新资源