Folded Reed–Solomon code

From Wikipedia the free encyclopedia

In coding theory, folded Reed–Solomon codes are like Reed–Solomon codes, which are obtained by mapping Reed–Solomon codewords over a larger alphabet by careful bundling of codeword symbols.

Folded Reed–Solomon codes are also a special case of Parvaresh–Vardy codes.

Using optimal parameters one can decode with a rate of R, and achieve a decoding radius of 1 − R.

The term "folded Reed–Solomon codes" was coined in a paper by V.Y. Krachkovsky with an algorithm that presented Reed–Solomon codes with many random "phased burst" errors.[1] The list-decoding algorithm for folded RS codes corrects beyond the bound for Reed–Solomon codes achieved by the GuruswamiSudan algorithm for such phased burst errors.

History[edit]

One of the ongoing challenges in Coding Theory is to have error correcting codes achieve an optimal trade-off between (Coding) Rate and Error-Correction Radius. Though this may not be possible to achieve practically (due to Noisy Channel Coding Theory issues), quasi optimal tradeoffs can be achieved theoretically.

Prior to Folded Reed–Solomon codes being devised, the best Error-Correction Radius achieved was , by Reed–Solomon codes for all rates .

An improvement upon this bound was achieved by Parvaresh and Vardy for rates

For the Parvaresh–Vardy algorithm can decode a fraction of errors.

Folded Reed–Solomon Codes improve on these previous constructions, and can be list decoded in polynomial time for a fraction of errors for any constant .

Definition[edit]

Consider a Reed–Solomon code of length and dimension and a folding parameter . Assume that divides .

Mapping for Reed–Solomon codes like this:

where is a primitive element in

.

The folded version of Reed Solomon code , denoted is a code of block length over . are just Reed Solomon codes with consecutive symbols from RS codewords grouped together.

Graphic description[edit]

Folding of Reed–Solomon code with folding parameter m=3

The above definition is made more clear by means of the diagram with , where is the folding parameter.

The message is denoted by , which when encoded using Reed–Solomon encoding, consists of values of at , where .

Then bundling is performed in groups of 3 elements, to give a codeword of length over the alphabet .

Something to be observed here is that the folding operation demonstrated does not change the rate of the original Reed–Solomon code.

To prove this, consider a linear code, of length , dimension and distance . The folding operation will make it a code. By this, the rate will be the same.

Folded Reed–Solomon codes and the singleton bound[edit]

According to the asymptotic version of the singleton bound, it is known that the relative distance , of a code must satisfy where is the rate of the code. As proved earlier, since the rate is maintained, the relative distance also meets the Singleton bound.

Why folding might help?[edit]

Decoding a folded Reed–Solomon code

Folded Reed–Solomon codes are basically the same as Reed Solomon codes, just viewed over a larger alphabet. To show how this might help, consider a folded Reed–Solomon code with . Decoding a Reed–Solomon code and folded Reed–Solomon code for the same fraction of errors are tasks of almost the same computational intensity: one can unfold the received word of the folded Reed–Solomon code, treat it as a received word of the original Reed–Solomon code, and run the Reed–Solomon list decoding algorithm on it. Obviously, this list will contain all the folded Reed–Solomon codewords within distance of the received word, along with some extras, which we can expurgate.

Also, decoding a folded Reed–Solomon code is an easier task. Suppose we want to correct a third of the errors. The decoding algorithm chosen must correct an error pattern that corrects every third symbol in the Reed–Solomon encoding. But after folding, this error pattern will corrupt all symbols over and will eliminate the need for error correction. This propagation of errors is indicated by the blue color in the graphical description. This proves that for a fixed fraction of errors the folding operation reduces the channel's flexibility to distribute errors, which in turn leads to a reduction in the number of error patterns that need to be corrected.

How folded Reed–Solomon (FRS) codes and Parvaresh Vardy (PV) codes are related[edit]

We can relate Folded Reed Solomon codes with Parvaresh Vardy codes which encodes a polynomial of degree with polynomials where where is an irreducible polynomial. While choosing irreducible polynomial and parameter we should check if every polynomial of degree at most satisfies since is just the shifted counterpart of where is the primitive element in Thus folded RS code with bundling together code symbols is PV code of order for the set of evaluation points

.

If we compare the folded RS code to a PV code of order 2 for the set of evaluation points

we can see that in PV encoding of , for every and every appears at and ,

Relation between PV codes and FRS codes

unlike in the folded FRS encoding in which it appears only once. Thus, the PV and folded RS codes have same information but only the rate of FRS is bigger by a factor of and hence the list decoding radius trade-off is better for folded RS code by just using the list decodability of the PV codes. The plus point is in choosing FRS code in a way that they are compressed forms of suitable PV code with similar error correction performance with better rate than corresponding PV code. One can use this idea to construct a folded RS codes of rate that are list decodable up to radius approximately for . [1]

Brief overview of list-decoding folded Reed–Solomon codes[edit]

A list decoding algorithm which runs in quadratic time to decode FRS code up to radius is presented by Guruswami. The algorithm essentially has three steps namely the interpolation step in which Welch–Berlekamp-style interpolation is used to interpolate the non-zero polynomial

after which all the polynomials with degree satisfying the equation derived in interpolation are found. In the third step the actual list of close-by codewords are known by pruning the solution subspace which takes time.

Linear-algebraic list decoding algorithm[edit]

Guruswami presents a time list decoding algorithm based on linear-algebra, which can decode folded Reed–Solomon code up to radius with a list-size of . There are three steps in this algorithm: Interpolation Step, Root Finding Step and Prune Step. In the Interpolation step it will try to find the candidate message polynomial by solving a linear system. In the Root Finding step, it will try to find the solution subspace by solving another linear system. The last step will try to prune the solution subspace gained in the second step. We will introduce each step in details in the following.

Step 1: The interpolation step[edit]

It is a Welch–Berlekamp-style interpolation (because it can be viewed as the higher-dimensional generalization of the Welch–Berlekamp algorithm). Suppose we received a codeword of the -folded Reed–Solomon code as shown below

We interpolate the nonzero polynomial

by using a carefully chosen degree parameter .

So the interpolation requirements will be

Then the number of monomials in is

Because the number of monomials in is greater than the number of interpolation conditions. We have below lemma

Lemma 1. satisfying the above interpolation condition can be found by solving a homogeneous linear system over with at most constraints and variables. Moreover this interpolation can be performed in operations over .[2]

This lemma shows us that the interpolation step can be done in near-linear time.

For now, we have talked about everything we need for the multivariate polynomial . The remaining task is to focus on the message polynomials .

Lemma 2. If a candidate message polynomial is a polynomial of degree at most whose Folded Reed-Solomon encoding agrees with the received word in at least columns with
then [3]

Here "agree" means that all the values in a column should match the corresponding values in codeword .

This lemma shows us that any such polynomial presents an algebraic condition that must be satisfied for those message polynomials that we are interested in list decoding.

Combining Lemma 2 and parameter , we have

Further we can get the decoding bound

We notice that the fractional agreement is

Step 2: The root-finding step[edit]

During this step, our task focus on how to find all polynomials with degree no more than and satisfy the equation we get from Step 1, namely

Since the above equation forms a linear system equations over in the coefficients of the polynomial

the solutions to the above equation is an affine subspace of . This fact is the key point that gives rise to an efficient algorithm - we can solve the linear system.

It is natural to ask how large is the dimension of the solution? Is there any upper bound on the dimension? Having an upper bound is very important in constructing an efficient list decoding algorithm because one can simply output all the codewords for any given decoding problem.

Actually it indeed has an upper bound as below lemma argues.

Lemma 3. If the order of is at least (in particular when is primitive), then the dimension of the solution is at most .[4]

This lemma shows us the upper bound of the dimension for the solution space.

Finally, based on the above analysis, we have below theorem

Theorem 1. For the folded Reed–Solomon code of block length and rate the following holds for all integers . Given a received word , in time, one can find a basis for a subspace of dimension at most that contains all message polynomials of degree less than whose FRS encoding differs from in at most a fraction
of the codeword positions.

When , we notice that this reduces to a unique decoding algorithm with up to a fraction of errors. In other words, we can treat unique decoding algorithm as a specialty of list decoding algorithm. The quantity is about for the parameter choices that achieve a list decoding radius of .

Theorem 1 tells us exactly how large the error radius would be.

Now we finally get the solution subspace. However, there is still one problem standing. The list size in the worst case is . But the actual list of close-by codewords is only a small set within that subspace. So we need some process to prune the subspace to narrow it down. This prune process takes time in the worst case. Unfortunately it is not known how to improve the running time because we do not know how to improve the bound of the list size for folded Reed-Solomon code.

Things get better if we change the code by carefully choosing a subset of all possible degree polynomials as messages, the list size shows to be much smaller while only losing a little bit in the rate. We will talk about this briefly in next step.

Step 3: The prune step[edit]

By converting the problem of decoding a folded Reed–Solomon code into two linear systems, one linear system that is used for the interpolation step and another linear system to find the candidate solution subspace, the complexity of the decoding problem is successfully reduced to quadratic. However, in the worst case, the bound of list size of the output is pretty bad.

It was mentioned in Step 2 that if one carefully chooses only a subset of all possible degree polynomials as messages, the list size can be much reduced. Here we will expand our discussion.

To achieve this goal, the idea is to limit the coefficient vector to a special subset , which satisfies below two conditions:

Condition 1. The set must be large enough ().

This is to make sure that the rate will be at most reduced by factor of .

Condition 2. The set should have low intersection with any subspace of dimension satisfying and Such a subset is called subspace-evasive subset.

The bound for the list size at worst case is , and it can be reduced to a relative small bound by using subspace-evasive subsets.

During this step, as it has to check each element of the solution subspace that we get from Step 2, it takes time in the worst case ( is the dimension of the solution subspace).

Dvir and Lovett improved the result based on the work of Guruswami, which can reduce the list size to a constant.

Here is only presented the idea that is used to prune the solution subspace. For the details of the prune process, please refer to papers by Guruswami, Dvir and Lovett, which are listed in the reference.

Summary[edit]

If we don't consider the Step 3, this algorithm can run in quadratic time. A summary for this algorithm is listed below.

Overview of linear-algebraic list decoding algorithm for FRS code
Steps
  1. Interpolation
  2. Root finding
  3. Prune
Runtime
Error radius
List size

See also[edit]

References[edit]

  1. ^ Krachkovsky, V.Y. (November 2003). "Reed-Solomon codes for correcting phased error bursts". IEEE Transactions on Information Theory. 49 (11): 2975–84. doi:10.1109/TIT.2003.819333.
  2. ^ Brander 2010, Proposition 5.11
  3. ^ Guruswami 2011
  4. ^ Guruswami 2011