Burrows–Wheeler Transform Based Lossless Text Compression Using Keys and Huffman Coding

Text compression is one of the most significant research fields, and various algorithms for text compression have already been developed. This is a significant issue, as the use of internet bandwidth is considerably increasing. This article proposes a Burrows–Wheeler transform and pattern matching-b...

Full description

Bibliographic Details
Main Authors: Md. Atiqur Rahman, Mohamed Hamada
Format: Article
Language:English
Published: MDPI AG 2020-10-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/10/1654
Description
Summary:Text compression is one of the most significant research fields, and various algorithms for text compression have already been developed. This is a significant issue, as the use of internet bandwidth is considerably increasing. This article proposes a Burrows–Wheeler transform and pattern matching-based lossless text compression algorithm that uses Huffman coding in order to achieve an excellent compression ratio. In this article, we introduce an algorithm with two keys that are used in order to reduce more frequently repeated characters after the Burrows–Wheeler transform. We then find patterns of a certain length from the reduced text and apply Huffman encoding. We compare our proposed technique with state-of-the-art text compression algorithms. Finally, we conclude that the proposed technique demonstrates a gain in compression ratio when compared to other compression techniques. A small problem with our proposed method is that it does not work very well for symmetric communications like Brotli.
ISSN:2073-8994