Home / Regular Issue / JST Vol. 28 (4) Oct. 2020 / JST-2004-2020

 

A Compression System for Unicode Files Using an Enhanced Lzw Method

Rincy Thayyalakkal Anto and Rajesh Ramachandran

Pertanika Journal of Science & Technology, Volume 28, Issue 4, October 2020

DOI: https://doi.org/10.47836/pjst.28.4.16

Keywords: Compression algorithms, dictionary-based data compression, LZW, unicode encoding, UTF-8

Published on: 21 October 2020

Data compression plays a vital and pivotal role in the process of computing as it helps in space reduction occupied by a file as well as to reduce the time taken to access the file. This work relates to a method for compressing and decompressing a UTF-8 encoded stream of data pertaining to Lempel-Ziv-welch (LZW) method. It is worth to use an exclusive-purpose LZW compression scheme as many applications are utilizing Unicode text. The system of the present work comprises a compression module, configured to compress the Unicode data by creating the dictionary entries in Unicode format. This is accomplished with adaptive characteristic data compression tables built upon the data to be compressed reflecting the characteristics of the most recent input data. The decompression module is configured to decompress the compressed file with the help of unique Unicode character table obtained from the compression module and the encoded output. We can have remarkable gain in compression, wherein the knowledge that we gather from the source is used to explore the decompression process.

ISSN 0128-7680

e-ISSN 2231-8526

Article ID

JST-2004-2020

Download Full Article PDF

Share this article

Recent Articles