
doi: 10.2298/sjee2203351a
Most algorithms of data compression were developed with English language as target text syntax. However, this paper approaches the problem of Yor?b? text files compression via the use of Discrete Wavelet Transform (DWT) and Huffman algorithm. Text files in Yor?b? language syntax are first converted into signal format that are then decomposed using DWT. The decomposed ASCII code representation of the text files are subsequently encoded using Huffman algorithm. Twenty different variants of DWTs taken from four families of wavelet filters (Haar, Daubechies, Symlets and bi-orthogonal) are considered to select the optimal DWT for Yor?b? text files compression. Furthermore, experiments are carried out in the proposed compression scheme with six different Yor?b? text files extracted from the open sources as input data sets. It is found that out of the twenty variants of DWT investigated, sym6 gives the best output for effective Yor?b? text files compression, due to its relatively high compression ratio, high compression factor and lowest compression error. Thus, sym6 as a wavelet transform is suitable for lossy text compression algorithm meant for Yor?b? language syntax text files.
yorùbá language syntax, Electrical engineering. Electronics. Nuclear engineering, text file, compression, wavelet transform, huffman coding, TK1-9971
yorùbá language syntax, Electrical engineering. Electronics. Nuclear engineering, text file, compression, wavelet transform, huffman coding, TK1-9971
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 2 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
