TiCA

A Tibetan Text Compression Algorithm

descriptionPublicationkeyboard_double_arrow_right Article 15 Oct 2020Publisher:ACMJournal:Proceedings of the 2nd International Conference on Artificial Intelligence and Advanced Manufacture

Authors: Suonan Jiancuo; Chen Shuo; Renqing Nuobu; Nima Zhaxi;

doi: 10.1145/3421766.3421868

TiCA

- Summary
- Metrics

Abstract

This paper proposes a Tibetan text compression algorithm (TiCA), which is based on the fact that each Tibetan syllable is composed of one to seven components and each component has a unique Unicode encoding. First of all, through statistical analysis of 20G Tibetan text corpus, a fault-tolerant mapping dictionary is established and used as the dictionary of the TiCA. The TiCA then compresses the Tibetan text according to the mapping dictionary by mapping the original code to a single code. Finally, the experimental comparison shows that the Tibetan text compression algorithm proposed in this paper has achieved excellent results both in the compression rate and time consuming.

Related Organizations

Sichuan University
China (People's Republic of)
Tibet University
China (People's Republic of)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average