Japanese / English

Detail of Publication

Text Language English
Authors Rina Buoy, Masakazu Iwamura, Sovila Srun, Koichi Kise
Title Language-Aware Non-Autoregressive Khmer Textline Recognition Using Khmer Subword Model
Journal Proc. International Conference on Pattern Recognition and Artificial Intelligence
Number of Pages 16 pages
Reviewed or not Reviewed
Presentation type Oral
Month & Year July 2024
Abstract Unlike the Latin script, Khmer does not use spaces between words, leading to text recognition typically being done at the textline level. This can involve a vast number of characters and results in high latency for a language-aware autoregressive (AR) decoder that generates one character at a time. On the other hand, a non-autoregressive (NAR) decoder generates all characters in parallel, but it is not language-aware. In this paper, we introduce an efficient Khmer textline recognition method based on a NAR decoder, ensuring low decoding latency while maintaining linguistic awareness. This is achieved by utilizing a Khmer-specific subword modeling called Khmer character clusters (KCC) that capture the syntactic, morphological, and orthographic aspects of the Khmer script. Therefore, instead of conventional character-level recognition, the proposed method recognizes all character clusters or subwords in parallel. The experimental results demonstrate that the proposed method outperforms the character-level baseline NAR model in terms of recognition accuracy while maintaining the same low latency. When compared with the character-level baseline AR model, the proposed method achieves comparable or improved recognition accuracy while also achieving significantly lower latency. When compared with the recent state-of-the-art (SOTA) NAR and AR Khmer text recognition methods, our proposed method achieves superior recognition performance.
Back to list