IMP lab. publication database: Detail of Publication

Detail of Publication

Text Language	English
Authors	Rina Buoy, Masakazu Iwamura, Sovila Srun, Koichi Kise
Title	PARSTR: Partially Autoregressive Scene Text Recognition
Journal	International Journal on Document Analysis and Recognition (IJDAR)
Vol.	27
Pages	pp.303-316
Number of Pages	14 pages
Publisher	Springer
Address	Berlin, Germany
Reviewed or not	Reviewed
Month & Year	May 2024
Abstract	An autoregressive (AR) decoder for scene text recognition (STR) requires numerous generation steps to decode a text image character by character but can yield high recognition accuracy. On the other hand, a non-autoregressive (NAR) decoder generates all characters in a single generation but suffers from a loss of recognition accuracy. This is because, unlike the former, the latter assumes that the predicted characters are conditionally independent. This paper presents a Partially Autoregressive Scene Text Recognition (PARSTR) method that unifies both AR and NAR decoding within the same model. To reduce decoding steps while maintaining recognition accuracy, we devise two decoding strategies: b-first and b-ahead, reducing the decoding steps to approximately b and by a factor of b, respectively. The experimental results demonstrate that our PARSTR models using the devised decoding strategies present a balanced compromise between efficiency and recognition accuracy compared to the fully AR and NAR decoding approaches. Specifically, the experimental results on public benchmark STR datasets demonstrate the potential to reduce decoding steps down to at most five steps and by a factor of five under the b-first and b-ahead decoding schemes, respectively, while having a slight reduction of total word recognition accuracy of less than or equal to 0.5%.
DOI	10.1007/s10032-024-00470-1

Entry for BibTeX

@Article{Buoy2024,
  author =	{Rina Buoy and Masakazu Iwamura and Sovila Srun and Koichi Kise},
  title =	{PARSTR: Partially Autoregressive Scene Text Recognition},
  journal =	{International Journal on Document Analysis and Recognition (IJDAR)},
  year =	2024,
  month =	may,
  volume =	{27},
  pages =	{303--316},
  numpages =	{14},
  DOI =		{10.1007/s10032-024-00470-1},
  publisher =	{Springer},
  address =	{Berlin, Germany}
}

Back to list

Homepage
-------
List of Publications
-------
Search for Publications
=======
Page for Management (Only for lab members)