[1]X. L. Truong, “Integrating features and harnessing pre-trained visual-language models for enhancing VQA reading comprehension”, J. Comput. Sci. Cybern., vol. 41, no. 3, p. 323–336, May 2025.