This repository contains the key code for ACM TOMM 2024 paper: Graph Pooling Inference Network for Text-based VQA
The framework of Graph Pooling Inference Network (GPIN).
If you find GPIN useful for your research and applications, please cite using this BibTeX:
@article{zhou2024graph,
title={Graph Pooling Inference Network for Text-based VQA},
author={Zhou, Sheng and Guo, Dan and Yang, Xun and Dong, Jianfeng and Wang, Meng},
journal={ACM Transactions on Multimedia Computing, Communications and Applications},
volume={20},
number={4},
pages={1--21},
year={2024},
publisher={ACM New York, NY}
}
This work is based on M4C.