zhousheng97

Follow

🐢

Focusing

Sheng Zhou zhousheng97

🐢

Focusing

Follow

VQA & MLLM.

14 followers · 9 following

Hefei University of Technology
China
11:41 (UTC +08:00)
https://zhousheng97.github.io/

Achievements

Achievements

zhousheng97/README.md

Hi there 👋

👩 I’m Sheng, a PhD student from China, currently studying as a visiting student at the National University of Singapore.
🧐 My focus is multimedia learning, especially VQA, and I’m currently exploring multimodal LLMs.
💬 As an ENFJ-A, I thrive on meaningful collaboration and communication.
📫 You can reach me at hzgn97@gmail.com—let’s connect!

Pinned Loading

EgoTextVQA EgoTextVQA Public

[CVPR 2025] 🌟🌟 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering

Python 24
ViTXT-GQA ViTXT-GQA Public

✨✨ Scene-Text Grounding for Text-Based Video Question Answering (arxiv)

Python 12 1
Awesome-MLLM-TextVQA Awesome-MLLM-TextVQA Public

✨✨Latest Research on Multimodal Large Language Models on Scene-Text VQA Tasks

6
GPIN GPIN Public

Graph Pooling Inference Network for Text-based VQA (ACM TOMM'2024)

Python 3
SSGN SSGN Public

Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA (IEEE TIP'2023)

Python 3