LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System

An intelligent video retrieval system leveraging Large Language Models (LLMs) and multimodal search, developed for the AIC2024 competition and accepted at the international SOICT 2024 conference.

Table of Contents

📍 Overview
🎯 Features
🤖 Tech Stack
🚀 Setup and Usage
🎬 Demo
👣 Workflow
📐 App Structure
🧑‍💻 Contributors

📍 Overview

The LLM-Powered Video Search System is an advanced multimodal video search solution that leverages Large Language Models (LLMs) to enhance video retrieval through text, image, and metadata queries. This project was developed for the AIC2024 competition and has been accepted at the international SOICT 2024 conference, aiming to provide an intelligent and efficient video search system. Details about the paper can be found on Springer.

🎯 Features

Multimodal Search Capabilities
- Text-based search: Supports ASR (Automatic Speech Recognition), OCR, captions, and descriptive image queries for improved accuracy.
- Image-based search: Enables users to find specific video segments based on images.
- Metadata-based search: Provides a 7x7 matrix for tagging objects and color attributes for contextual search.
LLM-Powered Interaction
- Integrates LLMs (e.g., GPT-4) to handle natural language queries and deliver relevant search results tailored to the context.
User-Friendly Interface
- A responsive user interface allows users to view results as keyframes or full video segments and interact with detailed metadata.

🤖 Tech Stack

Back-end: Django
Core Technologies: CLIP, Faiss, TFIDF
Supporting Technologies: OpenCV, PyTorch, Transformers
Development Tools: Docker, Git, Jupyter Notebook

🚀 Setup and Usage

Clone Repository

git clone https://github.com/xndien2004/LLM_Powered_Video_Search.git
cd AIC2024

Install Dependencies Ensure Python and Django are installed. Then, install other dependencies from requirements.txt:
```
pip install -r requirements.txt
```
Configure MEDIA_ROOT Open settings.py in the AIC/ folder and set MEDIA_ROOT to point to your local media directory:
```
MEDIA_ROOT = '/path/to/your/media'
```
You can download the dataset from Google Drive or Kaggle.

Media for the app should be stored in the media directory. For more detailed instructions, check the Media format
Verify Paths in viewAPI.py Ensure paths in app/viewAPI.py are correct.
Run Migrations Update the database with migrations:
```
python manage.py migrate
```
Run the Application To start the application, use:
```
python manage.py runserver
```
The app will run by default at http://127.0.0.1:8000/.

🎬 Demo

Screenshots:

👣 Workflow

Data Processing: Video data is processed using ASR or extracted via TransnetV2, then converted into image features and metadata.
LLM Powered Interaction: Natural language queries are processed by the LLM and combined with image features and metadata for relevant video retrieval.

📐 App Structure

├── LLM_Powered_Video_Search/
│   ├── AIC/
│   │   ├── settings.py
│   ├── app/
│   │   ├── admin.py
│   │   ├── data_utils.py
│   │   ├── migrations/
│   │   ├── static/
│   │   ├── templates/
│   │   ├── viewAPI.py 
│   ├── data_extraction/
│   │   ├── TransnetV2/
│   │   ├── audio/
│   │   ├── metadata/
│   ├── docker-compose.yml
│   ├── figs/
│   ├── manage.py
│   ├── requirements.txt
│   ├── utils/
│       ├── LLM/
│       ├── video_retrieval/
│       ├── faiss_search.py
│       ├── combine_search.py
|       |...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System

📍 Overview

🎯 Features

🤖 Tech Stack

🚀 Setup and Usage

🎬 Demo

👣 Workflow

📐 App Structure

🧑‍💻 Contributors

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
AIC		AIC
app		app
data_extraction		data_extraction
figs		figs
utils		utils
.gitignore		.gitignore
.gitpod.yml		.gitpod.yml
README.md		README.md
docker-compose.yml		docker-compose.yml
manage.py		manage.py
requirements.txt		requirements.txt

xndien2004/LLM_Powered_Video_Search

Folders and files

Latest commit

History

Repository files navigation

LLM-Powered Video Search: A Comprehensive Multimedia Retrieval System

📍 Overview

🎯 Features

🤖 Tech Stack

🚀 Setup and Usage

🎬 Demo

👣 Workflow

📐 App Structure

🧑‍💻 Contributors

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages