Video Comprehension
This project focuses on building advanced algorithms for "intelligent video comprehension", enabling efficient search and retrieval of specific events across large, complex video datasets through Multimodal Learning (vision, language, and knowledge graphs) and Retrieval-Augmented Generation (RAG).