VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Authors Editorial Team Affiliations MBZUAI Published July 30, 2024 DOI 15.997566/mbzuai.00033 Introduction Current methods in video understanding primarily rely on either image or video encoders, each with inherent limitations. Image…