video-text-retrieval

Here are 16 public repositories matching this topic...

ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

search retrieval ranking clip multimodality multimodal-learning multimodal activitynet retrieval-model msvd msrvtt video-text-retrieval lsmdc didemo video-clip-retrieval

Updated Apr 12, 2024
Python

Paranioar / Awesome_Matching_Pretraining_Transfering

Star

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Insight.

Updated Dec 15, 2024

microsoft / UniVL

Star

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

video localization caption alignment segmentation coin multimodality joint multimodal-sentiment-analysis pretrain pretraining msrvtt video-text-retrieval video-text video-language youcookii retrieval-task caption-task

Updated Jul 25, 2024
Python

whwu95 / Cap4Video

Star

【CVPR'2023 Highlight & TPAMI】Cap4Video: What Can Auxiliary Captions Do for Text-Video Retrieval?

video-understanding cross-modal-learning video-text-retrieval video-language-understanding

Updated Nov 29, 2024
Python

salesforce / ALPRO

Star

Align and Prompt: Video-and-Language Pre-training with Entity Prompts

representation-learning vision-and-language video-question-answering video-text-retrieval video-language prompt-learning

Updated Sep 20, 2022
Python

m-bain / CondensedMovies

Star

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

retrieval dataset video-text-retrieval source-videos precomputed-features

Updated Sep 21, 2022
Python

xuguohai / X-CLIP

Star

An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"

multimodal activitynet msvd msrvtt video-text-retrieval lsmdc didemo

Updated Apr 6, 2024
Python

alipay / Ant-Multi-Modal-Framework

Star

Research Code for Multimodal-Cognition Team in Ant Group

video-editing multimodal-learning video-text-retrieval image-text-retrieval multimodal-llm

Updated Jul 11, 2024
Python

amazon-science / crossmodal-contrastive-learning

Star

CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021

natural-language-processing video computer-vision transformers video-captioning multi-modality contrastive-learning video-text-retrieval

Updated Feb 7, 2022
Python

LeapLabTHU / Cross-Modal-Adapter

Star

[arXiv] Cross-Modal Adapter for Text-Video Retrieval

adapter machine-learning deep-learning pytorch clip vision-and-language video-text-retrieval parameter-efficient-learning parameter-efficient-tuning

Updated Nov 21, 2022

RenShuhuai-Andy / TESTA

Star

[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

video-understanding video-qa video-text-retrieval long-video-understanding

Updated Jan 9, 2024
Python

knightyxp / DGL

Star

[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.

cross-modal-retrieval cross-modal-learning video-text-retrieval prompt-tuning parameter-efficient-tuning video-language-understanding

Updated Oct 14, 2024
Python

unitaryai / VTC

Star

VTC: Improving Video-Text Retrieval with User Comments

comments video-understanding multimodal-deep-learning video-text-retrieval vision-language-transformer vision-language-pretraining

Updated Nov 3, 2024
Python

rn-snehapriya / Automatic-Note-Taking-From-Video-Using-Tesseract-OCR

Star

Text from the video is extracted and saved into a .docx file in the form of notes.

machine-learning ocr tesseract-ocr video-to-text video-text-recognition video-text-retrieval video-text automatic-note-taking

Updated Mar 27, 2022
Jupyter Notebook

unitaryai / VTC-dataset

Star

dataset video-understanding video-text-retrieval vision-language-pretraining vision-language-dataset

Updated May 1, 2024
Python

Jazz1996 / tech_review

Star

Survey of state-of-art video-text retrieval methods.

video-text-retrieval

Updated Nov 14, 2020

Improve this page

Add a description, image, and links to the video-text-retrieval topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the video-text-retrieval topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

video-text-retrieval

Here are 16 public repositories matching this topic...

ArrowLuo / CLIP4Clip

Paranioar / Awesome_Matching_Pretraining_Transfering

microsoft / UniVL

whwu95 / Cap4Video

salesforce / ALPRO

m-bain / CondensedMovies

xuguohai / X-CLIP

alipay / Ant-Multi-Modal-Framework

amazon-science / crossmodal-contrastive-learning

LeapLabTHU / Cross-Modal-Adapter

RenShuhuai-Andy / TESTA

knightyxp / DGL

unitaryai / VTC

rn-snehapriya / Automatic-Note-Taking-From-Video-Using-Tesseract-OCR

unitaryai / VTC-dataset

Jazz1996 / tech_review

Improve this page

Add this topic to your repo