Introduction

In the vast and dynamic world of sports, key moments often define games and fuel discussions among fans and analysts alike. However, sifting through hours of video to find these pivotal events can be time-consuming and labor-intensive. By leveraging the synergy of audio-to-text technology, like Whisper, and LLM, we can automate the detection of these key moments, transforming how content is curated and consumed. This innovative approach not only enhances the viewer's experience by delivering concise summaries and highlights but also opens new ways for data-driven sports analysis and storytelling.

Medium article: link
GitHub repo: link

Setup environment

Setup conda environment

1.1 Install miniconda

wget https://repo.anaconda.com/miniconda/Miniconda3-py310_23.9.0-0-Linux-x86_64.sh  .
chmod +x Miniconda3-py310_23.9.0-0-Linux-x86_64.sh
source ~/.bashrc
# validate installation
conda env list

1.2 Create conda environment

conda create -n sport_info_retrieval python=3.10
conda activate sport_info_retrieval
pip install -r requirements.txt

Download and prepare pre-trained audio-to-text model

2.1 Download model from Hugging Face

See here or run these commands in Jupyter cell :

!pip install huggingface_hub
from huggingface_hub import login
login(token = "put_your_hugging_face_here", add_to_git_credential=True)
model_hf_path = 'https://huggingface.co/openai/whisper-medium.en' # adjust for model you'd like to use
!git clone {model_hf_path}

# archive folder to copy folder to target location
!zip -r whisper-medium.en.zip whisper-medium.en -x "whisper-medium.en/.git/*"

Unarchive zip to get model folder and copy it to to sport_info_retrieval/models.

2.2 Adjust configuration - set AUDIO_TO_TEXT_MODEL_FOLDER and AUDIO_TO_TEXT_MODEL_NAME in config.yaml

2.3 Perform installs on Linux

sudo apt update
sudo apt install ffmpeg

Retrieve Info

info = get_info(prompt)

display_info(info, VIDEO_NAME)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
images		images
.gitignore		.gitignore
README.md		README.md
app_utils.py		app_utils.py
config_example.yaml		config_example.yaml
log.log		log.log
main.ipynb		main.ipynb
main.py		main.py
requirements.txt		requirements.txt
service_utils.py		service_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Setup environment

Retrieve Info

About

Releases

Packages

Languages

snassimr/sport_info_retrieval

Folders and files

Latest commit

History

Repository files navigation

Introduction

Setup environment

Retrieve Info

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages