Image Search Engine

CLIP is a powerful pre-trained model able to calculate compatibility between an image, and a text prompt. We can easily see how useful this model can be for an image search engine.

So, I decided to create a simple app that takes as input a directory of images, calculates their characteristics (embeddings), and stores them to an Elasticsearch index. Then when the user enters the prompt "dog", we calculate the text embedding using CLIP again and perform a cosine similarity query on our index.

Then the paths of the selected image are returned to the user.

Video Demo

Yep, I have not yet implemented a frontend app but it's definitely in my TODOs.

demo_vid.mp4

Setting up

Setup the image directory

We must point the app to the image directory we want to index.

Create a directory in the root of the repo named storage
Add any images you want to index and search over (jpeg, jpg, png)

Then there are two possible of installing the app:

Option 1: Docker compose installation

Make sure in backend/configs/config.yaml Elastic.host is set to "elastic"
Run docker-compose up --build in the directory root. (Takes ~5 minutes creating a 7GB image)

Since, I have not implemented a frontend we will use it as a command line app.

Run docker ps and note the container id of the clip image (NOT the elasticsearch id).
Run docker exec -it CONTAINER_ID /bin/bash
In the container run python cmd.py. You will have to for the model to be downloaded.

Option 2: Local installation

Install pytorch using conda. I suggest looking at the installation procedure in the docs.
Install the remaining requirements by pip install -r requirements.txt
Make sure in backend/configs/config.yaml Elastic.host is set to "localhost"
Set the image directory path in backend/configs/config.yaml
Start the elasticsearch container:

docker run -p 9200:9200 -p 9300:9300 -e "discovery.type=single-node" docker.elastic.co/elasticsearch/elasticsearch:7.14.0

Run python backend/cmd.py

Issues / TODOs

Cannot add an image in the Elasticsearch database without recalculating all of the image embeddings.
The Docker image created is relatively big (7GB) since it includes the cuda toolikit. No cpu-only solution.
No Elasticsearch persistence, which can be easily added through an elasticsearch data volume.
A web-like frontend instead of the current command line utility.
Currently, when initializing we calculate each image embedding sequentially. Should add batch encoding to speed up the process.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image Search Engine

Video Demo

Setting up

Issues / TODOs

About

Releases

Packages

Languages

MikeXydas/Image-Search-Engine-with-CLIP

Folders and files

Latest commit

History

Repository files navigation

Image Search Engine

Video Demo

Setting up

Issues / TODOs

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages