The methods and ways mentionned in most of the DSPy
examples that uses ChromadbRM
,ColBERTv2
, MilvusRM
, QdrantRM
, WeaviateRM
etc fails
to consider these points mentionned below:
- Everybody is Not a seasoned progrmmaer who can easily spin-up local databases, web servers on a whim!
- LLM !=
GPT-4
orGPT-3.5
orGrok
orClaude
or any other bigshot expensive money burning exernal api keys that makes a big hole in aDev
's Pocket and Morale - I Whole heartedly respect and love all those investors and Scientists who made it possible to develop and use all these Greate LLms, But Being a Biologist to turned himself from coding Dna to Coding LLMs, I believe
Life (and a pair of
Kanjoose Bengali Engineers
😉 ) always finds a way! (To get away with doing things easy and for free 😝)
- Sorry for diverting into the cavernous side alleys of insomniac thoughts.
- And another issue is that everyone is not a guy who uses pthon with OOPs, and this can be a big OOps!! for a lot of newcommers =
nc
Well well, being a subset of nc
myself (with a history of making complex programms in Blender's Geometry node) and having blessed with Guides, friends and Youtubers Like Arnav Singhvi & Herumb Shandilya from DSPy
Connor Shorten from Weaviate
and Tamoghna Das, My College senior who brought me into this world of LLms in November 2022. And The last but Not the least Perplexity
for enabling millions like me who are a subset of nc
to code without fear of syntaxes.
- Studied the Ollama Library in python that is given by ollama and make an embedder:retiever function pair
- made a local server of Ollama and Chromadb run from a given directory from urls
http://localhost:11434/
andhttp://localhost:8000/
respectively. - Used NLTK's
Sent_tokenize
for chunking the text into sentences that will be fed to the ChromaDB - Made a Ollama based LLM run in DSPy
- Created the default
GenerateAnswer
andGenerateSearchQuery
Signature
s for Rag (Never used theGenerateSearchQuery
for being a noob myself 😝) - Created a
markdown
file that has the data I am doingRAG
on (in a later example I will show you how to do rag on web search data)