RAG---LLM-with-custom-Data

This repository contains implementations of the Retrieval-Augmented Generation (RAG) Architecture, leveraging various frameworks and tools for efficient retrieval and generation tasks. RAG Architecture combines the power of vectorized retrieval-based methods with the flexibility of generative models(like chatGPT,etc), offering enhanced capabilities for tasks such as question answering and text summarization.

Basic about RAG:

This architecture used pre-defined custom data split into small chunks and stored in Vector DB. When user asks a question, it goes through vector data base and uses similarity search (cosine similarity) to retrieved the relevant chunks. It then forwards those chunks to the LLM. The LLM combines the user's query with the extracted context to generate answer to the user's query using the custom data provided.

Files:

RAG_Langchain.ipynb

This Jupyter Notebook implements the RAG Architecture using the Langchain framework. The ipynb format is used to make the understanding of RAG Implementation as easy as possible. The following components are utilized:

Langchain Framework: Langchain facilitates the implementation of RAG Architecture, enabling efficient interaction between retrieval and generation modules.
Faiss: Faiss is employed for Vector DB, allowing fast and scalable similarity search for retrieving relevant information.
PyPDF: PyPDF is utilized for extracting text from PDF documents, enabling the system to process a wide range of document formats.
Sentence Transformer: Sentence Transformer is used for generating vector embeddings of text, facilitating similarity calculations and retrieval tasks.
Free ChatGPT API: The system leverages the Free ChatGPT API from RAPID API.

RAG_with_Rerank.ipynb

Sometimes the simple RAG arcitecture is not able to retrieve apt chunks to be passed into LLM. So an additional retrieval layer is implemented to improve retrieval accuracy. In this notebook we use Flashrank for reranking which is based on cross-encoders and is open sourced. In this notebook we encounter the problem when the required chunk is not extracted properly using the RAG Architecture. We then use Flashrank to state this issue.

FlashRank: FlashRank (https://pypi.org/project/FlashRank/) is utilized for topic ReRanking, enhancing retrieval results by leveraging cross-encoder techniques.

Document:

A sample document is used with the name 'Doc1.pdf' downloaded from this website : https://cartographicperspectives.org/index.php/journal/article/view/cp13-full/pdf.

Modules/Packages

Langchain: https://python.langchain.com/docs/get_started/introduction/
Faiss (Vector DB): (https://python.langchain.com/docs/integrations/vectorstores/faiss/)
PyPDF: (https://pypi.org/project/pypdf/)
Sentence Transformer: https://python.langchain.com/docs/integrations/text_embedding/sentence_transformers/
Free ChatGPT API(RapidAPI hub): (https://rapidapi.com/haxednet/api/chatgpt-api8)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Doc1.pdf		Doc1.pdf
RAG__Langchain.ipynb		RAG__Langchain.ipynb
RAG_with_ReRank.ipynb		RAG_with_ReRank.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG---LLM-with-custom-Data

Basic about RAG:

Files:

RAG_Langchain.ipynb

RAG_with_Rerank.ipynb

Document:

Modules/Packages

About

Releases

Packages

Languages

sidd-tech/RAG---LLM-with-custom-Data

Folders and files

Latest commit

History

Repository files navigation

RAG---LLM-with-custom-Data

Basic about RAG:

Files:

RAG_Langchain.ipynb

RAG_with_Rerank.ipynb

Document:

Modules/Packages

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages