regenerate-embeddings

GitHub Action to regenerate OpenAI word embeddings and store them in a Supabase vector store via LangChain. Useful if you have a retrieval-augmented generation (RAG) system and want to update the word embeddings automatically when the knowledge base changes.

Inputs

`github-personal-access-token`

Required Github personal access token

`openai-api-key`

Required OpenAI API key

`supabase-anon-key`

Required Supabase anon key

`supabase-url`

Required Supabase url

`repository-owner-username`

Required GitHub username of the repository owner

`repository-name`

Required Name of the repository

`path-to-contents`

Required Path to the directory containing notes content relative to the root path

`directory-structure`

Required Either nested or flat

nested: path-to-contents points to a list of directories

flat: path-to-contents points to a list of files

Note: please have github-personal-access-token, openai-api-key, supabase-anon-key and supabase-url defined as environment variables. See the section below

Example usage

On the GitHub repository you're adding this action to, go to Settings > Environments and create a new environment called Dev
Add environment variables to the Dev environment by following these instructions
Create a .github/workflows directory in the root of the project
In .github/workflows, create a file called regenerate-embeddings.yml
Copy the following YAML into regenerate-embeddings.yml

Reference

name: Regenerate embeddings
run-name: Regenerate embeddings and store in Supabase
on: [push]
jobs:
  regenerate-embeddings:
    runs-on: ubuntu-latest
    environment: Dev
    steps:
      - name: Regenerate embeddings (flat notes)
        uses: K02D/regenerate-embeddings@v2.3
        with:
          repository-owner-username: "K02D"
          repository-name: "retrieval-augmented-generation"
          path-to-contents: "notes_flat"
          directory-structure: "flat"
          github-personal-access-token: ${{ secrets.GH_PERSONAL_ACCESS_TOKEN }}
          openai-api-key: ${{ secrets.OPENAI_API_KEY }}
          supabase-anon-key: ${{ secrets.SUPABASE_ANON_KEY }}
          supabase-url: ${{ secrets.SUPABASE_URL }}

This YAML

Assumes the environment variables added in step 2 are named GH_PERSONAL_ACCESS_TOKEN, OPENAI_API_KEY, SUPABASE_ANON_KEY, and SUPABASE_URL
Triggers the action on every push to the main branch

Pre-requisites

Create an OpenAI API key here if you don't have one. Use this for OPENAI_API_KEY
- OpenAI's API is used to generate the word embeddings
Create a supabase project here if you don't have one. Once created, go to Project Settings > API to get the project URL and anon api key. Use these for SUPABASE_URL and SUPABASE_ANON_KEY
- Supabase is used to store the word embeddings in a postgres vector database so relevant content is retrieved when a user enters a prompt. This relevant content augments the LLM's response
Initialize your database in your supabase project using LangChain's template (ref). On your project dashboard, go to SQL Editor > Quickstarts > LangChain and click RUN

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
dist		dist
.gitignore		.gitignore
README.md		README.md
action.yaml		action.yaml
client.ts		client.ts
index.ts		index.ts
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

regenerate-embeddings

Inputs

`github-personal-access-token`

`openai-api-key`

`supabase-anon-key`

`supabase-url`

`repository-owner-username`

`repository-name`

`path-to-contents`

`directory-structure`

Example usage

Pre-requisites

About

Releases 2

Packages

Languages

K02D/regenerate-embeddings

Folders and files

Latest commit

History

Repository files navigation

regenerate-embeddings

Inputs

github-personal-access-token

openai-api-key

supabase-anon-key

supabase-url

repository-owner-username

repository-name

path-to-contents

directory-structure

Example usage

Pre-requisites

About

Topics

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

`github-personal-access-token`

`openai-api-key`

`supabase-anon-key`

`supabase-url`

`repository-owner-username`

`repository-name`

`path-to-contents`

`directory-structure`

Packages