GitHub - fgtrzah/llmrfcpoc: combating the llm fomo, feeding the shiny object syndrome, for folly and partially for curiousity

Overview

The work in progress backend for rfcindex frontend. RFC Index is a simple poc of a system that leverages ietf datatracker api + tooling to facilitate keyword search over 9000+ RFC documents. At its core it explores basic use cases of prompt engineering - might expand in the future. Time permitting, a full roadmap or detailed series might be dedicated to document its evolution as I think of use cases or useful functionality. But the tldr: this is a gentle primer into using commonly known llm apis to facilitate exploration and comprehension of RFC documents.

aside: definitely not suitable as a boilerplate or starting point for production genai pipelines or projects, so fork at your own potential peril / disgretion.

Requiremnets

OpenAI API Key
Github App Registration

Optional

Auth0 configuration (this is easily removable and not central to most of the endpoints)

Setup & local development

You can run local builds via most python virtualization tools and a set of metadata for scaffolding with necessary secrets/config params.

mkcert localhost 127.0.0.1 (optional but this project assumes https)
cp .env.example .env and then step through and substitute as needed. The settings module under config/settings.py is self documenting and offers good indications of necessary / optional environment variables
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt
uvicorn main:app --reload or configure uvicorn programmatically inside an ifmain block in main.py and then python main.py

Credits / Inspo

instructor
whoever wrote the first RFC (idrk)
whoever decided how they'd be formatted (cheff's kiss)

Roadmap

moderation api endpoint usage for taming hallucinations
monitoring the egress/entry points attached to the parts of code interfacing with api response + interfacing with OpenAI requests
hard requirements: trace all dataflow to OpenAI whois info and stow in sqlite while linking to currently authenticated user's id
toxicity detection and phonetic vulgarity filtering
controlled chaos prevention
draft t.o.s that is succinct and clearly emphasizes all data that is idempotently associable to the current user - ensure 0 chance of liability and full transparency
trust but verify exhaustively
sanitize inputs (utilize a moderation dto layer for the no brainer injection prevention needs)

MISC

some fake keys can be found in lfs / I was testing code analysis using codeql to come up with a sensible security policy / bolster code quality
feedback and prs are always welcome

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
rfcllm		rfcllm
.env.example		.env.example
.gitignore		.gitignore
IDEA.md		IDEA.md
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

Requiremnets

Optional

Setup & local development

Credits / Inspo

Roadmap

MISC

About

Releases

Packages

Contributors 2

Languages

fgtrzah/llmrfcpoc

Folders and files

Latest commit

History

Repository files navigation

Overview

Requiremnets

Optional

Setup & local development

Credits / Inspo

Roadmap

MISC

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages