GitHub - jSwords91/ai-vision: A small project showcasing AI vision capabilities using your webcam

AI Vision

A project to showcase various AI vision capabilites.

Included:

Image captioning based on your webcam
- Live
This is a real-time scene description. The main challenge here, aside from the engineering, is the UX. How do you transcribe a scene in real-time and provide a decent UX? Open question...
- Snapshot
This describes the scene at a single point in time
Image Generation
- Through prompting i.e. "Generate an image of..."
- Based off your webcam

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__pycache__		__pycache__
backend		backend
frontend		frontend
images		images
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt