gpt2-tensorflow-localchat

gpt2-tensorflow-localchat is a simple CLI chat mode framework for Python, built for locally running GPT-2 models with TensorFlow. This tool provides an easy way to interact with GPT-2 models and fine-tune them on custom data sets or use them for unique, real-time applications.

Project Repository

Explore more about this project and its developments on GitHub: gpt2-tensorflow-localchat

Features

CLI-based interaction with GPT-2 models.
Local deployment of TensorFlow models for privacy and control.
Support for multiple command scripts to demonstrate various capabilities.

Directory Structure

├── .gitignore
├── README.md
└── src/
    ├── Model-Battle.py          # Battle between models: experimental feature
    ├── Model-Localtalk.py       # Main script for local chat interactions
    ├── encoder.py               # Manages text encoding and decoding
    ├── model.py                 # Core TensorFlow model definitions
    ├── olddemo.py               # Old demonstration scripts for reference
    ├── sample.py                # Sampling utilities for generating text
    └── start_localtalk.sh       # Script to start local chat environment(*)

(*) The purpose of the bash script is to suppress Tensorflow's output from interfering in the CLI output.

Getting Started

Prerequisites

Python 3.x
TensorFlow 1.15 or higher (with compat.v1 API support)
An environment supporting bash scripts (Linux/Unix)

Installation

Clone the repository:

git clone https://github.com/FlyingFathead/gpt2-tensorflow-localchat.git

Navigate into the project directory:
```
cd gpt2-tensorflow-localchat
```
Install required Python libraries:
```
pip install -r requirements.txt
```

(Note: requirements.txt currently not added in yet. Use your own local TF model files.)

Usage

To start a local chat with the model:

./src/start_localtalk.sh

This script sets the appropriate TensorFlow logging level and starts an interactive chat session using Model-Localtalk.py.

Changes

v0.17 - local chat now uses GPUtil to look for the best available CUDA GPU
v0.16 - /clear to clear out the context memory
v0.15 - bugfixes, /swap for role-swapping between user and the model
v0.10 - initial commit

Contributing

Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.

License

Distributed under the MIT License. See LICENSE for more information. Parts of the model loading code has been forked from OpenAI's GPT-2 source code.

Contact

Project Link: https://github.com/FlyingFathead/gpt2-tensorflow-localchat
Project Creator: Flyingfathead on GitHub

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

gpt2-tensorflow-localchat

Project Repository

Features

Directory Structure

Getting Started

Prerequisites

Installation

Usage

Changes

Contributing

License

Contact

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

FlyingFathead/gpt2-tensorflow-localchat

Folders and files

Latest commit

History

Repository files navigation

gpt2-tensorflow-localchat

Project Repository

Features

Directory Structure

Getting Started

Prerequisites

Installation

Usage

Changes

Contributing

License

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages