michaelnny

Michael Hu michaelnny

Achievements

Llama3-FunctionCalling Llama3-FunctionCalling Public

Fine-tune Llama3 model to support function calling

Jupyter Notebook 25 1
InstructLLaMA InstructLLaMA Public

Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to Instru…

Jupyter Notebook 44 9
RAG-LLaMA RAG-LLaMA Public archive

A clean and simple implementation of Retrieval Augmented Generation (RAG) to enhanced LLaMA chat model to answer questions from a private knowledge base. We use Tesla user manuals to build the know…

Jupyter Notebook 3 1
alpha_zero alpha_zero Public

A PyTorch implementation of DeepMind's AlphaZero agent to play Go and Gomoku board games

Python 79 18
muzero muzero Public archive

A PyTorch implementation of DeepMind's MuZero agent

Python 27 3
deep_rl_zoo deep_rl_zoo Public archive

A collection of Deep Reinforcement Learning algorithms implemented with PyTorch to solve Atari games and classic control tasks like CartPole, LunarLander, and MountainCar.

Python 104 11