name | description | authors | links | colaboratory | update |
---|---|---|---|---|---|
YOLOv5 | You Only Look Once | Glenn Jocher |
|
31.07.2022 | |
Anycost GAN | Interactive natural image editing |
|
20.07.2022 | ||
Disco Diffusion | A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations | 14.07.2022 | |||
Dream Fields | Zero-Shot Text-Guided Object Generation |
|
13.07.2022 | ||
GFPGAN | Towards Real-World Blind Face Restoration with Generative Facial Prior |
|
13.07.2022 | ||
Make-A-Scene | Scene-Based Text-to-Image Generation with Human Priors | 01.07.2022 | |||
DALL·E Mini | Generate images from a text prompt | 29.06.2022 | |||
OPT | Open Pre-trained Transformers is a family of NLP models trained on billions of tokens of text obtained from the internet |
|
29.06.2022 | ||
GPEN | GAN Prior Embedded Network for Blind Face Restoration in the Wild |
|
26.06.2022 | ||
HuggingArtists | Choose your favorite Artist and train a language model to write new lyrics based on their unique voice | Aleksey Korshuk | 25.06.2022 | ||
Customizing a Transformer Encoder | We will learn how to customize the encoder to employ new network architectures | Chen Chen | 22.06.2022 | ||
MTTR | End-to-End Referring Video Object Segmentation with Multimodal Transformers | 20.06.2022 | |||
SwinIR | Image Restoration Using Swin Transformer | 17.06.2022 | |||
VRT | A Video Restoration Transformer | 15.06.2022 | |||
Detic | Detecting Twenty-thousand Classes using Image-level Supervision | 07.06.2022 | |||
AMARETTO | Multiscale and multimodal inference of regulatory networks to identify cell circuits and their drivers shared and distinct within and across biological systems of human disease | 01.06.2022 | |||
T0 | Multitask Prompted Training Enables Zero-Shot Task Generalization |
|
29.05.2022 | ||
LaMa | Resolution-robust Large Mask Inpainting with Fourier Convolutions |
|
25.05.2022 | ||
StyleGAN-NADA | Zero-Shot non-adversarial domain adaptation of pre-trained generators |
|
20.05.2022 | ||
Parallel WaveGAN | State-of-the-art non-autoregressive models to build your own great vocoder | Tomoki Hayashi |
|
16.05.2022 | |
Text2Mesh | Text-Driven Neural Stylization for Meshes | 14.05.2022 | |||
T5 | Text-To-Text Transfer Transformer | 11.05.2022 | |||
XLS-R | Self-supervised Cross-lingual Speech Representation Learning at Scale | 10.05.2022 | |||
CLIPDraw | Synthesize drawings to match a text prompt |
|
28.04.2022 | ||
Real-ESRGAN | Extend the powerful ESRGAN to a practical restoration application, which is trained with pure synthetic data | 24.04.2022 | |||
FILM | A frame interpolation algorithm that synthesizes multiple intermediate frames from two input images with large in-between motion | 07.04.2022 | |||
Deep Painterly Harmonization | Algorithm produces significantly better results than photo compositing or global stylization techniques and that it enables creative painterly edits that would be otherwise difficult to achieve | 07.04.2022 | |||
LDM | High-Resolution Image Synthesis with Latent Diffusion Models | 04.04.2022 | |||
Demucs | Hybrid Spectrogram and Waveform Source Separation | Alexandre Défossez | 23.03.2022 | ||
CLIPasso | Semantically-Aware Object Sketching | 21.03.2022 | |||
AlphaFold | Highly accurate protein structure prediction | 16.03.2022 | |||
VideoGPT | A conceptually simple architecture for scaling likelihood based generative modeling to natural videos | 02.03.2022 | |||
Disentangled Lifespan Face Synthesis | LFS model is proposed to disentangle the key face characteristics including shape, texture and identity so that the unique shape and texture age transformations can be modeled effectively | 22.02.2022 | |||
ArcaneGAN | Process video in the style of the Arcane animated series | Alexander Spirin | 17.02.2022 | ||
Mask2Former | Masked-attention Mask Transformer for Universal Image Segmentation | 09.02.2022 | |||
SpecVQGAN | Taming the visually guided sound generation by shrinking a training dataset to a set of representative vectors |
|
03.02.2022 | ||
JoJoGAN | One Shot Face Stylization | 02.02.2022 | |||
DFL-Colab | This project provides you IPython Notebook to use DeepFaceLab | chervonij | 20.01.2022 | ||
Pose with Style | Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN |
|
19.01.2022 | ||
Taming Transformers for High-Resolution Image Synthesis | We combine the efficiancy of convolutional approaches with the expressivity of transformers by introducing a convolutional VQGAN, which learns a codebook of context-rich visual parts, whose composition is modeled with an autoregressive transformer | 13.01.2022 | |||
FuseDream | Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization | 02.01.2022 | |||
GLIDE | Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models | 22.12.2021 | |||
Music Composer | Synthesizing symbolic music in MIDI format using the Music Transformer model | bazanovvanya | 20.12.2021 | ||
encoder4editing | Designing an Encoder for StyleGAN Image Manipulation | 02.12.2021 | |||
StyleCariGAN | Caricature Generation via StyleGAN Feature Map Modulation |
|
30.11.2021 | ||
CartoonGAN | The implementation of the cartoon GAN model with PyTorch | Tobias Sunderdiek | 24.11.2021 | ||
SimSwap | An efficient framework, called Simple Swap, aiming for generalized and high fidelity face swapping | 24.11.2021 | |||
RVM | Robust High-Resolution Video Matting with Temporal Guidance |
|
24.11.2021 | ||
AnimeGANv2 | An improved version of AnimeGAN - it prevents the generation of high-frequency artifacts by simply changing the normalization of features in the network |
|
17.11.2021 | ||
YOLOv3 | You Only Look Once | Glenn Jocher |
|
14.11.2021 | |
SOAT | StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN | 13.11.2021 | |||
Arnheim | Generative Art Using Neural Visual Grammars and Dual Encoders | 11.11.2021 | |||
StyleGAN 2 | Generation of faces, cars, etc. | Mikael Christensen | 05.11.2021 | ||
ruDALL-E | Generate images from texts in Russian | Alex Shonenkov |
|
03.11.2021 | |
ByteTrack | Multi-Object Tracking by Associating Every Detection Box | 30.10.2021 | |||
StyleGAN3 | Alias-Free Generative Adversarial Networks |
|
19.10.2021 | ||
GPT-2 | Retrain an advanced text generating neural network on any text dataset using gpt-2-simple! | Max Woolf | 18.10.2021 | ||
IC-GAN | Instance-Conditioned GAN |
|
01.10.2021 | ||
Skillful Precipitation Nowcasting Using Deep Generative Models of Radar | Open-sourced dataset and model snapshot for precipitation nowcasting | 29.09.2021 | |||
Text2Animation | Generate images from text phrases with VQGAN and CLIP with animation and keyframes | 29.09.2021 | |||
Live Speech Portraits | Real-Time Photorealistic Talking-Head Animation |
|
26.09.2021 | ||
Open-Unmix | A deep neural network reference implementation for music source separation, applicable for researchers, audio engineers and artists | 23.07.2021 | |||
textgenrnn | Generate text using a pretrained neural network with a few lines of code, or easily train your own text-generating neural network of any size and complexity | Max Woolf | 13.07.2021 | ||
First Order Motion Model for Image Animation | Transferring facial movements from video to image | Aliaksandr Siarohin | 30.06.2021 | ||
TediGAN | Framework for multi-modal image generation and manipulation with textual descriptions | 30.06.2021 | |||
GANs N' Roses | Stable, Controllable, Diverse Image to Image Translation | 19.06.2021 | |||
Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes | A method to stylize images by optimizing parameterized brushstrokes instead of pixels | 02.06.2021 | |||
Pixel2Style2Pixel | Encoding in Style: A StyleGAN Encoder for Image-to-Image Translation |
|
01.06.2021 | ||
Fine-tuning a BERT | We will work through fine-tuning a BERT model using the tensorflow-models PIP package | 24.05.2021 | |||
ReStyle | A Residual-Based StyleGAN Encoder via Iterative Refinement |
|
21.05.2021 | ||
Motion Representations for Articulated Animation | Novel motion representations for animating articulated objects consisting of distinct parts | 29.04.2021 | |||
SAM | Age Transformation Using a Style-Based Regression Model |
|
26.04.2021 | ||
SkinDeep | Remove Body Tattoo Using Deep Learning | Vijish Madhavan | 24.04.2021 | ||
Geometry-Free View Synthesis | Is a geometric model required to synthesize novel views from a single image? |
|
22.04.2021 | ||
NeRViS | An algorithm for full-frame video stabilization by first estimating dense warp fields | 11.04.2021 | |||
NeX | View synthesis based on enhancements of multiplane image that can reproduce NeXt-level view-dependent effects in real time | 25.03.2021 | |||
Big Sleep | Text to image generation, using OpenAI's CLIP and a BigGAN | Phil Wang | 17.03.2021 | ||
Deep Daze | Text to image generation using OpenAI's CLIP and Siren | Phil Wang | 17.03.2021 | ||
Talking Head Anime from a Single Image | The network takes as input an image of an anime character's face and a desired pose, and it outputs another image of the same character in the given pose | Pramook Khungurn |
|
23.02.2021 | |
Multitrack MusicVAE | The models in this notebook are capable of encoding and decoding single measures of up to 8 tracks, optionally conditioned on an underlying chord | 17.02.2021 | |||
NFNet | An adaptive gradient clipping technique, a significantly improved class of Normalizer-Free ResNets | 17.02.2021 | |||
bsuite | A collection of carefully-designed experiments that investigate core capabilities of an RL agent with two main objectives |
|
13.02.2021 | ||
Wav2Lip | A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild | 12.02.2021 | |||
CLIP | A neural network which efficiently learns visual concepts from natural language supervision | 29.01.2021 | |||
Adversarial Patch | A method to create universal, robust, targeted adversarial image patches in the real world | Tom Brown | 27.01.2021 | ||
MSG-Net | Multi-style Generative Network with a novel Inspiration Layer, which retains the functionality of optimization-based approaches and has the fast speed of feed-forward networks | 25.01.2021 | |||
Toon-Me | A fun project to toon portrait images | Vijish Madhavan | 22.01.2021 | ||
Neural Style Transfer | Implementation of Neural Style Transfer in Keras 2.0+ | Somshubra Majumdar | 22.01.2021 | ||
SkyAR | A vision-based method for video sky replacement and harmonization, which can automatically generate realistic and dramatic sky backgrounds in videos with controllable styles | Zhengxia Zou | 18.01.2021 | ||
Big GAN | Large Scale GAN Training for High Fidelity Natural Image Synthesis | 12.01.2021 | |||
GrooVAE | Some applications of machine learning for generating and manipulating beats and drum performances | 08.01.2021 | |||
MusicXML Documentation | The goal of this notebook is to explore one of the magenta libraries for music | 08.01.2021 | |||
SVG VAE | A colab demo for the SVG VAE model | Raphael Gontijo Lopes | 08.01.2021 | ||
PIFuHD | Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization | 05.01.2021 | |||
Neural Magic Eye | Learning to See and Understand the Scene Behind an Autostereogram | 01.01.2021 | |||
Flow-edge Guided Video Completion | Method first extracts and completes motion edges, and then uses them to guide piecewise-smooth flow completion with sharp edges | 30.12.2020 | |||
ArtLine | A Deep Learning based project for creating line art portraits | Vijish Madhavan |
|
24.12.2020 | |
WikiArt (stylegan2-ada) | Generation of paintings of different styles and genres | Doron Adler | 08.12.2020 | ||
GANSpace | A simple technique to analyze GANs and create interpretable controls for image synthesis, such as change of viewpoint, aging, lighting, and time of day | 06.12.2020 | |||
SeFa | A closed-form approach for unsupervised latent semantic factorization in GANs | 06.12.2020 | |||
Stylized Neural Painting | An image-to-painting translation method that generates vivid and realistic painting artworks with controllable styles | 01.12.2020 | |||
DeOldify (video) | Colorize your own videos! | Jason Antic |
|
13.11.2020 | |
DeOldify (photo) | Colorize your own photos! |
|
13.11.2020 | ||
MakeItTalk | A method that generates expressive talking-head videos from a single facial image with audio as the only input | 10.11.2020 | |||
LaSAFT | Latent Source Attentive Frequency Transformation for Conditioned Source Separation | Woosung Choi | 01.11.2020 | ||
Lifespan Age Transformation Synthesis | Multi-domain image-to-image generative adversarial network architecture, whose learned latent space models a continuous bi-directional aging process |
|
31.10.2020 | ||
HiGAN | Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis |
|
14.10.2020 | ||
InterFaceGAN | Interpreting the Latent Space of GANs for Semantic Face Editing |
|
13.10.2020 | ||
Faceswap-GAN | A minimum demo for faceswap-GAN v2.2 | shaoanlu | 12.09.2020 | ||
Instance-aware Image Colorization | Novel deep learning framework to achieve instance-aware colorization | Jheng-Wei Su | 30.08.2020 | ||
Person Remover | Project that combines Pix2Pix and YOLO arhitectures in order to remove people or other objects from photos | 22.08.2020 | |||
Rewriting a Deep Generative Model | We ask if a deep network can be reprogrammed to follow different rules, by enabling a user to directly change the weights, instead of training with a data set |
|
31.07.2020 | ||
BERT score | An automatic evaluation metric for text generation | Tianyi Zhang | 17.07.2020 | ||
HiDT | A generative image-to-image model and a new upsampling scheme that allows to apply image translation at high resolution | 17.07.2020 | |||
Analyzing Tennis Serve | We'll use the Video Intelligence API to analyze a tennis serve, including the angle of the arms and legs during the serve | Dale Markowitz | 14.07.2020 | ||
SIREN | Implicit Neural Representations with Periodic Activation Functions | 24.06.2020 | |||
PIFu | Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization | 18.06.2020 | |||
3D Ken Burns | A reference implementation of 3D Ken Burns Effect from a Single Image using PyTorch - given a single input image, it animates this still image with a virtual camera scan and zoom subject to motion parallax | Manuel Romero | 13.06.2020 | ||
MusicVAE | A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music | 02.06.2020 | |||
Background Matting | The notebook is split into three parts: required setup, running the algorithm on photos, and running it on videos | Andrey Ryabtsev | 18.05.2020 | ||
Jukebox | A neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles | Christine Payne | 04.05.2020 | ||
3D Photo Inpainting | Method for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view | 04.05.2020 | |||
Motion Supervised co-part Segmentation | A self-supervised deep learning method for co-part segmentation | 07.04.2020 | |||
RNN for Predictive Maintenance | LSTM network in order to predict remaining useful life of aircraft engines | Umberto Griffo | 03.04.2020 | ||
Onsets and Frames | Onsets and Frames is an automatic music transcription framework with piano and drums models | 02.04.2020 | |||
Classification of chest vs. adominal X-rays | The goal of this tutorial is to build a deep learning classifier to accurately differentiate between chest and abdominal X-rays | tmoneyx01 | 07.03.2020 | ||
Lung X-Rays Semantic Segmentation | This lesson applies a U-Net for Semantic Segmentation of the lung fields on chest x-rays | tmoneyx01 | 07.03.2020 | ||
WikiArt (stylegan2) | Generation of paintings of different styles and genres | Doron Adler | 27.01.2020 | ||
Earth Engine Python API and Folium Interactive Mapping | This notebook demonstrates how to setup the Earth Engine and provides several examples for visualizing Earth Engine processed data interactively using the folium library | Qiusheng Wu | 20.01.2020 | ||
Train a GPT-2 Model on Tweets | Train the model on your downloaded tweets, and generate massive amounts of Tweets from it | Max Woolf | 16.01.2020 | ||
Traffic counting | Making Road Traffic Counting App based on Computer Vision and OpenCV | Andrey Nikishaev | 10.01.2020 | ||
Siamese NN | Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task | Tomasz Latkowski |
|
19.12.2019 | |
Learning to Paint | Learning to Paint With Model-based Deep Reinforcement Learning | Manuel Romero | 17.12.2019 | ||
Imaging-AMARETTO | An imaging genomics software tool to systematically interrogate multi-omics networks for relevance to radiography and histopathology imaging biomarkers of clinical outcomes with application to studies of brain tumors | 29.11.2019 | |||
Face toolbox | A collection of deep learning frameworks ported to Keras for face detection, face segmentation, face parsing, iris detection, and face verification | shaoanlu | 03.10.2019 | ||
Generating Piano Music with Transformer | This Colab notebook lets you play with pretrained Transformer models for piano music generation, based on the Music Transformer | 16.09.2019 | |||
Few-shot face translation | A GAN based approach for one model to swap them all: model is capable of producing faces that has its gaze direction, glasses, and hiar occlusions being consistent with given source face | shaoanlu |
|
02.09.2019 | |
Waifu2x | This is the Google Colab implementation of tsurumeso's chainer implementation of waifu2x | Margesh Phirke | 23.08.2019 | ||
GMCNN | Generative Multi-column Convolutional Neural Networks inpainting model in Keras | Tomasz Latkowski | 09.08.2019 | ||
XLNet | Generalized Autoregressive Pretraining for Language Understanding | Zhilin Yang | 28.06.2019 | ||
Breast Cancer detection | A Neural Network for detecting breast cancer in cell scans! | Peter Teoh | blog post | 28.04.2019 | |
GazeML | Eye region landmarks detection | shaoanlu | 03.04.2019 | ||
BERT with TPU | Using a free Colab Cloud TPU to fine-tune sentence and sentence-pair classification tasks built on top of pretrained BERT models and run predictions on tuned model | Sourabh Bajaj | 29.03.2019 | ||
automl-gs on a TPU | Give an input CSV file and a target field you want to predict to automl-gs, and get a trained high-performing machine learning or deep learning model plus native Python code pipelines allowing you to integrate that model into any prediction workflow | Max Woolf | 26.03.2019 | ||
GANSynth | This notebook is a demo GANSynth, which generates audio with Generative Adversarial Networks | Jesse Engel | 25.02.2019 | ||
Edge Detection | Edge detection in OpenCV and skimage | Yuhuang Hu | 15.02.2019 | ||
BERT on TF Hub | Predicting Movie Review Sentiment with BERT on TF Hub | Dale Markowitz | 12.02.2019 | ||
Mask R-CNN | Code and visualizations to test, debug, and evaluate the Mask R-CNN model | 22.01.2019 | |||
RSNA Pneumonia Detection Challenge (Kaggel API) | The basics of parsing the competition dataset, training using a detector basd on the Mask-RCNN algorithm for object detection and instance segmentation | tmoneyx01 | 03.09.2018 | ||
HoF | This notebook will walk you step by step through the process of using a pre-trained model to detect faces in an image | Lucas Persona | 23.04.2018 | ||
Group Normalization | A simple alternative to BN. GN divides the channels into groups and computes within each group the mean and variance for normalization. | shaoanlu | 26.03.2018 | ||
Latent Constraints | Conditional Generation from Unconditional Generative Models | 27.11.2017 | |||
Performance RNN | This notebook shows you how to generate new performed compositions from a trained model | 11.07.2017 | |||
NSynth | This colab notebook has everything you need to upload your own sounds and use NSynth models to reconstruct and interpolate between them | 06.04.2017 |
name | description | authors | links | colaboratory | update |
---|---|---|---|---|---|
Hello, many worlds | This tutorial shows how a classical neural network can learn to correct qubit calibration errors | Michael Broughton | 28.07.2022 | ||
Diffusers | provides pretrained diffusion models across multiple modalities, such as vision and audio, and serves as a modular toolbox for inference and training of diffusion models | Hugging Face | 28.07.2022 | ||
Building Your Own Federated Learning Algorithm | We discuss how to implement federated learning algorithms without deferring to the tff.learning API | Zachary Charles | 27.07.2022 | ||
Federated Learning for Image Classification | We use the classic MNIST training example to introduce the Federated Learning API layer of TFF, tff.learning - a set of higher-level interfaces that can be used to perform common types of federated learning tasks, such as federated training, against user-supplied models implemented in TensorFlow | Krzysztof Ostrowski |
|
27.07.2022 | |
Federated Learning for Text Generation | We start with a RNN that generates ASCII characters, and refine it via federated learning | Krzysztof Ostrowski | 27.07.2022 | ||
Custom Federated Algorithms, Part 1: Introduction to the Federated Core | This tutorial is the first part of a two-part series that demonstrates how to implement custom types of federated algorithms in TensorFlow Federated using the Federated Core - a set of lower-level interfaces that serve as a foundation upon which we have implemented the Federated Learning layer | Krzysztof Ostrowski | 27.07.2022 | ||
Custom Federated Algorithms, Part 2: Implementing Federated Averaging | This tutorial is the second part of a two-part series that demonstrates how to implement custom types of federated algorithms in TFF using the Federated Core, which serves as a foundation for the Federated Learning layer | Krzysztof Ostrowski | 27.07.2022 | ||
TFF for Federated Learning Research: Model and Update Compression | We use the EMNIST dataset to demonstrate how to enable lossy compression algorithms to reduce communication cost in the Federated Averaging algorithm | Weikang Song | 27.07.2022 | ||
High-performance simulations with TFF | This tutorial will describe how to setup high-performance simulations with TFF in a variety of common scenarios | Krzysztof Ostrowski | 27.07.2022 | ||
High-performance Simulation with Kubernetes | This tutorial will describe how to set up high-performance simulation using a TFF runtime running on Kubernetes | Jason Roselander | 27.07.2022 | ||
Accelerate | A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision | Hugging Face | 27.07.2022 | ||
YOLOv5 on Custom Objects | This notebook shows training on your own custom objects | Jacob Solawetz | 20.07.2022 | ||
Word2Vec | Word2Vec is not a singular algorithm, rather, it is a family of model architectures and optimizations that can be used to learn word embeddings from large datasets | 19.07.2022 | |||
dm_control | DeepMind Infrastructure for Physics-Based Simulation |
|
18.07.2022 | ||
MuJoCo | A general purpose physics engine that aims to facilitate research and development in robotics, biomechanics, graphics and animation, machine learning, and other areas which demand fast and accurate simulation of articulated structures interacting with their environment | 18.07.2022 | |||
Epistemic Neural Networks | A library for neural networks that know what they don't know | 12.07.2022 | |||
Integrated gradients | This tutorial demonstrates how to implement Integrated Gradients, an Explainable AI technique |
|
30.06.2022 | ||
SberSwap | A new face swap method for image and video domains | 29.06.2022 | |||
BIG-bench | A collaborative benchmark intended to probe large language models and extrapolate their future capabilities |
|
27.06.2022 | ||
Neural style transfer | This tutorial uses deep learning to compose one image in the style of another image | Billy Lamberta | 26.06.2022 | ||
Introduction to the TensorFlow Models NLP library | You will learn how to build transformer-based models for common NLP tasks including pretraining, span labelling and classification using the building blocks from NLP modeling library | Chen Chen | 22.06.2022 | ||
Cirq | A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum circuits | 21.06.2022 | |||
Actor-Critic | This tutorial demonstrates how to implement the Actor-Critic method using TensorFlow to train an agent on the Open AI Gym CartPole-V0 environment | Mark Daoust |
|
09.06.2022 | |
Transfer learning and fine-tuning | You will learn how to classify images of cats and dogs by using transfer learning from a pre-trained network | François Chollet | 07.06.2022 | ||
CycleGAN | This notebook demonstrates unpaired image to image translation using conditional GAN's | Billy Lamberta | 07.06.2022 | ||
Image captioning | Given an image our goal is to generate a caption | Billy Lamberta | 07.06.2022 | ||
TorchGeo | PyTorch domain library that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data | 05.06.2022 | |||
Evidently | An open-source framework to evaluate, test and monitor ML models in production | 31.05.2022 | |||
Transformer | This tutorial trains a Transformer model to translate Portuguese to English | Billy Lamberta |
|
24.05.2022 | |
MyoSuite | A collection of musculoskeletal environments and tasks simulated with the MuJoCo physics engine and wrapped in the OpenAI gym API to enable the application of Machine Learning to bio-mechanic control problems | 23.05.2022 | |||
Detectron2 | FAIR's next-generation platform for object detection and segmentation | Yuxin Wu | 21.05.2022 | ||
The Autodiff Cookbook | You'll go through a whole bunch of neat autodiff ideas that you can cherry pick for your own work, starting with the basics | 13.05.2022 | |||
Image segmentation | This tutorial focuses on the task of image segmentation, using a modified U-Net | Billy Lamberta | 09.05.2022 | ||
Text generation with RNN | This tutorial demonstrates how to generate text using a character-based RNN | Billy Lamberta | 02.05.2022 | ||
Simple audio recognition | This tutorial will show you how to build a basic speech recognition network that recognizes ten different words | 26.04.2022 | |||
Autoencoders | This tutorial introduces autoencoders with three examples: the basics, image denoising, and anomaly detection | 05.04.2022 | |||
highway-env | A collection of environments for autonomous driving and tactical decision-making tasks | Edouard Leurent |
|
19.03.2022 | |
Text classification with RNN | This text classification tutorial trains a recurrent neural network on the IMDB large movie review dataset for sentiment analysis | Billy Lamberta | 17.03.2022 | ||
Real-Time Voice Cloning | SV2TTS with a vocoder that works in real-time | 07.03.2022 | |||
Silero Models | Pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple | Silero team | 27.02.2022 | ||
NeMo | A conversational AI toolkit built for researchers working on automatic speech recognition, natural language processing, and text-to-speech synthesis | 23.02.2022 | |||
Classify text with BERT | This tutorial contains complete code to fine-tune BERT to perform sentiment analysis on a dataset of plain-text IMDB movie reviews |
|
22.02.2022 | ||
NMT with attention | This notebook trains a seq2seq model for Spanish to English translation | Billy Lamberta |
|
22.02.2022 | |
GLUE using BERT on TPU | This tutorial contains complete end-to-end code to train models on a TPU | 22.02.2022 | |||
Data augmentation | This tutorial demonstrates data augmentation: a technique to increase the diversity of your training set by applying random transformations such as image rotation | Billy Lamberta | 22.02.2022 | ||
Word embeddings | This tutorial contains an introduction to word embeddings | Billy Lamberta | 15.01.2022 | ||
RuDOLPH | A fast and light text-image-text transformer designed for a quick and easy fine-tuning setup for the solution of various tasks: from generating images by text description and image classification to visual question answering and more | 14.01.2022 | |||
DeepDream | This tutorial contains a minimal implementation of DeepDream: an experiment that visualizes the patterns learned by a neural network | Billy Lamberta | 13.01.2022 | ||
MLP | The most basic neural network architectures, a multilayer perceptron, also known as a feedforward network | Ben Trevett | 26.12.2021 | ||
AlexNet | A neural network model that uses convolutional neural network layers and was designed for the ImageNet challenge | Ben Trevett | 26.12.2021 | ||
VGG | Very Deep Convolutional Networks for Large-Scale Image Recognition | Ben Trevett | 26.12.2021 | ||
LeNet | A neural network model that uses convolutional neural network layers and was designed for classifying handwritten characters | Ben Trevett | 26.12.2021 | ||
FLAML | Lightweight Python library that finds accurate machine learning models automatically, efficiently and economically | 17.12.2021 | |||
NL-Augmenter | A collaborative effort intended to add transformations of datasets dealing with natural language |
|
15.12.2021 | ||
Image classification | This tutorial shows how to classify images of flowers | Billy Lamberta | 28.11.2021 | ||
ruGPT3 | Example of inference of RuGPT3XL | Anton Emelyanov | 17.11.2021 | ||
CompilerGym | A reinforcement learning toolkit for compiler optimizations | 16.11.2021 | |||
Pix2Pix | This notebook demonstrates image to image translation using conditional GAN's | Billy Lamberta | 10.11.2021 | ||
DeepStyle | The Neural Style algorithm synthesizes a pastiche by separating and combining the content of one image with the style of another image using convolutional neural networks |
|
01.10.2021 | ||
EfficientNetV2 | A family of image classification models, which achieve better parameter efficiency and faster training speed than prior arts | 24.09.2021 | |||
Droidlet | A modular embodied agent architecture and platform for building embodied agents |
|
15.09.2021 | ||
GPT-J-6B | A 6 billion parameter, autoregressive text generation model trained on The Pile | 15.09.2021 | |||
Sentence Transformers | Multilingual Sentence, Paragraph, and Image Embeddings using BERT & Co |
|
13.09.2021 | ||
Lucid Sonic Dreams | Syncs GAN-generated visuals to music | Mikael Alafriz | 24.08.2021 | ||
Haiku | A library built on top of JAX designed to provide simple, composable abstractions for machine learning research | 17.06.2021 | |||
CNN | This tutorial demonstrates training a simple Convolutional Neural Network to classify CIFAR images | Billy Lamberta | 21.05.2021 | ||
Custom GPT-2 + Tokenizer | Train a custom GPT-2 model for free on a GPU using aitextgen! | Max Woolf | 17.05.2021 | ||
Train a GPT-2 Text-Generating Model | Retrain an advanced text generating neural network on any text dataset for free on a GPU using Colaboratory using aitextgen! | Max Woolf | 17.05.2021 | ||
EasyNMT | Easy to use, state-of-the-art machine translation for more than 100+ languages | Nils Reimers |
|
26.04.2021 | |
Deep-MAC | Welcome to the Novel class segmentation demo | Vighnesh Birodkar | 02.04.2021 | ||
GPT Neo | An implementation of model & data parallel GPT2 & GPT3 -like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library | EleutherAI |
|
28.03.2021 | |
CVAE | This notebook demonstrates how train a Variational Autoencoder on the MNIST dataset | Billy Lamberta | , | 22.03.2021 | |
DCGAN | This tutorial demonstrates how to generate images of handwritten digits using a Deep Convolutional Generative Adversarial Network | Billy Lamberta | 12.03.2021 | ||
Adversarial FGSM | This tutorial creates an adversarial example using the Fast Gradient Signed Method attack. This was one of the first and most popular attacks to fool a neural network. | Billy Lamberta | 12.03.2021 | ||
GAN steerability | We will navigate in GAN latent space to simulate various camera transformations |
|
04.03.2021 | ||
TF-Ranking | End-to-end walkthrough of training a TensorFlow Ranking neural network model which incorporates sparse textual features | Rama Kumar |
|
04.02.2021 | |
TensorNetwork | A library for easy and efficient manipulation of tensor networks | Chase Roberts |
|
21.01.2021 | |
Spleeter | Deezer source separation library including pretrained models | 10.01.2021 | |||
Semantic Segmentation | Pytorch implementation for Semantic Segmentation/Scene Parsing on MIT ADE20K dataset |
|
21.08.2020 | ||
CoVoST | A Large-Scale Multilingual Speech-To-Text Translation Corpus | 07.08.2020 | |||
Eager Few Shot Object Detection | Fine tuning of a RetinaNet architecture on very few examples of a novel class after initializing from a pre-trained COCO checkpoint | kmindspark | 11.07.2020 | ||
YOLOv4 | This tutorial will help you build YOLOv4 easily in the cloud with GPU enabled so that you can run object detections in milliseconds! | Aleksey Bochkovskiy |
|
25.06.2020 | |
Context R-CNN Demo | This notebook will walk you step by step through the process of using a pre-trained model to build up a contextual memory bank for a set of images, and then detect objects in those images+context using Context R-CNN | pkulzc | 17.06.2020 | ||
GAN Dissection | Visualizing and Understanding Generative Adversarial Networks | 04.05.2020 | |||
Imagededup | This package provides functionality to make use of hashing algorithms that are particularly good at finding exact duplicates as well as convolutional neural networks which are also adept at finding near duplicates | 03.10.2019 | |||
SentencePiece | An unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training | 03.07.2019 | |||
Transfer Learning in NLP | This notebook accompanies the tutorial given at NAACL 2019 on Transfer Learning in Natural Language Processing |
|
03.06.2019 | ||
RSNA Pneumonia Detection Challenge (MD.ai API) | The basics of parsing the competition dataset, training using a detector basd on the Mask-RCNN algorithm for object detection and instance segmentation | tmoneyx01 | 29.08.2018 | ||
Python Data Science Handbook | Jupyter notebook version of the Python Data Science Handbook by Jake VanderPlas | Jake Vanderplas | 14.08.2017 |
(generated by generate_markdown.py based on research.json and tutorials.json)