YOLOv5 |
You Only Look Once |
Glenn Jocher |
|
|
31.07.2022 |
Anycost GAN |
Interactive natural image editing |
|
|
|
20.07.2022 |
Disco Diffusion |
A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations |
|
|
|
14.07.2022 |
Dream Fields |
Zero-Shot Text-Guided Object Generation |
|
|
|
13.07.2022 |
GFPGAN |
Towards Real-World Blind Face Restoration with Generative Facial Prior |
|
|
|
13.07.2022 |
Make-A-Scene |
Scene-Based Text-to-Image Generation with Human Priors |
|
|
|
01.07.2022 |
DALL·E Mini |
Generate images from a text prompt |
|
|
|
29.06.2022 |
OPT |
Open Pre-trained Transformers is a family of NLP models trained on billions of tokens of text obtained from the internet |
|
|
|
29.06.2022 |
GPEN |
GAN Prior Embedded Network for Blind Face Restoration in the Wild |
|
|
|
26.06.2022 |
HuggingArtists |
Choose your favorite Artist and train a language model to write new lyrics based on their unique voice |
Aleksey Korshuk |
|
|
25.06.2022 |
Customizing a Transformer Encoder |
We will learn how to customize the encoder to employ new network architectures |
Chen Chen |
|
|
22.06.2022 |
MTTR |
End-to-End Referring Video Object Segmentation with Multimodal Transformers |
|
|
|
20.06.2022 |
SwinIR |
Image Restoration Using Swin Transformer |
|
|
|
17.06.2022 |
VRT |
A Video Restoration Transformer |
|
|
|
15.06.2022 |
Detic |
Detecting Twenty-thousand Classes using Image-level Supervision |
|
|
|
07.06.2022 |
AMARETTO |
Multiscale and multimodal inference of regulatory networks to identify cell circuits and their drivers shared and distinct within and across biological systems of human disease |
|
|
|
01.06.2022 |
T0 |
Multitask Prompted Training Enables Zero-Shot Task Generalization |
|
|
|
29.05.2022 |
LaMa |
Resolution-robust Large Mask Inpainting with Fourier Convolutions |
|
|
|
25.05.2022 |
StyleGAN-NADA |
Zero-Shot non-adversarial domain adaptation of pre-trained generators |
|
|
|
20.05.2022 |
Parallel WaveGAN |
State-of-the-art non-autoregressive models to build your own great vocoder |
Tomoki Hayashi |
|
|
16.05.2022 |
Text2Mesh |
Text-Driven Neural Stylization for Meshes |
|
|
|
14.05.2022 |
T5 |
Text-To-Text Transfer Transformer |
|
|
|
11.05.2022 |
XLS-R |
Self-supervised Cross-lingual Speech Representation Learning at Scale |
|
|
|
10.05.2022 |
CLIPDraw |
Synthesize drawings to match a text prompt |
|
|
|
28.04.2022 |
Real-ESRGAN |
Extend the powerful ESRGAN to a practical restoration application, which is trained with pure synthetic data |
|
|
|
24.04.2022 |
FILM |
A frame interpolation algorithm that synthesizes multiple intermediate frames from two input images with large in-between motion |
|
|
|
07.04.2022 |
Deep Painterly Harmonization |
Algorithm produces significantly better results than photo compositing or global stylization techniques and that it enables creative painterly edits that would be otherwise difficult to achieve |
|
|
|
07.04.2022 |
LDM |
High-Resolution Image Synthesis with Latent Diffusion Models |
|
|
|
04.04.2022 |
Demucs |
Hybrid Spectrogram and Waveform Source Separation |
Alexandre Défossez |
|
|
23.03.2022 |
CLIPasso |
Semantically-Aware Object Sketching |
|
|
|
21.03.2022 |
AlphaFold |
Highly accurate protein structure prediction |
|
|
|
16.03.2022 |
VideoGPT |
A conceptually simple architecture for scaling likelihood based generative modeling to natural videos |
|
|
|
02.03.2022 |
Disentangled Lifespan Face Synthesis |
LFS model is proposed to disentangle the key face characteristics including shape, texture and identity so that the unique shape and texture age transformations can be modeled effectively |
|
|
|
22.02.2022 |
ArcaneGAN |
Process video in the style of the Arcane animated series |
Alexander Spirin |
|
|
17.02.2022 |
Mask2Former |
Masked-attention Mask Transformer for Universal Image Segmentation |
|
|
|
09.02.2022 |
SpecVQGAN |
Taming the visually guided sound generation by shrinking a training dataset to a set of representative vectors |
|
- , , , , ,
- , , , ,
- project
- , ,
|
|
03.02.2022 |
JoJoGAN |
One Shot Face Stylization |
|
|
|
02.02.2022 |
DFL-Colab |
This project provides you IPython Notebook to use DeepFaceLab |
chervonij |
|
|
20.01.2022 |
Pose with Style |
Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN |
|
|
|
19.01.2022 |
Taming Transformers for High-Resolution Image Synthesis |
We combine the efficiancy of convolutional approaches with the expressivity of transformers by introducing a convolutional VQGAN, which learns a codebook of context-rich visual parts, whose composition is modeled with an autoregressive transformer |
|
|
|
13.01.2022 |
FuseDream |
Training-Free Text-to-Image Generation with Improved CLIP+GAN Space Optimization |
|
|
|
02.01.2022 |
GLIDE |
Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models |
|
|
|
22.12.2021 |
Music Composer |
Synthesizing symbolic music in MIDI format using the Music Transformer model |
bazanovvanya |
|
|
20.12.2021 |
encoder4editing |
Designing an Encoder for StyleGAN Image Manipulation |
|
|
|
02.12.2021 |
StyleCariGAN |
Caricature Generation via StyleGAN Feature Map Modulation |
|
|
|
30.11.2021 |
CartoonGAN |
The implementation of the cartoon GAN model with PyTorch |
Tobias Sunderdiek |
|
|
24.11.2021 |
SimSwap |
An efficient framework, called Simple Swap, aiming for generalized and high fidelity face swapping |
|
|
|
24.11.2021 |
RVM |
Robust High-Resolution Video Matting with Temporal Guidance |
|
|
|
24.11.2021 |
AnimeGANv2 |
An improved version of AnimeGAN - it prevents the generation of high-frequency artifacts by simply changing the normalization of features in the network |
|
|
|
17.11.2021 |
YOLOv3 |
You Only Look Once |
Glenn Jocher |
|
|
14.11.2021 |
SOAT |
StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN |
|
|
|
13.11.2021 |
Arnheim |
Generative Art Using Neural Visual Grammars and Dual Encoders |
|
|
|
11.11.2021 |
StyleGAN 2 |
Generation of faces, cars, etc. |
Mikael Christensen |
|
|
05.11.2021 |
ruDALL-E |
Generate images from texts in Russian |
Alex Shonenkov |
|
|
03.11.2021 |
ByteTrack |
Multi-Object Tracking by Associating Every Detection Box |
|
|
|
30.10.2021 |
StyleGAN3 |
Alias-Free Generative Adversarial Networks |
|
- , , , , ,
- , , , , ,
- project
|
|
19.10.2021 |
GPT-2 |
Retrain an advanced text generating neural network on any text dataset using gpt-2-simple! |
Max Woolf |
|
|
18.10.2021 |
IC-GAN |
Instance-Conditioned GAN |
|
|
|
01.10.2021 |
Skillful Precipitation Nowcasting Using Deep Generative Models of Radar |
Open-sourced dataset and model snapshot for precipitation nowcasting |
|
|
|
29.09.2021 |
Text2Animation |
Generate images from text phrases with VQGAN and CLIP with animation and keyframes |
|
|
|
29.09.2021 |
Live Speech Portraits |
Real-Time Photorealistic Talking-Head Animation |
|
|
|
26.09.2021 |
Open-Unmix |
A deep neural network reference implementation for music source separation, applicable for researchers, audio engineers and artists |
|
|
|
23.07.2021 |
textgenrnn |
Generate text using a pretrained neural network with a few lines of code, or easily train your own text-generating neural network of any size and complexity |
Max Woolf |
|
|
13.07.2021 |
First Order Motion Model for Image Animation |
Transferring facial movements from video to image |
Aliaksandr Siarohin |
|
|
30.06.2021 |
TediGAN |
Framework for multi-modal image generation and manipulation with textual descriptions |
|
|
|
30.06.2021 |
GANs N' Roses |
Stable, Controllable, Diverse Image to Image Translation |
|
|
|
19.06.2021 |
Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes |
A method to stylize images by optimizing parameterized brushstrokes instead of pixels |
|
|
|
02.06.2021 |
Pixel2Style2Pixel |
Encoding in Style: A StyleGAN Encoder for Image-to-Image Translation |
|
|
|
01.06.2021 |
Fine-tuning a BERT |
We will work through fine-tuning a BERT model using the tensorflow-models PIP package |
|
|
|
24.05.2021 |
ReStyle |
A Residual-Based StyleGAN Encoder via Iterative Refinement |
|
|
|
21.05.2021 |
Motion Representations for Articulated Animation |
Novel motion representations for animating articulated objects consisting of distinct parts |
|
|
|
29.04.2021 |
SAM |
Age Transformation Using a Style-Based Regression Model |
|
|
|
26.04.2021 |
SkinDeep |
Remove Body Tattoo Using Deep Learning |
Vijish Madhavan |
|
|
24.04.2021 |
Geometry-Free View Synthesis |
Is a geometric model required to synthesize novel views from a single image? |
|
|
|
22.04.2021 |
NeRViS |
An algorithm for full-frame video stabilization by first estimating dense warp fields |
|
|
|
11.04.2021 |
NeX |
View synthesis based on enhancements of multiplane image that can reproduce NeXt-level view-dependent effects in real time |
|
|
|
25.03.2021 |
Big Sleep |
Text to image generation, using OpenAI's CLIP and a BigGAN |
Phil Wang |
|
|
17.03.2021 |
Deep Daze |
Text to image generation using OpenAI's CLIP and Siren |
Phil Wang |
|
|
17.03.2021 |
Talking Head Anime from a Single Image |
The network takes as input an image of an anime character's face and a desired pose, and it outputs another image of the same character in the given pose |
Pramook Khungurn |
|
|
23.02.2021 |
Multitrack MusicVAE |
The models in this notebook are capable of encoding and decoding single measures of up to 8 tracks, optionally conditioned on an underlying chord |
|
|
|
17.02.2021 |
NFNet |
An adaptive gradient clipping technique, a significantly improved class of Normalizer-Free ResNets |
|
|
|
17.02.2021 |
bsuite |
A collection of carefully-designed experiments that investigate core capabilities of an RL agent with two main objectives |
|
|
|
13.02.2021 |
Wav2Lip |
A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild |
|
|
|
12.02.2021 |
CLIP |
A neural network which efficiently learns visual concepts from natural language supervision |
|
|
|
29.01.2021 |
Adversarial Patch |
A method to create universal, robust, targeted adversarial image patches in the real world |
Tom Brown |
|
|
27.01.2021 |
MSG-Net |
Multi-style Generative Network with a novel Inspiration Layer, which retains the functionality of optimization-based approaches and has the fast speed of feed-forward networks |
|
|
|
25.01.2021 |
Toon-Me |
A fun project to toon portrait images |
Vijish Madhavan |
|
|
22.01.2021 |
Neural Style Transfer |
Implementation of Neural Style Transfer in Keras 2.0+ |
Somshubra Majumdar |
|
|
22.01.2021 |
SkyAR |
A vision-based method for video sky replacement and harmonization, which can automatically generate realistic and dramatic sky backgrounds in videos with controllable styles |
Zhengxia Zou |
|
|
18.01.2021 |
Big GAN |
Large Scale GAN Training for High Fidelity Natural Image Synthesis |
Google |
|
|
12.01.2021 |
GrooVAE |
Some applications of machine learning for generating and manipulating beats and drum performances |
|
|
|
08.01.2021 |
MusicXML Documentation |
The goal of this notebook is to explore one of the magenta libraries for music |
|
|
|
08.01.2021 |
SVG VAE |
A colab demo for the SVG VAE model |
Raphael Gontijo Lopes |
|
|
08.01.2021 |
PIFuHD |
Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization |
|
|
|
05.01.2021 |
Neural Magic Eye |
Learning to See and Understand the Scene Behind an Autostereogram |
|
|
|
01.01.2021 |
Flow-edge Guided Video Completion |
Method first extracts and completes motion edges, and then uses them to guide piecewise-smooth flow completion with sharp edges |
|
|
|
30.12.2020 |
ArtLine |
A Deep Learning based project for creating line art portraits |
Vijish Madhavan |
|
|
24.12.2020 |
WikiArt (stylegan2-ada) |
Generation of paintings of different styles and genres |
Doron Adler |
|
|
08.12.2020 |
GANSpace |
A simple technique to analyze GANs and create interpretable controls for image synthesis, such as change of viewpoint, aging, lighting, and time of day |
|
|
|
06.12.2020 |
SeFa |
A closed-form approach for unsupervised latent semantic factorization in GANs |
|
|
|
06.12.2020 |
Stylized Neural Painting |
An image-to-painting translation method that generates vivid and realistic painting artworks with controllable styles |
|
|
|
01.12.2020 |
DeOldify (video) |
Colorize your own videos! |
Jason Antic |
|
|
13.11.2020 |
DeOldify (photo) |
Colorize your own photos! |
|
|
|
13.11.2020 |
MakeItTalk |
A method that generates expressive talking-head videos from a single facial image with audio as the only input |
|
|
|
10.11.2020 |
LaSAFT |
Latent Source Attentive Frequency Transformation for Conditioned Source Separation |
Woosung Choi |
|
|
01.11.2020 |
Lifespan Age Transformation Synthesis |
Multi-domain image-to-image generative adversarial network architecture, whose learned latent space models a continuous bi-directional aging process |
|
|
|
31.10.2020 |
HiGAN |
Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis |
|
|
|
14.10.2020 |
InterFaceGAN |
Interpreting the Latent Space of GANs for Semantic Face Editing |
|
|
|
13.10.2020 |
Faceswap-GAN |
A minimum demo for faceswap-GAN v2.2 |
shaoanlu |
|
|
12.09.2020 |
Instance-aware Image Colorization |
Novel deep learning framework to achieve instance-aware colorization |
Jheng-Wei Su |
|
|
30.08.2020 |
Person Remover |
Project that combines Pix2Pix and YOLO arhitectures in order to remove people or other objects from photos |
|
|
|
22.08.2020 |
Rewriting a Deep Generative Model |
We ask if a deep network can be reprogrammed to follow different rules, by enabling a user to directly change the weights, instead of training with a data set |
|
|
|
31.07.2020 |
BERT score |
An automatic evaluation metric for text generation |
Tianyi Zhang |
|
|
17.07.2020 |
HiDT |
A generative image-to-image model and a new upsampling scheme that allows to apply image translation at high resolution |
|
|
|
17.07.2020 |
Analyzing Tennis Serve |
We'll use the Video Intelligence API to analyze a tennis serve, including the angle of the arms and legs during the serve |
Dale Markowitz |
|
|
14.07.2020 |
SIREN |
Implicit Neural Representations with Periodic Activation Functions |
|
|
|
24.06.2020 |
PIFu |
Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization |
|
|
|
18.06.2020 |
3D Ken Burns |
A reference implementation of 3D Ken Burns Effect from a Single Image using PyTorch - given a single input image, it animates this still image with a virtual camera scan and zoom subject to motion parallax |
Manuel Romero |
|
|
13.06.2020 |
MusicVAE |
A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music |
|
|
|
02.06.2020 |
Background Matting |
The notebook is split into three parts: required setup, running the algorithm on photos, and running it on videos |
Andrey Ryabtsev |
|
|
18.05.2020 |
Jukebox |
A neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles |
Christine Payne |
|
|
04.05.2020 |
3D Photo Inpainting |
Method for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view |
|
|
|
04.05.2020 |
Motion Supervised co-part Segmentation |
A self-supervised deep learning method for co-part segmentation |
|
|
|
07.04.2020 |
RNN for Predictive Maintenance |
LSTM network in order to predict remaining useful life of aircraft engines |
Umberto Griffo |
|
|
03.04.2020 |
Onsets and Frames |
Onsets and Frames is an automatic music transcription framework with piano and drums models |
|
|
|
02.04.2020 |
Classification of chest vs. adominal X-rays |
The goal of this tutorial is to build a deep learning classifier to accurately differentiate between chest and abdominal X-rays |
tmoneyx01 |
|
|
07.03.2020 |
Lung X-Rays Semantic Segmentation |
This lesson applies a U-Net for Semantic Segmentation of the lung fields on chest x-rays |
tmoneyx01 |
|
|
07.03.2020 |
WikiArt (stylegan2) |
Generation of paintings of different styles and genres |
Doron Adler |
|
|
27.01.2020 |
Earth Engine Python API and Folium Interactive Mapping |
This notebook demonstrates how to setup the Earth Engine and provides several examples for visualizing Earth Engine processed data interactively using the folium library |
Qiusheng Wu |
|
|
20.01.2020 |
Train a GPT-2 Model on Tweets |
Train the model on your downloaded tweets, and generate massive amounts of Tweets from it |
Max Woolf |
|
|
16.01.2020 |
Traffic counting |
Making Road Traffic Counting App based on Computer Vision and OpenCV |
Andrey Nikishaev |
|
|
10.01.2020 |
Siamese NN |
Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task |
Tomasz Latkowski |
|
|
19.12.2019 |
Learning to Paint |
Learning to Paint With Model-based Deep Reinforcement Learning |
Manuel Romero |
|
|
17.12.2019 |
Imaging-AMARETTO |
An imaging genomics software tool to systematically interrogate multi-omics networks for relevance to radiography and histopathology imaging biomarkers of clinical outcomes with application to studies of brain tumors |
|
|
|
29.11.2019 |
Face toolbox |
A collection of deep learning frameworks ported to Keras for face detection, face segmentation, face parsing, iris detection, and face verification |
shaoanlu |
|
|
03.10.2019 |
Generating Piano Music with Transformer |
This Colab notebook lets you play with pretrained Transformer models for piano music generation, based on the Music Transformer |
|
|
|
16.09.2019 |
Few-shot face translation |
A GAN based approach for one model to swap them all: model is capable of producing faces that has its gaze direction, glasses, and hiar occlusions being consistent with given source face |
shaoanlu |
|
|
02.09.2019 |
Waifu2x |
This is the Google Colab implementation of tsurumeso's chainer implementation of waifu2x |
Margesh Phirke |
|
|
23.08.2019 |
GMCNN |
Generative Multi-column Convolutional Neural Networks inpainting model in Keras |
Tomasz Latkowski |
|
|
09.08.2019 |
XLNet |
Generalized Autoregressive Pretraining for Language Understanding |
Zhilin Yang |
|
|
28.06.2019 |
Breast Cancer detection |
A Neural Network for detecting breast cancer in cell scans! |
Peter Teoh |
blog post |
|
28.04.2019 |
GazeML |
Eye region landmarks detection |
shaoanlu |
|
|
03.04.2019 |
BERT with TPU |
Using a free Colab Cloud TPU to fine-tune sentence and sentence-pair classification tasks built on top of pretrained BERT models and run predictions on tuned model |
Sourabh Bajaj |
|
|
29.03.2019 |
automl-gs on a TPU |
Give an input CSV file and a target field you want to predict to automl-gs, and get a trained high-performing machine learning or deep learning model plus native Python code pipelines allowing you to integrate that model into any prediction workflow |
Max Woolf |
|
|
26.03.2019 |
GANSynth |
This notebook is a demo GANSynth, which generates audio with Generative Adversarial Networks |
Jesse Engel |
|
|
25.02.2019 |
Edge Detection |
Edge detection in OpenCV and skimage |
Yuhuang Hu |
|
|
15.02.2019 |
BERT on TF Hub |
Predicting Movie Review Sentiment with BERT on TF Hub |
Dale Markowitz |
|
|
12.02.2019 |
Mask R-CNN |
Code and visualizations to test, debug, and evaluate the Mask R-CNN model |
|
|
|
22.01.2019 |
RSNA Pneumonia Detection Challenge (Kaggel API) |
The basics of parsing the competition dataset, training using a detector basd on the Mask-RCNN algorithm for object detection and instance segmentation |
tmoneyx01 |
|
|
03.09.2018 |
HoF |
This notebook will walk you step by step through the process of using a pre-trained model to detect faces in an image |
Lucas Persona |
|
|
23.04.2018 |
Group Normalization |
A simple alternative to BN. GN divides the channels into groups and computes within each group the mean and variance for normalization. |
shaoanlu |
|
|
26.03.2018 |
Latent Constraints |
Conditional Generation from Unconditional Generative Models |
|
|
|
27.11.2017 |
Performance RNN |
This notebook shows you how to generate new performed compositions from a trained model |
|
|
|
11.07.2017 |
NSynth |
This colab notebook has everything you need to upload your own sounds and use NSynth models to reconstruct and interpolate between them |
|
|
|
06.04.2017 |