- A Mathematical Theory of Deep Convolutional Neural Networks for Feature Extraction - [Arxiv] [QA]
- Deep Residual Learning for Image Recognition - [Arxiv] [QA]
- MovieQA: Understanding Stories in Movies through Question-Answering - [Arxiv] [QA]
- Explaining NonLinear Classification Decisions with Deep Taylor Decomposition - [Arxiv] [QA]
- A Type Theory for Probabilistic and Bayesian Reasoning - [Arxiv] [QA]
- Sequence Level Training with Recurrent Neural Networks - [Arxiv] [QA]
- All you need is a good init - [Arxiv] [QA]
- Unsupervised Deep Embedding for Clustering Analysis - [Arxiv] [QA]
- Supersizing Self-supervision: Learning to Grasp from 50K Tries and 700 Robot Hours - [Arxiv] [QA]
- Quantization based Fast Inner Product Search - [Arxiv] [QA]
- Inverting Visual Representations with Convolutional Networks - [Arxiv] [QA]
- You Only Look Once: Unified, Real-Time Object Detection - [Arxiv] [QA]
- Visualizing and Understanding Recurrent Networks - [Arxiv] [QA]
- A Critical Review of Recurrent Neural Networks for Sequence Learning - [Arxiv] [QA]
- Unsupervised Visual Representation Learning by Context Prediction - [Arxiv] [QA]
- Visual Semantic Role Labeling - [Arxiv] [QA]
- Contextual Action Recognition with R*CNN - [Arxiv] [QA]