Understanding Convolutional Networks with multi-layered Deconvolutional Network:
Texture synthesis using Convolutional Neural Networks:
JEPA pre-trained video embedding for vision tasks:
DETR (Object detection with vision transformers):
Deep Supervision Distillation:
Deformable convolutions:
DeformableDETR (Deformable Attention):
SwinTransformer (Shifted Window Attention):
Dino-DETR:
Anchor-Queries (DAB-DETR):
Object detection on compressed video:
Classifying JPEG compressed images (directly on DCT):
Original label-smoothing paper:
Online Label-smoothing:
Hungarian-matching loss for single-shot detection:
Focal Loss:
GIoU:
Original Knowledge Distillation paper:
Original Deep Supervision paper:
L2 Feature Distillation:
Theoretical underpinnings of Knowledge Distillation:
Feature Distillation + Knowledge Distillation + Deep Supervision:
Attention is All you need:
GLoVe embedding:
FastText embedding:
LLM’s escalation risks in political situations:
GPT-1:
BERT:
Show attend and tell:
CLIP:
Image-to-text pretraining for one-shot object detection tasks:
Auto-encoding Variational Bayes:
GAN:
DeepConvolutionalGAN:
Style Transfer:
Conditional GAN:
Deep Convolutional GAN:
PatchGAN and U-Net Generator:
CycleGAN:
Original Diffusion Paper:
XGBoost:
Why Neural Networks don't perform well on Tabular: