-
Exploring the Landscape of Convolutional Neural Networks
A review of CNN architectures
-
Paper Summary: GIT - A Generative Image-to-text Transformer for Vision and Language
Transformer based generative vision-language model
A review of CNN architectures
Transformer based generative vision-language model