본문 바로가기

논문리뷰/Computer Vision

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

Transformer를 vision task에 접목시킨 모델인 ViT 논문을 리뷰해봤다.

 

 

https://ruddy-sheet-75d.notion.site/An-Image-is-Worth-16x16-Words-Transformers-for-Image-Recognition-at-Scale-8f616a55fa6a428a97845727882c1b02?pvs=4

 

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale | Notion

1. Introduction

ruddy-sheet-75d.notion.site

 

정리한 내용 중 오류가 있다면 댓글로 알려주시면 감사하겠습니다!