Decoder model vs Encoder-Decoder model RQ 발표

Decoder model vs Encoder-Decoder model RQ 발표

2024. 10. 5. 20:11ㆍAI/LLM

Research Question: Both Decoder and Encoder-Decoder models can be trained for generative LMs. Why did the latter lose popularity since T5?

결론: NLP의 everyday usage에는 decoder가 가진 장점(less resource, faster)가 좋다. 하지만 요즘의 multimodality를 강조한 LMM들은 decoder의 기존 pretrained 된 모델로 해결이 불가능해서, 동영상/사진/지식그래프 들의 embedding을 뽑아내는 encoder가 별도로 필요. 최근 구글 Gemini가 encoder-decoder 모델이다.

전체 슬라이드 주소: https://docs.google.com/presentation/d/e/2PACX-1vSnsDr0TNXWm5af78wurw8n9huI_K2VsD1Xa6Rq4F1y8noJoHUdbjXXwiRH9Bqg6QiqJWue1lGRgVqB/pub?start=false&loop=false&delayms=3000

저작자표시 (새창열림)

'AI > LLM' 카테고리의 다른 글

Reasoning and Planning - Paper 발표(Let’s Verify Step by Step) (0)	2024.10.05
Instruction finetuning - RQ (1)	2024.10.05
Instruction Finetuning(SELF-INSTRUCT)- paper 발표 (1)	2024.10.05
Relative Positional encoding RQ 발표 (0)	2024.10.05

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

규동이의 여행일기

규동이의 여행일기

태그

최근글

댓글

공지사항

아카이브

'AI > LLM' 카테고리의 다른 글

관련글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역