'grpo' 태그의 글 목록

[논문 리뷰] MMSearch-R1: Incentivizing LMMs to Search

https://arxiv.org/abs/2506.20670 MMSearch-R1: Incentivizing LMMs to SearchRobust deployment of large multimodal models (LMMs) in real-world scenarios requires access to external knowledge sources, given the complexity and dynamic nature of real-world information. Existing approaches such as retrieval-augmented generation (RAG) aarxiv.orgAbstract현실 세계 시나리오에서 대형 멀티모달 모델(LMMs)의 안정적인 배포를 위해, 현실 세계 정..

논문 2025.06.30

« 2025/07 »

일

월

화

수

목

금

토

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

khseon7 님의 블로그

grpo 1

티스토리툴바