'dapo' 태그의 글 목록

[논문 리뷰] DAPO: An Open-Source LLM Reinforcement Learning System at Scale

https://arxiv.org/abs/2503.14476 DAPO: An Open-Source LLM Reinforcement Learning System at ScaleInference scaling empowers LLMs with unprecedented reasoning ability, with reinforcement learning as the core technique to elicit complex reasoning. However, key technical details of state-of-the-art reasoning LLMs are concealed (such as in OpenAI o1 blogarxiv.orgIntroductionTest-time scaling은 더 긴 Cha..

논문 2025.04.28

« 2025/07 »

일

월

화

수

목

금

토

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

khseon7 님의 블로그

dapo 1

티스토리툴바