arxiv:2606.13707
Jinyang Wu
Jinyang23
AI & ML interests
large language models, reasoning, agentic rl
Recent Activity
updated a model about 18 hours ago
Jinyang23/OPID-ALFWorld-1.7B published a model about 18 hours ago
Jinyang23/OPID-ALFWorld-1.7B upvoted a paper about 19 hours ago
OPID: On-Policy Skill Distillation for Agentic Reinforcement LearningOrganizations
None yet