AnIdealRing's picture

1 8

AnIdealRing

SmartDazi

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 18 hours ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

upvoted a paper about 2 months ago

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

upvoted a paper 4 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

View all activity

Organizations

models 0

None public yet

datasets 0

None public yet