AnIdealRing
SmartDazi
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 18 hours ago
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning
upvoted
a
paper
about 2 months ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
upvoted
a
paper
4 months ago
R-Zero: Self-Evolving Reasoning LLM from Zero Data