Penny's picture

4

Penny

pennypanpan

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

GARDO: Reinforcing Diffusion Models without Reward Hacking

upvoted a paper 3 months ago

Agentic Design of Compositional Machines

upvoted a paper 3 months ago

Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards

View all activity

Organizations

None yet

Papers 1

arxiv:2302.01687

models 0

None public yet

datasets 0

None public yet