Zhang Xu's picture

4 4

Zhang Xu

texzhang

·

CheungXu

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

upvoted a paper 19 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper 25 days ago

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

View all activity

Organizations

None yet

Collections 2

models 0

None public yet

datasets 0

None public yet