Hai Ye

oceanpty

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

authored a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

upvoted a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

View all activity

Organizations

None yet

upvoted a paper 5 days ago

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

Paper • 2512.17532 • Published 9 days ago • 63

authored a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14 • 164

upvoted a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14 • 164

liked 3 models about 1 month ago

upvoted a collection about 1 month ago

MiroThinker-v1.0

Collection

Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling • 8 items • Updated 12 days ago • 41

updated a model 4 months ago

oceanpty/self-j-vicuna-13b-v1.5-student

8B • Updated Sep 3 • 5

published a model 4 months ago

oceanpty/self-j-vicuna-13b-v1.5-student

8B • Updated Sep 3 • 5

updated a model 4 months ago

oceanpty/self-j-vicuna-13b-v1.5-kd

Updated Aug 25

published a model 4 months ago

oceanpty/self-j-vicuna-13b-v1.5-kd

Updated Aug 25

upvoted a collection 5 months ago

MiroThinker-v0.1

Collection

High performance in deep research and tool use. • 7 items • Updated Sep 8 • 36

upvoted a paper 8 months ago

100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models

Paper • 2505.00551 • Published May 1 • 36

upvoted a paper 10 months ago

Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization

Paper • 2502.16825 • Published Feb 24 • 7

commented a paper 10 months ago

Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization

Paper • 2502.16825 • Published Feb 24 • 7 •

authored a paper 12 months ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published Jan 5 • 45

upvoted a paper 12 months ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published Jan 5 • 45

updated 2 models 12 months ago

oceanpty/Self-J-lla31-8b-inst-base-yi-1-5-16k-chat-threshold-1

8B • Updated Dec 31, 2024 • 12

oceanpty/Self-J-lla31-8b-inst-ref-lla31-70b-base-yi-1-5-16k-chat-threshold-1

8B • Updated Dec 31, 2024 • 4

authored a paper 12 months ago

Preference-Guided Reflective Sampling for Aligning Language Models

Paper • 2408.12163 • Published Aug 22, 2024

Hai Ye

AI & ML interests

Recent Activity

Organizations

oceanpty's activity