5 3 10

adaface PRO

adaface-neurips

adaface-neurips

AI & ML interests

None yet

Recent Activity

new activity about 11 hours ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16:doesn't do kv caching on transformers

new activity 4 days ago

nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4:Errors loading state_dict via transformers: size mismatch for down_proj

upvoted a paper 3 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

View all activity

Organizations

None yet

New activity in nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 about 11 hours ago

doesn't do kv caching on transformers

#14 opened 1 day ago by

adaface-neurips

New activity in nvidia/Llama-4-Scout-17B-16E-Instruct-NVFP4 4 days ago

Errors loading state_dict via transformers: size mismatch for down_proj

#2 opened about 1 month ago by

rustyjelly

upvoted a paper 3 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114

liked a model 6 months ago

mistralai/Magistral-Small-2506

24B • Updated Jul 28 • 14.3k • 607

liked a dataset 6 months ago

open-thoughts/OpenThoughts3-1.2M

Viewer • Updated Jun 9 • 1.2M • 13.6k • 193

updated 2 Spaces 7 months ago

AdaFace-Animate

🎨

Generate personalized animated videos from face images

Adaface

😻

AdaFace: Face Encoder for 0-Shot Diffusion Personalization

New activity in yzwang/X2I-subject-driven 7 months ago

How is the GRIT-Entity-New dataset constructed?

#1 opened 12 months ago by

onion-liu

updated a model 7 months ago

adaface-neurips/adaface-models

Updated May 24

upvoted a paper 7 months ago

From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems

Paper • 2505.15685 • Published May 21 • 3

updated a model 7 months ago

adaface-neurips/adaface-animate-models

Updated May 22

liked a model 8 months ago

lodestones/Chroma

Text-to-Image • Updated Oct 23 • 1.29k

published 2 models 8 months ago

adaface-neurips/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Apr 15

adaface-neurips/Qwen2.5-1.5B-Open-R1-GRPO

Updated Apr 15

updated a model 8 months ago

adaface-neurips/Qwen2.5-1.5B-Open-R1-Distill

2B • Updated Apr 15 • 5

published a model 8 months ago

adaface-neurips/Qwen2.5-1.5B-Open-R1-Distill

2B • Updated Apr 15 • 5

upvoted an article 8 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

260

liked a model 9 months ago

nitrosocke/Ghibli-Diffusion

Text-to-Image • Updated Aug 3, 2023 • 2.84k • 783

published a model 9 months ago

adaface-neurips/adaface-animate-models

Updated May 22

published a Space 9 months ago

AdaFace-Animate

🎨

Generate personalized animated videos from face images

adaface PRO

AI & ML interests

Recent Activity

Organizations

adaface-neurips's activity

doesn't do kv caching on transformers

Errors loading state_dict via transformers: size mismatch for down_proj

AdaFace-Animate

Adaface

How is the GRIT-Entity-New dataset constructed?

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

AdaFace-Animate