kkr5155 (Karthik)

upvoted a collection 10 months ago

VisionLM

Collection

1867 items • Updated 13 days ago • 138

upvoted an article 10 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25, 2025

•

172

upvoted 2 papers 11 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20, 2025 • 156

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4, 2025 • 253

upvoted an article 11 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

209

upvoted a paper about 1 year ago

Contrastive Localized Language-Image Pre-Training

Paper • 2410.02746 • Published Oct 3, 2024 • 37

upvoted a collection about 1 year ago

multilingual vision models

Collection

Some papers I read for understanding vision models and also adding multilingual capabilities to them • 14 items • Updated Dec 11, 2024 • 2

upvoted 2 papers about 1 year ago

Maya: An Instruction Finetuned Multilingual Multimodal Model

Paper • 2412.07112 • Published Dec 10, 2024 • 28

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20, 2024 • 12

upvoted 2 papers over 1 year ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 174

Federated Learning driven Large Language Models for Swarm Intelligence: A Survey

Paper • 2406.09831 • Published Jun 14, 2024 • 1

upvoted an article over 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

436

upvoted a paper over 1 year ago

SF-V: Single Forward Video Generation Model

Paper • 2406.04324 • Published Jun 6, 2024 • 24

upvoted a collection over 1 year ago

Cohere Labs Aya 23

Collection

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated Jul 31, 2025 • 56

Karthik

AI & ML interests

Organizations

VisionLM

FastRTC: The Real-Time Communication Library for Python

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

KV Caching Explained: Optimizing Transformer Inference Efficiency

Contrastive Localized Language-Image Pre-Training

multilingual vision models

Maya: An Instruction Finetuned Multilingual Multimodal Model

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Transformer Explainer: Interactive Learning of Text-Generative Models

Federated Learning driven Large Language Models for Swarm Intelligence: A Survey

SmolLM - blazingly fast and remarkably powerful

SF-V: Single Forward Video Generation Model

Cohere Labs Aya 23

Karthik

AI & ML interests

Organizations

kkr5155's activity

FastRTC: The Real-Time Communication Library for Python

KV Caching Explained: Optimizing Transformer Inference Efficiency

SmolLM - blazingly fast and remarkably powerful