J C's picture

J C

dark-pen

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 12 hours ago

MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning

upvoted a collection about 12 hours ago

Gemma-2-9B-it-Advanced

liked a model about 15 hours ago

CATIE-AQ/FAT5-small

View all activity

Organizations

None yet

dark-pen's activity

upvoted a paper about 12 hours ago

MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning

Paper • 2410.18035 • Published Oct 23, 2024 • 1

upvoted a collection about 12 hours ago

Gemma-2-9B-it-Advanced

Merges of the advanced research fine tunes of gemma-2 9B it • 3 items • Updated Oct 20, 2024 • 3

upvoted a paper 4 days ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 16

upvoted a collection 5 days ago

VideoChat-Flash

Faster and more powerful VideoChat. • 4 items • Updated 22 days ago • 8

upvoted a paper 5 days ago

Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes

Paper • 2410.16930 • Published Oct 22, 2024 • 7

upvoted a collection 5 days ago

TinySQL

"Convert English query to a SQL command" models and training data. • 26 items • Updated 6 days ago • 2

upvoted a paper 7 days ago

Know When to Fuse: Investigating Non-English Hybrid Retrieval in the Legal Domain

Paper • 2409.01357 • Published Sep 2, 2024 • 3

upvoted a collection 7 days ago

DeepSeek-R1-ReDistill

Re-distilled DeepSeek R1 models • 4 items • Updated 4 days ago • 10

upvoted 2 collections 8 days ago

Models - Gaming

3 items • Updated Mar 9, 2024 • 1

Papers - Reasoning

32 items • Updated 8 days ago • 1

upvoted 2 collections 9 days ago

Social Deduction LLM (AAMAS 2025)

Pretrained models for "Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning" (AAMAS 2025 Version) • 2 items • Updated 14 days ago • 1

Sa2VA model zoo

4 items • Updated 20 days ago • 28

upvoted a collection 16 days ago

MoDE

Collection of pretrained MoDE Diffusion Policies. Variants include finetuned versions for all CALVIN benchmarks and LIBERO 90. • 9 items • Updated Dec 19, 2024 • 2

upvoted a collection 20 days ago

[MASK] is All You Need

Code, dataset, and pretrained model • 5 items • Updated Nov 29, 2024 • 9

upvoted 3 collections 21 days ago

Cosmos

The collection of Cosmos models • 31 items • Updated 17 days ago • 254

CorDA

models and datas for CorDA • 9 items • Updated Jul 15, 2024 • 1

Hymba

A series of Hybrid Small Language Models. • 2 items • Updated 17 days ago • 27

upvoted 2 collections 23 days ago

🛰️🌍 Geospatial Datasets

A curated collections of diverse geospatial and satellite imagery datasets. • 54 items • Updated Mar 6, 2024 • 21

Glider Checkpoints

Official checkpoints for Glider (arXiv/2410.07172) reproduction. • 5 items • Updated Oct 19, 2024 • 2

upvoted a paper 23 days ago

Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation

Paper • 2501.04144 • Published 27 days ago • 18