Kye Gomez's picture

Kye Gomez

kye

·

https://discord.gg/qUtxnK2NMf

kyegomezb

AI & ML interests

Neuroscience, Behavior Science, Anti-Matter, Anti-Gravity propulsion,

Recent Activity

upvoted a paper about 6 hours ago

Return of the Encoder: Maximizing Parameter Efficiency for SLMs

upvoted a paper about 6 hours ago

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

upvoted a paper about 6 hours ago

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

View all activity

Organizations

kye's activity

upvoted 6 papers about 6 hours ago

Return of the Encoder: Maximizing Parameter Efficiency for SLMs

Paper • 2501.16273 • Published 1 day ago • 2

ARWKV: Pretrain is not what we need, an RNN-Attention-Based Language Model Born from Transformer

Paper • 2501.15570 • Published 2 days ago • 11

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Paper • 2501.16295 • Published 1 day ago • 4

iFormer: Integrating ConvNet and Transformer for Mobile Application

Paper • 2501.15369 • Published 3 days ago • 8

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 3 days ago • 22

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 3 days ago • 35

liked 2 models 1 day ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated 3 days ago • 150k • 517

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 1 day ago • 19.6k • 1.63k

upvoted 5 papers 1 day ago

Debate Helps Weak-to-Strong Generalization

Paper • 2501.13124 • Published 8 days ago • 4

Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published 5 days ago • 41

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 5 days ago • 40

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 6 days ago • 60

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 5 days ago • 12

liked a model 2 days ago

lumolabs-ai/Lumo-70B-Instruct

Updated 7 days ago • 89 • 3

upvoted 6 papers 5 days ago

IntellAgent: A Multi-Agent Framework for Evaluating Conversational AI Systems

Paper • 2501.11067 • Published 9 days ago • 12

Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament

Paper • 2501.13007 • Published 6 days ago • 18

Autonomy-of-Experts Models

Paper • 2501.13074 • Published 6 days ago • 38

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 7 days ago • 70

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published 6 days ago • 71

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 6 days ago • 245