Pratyay Banerjee's picture

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

HCI, Computer Vision, Object Detection, Pattern Recognition, NLP, Supervised Learning

Recent Activity

upvoted a paper about 12 hours ago

Improved Training Technique for Latent Consistency Models

upvoted a paper about 12 hours ago

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

upvoted a paper about 12 hours ago

Preference Leakage: A Contamination Problem in LLM-as-a-judge

View all activity

Organizations

Neilblaze's activity

upvoted 19 papers about 12 hours ago

Improved Training Technique for Latent Consistency Models

Paper • 2502.01441 • Published 22 days ago • 8

SafeRAG: Benchmarking Security in Retrieval-Augmented Generation of Large Language Model

Paper • 2501.18636 • Published 28 days ago • 28

Preference Leakage: A Contamination Problem in LLM-as-a-judge

Paper • 2502.01534 • Published 22 days ago • 38

Can LLMs Maintain Fundamental Abilities under KV Cache Compression?

Paper • 2502.01941 • Published 22 days ago • 14

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Paper • 2502.02508 • Published 21 days ago • 22

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 20 days ago • 53

Step Back to Leap Forward: Self-Backtracking for Boosting Reasoning of Language Models

Paper • 2502.04404 • Published 20 days ago • 22

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 18 days ago • 117

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published 15 days ago • 136

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published 15 days ago • 124

LLM Pretraining with Continuous Concepts

Paper • 2502.08524 • Published 13 days ago • 27

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published 14 days ago • 44

Distillation Scaling Laws

Paper • 2502.08606 • Published 13 days ago • 45

CoT-Valve: Length-Compressible Chain-of-Thought Tuning

Paper • 2502.09601 • Published 12 days ago • 14

Logical Reasoning in Large Language Models: A Survey

Paper • 2502.09100 • Published 13 days ago • 22

MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency

Paper • 2502.09621 • Published 12 days ago • 27

Jailbreaking to Jailbreak

Paper • 2502.09638 • Published 16 days ago • 4

ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation

Paper • 2502.09411 • Published 12 days ago • 17

Large Language Diffusion Models

Paper • 2502.09992 • Published 12 days ago • 80

upvoted a paper about 13 hours ago

How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training

Paper • 2502.11196 • Published 9 days ago • 21