17 15 1

Ajith V Prabhakar

ajithprabhakar

https://www.ajithp.com

ajithprabhakar

AI & ML interests

NLP, Responsible AI, Generative AI

Recent Activity

commented on a paper about 18 hours ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

commented on a paper 7 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

commented on a paper 12 days ago

Qwen2.5-1M Technical Report

View all activity

Organizations

Posts 2

Post

533

Hi All,
In my latest blog post, I created a comprehensive guide on LLM Benchmarking.
➟ 20+ key benchmarks, from MMLU to TruthfulQA
➟ How each benchmark assesses different LLM capabilities
➟ Why benchmarking matters for real-world AI applications
➟ Future trends in AI evaluation
Read the blog here: https://wp.me/p7Qix-wO

Please let me know your thoughts, suggestions, and comments.

Post

1374

Can AI cheat or lie?

In this blog, we will explore the research conducted by experts from MIT, Australian Catholic University, and the Center for AI Safety to better understand the nature of AI deception, its various forms, and the potential risks it poses. We will examine real-world examples and the underlying mechanisms that enable AI systems to deceive.

Learn more at: https://ajithp.com/2024/05/12/ai-deception-risks-real-world-examples-and-proactive-solutions/

View all Posts

Collections 1

models

None public yet

datasets

None public yet

Ajith V Prabhakar

AI & ML interests

Recent Activity

Organizations

Posts 2

Collections 1

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

OneLLM: One Framework to Align All Modalities with Language

Generative Multimodal Models are In-Context Learners

The LLM Surgeon

models

datasets