Shyam Sunder Kumar's picture

Shyam Sunder Kumar

theainerd

·

AI & ML interests

Natural Language Processing

Recent Activity

liked a model 3 days ago

openbmb/MiniCPM-o-2_6

reacted to chansung's post with 👍 4 days ago

Simple summarization of Evolving Deeper LLM Thinking (Google DeepMind) The process starts by posing a question. 1) The LLM generates initial responses. 2) These generated responses are evaluated according to specific criteria (program-based checker). 3) The LLM critiques the evaluated results. 4) The LLM refines the responses based on the evaluation, critique, and original responses. The refined response is then fed back into step 2). If it meets the criteria, the process ends. Otherwise, the algorithm generates more responses based on the refined ones (with some being discarded, some remaining, and some responses potentially being merged). Through this process, it demonstrated excellent performance in complex scheduling problems (travel planning, meeting scheduling, etc.). It's a viable method for finding highly effective solutions in specific scenarios. However, there are two major drawbacks: 🤔 An excessive number of API calls are required. (While the cost might not be very high, it leads to significant latency.) 🤔 The evaluator is program-based. (This limits its use as a general method. It could potentially be modified/implemented using LLM as Judge, but that would introduce additional API costs for evaluation.) https://arxiv.org/abs/2501.09891

liked a Space 4 days ago

hf-audio/open_asr_leaderboard

View all activity

Organizations

Collections 4

models 2

theainerd/Wav2Vec2-large-xlsr-hindi

Automatic Speech Recognition • Updated May 31, 2023 • 2.12M • 5

theainerd/wav2vec2-large-xlsr-53-odia

Automatic Speech Recognition • Updated Mar 24, 2021 • 1.63k • 3

datasets

None public yet