Spaces:

open-llm-leaderboard
/

open_llm_leaderboard

Running on CPU Upgrade

App Files Files Community

1098

Pending queue is currently on hold

#346

by lvkaokao - opened Oct 30, 2023

Discussion

lvkaokao

Oct 30, 2023

Hi, I had submitted model a few days ago. But it hasn't been evaluated now.

clefourrier

Open LLM Leaderboard org Oct 30, 2023

Hi @lvkaokao !
We are currently doing a very big upgrade of the leaderboard so all evaluations are halted till the end of the month. @SaylorTwift did a small announcement about this on twitter a week ago.
Thank you very much for your patience, we're going as fast as we can! 🤗

lvkaokao

Oct 30, 2023

•

edited Oct 30, 2023

hi, @clefourrier

Thanks for your reply!

clefourrier

Open LLM Leaderboard org Oct 30, 2023

(Let's not close this issue yet so that other people wondering the same thing get the information too if that works for you :) )

clefourrier changed discussion title from why does the pending queue take long time? to Pending queue is currently on hold Oct 30, 2023

lunaalice

Nov 1, 2023

(Let's not close this issue yet so that other people wondering the same thing get the information too if that works for you :) )

hhh

Dampfinchen

Nov 2, 2023

•

edited Nov 2, 2023

Any updates? It was supposed to resume at the end of october, but we have november now.

Personally I would prefer a stable and reliable leaderboard over some updates. This is the best and only way to evaluate models for people who don't have the hardware power themselves.

clefourrier

Open LLM Leaderboard org Nov 2, 2023

Hi @Dampfinchen !
We are in the last stages of the update, and expect to do an announcement next week.

Side note, but the update should be increasing the leaderboard's reliability quite a bit :)

pszemraj

Nov 3, 2023

We are in the last stages of the update, and expect to do an announcement next week.

Thanks 🙏 I just wanted to chime in and say it would be great if status updates could be here or somewhere else on this space instead of Twitter or other even more soul-destroying social media (LinkedIn … 🤢)

ehartford

Nov 4, 2023

thank you for your work

Olofp

Nov 6, 2023

Yes, agree, you are doing excellent work.
I saw the queue of models and there’s like 200.
Any chance we could bump the testing of
DeepSeeks Code 6.7B and 34Bmodels and ofcourse Openchat 3.5 7B which I feel( but could of course be wrong) might be big in terms of results..I realize it is not completely fair to all who are patiently waiting in line. BUt that is what I am looking out for at moment.
Again thanks for great work on this leaderboard!!! 🙏

ehartford

Nov 6, 2023

Yes, agree, you are doing excellent work.
I saw the queue of models and there’s like 200.
Any chance we could bump the testing of
DeepSeeks Code 6.7B and 34Bmodels and ofcourse Openchat 3.5 7B which I feel( but could of course be wrong) might be big in terms of results..I realize it is not completely fair to all who are patiently waiting in line. BUt that is what I am looking out for at moment.
Again thanks for great work on this leaderboard!!! 🙏

lol you and the rest of us buddy!

Olofp

Nov 8, 2023

Hi @clefourrier ,
Noticed the queue of models seems to be stuck. Are you actively adding new models to leaderboard?
How many can you actually handle / day? Are there any resources that would help from community.
I’d like to ask/ suggest again if you might bump deepseek coder, openchat 3.5 and maybe..? Yi? in queue, as I think many are eager to see results.

Thank you for providing an excellent resource! 👍

clefourrier

Open LLM Leaderboard org Nov 8, 2023

Hi @Olofp ,
Thank you for your concerns! The Yi base models (7B and 34B) are already in the leaderboard :)
We are starting the queue again (slowly because our cluster is quite full), as our update is taking a couple of days more than we thought and we don't want to keep you folks waiting!

But stay tuned for something very cool tomorrow if all goes well

clefourrier

Open LLM Leaderboard org Nov 9, 2023

•

edited Nov 9, 2023

@lvkaokao @lunaalice @Dampfinchen @pszemraj @ehartford @Olofp

Upgrade is finally out, and the queue restarted! Find out more here 🔥

Thank you all very much for your patience! 🤗

clefourrier changed discussion status to closed Nov 9, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment