Adding Evaluation Results
#6 opened 12 months ago
by
leaderboard-pr-bot
The model produces nonsense
9
#4 opened about 1 year ago
by
Pkoosha
The model seems not have a general ability
6
#3 opened about 1 year ago
by
yuansiwe
Evaluation of long sequence of conversation
5
#1 opened about 1 year ago
by
cooee-ashutosh