Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

GGUF Quants with iMatrix for https://huggingface.co/brucethemoose/Yi-34B-200K-DARE-megamerge-v8

iMatrix made with 2500 batches of 32 tokens made on wiki.train.raw

Benchs made with LlamaCPP :

  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,Hellaswag,84.5,,400,2024-01-26 00:00:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,Hellaswag_Bin,79,,400,2024-01-26 00:00:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,Arc-Challenge,57.52508361,,299,2024-01-26 05:40:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,Arc-Easy,78.59649123,,570,2024-01-26 05:40:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,MMLU,40.89456869,,313,2024-01-26 05:40:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,Thruthful-QA,34.76132191,,817,2024-01-26 05:40:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,Winogrande,77.9795,,1267,2024-01-26 05:40:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,wikitext,5.0681,512,512,2024-01-26 00:00:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,wikitext,4.5052,2048,2048,2024-01-26 00:00:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,wikitext,4.3656,4096,4096,2024-01-26 00:00:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
  • Yi-34B-200K-DARE-megamerge-v8-b1952-iMat-c32_ch2500-Q4_K_M.gguf,-,wikitext,4.3190,8192,8192,2024-01-26 00:00:00,,34b,Yi,2000000,,,GGUF,Brucethemoose,Nexesenex,
Downloads last month
486
GGUF
Model size
34.4B params
Architecture
llama

2-bit

3-bit

4-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .