ENERGY-DRINK-LOVE
/

DataVortexS_dpov3

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Edit model card

ENERGY-DRINK-LOVE/DataVortexS_dpov3

Our Team

Youjin Chung
Jingyeom Kim

Model

Base Model

Edentns/DataVortexS-10.7B-dpo-v1.11

Hardware and Software

Hardware: A100 * 8 for training our model
Deepspeed library & Huggingface TRL Trainer

Dataset

DPO_dataset
- 자체 제작 dpo dataset(AI-hub dataset 활용)
- OpenOrca DPO 등 영어 데이터셋 번역(ENERGY-DRINK-LOVE/translate_share_gpt_dedup_llama_SFT_1024, 자체모델 활용)

Training Method

DPO

Benchmark

Ko LM Eval Harness

Ko-LLM-Leaderboard

(240316기준 7등)

Average	Ko-ARC	Ko-HellaSwag	Ko-MMLU	Ko-TruthfulQA	Ko-CommonGen V2
60.18	56.23	69.15	52.76	67.87	54.9

Downloads last month: 1,702

Safetensors

Model size

10.9B params

Tensor type

BF16

·

Inference Examples

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for ENERGY-DRINK-LOVE/DataVortexS_dpov3

Base model

LDCC/LDCC-SOLAR-10.7B

Finetuned

Edentns/DataVortexS-10.7B-dpo-v1.11

Finetuned

(1)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard