Model Details

Model and Training Details

Preprocessing

  • preprocessed and packed the sft dataset with trl.trainer.ConstantLengthDataset

Results

image/png

Compute Infrastructure

The model is trained using 4 * RTX 3090 - 24GB

Model Card Authors

Yiyu (Michael) Ren

Model Card Contact

Email: [email protected]

Framework versions

  • PEFT 0.8.2
Downloads last month
23
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no pipeline_tag.

Model tree for renyiyu/llama-2-7b-sft-lora

Adapter
(1837)
this model