lora_pandas_model / README.md
samsonleegh's picture
Update README.md
f187e84 verified
|
raw
history blame
1.5 kB
---
base_model: unsloth/llama-3-8b-bnb-4bit
language:
- en
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- llama
- trl
---
# Uploaded model
- Finetuned to generate pandas codes given a dataframe and user query.
- ~100 datasets were taken from kaggle https://www.kaggle.com/datasets?search=Tabular+data
- These dataset were used to generate 390 sets of data queries and pandas code answers via llama3-70b https://www.kaggle.com/code/samsonleegh/sampling-data-qns-and-pandas-ans-from-dataset
- Finetuned llama3-8b-4bit with LoRA 16 adapters on 350 queries and answers pair https://colab.research.google.com/drive/1ZFBZGAK-fNcgt6RYSp0npnDWOeBAXEgf?usp=sharing
- Compare ROUGE score of original vs finetuned model on 40 queries and answers pair
## ROUGE Score Comparison
| Metric | llama3-8b | llama3-8b finetuned |
|------------|----------------|----------------|
| ROUGE-1 | 0.4415 | 0.6585 |
| ROUGE-2 | 0.2480 | 0.4810 |
| ROUGE-L | 0.3155 | 0.5552 |
| ROUGE-Lsum | 0.3013 | 0.5570 |
- **Developed by:** samsonleegh
- **License:** apache-2.0
- **Finetuned from model :** unsloth/llama-3-8b-bnb-4bit
This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)