samsonleegh
/

lora_pandas_model

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lora_pandas_model / README.md

samsonleegh's picture

Update README.md

f187e84 verified 7 months ago

|

1.5 kB

	---
	base_model: unsloth/llama-3-8b-bnb-4bit
	language:
	- en
	license: apache-2.0
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- llama
	- trl
	---

	# Uploaded model
	- Finetuned to generate pandas codes given a dataframe and user query.
	- ~100 datasets were taken from kaggle https://www.kaggle.com/datasets?search=Tabular+data
	- These dataset were used to generate 390 sets of data queries and pandas code answers via llama3-70b https://www.kaggle.com/code/samsonleegh/sampling-data-qns-and-pandas-ans-from-dataset
	- Finetuned llama3-8b-4bit with LoRA 16 adapters on 350 queries and answers pair https://colab.research.google.com/drive/1ZFBZGAK-fNcgt6RYSp0npnDWOeBAXEgf?usp=sharing
	- Compare ROUGE score of original vs finetuned model on 40 queries and answers pair
	## ROUGE Score Comparison
	\| Metric \| llama3-8b \| llama3-8b finetuned \|
	\|------------\|----------------\|----------------\|
	\| ROUGE-1 \| 0.4415 \| 0.6585 \|
	\| ROUGE-2 \| 0.2480 \| 0.4810 \|
	\| ROUGE-L \| 0.3155 \| 0.5552 \|
	\| ROUGE-Lsum \| 0.3013 \| 0.5570 \|
	- Developed by: samsonleegh
	- License: apache-2.0
	- Finetuned from model : unsloth/llama-3-8b-bnb-4bit

	This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)