Model Card for Model ID

meditron-7b-chat is a finetuned version of epfl-llm/meditron-7b using SFT Training on the Alpaca Dataset. This model can answer information about different excplicit ideas in medicine (see epfl-llm/meditron-7b for more info)

Model Description

Prompt Template

### Instruction:

<prompt> (without the <>)

### Response:

How to Get Started with the Model

Use the code sample provided in the original post to interact with the model.

from transformers import AutoTokenizer,AutoModelForCausalLM
 
model_id = "malhajar/meditron-7b-chat"
model = AutoModelForCausalLM.from_pretrained(model_name_or_path,
                                             device_map="auto",
                                             torch_dtype=torch.float16,
                                             revision="main")

tokenizer = AutoTokenizer.from_pretrained(model_id)

question: "what is tract infection?"
# For generating a response
prompt = '''
### Instruction:
{question} 

### Response:'''
input_ids = tokenizer(prompt, return_tensors="pt").input_ids
output = model.generate(inputs=input_ids,max_new_tokens=512,pad_token_id=tokenizer.eos_token_id,top_k=50, do_sample=True,
        top_p=0.95)
response = tokenizer.decode(output[0])

print(response)

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 49.59
AI2 Reasoning Challenge (25-Shot) 50.77
HellaSwag (10-Shot) 75.37
MMLU (5-Shot) 40.49
TruthfulQA (0-shot) 48.56
Winogrande (5-shot) 73.16
GSM8k (5-shot) 9.17
Downloads last month
116
Safetensors
Model size
6.74B params
Tensor type
BF16
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for malhajar/meditron-7b-chat

Finetuned
(28)
this model
Adapters
1 model
Quantizations
5 models

Dataset used to train malhajar/meditron-7b-chat

Collection including malhajar/meditron-7b-chat

Evaluation results