Edit model card

QuantFactory Banner

QuantFactory/Hermes-3-Llama-3.1-8B-Kor-Finance-Advisor-GGUF

This is quantized version of kimhyeongjun/Hermes-3-Llama-3.1-8B-Kor-Finance-Advisor created using llama.cpp

Original Model Card

kimhyeongjun/Hermes-3-Llama-3.1-8B-Kor-Finance-Advisor

This is my personal toy project for Chuseok(Korean Thanksgiving Day).

This model is a fine-tuned version of NousResearch/Hermes-3-Llama-3.1-8B on the Korean_synthetic_financial_dataset_21K.

Model description

Everything happened automatically without any user intervention.

Based on finance PDF data collected directly from the web, we refined the raw data using the 'meta-llama/Meta-Llama-3.1-70B-Instruct-FP8' model. After generating synthetic data based on the cleaned data, we further evaluated the quality of the generated data using the 'meta-llama/Llama-Guard-3-8B' and 'RLHFlow/ArmoRM-Llama3-8B-v0.1' models. We then used 'Alibaba-NLP/gte-large-en-v1.5' to extract embeddings and applied Faiss to perform Jaccard distance-based nearest neighbor analysis to construct the final dataset of 21k, which is diverse and sophisticated.

๋ชจ๋“  ๊ณผ์ •์€ ์‚ฌ์šฉ์ž์˜ ๊ฐœ์ž… ์—†์ด ์ž๋™์œผ๋กœ ์ง„ํ–‰๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

์›น์—์„œ ์ง์ ‘ ์ˆ˜์ง‘ํ•œ ๊ธˆ์œต ๊ด€๋ จ PDF ๋ฐ์ดํ„ฐ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ, ๋ˆ์ด ์—†์–ด์„œ 'meta-llama/Meta-Llama-3.1-70B-Instruct-FP8' ๋ชจ๋ธ์„ ํ™œ์šฉํ•˜์—ฌ Raw ๋ฐ์ดํ„ฐ๋ฅผ ์ •์ œํ•˜์˜€์Šต๋‹ˆ๋‹ค. ์ •์ œ๋œ ๋ฐ์ดํ„ฐ๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ํ•ฉ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ์ƒ์„ฑํ•œ ํ›„, 'meta-llama/Llama-Guard-3-8B' ๋ฐ 'RLHFlow/ArmoRM-Llama3-8B-v0.1' ๋ชจ๋ธ์„ ํ†ตํ•ด ์ƒ์„ฑ๋œ ๋ฐ์ดํ„ฐ์˜ ํ’ˆ์งˆ์„ ์‹ฌ์ธต์ ์œผ๋กœ ํ‰๊ฐ€ํ•˜์˜€์Šต๋‹ˆ๋‹ค. ์ด์–ด์„œ 'Alibaba-NLP/gte-large-en-v1.5'๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ž„๋ฒ ๋”ฉ์„ ์ถ”์ถœํ•˜๊ณ , Faiss๋ฅผ ์ ์šฉํ•˜์—ฌ ์ž์นด๋“œ ๊ฑฐ๋ฆฌ ๊ธฐ๋ฐ˜์˜ ๊ทผ์ ‘ ์ด์›ƒ ๋ถ„์„์„ ์ˆ˜ํ–‰ํ•จ์œผ๋กœ์จ ๋‹ค์–‘ํ•˜๊ณ  ์ •๊ตํ•œ ์ตœ์ข… ๋ฐ์ดํ„ฐ์…‹ 21k์„ ์ง์ ‘ ๊ตฌ์„ฑํ•˜์˜€์Šต๋‹ˆ๋‹ค.

Task duration

3days (20240914~20240916)

evaluation

Nothing (I had to take the Thanksgiving holiday off.)

sample

image/png

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu121
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
233
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this modelโ€™s pipeline type. Check the docs .

Model tree for QuantFactory/Hermes-3-Llama-3.1-8B-Kor-Finance-Advisor-GGUF

Quantized
(37)
this model