tenyx
/

TenyxChat-7B-v1

Text Generation

tenyx-fine-tuning

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sarath-shekkizhar commited on Jan 6, 2024

Commit

268e444

·

1 Parent(s): 75d20ad

adding model card

Files changed (1) hide show

README.md +8 -3

README.md CHANGED Viewed

@@ -1,8 +1,13 @@
 ---
-license: {apache-2.0}
-base_model: {openchat/openchat_3.5}
 ---
 # TenyxChat: Language Model Alignment using Tenyx Fine-tuning
 Introducing TenyxChat, a series of ChatGPT-like models trained to function as useful assistants through preference tuning, using Tenyx's recently released advanced fine-tuning technology ([VentureBeat article](https://venturebeat.com/ai/tenyx-aims-to-fix-llms-catastrophic-forgetting-problem/)). Our first chat model in the series, TenyxChat-7B-v1, is trained using the [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290) framework on the open-source AI feedback dataset [UltraFeedback](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized).

 ---
+license: apache-2.0
+language:
+- en
+library_name: transformers
+tags:
+- tenyx-fine-tuning
+- dpo
+- tenyxchat
 ---
 # TenyxChat: Language Model Alignment using Tenyx Fine-tuning
 Introducing TenyxChat, a series of ChatGPT-like models trained to function as useful assistants through preference tuning, using Tenyx's recently released advanced fine-tuning technology ([VentureBeat article](https://venturebeat.com/ai/tenyx-aims-to-fix-llms-catastrophic-forgetting-problem/)). Our first chat model in the series, TenyxChat-7B-v1, is trained using the [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290) framework on the open-source AI feedback dataset [UltraFeedback](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized).