Thanos-1B Model Card

🧠 Multifaceted Skill-of-Mind | πŸ€– Thanos-3B | πŸ€– Thanos-8B | πŸ’» Github | πŸ“„ Arxiv | πŸ“• PDF

🚨 Disclaimer: All models and dataset are intended to be used for research purposes only.

Model Description

  • Repository: Code
  • Paper: Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model
  • Point of Contact: Young-Jun Lee

Model Details

  • Model: Thanos-series is a fully open-source, skill-of-mind-infused LLM designed to help general conversational agents respond in a more human-like way.
  • Date: Thanos-series was trained in 2024.
  • Training Dataset: 100K Multifaceted Skill-of-Mind
  • Architecture: Thanos-1B was trained on top of LLaMA-3.2-1B.

How to Use

License and Recommendations

🚨 Thanos-1B is intended to be used for research purposes only.

Acknowledgement

This work was supported by a grant of the KAIST-KT joint research project through AI Tech Lab, Institute of convergence Technology, funded by KT [Project No. G01230605, Development of Task-oriented Persona-based Dialogue Generation Combining Multi-modal Interaction and Knowledge Modeling].

Citation

If you find the resources in this repository useful, please cite our work:

@misc{lee2024thanosenhancingconversationalagents,
      title={Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model}, 
      author={Young-Jun Lee and Dokyong Lee and Junyoung Youn and Kyeongjin Oh and Ho-Jin Choi},
      year={2024},
      eprint={2411.04496},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2411.04496}, 
}
Downloads last month
147
Safetensors
Model size
1.24B params
Tensor type
BF16
Β·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for passing2961/Thanos-1B

Finetuned
(287)
this model
Finetunes
1 model
Merges
4 models
Quantizations
2 models

Dataset used to train passing2961/Thanos-1B

Collection including passing2961/Thanos-1B