THUdyh commited on
Commit
7f5b8f1
1 Parent(s): e0a240f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -3
README.md CHANGED
@@ -1,3 +1,34 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - THUdyh/Oryx-Image-Data
5
+ base_model:
6
+ - Qwen/Qwen2-7B-Instruct
7
+ pipeline_tag: text-generation
8
+ ---
9
+ # Oryx-7B
10
+
11
+ ## Model Summary
12
+
13
+ The Oryx-Image models are 7/34B parameter models trained on [Oryx-Image-Data](https://huggingface.co/datasets/THUdyh/Oryx-Image-Data), based on Qwen2 language model with a context window of 32K tokens.
14
+
15
+ Oryx offers an on-demand solution to seamlessly and efficiently process visual inputs with arbitrary spatial sizes and temporal lengths.
16
+
17
+ - **Repository:** https://github.com/liuzuyan/oryx
18
+ - **Languages:** English, Chinese
19
+ - **Paper:** Coming Soon
20
+
21
+
22
+ ### Model Architecture
23
+
24
+ - **Architecture:** Pre-trained [Oryx-ViT](https://huggingface.co/THUdyh/Oryx-ViT) + Qwen2-7B
25
+ - **Data:** a mixture of 4M image data
26
+ - **Precision:** BFloat16
27
+
28
+ #### Hardware & Software
29
+
30
+ - **Hardware:** 64 * NVIDIA Tesla A100
31
+ - **Orchestration:** HuggingFace Trainer
32
+ - **Code:** Pytorch
33
+
34
+ ## Citation