h2oai
/

h2ogpt-4096-llama2-70b-chat

Text Generation

text-generation-inference

Model card Files Files and versions Community

arnocandel commited on Aug 11, 2023

Commit

c1575b4

·

1 Parent(s): 67f60b6

Update README.md

Files changed (1) hide show

README.md +33 -1

README.md CHANGED Viewed

@@ -20,4 +20,36 @@ Try it live on our [h2oGPT demo](https://gpt.h2o.ai) with side-by-side LLM compa
 See how it compares to other models on our [LLM Leaderboard](https://evalgpt.ai/)!
-See more at [H2O.ai](https://h2o.ai/)

 See how it compares to other models on our [LLM Leaderboard](https://evalgpt.ai/)!
+See more at [H2O.ai](https://h2o.ai/)
+## Model Architecture
+```
+LlamaForCausalLM(
+  (model): LlamaModel(
+    (embed_tokens): Embedding(32000, 8192, padding_idx=0)
+    (layers): ModuleList(
+      (0-79): 80 x LlamaDecoderLayer(
+        (self_attn): LlamaAttention(
+          (q_proj): Linear4bit(in_features=8192, out_features=8192, bias=False)
+          (k_proj): Linear4bit(in_features=8192, out_features=1024, bias=False)
+          (v_proj): Linear4bit(in_features=8192, out_features=1024, bias=False)
+          (o_proj): Linear4bit(in_features=8192, out_features=8192, bias=False)
+          (rotary_emb): LlamaRotaryEmbedding()
+        )
+        (mlp): LlamaMLP(
+          (gate_proj): Linear4bit(in_features=8192, out_features=28672, bias=False)
+          (up_proj): Linear4bit(in_features=8192, out_features=28672, bias=False)
+          (down_proj): Linear4bit(in_features=28672, out_features=8192, bias=False)
+          (act_fn): SiLUActivation()
+        )
+        (input_layernorm): LlamaRMSNorm()
+        (post_attention_layernorm): LlamaRMSNorm()
+      )
+    )
+    (norm): LlamaRMSNorm()
+  )
+  (lm_head): Linear(in_features=8192, out_features=32000, bias=False)
+)
+```