Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fromthesky
/
PLDR-LLM-v51-104M
like
0
Text Generation
PyTorch
tiiuae/falcon-refinedweb
English
large-language-model
power-law-decoder-representations
power-law-graph-attention
pldr-llm
kv-cache
g-cache
kvg-cache
arxiv:
2502.13502
arxiv:
2306.01116
arxiv:
2101.00027
License:
apache-2.0
Model card
Files
Files and versions
Community
main
PLDR-LLM-v51-104M
1 contributor
History:
3 commits
fromthesky
Updated readme.
eff20f1
1 day ago
.gitattributes
Safe
1.52 kB
initial commit
3 days ago
PLDRv51-104M-model-checkpoint.pth
pickle
Detected Pickle imports (5)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.Tensor"
,
"torch._tensor._rebuild_from_type_v2"
,
"torch.FloatStorage"
How to fix it?
417 MB
LFS
initial commit.
1 day ago
PLDRv51_104M_hyperparameters.py
Safe
786 Bytes
initial commit.
1 day ago
README.md
Safe
3.41 kB
Updated readme.
1 day ago
refinedweb-tokenizer-pldrllm-kvg-cache-paper.tar.gz
Safe
616 kB
LFS
initial commit.
1 day ago