Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
73.5
TFLOPS
13
1
37
Googlefan
googlefan
Follow
21world's profile picture
bradex's profile picture
Maaz66's profile picture
4 followers
·
0 following
https://googlefan.net/
advictrius85
googlefan256
AI & ML interests
None yet
Recent Activity
reacted
to
mitkox
's
post
with 👍
about 24 hours ago
llama.cpp is 26.8% faster than ollama. I have upgraded both, and using the same settings, I am running the same DeepSeek R1 Distill 1.5B on the same hardware. It's an Apples to Apples comparison. Total duration: llama.cpp 6.85 sec <- 26.8% faster ollama 8.69 sec Breakdown by phase: Model loading llama.cpp 241 ms <- 2x faster ollama 553 ms Prompt processing llama.cpp 416.04 tokens/s with an eval time 45.67 ms <- 10x faster ollama 42.17 tokens/s with an eval time of 498 ms Token generation llama.cpp 137.79 tokens/s with an eval time 6.62 sec <- 13% faster ollama 122.07 tokens/s with an eval time 7.64 sec llama.cpp is LLM inference in C/C++; ollama adds abstraction layers and marketing. Make sure you own your AI. AI in the cloud is not aligned with you; it's aligned with the company that owns it.
updated
a model
5 days ago
neody/r1-14b-awq
liked
a model
6 days ago
tencent/Hunyuan3D-2
View all activity
Organizations
models
4
Sort: Recently updated
googlefan/cycleqd-test-model
Updated
Dec 5, 2024
•
2
googlefan/sbv2_personal_models
Updated
Nov 25, 2024
•
1
googlefan/sbv2_onnx_models
Updated
Sep 23, 2024
•
1
googlefan/my_first_lm
Updated
Jun 18, 2024
•
1
datasets
6
Sort: Recently updated
googlefan/lami-voice
Viewer
•
Updated
Nov 22, 2024
•
424
•
39
googlefan/test-cc
Viewer
•
Updated
Nov 20, 2024
•
50.5k
•
13
googlefan/kusanagi-audio-tts
Viewer
•
Updated
Oct 3, 2024
•
532k
•
45
•
1
googlefan/kusanagi-audio
Viewer
•
Updated
Sep 19, 2024
•
653k
•
73
•
1
googlefan/guanaco-jp-audio
Viewer
•
Updated
Sep 17, 2024
•
16.4k
•
35
•
1
googlefan/sakura-audio
Viewer
•
Updated
Sep 13, 2024
•
500
•
33