Csaba Kecskemeti PRO
csabakecskemeti
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 hour ago
DevQuasar/allenai.Llama-3.1-Tulu-3-405B-GGUF
replied to
their
post
about 3 hours ago
Check out my idea:
LLmaaS - Local LLM as a Service
With LLmaaS, I propose leveraging locally running LLMs as a service, providing a standardized way for websites to access and utilize them for LLM-powered operations directly on the user’s device.
Demo, code, more detailed description.
https://devquasar.com/llmaas/
https://github.com/csabakecskemeti/LLmaaS
https://youtu.be/OOWGr8jcP5Q
Call for contributors
Join me a develop the LLmaaS proxy to make this a generic purpose tool to leverage local LLMs on web. Build in security measures.
I'm looking for help to make the proxy more generic support multiple local LLM services without any change on the HTML side.
Also looking for ideas how to make the HTML par more modular and easy to use.
updated
a model
about 3 hours ago
DevQuasar/mkurman.Qwen2.5-14B-DeepSeek-R1-1M-GGUF
Organizations
csabakecskemeti's activity
CUDA out of memory error during fp8 to bf16 model conversion + fix
1
#17 opened about 1 month ago
by
sszymczyk
Is this tested?
5
#1 opened about 2 months ago
by
csabakecskemeti
Generate on V100 questions
5
#10 opened about 1 month ago
by
csabakecskemeti
Is this a LORA adapter?
2
#1 opened about 1 month ago
by
csabakecskemeti
New activity in
DevQuasar/huihui-ai.Llama-3.3-70B-Instruct-abliterated-finetuned-GGUF
about 1 month ago
Checksum fails on /huihui-ai.Llama-3.3-70B-Instruct-abliterated-finetuned.Q4_K_M-00004-of-00004.gguf
2
#1 opened about 1 month ago
by
JoshGreifer
How the scores are calculated
3
#1028 opened 2 months ago
by
csabakecskemeti
Phi3 or Mistral?
2
#3 opened 2 months ago
by
csabakecskemeti
[bot] Conversion to Parquet
#1 opened 3 months ago
by
parquet-converter
I think the Q8_0 is corrupted.
2
#1 opened 3 months ago
by
remghoost
Having issues running this
2
#1 opened 3 months ago
by
csabakecskemeti
Weight size VS VRAM requirements
7
#8 opened 3 months ago
by
mindkrypted
Update README.md
#1 opened 5 months ago
by
bbqddt
update pad_token?
3
#1 opened 3 months ago
by
gatorand
Possible non-GGUF release?
1
#1 opened 8 months ago
by
Azazelle