Pruna AI

Enterprise

company

https://www.pruna.ai/

PrunaAI

Activity Feed Request to join this org

AI & ML interests

Efficient machine learning for any model and hardware: pruning, quantization, compilation, and more.

Recent Activity

sharpenb published a model about 10 hours ago

PrunaAI/Nexusflow-Starling-LM-7B-beta-GGUF-smashed

sharpenb updated a model about 10 hours ago

PrunaAI/Nexusflow-Starling-LM-7B-beta-GGUF-smashed

sharpenb published a model about 10 hours ago

PrunaAI/stabilityai-stablecode-completion-alpha-3b-4k-GGUF-smashed

View all activity

PrunaAI's activity

sharpenb

published a model about 10 hours ago

PrunaAI/Nexusflow-Starling-LM-7B-beta-GGUF-smashed

Updated about 10 hours ago

sharpenb

updated a model about 10 hours ago

PrunaAI/Nexusflow-Starling-LM-7B-beta-GGUF-smashed

Updated about 10 hours ago

sharpenb

published a model about 10 hours ago

PrunaAI/stabilityai-stablecode-completion-alpha-3b-4k-GGUF-smashed

Updated about 10 hours ago

sharpenb

updated a model about 10 hours ago

PrunaAI/stabilityai-stablecode-completion-alpha-3b-4k-GGUF-smashed

Updated about 10 hours ago

sharpenb

published a model about 10 hours ago

PrunaAI/databricks-dolly-v2-3b-GGUF-smashed

Updated about 10 hours ago

sharpenb

updated a model about 10 hours ago

PrunaAI/databricks-dolly-v2-3b-GGUF-smashed

Updated about 10 hours ago

sharpenb

published a model about 10 hours ago

PrunaAI/AI-MO-NuminaMath-7B-TIR-GGUF-smashed

Updated about 10 hours ago

sharpenb

updated a model about 10 hours ago

PrunaAI/AI-MO-NuminaMath-7B-TIR-GGUF-smashed

Updated about 10 hours ago

sharpenb

posted an update 15 days ago

Post

497

How to deploy compressed ML models in your pipeline?

We wrote a series of blogs on this topics. Hope that it can be helpful to people:
- Standard Model Compression in ML Pipeline: https://www.pruna.ai/blog/standard-model-compression-ml-pipeline
- Boost Your Replicate Models with Pruna AI: A Step-by-Step Guide: https://www.pruna.ai/blog/guide-replicate-pruna-ai
- Pruna + Triton: A Winning Combination for High-Performance AI Deployments: https://www.pruna.ai/blog/pruna-triton-combination

Feel free to join our discord (https://discord.com/invite/rskEr4BZJx) if you have questions ;)

sharpenb

posted an update about 1 month ago

Post

1882

We compressed SmolLMs to make 135 variations of them (see https://huggingface.co/PrunaAI?search_models=smolLM) with different quantization configurations with pruna (https://docs.pruna.ai/en/latest/).

We made a blog to summarize our findings (see https://www.pruna.ai/blog/smollm2-smaller-faster) and small LM can be made smaller! :)