louisbrulenaudet/Romulus-cpt-Llama-3.1-8B-v0.1 Text Generation β’ Updated Sep 11, 2024 β’ 21 β’ 11
jpacifico/Chocolatine-3B-Instruct-DPO-Revised-Q4_K_M-GGUF Text Generation β’ Updated Aug 4, 2024 β’ 36 β’ 6
jpacifico/Chocolatine-3B-Instruct-DPO-Revised Text Generation β’ Updated Oct 15, 2024 β’ 1.15k β’ 27
view post Post 14000 I can't believe this... Phi-3.5-mini (3.8B) running in-browser at ~90 tokens/second on WebGPU w/ Transformers.js and ONNX Runtime Web! π€― Since everything runs 100% locally, no messages are sent to a server β a huge win for privacy!- π€ Demo: webml-community/phi-3.5-webgpu- π§βπ» Source code: https://github.com/huggingface/transformers.js-examples/tree/main/phi-3.5-webgpu 11 replies Β· π₯ 31 31 π 6 6 π 2 2 β€οΈ 2 2 π 2 2 π€― 1 1 + Reply