Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon May 9, 2024 • 12
Shears: Unstructured Sparsity with Neural Low-rank Adapter Search Paper • 2404.10934 • Published Apr 16, 2024
A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities Paper • 2205.10358 • Published May 19, 2022
An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs Paper • 2306.16601 • Published Jun 28, 2023 • 4
Intel/distilbert-base-uncased-finetuned-sst-2-english-int8-static-inc Text Classification • Updated Jun 27, 2023 • 35 • 5