Technology Innovation Institute

Enterprise

company

https://www.tii.ae/

TIIuae

tiiuae

AI & ML interests

Large language models

Recent Activity

IChahed authored a paper 2 days ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

DhiyaEddine authored a paper about 1 month ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Gkunsch authored a paper 2 months ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

View all activity

tiiuae's activity

IChahed

authored a paper 2 days ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 31

DhiyaEddine

authored a paper about 1 month ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 31

Gkunsch

authored a paper 2 months ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 31

yellowvm

authored a paper 3 months ago

Falcon2-11B Technical Report

Paper • 2407.14885 • Published Jul 20

ybelkada

authored a paper 3 months ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 31

JingweiZuo

authored a paper 3 months ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 31

yellowvm

authored 2 papers 3 months ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 31

Generalization error of spectral algorithms

Paper • 2403.11696 • Published Mar 18

ybelkada

posted an update 4 months ago

Post

2871

Falcon Mamba now available now in llama.cpp !
Check out GGUF files uploaded here: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a

2 replies

·

ybelkada

posted an update 4 months ago

Post

3679

FalconMamba 7B - a new model from TII (Technology Innovation Institute) is out !

- Blogpost: https://huggingface.co/blog/falconmamba
- Link to collection: tiiuae/falconmamba-7b-66b9a580324dd1598b0f6d4a
- Link to playground: tiiuae/falcon-mamba-playground

wdevazelhes

authored a paper 7 months ago

metric-learn: Metric Learning Algorithms in Python

Paper • 1908.04710 • Published Aug 13, 2019

ybelkada

posted an update 10 months ago

Post

Check out quantized weights from ISTA-DAS Lab directly in their organisation page: https://huggingface.co/ISTA-DASLab ! With official weights of AQLM (for 2bit quantization) & QMoE (1-bit MoE quantization)

Read more about these techniques below:

AQLM paper: Extreme Compression of Large Language Models via Additive Quantization (2401.06118)
QMoE: QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models (2310.16795)

Some useful links below:

AQLM repo: https://github.com/Vahe1994/AQLM
How to use AQLM & transformers: https://huggingface.co/docs/transformers/quantization#aqlm
How to use AQLM & PEFT: https://huggingface.co/docs/peft/developer_guides/quantization#aqlm-quantizaion

Great work from @BlackSamorez and team !

ybelkada

posted an update 10 months ago

Post

Try out Mixtral 2-bit on a free-tier Google Colab notebook right now!

https://colab.research.google.com/drive/1-xZmBRXT5Fm3Ghn4Mwa2KRypORXb855X?usp=sharing

AQLM method has been recently introduced on transformers main branch

The 2bit model can be found here: BlackSamorez/Mixtral-8x7b-AQLM-2Bit-1x16-hf-test-dispatch

And you can read more about the method here: https://huggingface.co/docs/transformers/main/en/quantization#aqlm

Great work @BlackSamorez and team!

5 replies

·

ybelkada

authored a paper about 1 year ago

Distributed Inference and Fine-tuning of Large Language Models Over The Internet

Paper • 2312.08361 • Published Dec 13, 2023 • 25

Hamza-Alobeidli

authored a paper about 1 year ago

The Falcon Series of Open Language Models

Paper • 2311.16867 • Published Nov 28, 2023 • 13

ybelkada

authored a paper about 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123

ybelkada

authored 3 papers over 1 year ago

Petals: Collaborative Inference and Fine-tuning of Large Models

Paper • 2209.01188 • Published Sep 2, 2022 • 2

Do Pedestrians Pay Attention? Eye Contact Detection in the Wild

Paper • 2112.04212 • Published Dec 8, 2021 • 1

LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale

Paper • 2208.07339 • Published Aug 15, 2022 • 4

ybelkada

authored a paper almost 2 years ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 27