Post
3661
Is GPT-4o everything you expected? ๐ค
@OpenAI has gone omni (GPT-4"o" ๐), a multimodal LLM, it accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs. ๐ค๐ธโ๏ธ
1๏ธโฃ Based on the examples seen:
Inputs possible are Text โ๏ธ, Text + Image ๐๐ผ๏ธ, Text + Audio ๐๐ง, Text + Video ๐๐ฅ, Audio ๐ง
and outputs possible are Image ๐ผ๏ธ, Image + Text ๐ผ๏ธ๐, Text ๐, Audio ๐ง
2๏ธโฃ 88.7% on MMLU ๐; 90.2% on HumanEval (best in class) ๐ฅ
3๏ธโฃ Up to 50% cheaper ๐ธ and 2x faster โก than GPT-4 Turbo
4๏ธโฃ GPT-4o will be available in the free tier of ChatGPT ๐
5๏ธโฃ Near real-time audio with 320ms on average, similar to human conversation ๐ฃ๏ธ**
6๏ธโฃ New tokenizer with a 200k token vocabulary ๐ (previously 100k vocabulary) leading to 1.1x - 4.4x fewer tokens needed across 20 languages ๐
7๏ธโฃ Tokenizer compression and more efficient across non-English languages (3-5 times fewer tokens for major Indian languages ๐ฎ๐ณ)
๐Open questions:
- What is the context length? โ
- Why does GPT-4 still exist, if GPT-4o is better, faster, and cheaper? ๐คจ
Blog: https://openai.com/index/hello-gpt-4o/ ๐
Available today:https://chatgpt.com/ ๐
I just wanted it to be cheaper, and more accessible! ๐
Still not open source, but a price reduction is welcome! ๐ฐ
Also, something fun happened, for the first 10-15 mins all search engines were correcting GPT-4o to GPT-4 ๐
Also, also, GPT-4o is the model which was powering the GPT2 chatbot in the LMsys arena (ELO 1310 vs. 1253 for GPT-4 Turbo) ๐
@OpenAI has gone omni (GPT-4"o" ๐), a multimodal LLM, it accepts as input any combination of text, audio, and image and generates any combination of text, audio, and image outputs. ๐ค๐ธโ๏ธ
1๏ธโฃ Based on the examples seen:
Inputs possible are Text โ๏ธ, Text + Image ๐๐ผ๏ธ, Text + Audio ๐๐ง, Text + Video ๐๐ฅ, Audio ๐ง
and outputs possible are Image ๐ผ๏ธ, Image + Text ๐ผ๏ธ๐, Text ๐, Audio ๐ง
2๏ธโฃ 88.7% on MMLU ๐; 90.2% on HumanEval (best in class) ๐ฅ
3๏ธโฃ Up to 50% cheaper ๐ธ and 2x faster โก than GPT-4 Turbo
4๏ธโฃ GPT-4o will be available in the free tier of ChatGPT ๐
5๏ธโฃ Near real-time audio with 320ms on average, similar to human conversation ๐ฃ๏ธ**
6๏ธโฃ New tokenizer with a 200k token vocabulary ๐ (previously 100k vocabulary) leading to 1.1x - 4.4x fewer tokens needed across 20 languages ๐
7๏ธโฃ Tokenizer compression and more efficient across non-English languages (3-5 times fewer tokens for major Indian languages ๐ฎ๐ณ)
๐Open questions:
- What is the context length? โ
- Why does GPT-4 still exist, if GPT-4o is better, faster, and cheaper? ๐คจ
Blog: https://openai.com/index/hello-gpt-4o/ ๐
Available today:https://chatgpt.com/ ๐
I just wanted it to be cheaper, and more accessible! ๐
Still not open source, but a price reduction is welcome! ๐ฐ
Also, something fun happened, for the first 10-15 mins all search engines were correcting GPT-4o to GPT-4 ๐
Also, also, GPT-4o is the model which was powering the GPT2 chatbot in the LMsys arena (ELO 1310 vs. 1253 for GPT-4 Turbo) ๐