Joseph [open/acc] Pollack's picture

Joseph [open/acc] Pollack

Tonic

AI & ML interests

๐Ÿค–Making robots to help people learn things quicker ๐Ÿ‘ฉ๐Ÿปโ€๐Ÿš€๐Ÿš€

Recent Activity

updated a Space about 19 hours ago
Tonic/Phi-4
updated a collection about 23 hours ago
cool models
liked a dataset about 23 hours ago
HumanLLMs/Human-Like-DPO-Dataset
View all activity

Articles

Organizations

MISATO-dataset's profile picture Masakhane NLP's profile picture LangChain Agents Hub's profile picture LangChain Chains Hub's profile picture BigScience Biomedical Datasets's profile picture LangChainDatasets's profile picture OpenVINO Toolkit's profile picture Gradio-Blocks-Party's profile picture DeepGHS's profile picture The introspector project's profile picture Pseudo Lab's profile picture LangChain Hub Prompts's profile picture The Waifu Research Department's profile picture Blog-explorers's profile picture Tonic AI's profile picture OpenLLM France's profile picture Multi๐Ÿค–Transformers's profile picture Qwen's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture SaprotHub's profile picture The Hydra Project's profile picture Copyleft Cultivars's profile picture Argilla Explorers's profile picture the collabage patch's profile picture Social Post Explorers's profile picture C4AI Community's profile picture AIffl : AI For French Language's profile picture M4-ai's profile picture takara.ai's profile picture Dev Mode Explorers's profile picture Quasar Research's profile picture Chinese LLMs on Hugging Face's profile picture Hugging Face for Legal's profile picture Hugging Face Discord Community's profile picture Seq-to-Pheno's profile picture Data Tonic (Alignment Lab)'s profile picture Nerdy Face's profile picture Intelligent Estate's profile picture open/ acc's profile picture

Tonic's activity

reacted to hexgrad's post with ๐Ÿ‘€ 2 days ago
view post
Post
5745
๐Ÿ“ฃ Looking for labeled, high-quality synthetic audio/TTS data ๐Ÿ“ฃ Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.

If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.

What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. โค๏ธ

More details at hexgrad/Kokoro-82M#21
ยท
posted an update 4 days ago
view post
Post
1519
microsoft just released Phi-4 , check it out here : Tonic/Phi-4

hope you like it :-)
reacted to anakin87's post with โค๏ธ about 1 month ago
view post
Post
1618
Tulu 3 SFT Mixture by AllenAI is a massive, good, multilingual dataset for fine-tuning Language Models.

Unfortunately, it was missing the "language" column.

I added it using the good old fastText.

Check out the dataset here ๐Ÿ‘‰ anakin87/tulu-3-sft-mixture-with-language

  • 1 reply
ยท
reacted to merve's post with ๐Ÿง ๐Ÿ˜Žโค๏ธ๐Ÿ‘€ about 2 months ago
view post
Post
3157
your hugging face profile now has your recent activities ๐Ÿค—
posted an update 2 months ago
view post
Post
3525
๐Ÿ™‹๐Ÿปโ€โ™‚๏ธhey there folks,

periodic reminder : if you are experiencing โš ๏ธ500 errors โš ๏ธ or โš ๏ธ abnormal spaces behavior on load or launch โš ๏ธ

we have a thread ๐Ÿ‘‰๐Ÿป https://discord.com/channels/879548962464493619/1295847667515129877

if you can record the problem and share it there , or on the forums in your own post , please dont be shy because i'm not sure but i do think it helps ๐Ÿค—๐Ÿค—๐Ÿค—
  • 2 replies
ยท
reacted to davidberenstein1957's post with ๐Ÿš€๐Ÿค—๐Ÿ‘€ 2 months ago
view post
Post
3094
Vector Search (most) datasets on the Hugging Face Hub ๐Ÿ”ฆ

Powered by: Polars, DuckDB, Gradio and model2vec (lightning-fast embeddings by Stรฉphan Tulkens).

Should work fast enough for datasets up to 100K.

davidberenstein1957/vectorsearch-hub-datasets
reacted to prithivMLmods's post with ๐Ÿ‘€โค๏ธ 2 months ago
view post
Post
5750
New Style, New Mix, New Drop ๐Ÿงค

๐ŸงจFlux LoRA DLC: prithivMLmods/FLUX-LoRA-DLC

๐ŸŽ†Glowing-Body: prithivMLmods/Glowing-Body-Flux-LoRA
๐ŸŽ†Electric-Blue: prithivMLmods/Electric-Blue-Flux-LoRA
๐ŸŽ†Intense-Red: prithivMLmods/Intense-Red-Flux-LoRA
๐ŸŽ†Clouds-Illusion: prithivMLmods/Clouds-Illusion-Flux-LoRA
๐ŸŽ†Digital-Yellow: prithivMLmods/Digital-Yellow-Flux-LoRA

๐ŸงจFlux LoRA Collection: prithivMLmods/flux-lora-collections-66dd5908be2206cfaa8519be

.
.
.
@prithivMLmods
reacted to m-ric's post with ๐Ÿ˜Žโค๏ธ๐Ÿš€๐Ÿง  2 months ago
view post
Post
2439
> Oasis: First Real-Time Video Game Without a Game Engine! ๐ŸŽฎ

DecartAI & Etched just released Oasis - a fully AI-generated video game running at 20 FPS (frames per second). The model takes keyboard inputs and generates everything - physics, rules, graphics - on the fly, without any game engine.

โšก๏ธ What makes this special? Current text-to-video models (Mochi-1, Sora, Kling) generate about 1 frame every 10-20 seconds (that's the kind of device I had to play LoL back in the day, thus my low rankings). Oasis is 200 times faster, making it the first playable AI-generated game.

โš™๏ธ Under the hood, it uses a vision transformer to encode space and a diffusion model to generate frames. The secret sauce is "dynamic noising" - a technique that keeps the video stable between frames.

Key insights:
โšก๏ธ Generates 20 FPS, vs 0.2 FPS for other DIT-based video models
โ€ฃ The specialized hardware Sohu developed by Etched allows to handle 10x more player than H100

๐ŸŽฎ Features real game mechanics
โ€ฃ Movement, jumping, item management
โ€ฃ Physics and lighting
โ€ฃ Procedurally generated worlds

โš ๏ธ Current limitations
โ€ฃ Blurry graphics at a distance
โ€ฃ Objects sometimes change appearance
โ€ฃ Memory issues in long sessions

Try it yourself, the playable demo is impressive! ๐Ÿ‘‰ https://oasis.decart.ai/welcome
Code ๐Ÿ‘‰ https://github.com/etched-ai/open-oasis
Read it in full ๐Ÿ‘‰ https://oasis-model.github.io/
reacted to davanstrien's post with ๐Ÿš€โค๏ธ 2 months ago
replied to nbroad's post 2 months ago