s3nh's picture

s3nh

s3nh

AI & ML interests

Quantization, LLMs, Deep Learning for good. Follow me if you like my work. Patreon.com/s3nh

Recent Activity

Organizations

ESPnet's profile picture Gradio-Blocks-Party's profile picture Lajonbot's profile picture The Waifu Research Department's profile picture AblateIt's profile picture Blog-explorers's profile picture BangumiBase's profile picture CyberHarem's profile picture HydraLM's profile picture GOAT.AI's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Social Post Explorers's profile picture Spinner-GPT-4's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture Smol Community's profile picture

s3nh's activity

reacted to YannisTevissen's post with ๐Ÿ‘๐Ÿค— 10 days ago
reacted to sayakpaul's post with ๐Ÿ”ฅ about 1 month ago
view post
Post
4215
Commits speak louder than words ๐Ÿคช

* 4 new video models
* Multiple image models, including SANA & Flux Control
* New quantizers -> GGUF & TorchAO
* New training scripts

Enjoy this holiday-special Diffusers release ๐Ÿค—
Notes: https://github.com/huggingface/diffusers/releases/tag/v0.32.0
reacted to merve's post with ๐Ÿง  about 1 month ago
view post
Post
1792
A complete RAG pipeline includes a reranker, which ranks the documents to find the best document ๐Ÿ““
Same goes for multimodal RAG, multimodal rerankers which we can integrate to multimodal RAG pipelines!
Learn how to build a complete multimodal RAG pipeline with vidore/colqwen2-v1.0 as retriever, lightonai/MonoQwen2-VL-v0.1 as reranker, Qwen/Qwen2-VL-7B-Instruct as VLM in this notebook that runs on a GPU as small as L4 ๐Ÿ”ฅ https://huggingface.co/learn/cookbook/multimodal_rag_using_document_retrieval_and_reranker_and_vlms
  • 1 reply
ยท
reacted to fdaudens's post with ๐Ÿค— about 1 month ago
view post
Post
1308
๐Ÿค Want to share your AI models while protecting your work? Licenses are key!

Fascinating to see that nearly 60% of models on the Hub use Apache & MIT licenses.

Explore the viz here: huggingface/open-source-ai-year-in-review-2024
reacted to Lewdiculous's post with โž• about 1 month ago
view post
Post
4491
Hello fellow LLMers, just a quick notice that some of my activity will be moved into the AetherArchitectural Commuity and split with @Aetherarchio .

[here] https://huggingface.co/AetherArchitectural

All activity should be visible in the left side of my profile.
  • 1 reply
ยท
reacted to fdaudens's post with ๐Ÿ‘ about 1 month ago
view post
Post
1381
๐Ÿ” From instruction-following to creative storytelling, dive into 2024's most impactful AI datasets! These gems are shaping everything from scientific research to video understanding.

Check it out: huggingface/open-source-ai-year-in-review-2024
replied to louisbrulenaudet's post about 1 month ago
reacted to louisbrulenaudet's post with ๐Ÿค— about 1 month ago
view post
Post
1863
Iโ€™ve published a new dataset to simplify model merging ๐Ÿค—

This dataset facilitates the search for compatible architectures for model merging with @arcee_aiโ€™s mergekit, streamlining the automation of high-performance merge searches ๐Ÿ“–

Dataset : louisbrulenaudet/mergekit-configs
  • 1 reply
ยท
reacted to nyuuzyou's post with ๐Ÿ‘ about 1 month ago
view post
Post
1512
โœˆ๏ธ Aircraft Dataset & Generation Model nyuuzyou/aircraft-images & nyuuzyou/AircraftFLUX-LoRA

Dataset Features:
โ€ข 165,340 high-res aircraft images with metadata
โ€ข Machine-generated English captions
โ€ข Detailed aircraft specs, registration & flight info
โ€ข Environmental context descriptions

LoRA model specializes in:
โ€ข Realistic aircraft generation
โ€ข Accurate technical details for unpopular airplanes compared to black-forest-labs/FLUX.1-schnell
โ€ข Proper airline liveries
โ€ข Contextual aviation scenes
replied to danielhanchen's post about 1 month ago
reacted to danielhanchen's post with ๐Ÿค—๐Ÿ‘ about 1 month ago
reacted to stefan-it's post with โค๏ธ about 1 month ago
view post
Post
1363
My latest project is the outcome of the last 2+ years working with TPUs from the amazing TPU Research Cloud (TRC) program and training Encoder-only LMs with the TensorFlow Model Garden library.

๐Ÿ‘‰ Link: https://github.com/stefan-it/model-garden-lms

An overview of some features:

- Cheatsheet for setting-up a TPU VM Pod (with all necessary dependencies) to pretrain LMs with TF Model Garden
- Conversion scripts that convert TF Model Garden weights to Hugging Face Transformers-compatible models
- Supported architectures include BERT, BERT with Token Dropping and TEAMS

I also released BERT-based models pretrained on the great Hugging Face FineWeb and FineWeb-Edu datasets (10BT subset). With more to come!

๐Ÿ‘‰ Model Hub Link: https://huggingface.co/model-garden-lms

If you find these resources useful, please give them a like!

Made from Bavarian Oberland with โค๏ธ and ๐Ÿฅจ.
reacted to lucifertrj's post with ๐Ÿ‘€ about 1 month ago
view post
Post
518
Image Prompt Engineering Guide:
โžก๏ธ Artistic styling for Image generation
โžก๏ธ Prompt weighting using the parentheses method to generate realistic images.
โžก๏ธ Advanced features like style and positioning control[experimental].
โžก๏ธ Image placement on the generated AI image using Recraft V3 Mockup.

Watch: https://www.youtube.com/watch?v=d3nUG28-jIc
replied to AtAndDev's post about 1 month ago
reacted to davidberenstein1957's post with ๐Ÿ”ฅ about 1 month ago
replied to davidberenstein1957's post about 1 month ago
view reply

Looking great, cznnot wait to test, thank you ๐Ÿค—

reacted to davidberenstein1957's post with โค๏ธ๐Ÿ”ฅ about 1 month ago
view post
Post
4204
Introducing the Synthetic Data Generator, a user-friendly application that takes a no-code approach to creating custom datasets with Large Language Models (LLMs). The best part: A simple step-by-step process, making dataset creation a non-technical breeze, allowing anyone to create datasets and models in minutes and without any code.

Blog: https://huggingface.co/blog/synthetic-data-generator
Space: argilla/synthetic-data-generator
  • 4 replies
ยท