nicolo's picture

nicolo

nicolollo
ยท

AI & ML interests

None yet

Recent Activity

upvoted a collection about 17 hours ago
Qwen2.5-1M
upvoted a collection about 17 hours ago
Qwen2.5-VL
updated a model 1 day ago
nicolollo/test1-Q4_K_M-GGUF
View all activity

Organizations

Hugging Face Discord Community's profile picture

nicolollo's activity

reacted to merve's post with โค๏ธ๐Ÿš€๐Ÿ”ฅ 25 days ago
view post
Post
4828
supercharge your LLM apps with smolagents ๐Ÿ”ฅ

however cool your LLM is, without being agentic it can only go so far

enter smolagents: a new agent library by Hugging Face to make the LLM write code, do analysis and automate boring stuff!

Here's our blog for you to get started https://huggingface.co/blog/smolagents
reacted to lewtun's post with ๐Ÿ‘ about 1 month ago
view post
Post
6789
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute ๐Ÿ”ฅ

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

๐Ÿ“ˆ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

๐ŸŽ„ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

๐Ÿงญ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!
  • 2 replies
ยท