Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
48.7
TFLOPS
4
20
26
Nicolay Rusnachenko
nicolay-r
Follow
hooya26's profile picture
0-kodiya-0's profile picture
alvdansen's profile picture
82 followers
Β·
4 following
https://nicolay-r.github.io/
nicolayr_
nicolay-r
nicolay-r
AI & ML interests
Information Retrievalγ»Medical Multimodal NLP (πΌ+π) Research Fellow @BU_Researchγ»software developer http://arekit.ioγ»PhD in NLP
Recent Activity
posted
an
update
about 9 hours ago
π’ Qwen so far released the 2.5-MAX that claims to outperform DeepSeek-R1. And here is how you can start applying it for handling CSV / JSONL data. The model is compatible with OpenAI API so here is my wrapper for it: π https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/openai_156.py π All you have to do is to set base-url: https://dashscope-intl.aliyuncs.com/compatible-mode/v1 and API key of the platform. βοΈ Below is the link to the complete example (see screenshot): https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_qwen_25_max_chat.sh π° Source: https://www.alibabacloud.com/help/en/model-studio/developer-reference/what-is-qwen-llm πΊ Official Sandbox Demo: https://huggingface.co/spaces/Qwen/Qwen2.5-Max-Demo π Paper: https://arxiv.org/abs/2412.15115
reacted
to
singhsidhukuldeep
's
post
with π
about 12 hours ago
Exciting Research Alert: Revolutionizing Complex Information Retrieval! A groundbreaking paper from researchers at MIT, AWS AI, and UPenn introduces ARM (Alignment-Oriented LLM-based Retrieval Method), a novel approach to tackle complex information retrieval challenges. >> Key Innovations Information Alignment The method first decomposes queries into keywords and aligns them with available data using both BM25 and embedding similarity, ensuring comprehensive coverage of information needs. Structure Alignment ARM employs a sophisticated mixed-integer programming solver to identify connections between data objects, exploring relationships beyond simple semantic matching. Self-Verification The system includes a unique self-verification mechanism where the LLM evaluates and aggregates results from multiple retrieval paths, ensuring accuracy and completeness. >> Performance Highlights The results are impressive: - Outperforms standard RAG by up to 5.2 points in execution accuracy on Bird dataset - Achieves 19.3 points higher F1 scores compared to existing approaches on OTT-QA - Reduces the number of required LLM calls while maintaining superior retrieval quality >> Technical Implementation The system uses a three-step process: 1. N-gram indexing and embedding computation for all data objects 2. Constrained beam decoding for information alignment 3. Mixed-integer programming optimization for structure exploration This research represents a significant step forward in making complex information retrieval more efficient and accurate. The team's work demonstrates how combining traditional optimization techniques with modern LLM capabilities can solve challenging retrieval problems.
reacted
to
JingzeShi
's
post
with π€
about 14 hours ago
Welcome to the Doge Face Open Source Community! π Our goal is to explore the foundation of embodied intelligence for the next two years, which is indispensable β small language models. π¬ We aim to open-source code and documentation to give everyone more time to slack off while working or studying! π€ π Repository name on Github: https://github.com/SmallDoges/small-doge π Organization name on Hugging Face: https://huggingface.co/SmallDoge
View all activity
Organizations
None yet
nicolay-r
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a Space
about 15 hours ago
Running
366
π’
Qwen2.5 Max Demo
liked
a model
5 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
β’
Updated
3 days ago
β’
235k
β’
315
liked
a model
8 days ago
deepseek-ai/DeepSeek-R1
Text Generation
β’
Updated
3 days ago
β’
953k
β’
β’
6.29k
liked
a Space
20 days ago
Running
on
CPU Upgrade
327
π₯
Open Medical-LLM Leaderboard
liked
a model
20 days ago
johnsnowlabs/JSL-MedLlama-3-8B-v2.0
Text Generation
β’
Updated
Apr 30, 2024
β’
11.9k
β’
30
liked
a model
4 months ago
meta-llama/Llama-3.2-3B-Instruct
Text Generation
β’
Updated
Oct 24, 2024
β’
1.49M
β’
β’
950
liked
3 models
7 months ago
hyy-33/hyy33-WASSA-2024-Track-2
Updated
Jul 9, 2024
β’
2
google/gemma-2-9b-it
Text Generation
β’
Updated
Aug 27, 2024
β’
389k
β’
β’
642
google/gemma-2-27b-it
Text Generation
β’
Updated
Aug 27, 2024
β’
176k
β’
513
liked
4 models
8 months ago
Qwen/Qwen2-7B-Instruct
Text Generation
β’
Updated
Aug 21, 2024
β’
818k
β’
610
mistralai/Mistral-7B-Instruct-v0.3
Text Generation
β’
Updated
Aug 21, 2024
β’
1.7M
β’
β’
1.3k
microsoft/Phi-3-small-8k-instruct
Text Generation
β’
Updated
Aug 30, 2024
β’
24.8k
β’
160
microsoft/Phi-3-mini-4k-instruct
Text Generation
β’
Updated
Sep 20, 2024
β’
903k
β’
β’
1.13k
liked
2 models
9 months ago
xtuner/llava-phi-3-mini-hf
Image-to-Text
β’
Updated
Apr 25, 2024
β’
5.94k
β’
49
xtuner/llava-llama-3-8b-v1_1
Image-Text-to-Text
β’
Updated
Apr 28, 2024
β’
50
β’
120
liked
5 models
10 months ago
AIRI-Institute/OmniFusion
Updated
Apr 10, 2024
β’
56
google-bert/bert-base-uncased
Fill-Mask
β’
Updated
Feb 19, 2024
β’
80.7M
β’
2.09k
google/gemma-1.1-2b-it
Text Generation
β’
Updated
Jun 27, 2024
β’
90.3k
β’
154
google/gemma-2b-it
Text Generation
β’
Updated
Sep 27, 2024
β’
107k
β’
β’
703
google/gemma-7b-it
Text Generation
β’
Updated
Aug 14, 2024
β’
56.4k
β’
1.15k
Load more