Nicolay Rusnachenko

nicolay-r

https://nicolay-r.github.io/

AI & ML interests

Information Retrieval・Medical Multimodal NLP (🖼+📝) Research Fellow @BU_Research・software developer http://arekit.io・PhD in NLP

Recent Activity

posted an update about 9 hours ago

📢 Qwen so far released the 2.5-MAX that claims to outperform DeepSeek-R1. And here is how you can start applying it for handling CSV / JSONL data. The model is compatible with OpenAI API so here is my wrapper for it: 🌌 https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/openai_156.py 🚀 All you have to do is to set base-url: https://dashscope-intl.aliyuncs.com/compatible-mode/v1 and API key of the platform. ↗️ Below is the link to the complete example (see screenshot): https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/llm_qwen_25_max_chat.sh 📰 Source: https://www.alibabacloud.com/help/en/model-studio/developer-reference/what-is-qwen-llm 📺 Official Sandbox Demo: https://huggingface.co/spaces/Qwen/Qwen2.5-Max-Demo 📜 Paper: https://arxiv.org/abs/2412.15115

reacted to singhsidhukuldeep's post with 🚀 about 12 hours ago

Exciting Research Alert: Revolutionizing Complex Information Retrieval! A groundbreaking paper from researchers at MIT, AWS AI, and UPenn introduces ARM (Alignment-Oriented LLM-based Retrieval Method), a novel approach to tackle complex information retrieval challenges. >> Key Innovations Information Alignment The method first decomposes queries into keywords and aligns them with available data using both BM25 and embedding similarity, ensuring comprehensive coverage of information needs. Structure Alignment ARM employs a sophisticated mixed-integer programming solver to identify connections between data objects, exploring relationships beyond simple semantic matching. Self-Verification The system includes a unique self-verification mechanism where the LLM evaluates and aggregates results from multiple retrieval paths, ensuring accuracy and completeness. >> Performance Highlights The results are impressive: - Outperforms standard RAG by up to 5.2 points in execution accuracy on Bird dataset - Achieves 19.3 points higher F1 scores compared to existing approaches on OTT-QA - Reduces the number of required LLM calls while maintaining superior retrieval quality >> Technical Implementation The system uses a three-step process: 1. N-gram indexing and embedding computation for all data objects 2. Constrained beam decoding for information alignment 3. Mixed-integer programming optimization for structure exploration This research represents a significant step forward in making complex information retrieval more efficient and accurate. The team's work demonstrates how combining traditional optimization techniques with modern LLM capabilities can solve challenging retrieval problems.

reacted to JingzeShi's post with 🤗 about 14 hours ago

Welcome to the Doge Face Open Source Community! 🚀 Our goal is to explore the foundation of embodied intelligence for the next two years, which is indispensable – small language models. 🔬 We aim to open-source code and documentation to give everyone more time to slack off while working or studying! 🤗 👉 Repository name on Github: https://github.com/SmallDoges/small-doge 👉 Organization name on Hugging Face: https://huggingface.co/SmallDoge

View all activity

Organizations

None yet

nicolay-r's activity

liked a Space about 15 hours ago

Running

366

🐢

Qwen2.5 Max Demo

liked a model 5 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-7B

Text Generation • Updated 3 days ago • 235k • 315

liked a model 8 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 3 days ago • 953k • • 6.29k

liked a Space 20 days ago

Running on CPU Upgrade

327

🥇

Open Medical-LLM Leaderboard

liked a model 20 days ago

johnsnowlabs/JSL-MedLlama-3-8B-v2.0

Text Generation • Updated Apr 30, 2024 • 11.9k • 30

liked a model 4 months ago

meta-llama/Llama-3.2-3B-Instruct

Text Generation • Updated Oct 24, 2024 • 1.49M • • 950

liked 3 models 7 months ago

liked 4 models 8 months ago

Qwen/Qwen2-7B-Instruct

Text Generation • Updated Aug 21, 2024 • 818k • 610

mistralai/Mistral-7B-Instruct-v0.3

Text Generation • Updated Aug 21, 2024 • 1.7M • • 1.3k

microsoft/Phi-3-small-8k-instruct

Text Generation • Updated Aug 30, 2024 • 24.8k • 160

microsoft/Phi-3-mini-4k-instruct

Text Generation • Updated Sep 20, 2024 • 903k • • 1.13k

liked 2 models 9 months ago

xtuner/llava-phi-3-mini-hf

Image-to-Text • Updated Apr 25, 2024 • 5.94k • 49

xtuner/llava-llama-3-8b-v1_1

Image-Text-to-Text • Updated Apr 28, 2024 • 50 • 120

liked 5 models 10 months ago

AIRI-Institute/OmniFusion

Updated Apr 10, 2024 • 56

google-bert/bert-base-uncased

Fill-Mask • Updated Feb 19, 2024 • 80.7M • 2.09k

google/gemma-1.1-2b-it

Text Generation • Updated Jun 27, 2024 • 90.3k • 154

google/gemma-2b-it

Text Generation • Updated Sep 27, 2024 • 107k • • 703

google/gemma-7b-it

Text Generation • Updated Aug 14, 2024 • 56.4k • 1.15k