dushuai

dushuai112233

AI & ML interests

None yet

Recent Activity

liked a dataset 4 days ago

m-a-p/COIG-CQIA

reacted to singhsidhukuldeep's post with 👀 5 days ago

Exciting Research Alert: Enhancing Dense Retrieval with Deliberate Thinking I just came across a fascinating new paper titled "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search" that introduces DEBATER (Deliberate Thinking based Dense Retriever), a novel approach to improve information retrieval using large language models. The research team from Northeastern University and Tsinghua University has developed a method that significantly outperforms existing dense retrieval systems by enabling LLMs to "think deliberately" before generating document representations. >> Technical Details DEBATER enhances LLM-based retrievers through two key mechanisms: 1. Chain-of-Deliberation (CoD): This approach delays the computation of document embeddings by performing several steps of reasoning. It incorporates a sequence of prompt tokens that stimulate the reasoning capability of LLMs, encouraging the model to think step-by-step before producing the final document embedding. 2. Self Distillation (SD): This mechanism distills knowledge from different thinking steps into the final document representation. It identifies the most informative thinking steps and integrates them into a unified text embedding. The implementation uses cosine similarity to measure the similarity between queries and documents. During training, DEBATER calculates similarity scores between query representation and document representations at each thinking step, then selects the most useful thinking step from CoD. >> Performance What's particularly impressive is that DEBATER-4B outperforms larger 7B-scale LLM-based dense retrievers while using significantly fewer parameters. In experiments on the BEIR benchmark, DEBATER achieved more than a 2% improvement over baseline retrievers. The researchers found that an appropriate thinking depth (around 4-8 steps) effectively activates the reasoning capabilities of LLM-based retrievers.

reacted to singhsidhukuldeep's post with 👍 5 days ago

Exciting New Tool for Knowledge Graph Extraction from Plain Text! I just came across a groundbreaking new tool called KGGen that's solving a major challenge in the AI world - the scarcity of high-quality knowledge graph data. KGGen is an open-source Python package that leverages language models to extract knowledge graphs (KGs) from plain text. What makes it special is its innovative approach to clustering related entities, which significantly reduces sparsity in the extracted KGs. The technical approach is fascinating: 1. KGGen uses a multi-stage process involving an LLM (GPT-4o in their implementation) to extract entities and relations from source text 2. It aggregates graphs across sources to reduce redundancy 3. Most importantly, it applies iterative LM-based clustering to refine the raw graph The clustering stage is particularly innovative - it identifies which nodes and edges refer to the same underlying entities or concepts. This normalizes variations in tense, plurality, stemming, and capitalization (e.g., "labors" clustered with "labor"). The researchers from Stanford and University of Toronto also introduced MINE (Measure of Information in Nodes and Edges), the first benchmark for evaluating KG extractors. When tested against existing methods like OpenIE and GraphRAG, KGGen outperformed them by up to 18%. For anyone working with knowledge graphs, RAG systems, or KG embeddings, this tool addresses the fundamental challenge of data scarcity that's been holding back progress in graph-based foundation models. The package is available via pip install kg-gen, making it accessible to everyone. This could be a game-changer for knowledge graph applications!

View all activity

Organizations

None yet

dushuai112233's activity

liked a dataset 4 days ago

m-a-p/COIG-CQIA

Viewer • Updated Apr 18, 2024 • 44.7k • 5.75k • 611

liked a model 5 days ago

Tifa-RP/Tifa-7B-Qwen2-v0.1-GGUF

Updated Jul 19, 2024 • 4.33k • 28

liked a dataset 19 days ago

opencsg/smoltalk-chinese

Preview • Updated Jan 15 • 1.28k • 25

liked 4 datasets 20 days ago

liked 9 datasets 28 days ago

Jaren/T5-dialogue-pretrain-data

Viewer • Updated Aug 30, 2022 • 16.6M • 189 • 1

bdotloh/empathetic-dialogues-contexts

Viewer • Updated Sep 21, 2022 • 24.5k • 132 • 15

silk-road/ChatHaruhi-54K-Role-Playing-Dialogue

Viewer • Updated Dec 16, 2023 • 54.7k • 138 • 62

Johnson8187/Chinese_Multi-Emotion_Dialogue_Dataset

Viewer • Updated Dec 13, 2024 • 4.16k • 405 • 8

OEvortex/EmotionalIntelligence-50K

Viewer • Updated Jul 4, 2024 • 51.8k • 104 • 9

LooksJuicy/Chinese-Roleplay-Novel

Viewer • Updated Sep 11, 2024 • 266 • 335 • 56

svjack/GLM-Open-Dialogue-Chinese-Dataset

Viewer • Updated Dec 30, 2022 • 246k • 17 • 1

chi-vi/UAA-Chinese-H-Novels

Viewer • Updated Jan 2 • 59.6M • 692 • 11

LooksJuicy/Chinese-Emotional-Intelligence

Viewer • Updated Sep 13, 2024 • 40.3k • 194 • 31