Memory

Ksgk-fy 's Collections

Exciting Papers

Fine Dataset

Cognition

Embodied AI

Memory

What I don't understand

updated Oct 22, 2024

Prompt is text-based memory. System II prompting is updating memory. Parametric memory is long-term, while prompt-based are short-tem.

Upvote

What You Say = What You Want? Teaching Humans to Articulate Requirements for LLMs

Paper • 2409.08775 • Published Sep 13, 2024
OmniQuery: Contextually Augmenting Captured Multimodal Memory to Enable Personal Question Answering

Paper • 2409.08250 • Published Sep 12, 2024 • 1
Synthetic continued pretraining

Paper • 2409.07431 • Published Sep 11, 2024 • 2
WonderWorld: Interactive 3D Scene Generation from a Single Image

Paper • 2406.09394 • Published Jun 13, 2024 • 3
xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

Paper • 2405.13792 • Published May 22, 2024 • 1
Note Projecting document chunk embedding vector directly into hidden space for xRAG ! Explicit memory is expensive and dumb for RAG, mid-term memory relies on a 'projector', long-term memory updates on the langauge decoding part of the model. I guess that could be the next step here.
Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language Model

Paper • 2401.02051 • Published Jan 4, 2024 • 1
Yo'LLaVA: Your Personalized Language and Vision Assistant

Paper • 2406.09400 • Published Jun 13, 2024 • 1
Note Addition of new concept into VLM via soft-prompt tuning. Extra id token in vocabulary plus k visual feature embeddings enables customizing VLM towards personalized knowledge.
LLMs + Persona-Plug = Personalized LLMs

Paper • 2409.11901 • Published Sep 18, 2024 • 32
Note Same thing. Personalization with soft-prompt user embedding, this one is on text-modality, less exciting than Yo'LLaVA in some sense.
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

Paper • 2409.12903 • Published Sep 19, 2024 • 22
A-VL: Adaptive Attention for Large Vision-Language Models

Paper • 2409.14846 • Published Sep 23, 2024
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization

Paper • 2406.05981 • Published Jun 10, 2024 • 13
Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12, 2024 • 60
Contextual Document Embeddings

Paper • 2410.02525 • Published Oct 3, 2024 • 20
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published Oct 14, 2024 • 27
Agent-as-a-Judge: Evaluate Agents with Agents

Paper • 2410.10934 • Published Oct 14, 2024 • 19
Retrospective Learning from Interactions

Paper • 2410.13852 • Published Oct 17, 2024 • 9
SPIN: Self-Supervised Prompt INjection

Paper • 2410.13236 • Published Oct 17, 2024 • 1
DAG-aware Transformer for Causal Effect Estimation

Paper • 2410.10044 • Published Oct 13, 2024 • 1
SMART: Self-learning Meta-strategy Agent for Reasoning Tasks

Paper • 2410.16128 • Published Oct 21, 2024 • 1
SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 113
Note Interesting approach to address the (more horrible) context length issue for video processing -- explicit memory embedding using "previous prediction" (instead of re-processing previous frames, or caching previous attention KV values ....) Makes immediate sense ...

Upvote