49 176 1033

Jade

euclaise

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

upvoted a paper 2 days ago

PLDR-LLMs Learn A Generalizable Tensor Operator That Can Replace Its Own Deep Neural Net At Inference

upvoted a paper 2 days ago

Liger: Linearizing Large Language Models to Gated Recurrent Structures

View all activity

Organizations

Posts 1

Post

Memphis: Advancing language model reasoning without relying on proprietary model outputs

Memphis is a series of models which advance human-data models, offering good performance without relying on proprietary model outputs (e.g. GPT-generated datasets). I've developed a new iterative finetuning procedure to improve the reasoning ability of these models beyond what is possible using only SFT on the same data.

Currently, I've released two models: Memphis-CoT-3B, and Memphis-scribe-3B.

To create these models, I've created new datasets:
- euclaise/reddit-instruct : A dataset of instruction/QA-like data scraped from Reddit. A curated version, filtered using Lilac and neural embedding models, is available at euclaise/reddit-instruct-curated
- euclaise/TinyCoT : TinyCoT is a mtea-dataset that aggregates a variety of different human-sourced reasoning data. It is a curated version of my previous MegaCoT dataset euclaise/MegaCoT, which contains 629k responses which get cut down to 28k for TinyCoT. There's also an intermediate version euclaise/MiniCoT, which has 129k responses.

Memphis-CoT is trained on reddit-instruct, a filtered version of oasst2 sablo/oasst2_curated, and TinyCoT. Multiple iterations were performed on TinyCoT, while reddit-instruct and oasst2 were only used for the initial model.

Memphis-scribe further finetunes Memphis-CoT on more creative tasks. It was finetuned from Memphis-CoT on 18 different datasets, including datasets like euclaise/WritingPrompts_curated, lemonilia/LimaRP, and more.

To prevent catastrophic forgetting, I used weight averaging between iterations.

- euclaise/Memphis-CoT-3B
- euclaise/Memphis-scribe-3B

Collections 4

models 15

datasets 30

euclaise/gsm8k_multiturn

Viewer • Updated Jul 6, 2024 • 8.79k • 85 • 13

euclaise/reddit-instruct-curated

Viewer • Updated Feb 1, 2024 • 11k • 118 • 19

euclaise/logician

Viewer • Updated Jan 30, 2024 • 7.94k • 119 • 6

euclaise/SuperMC

Viewer • Updated Jan 25, 2024 • 278k • 102 • 1

euclaise/MegaCoT

Viewer • Updated Jan 23, 2024 • 629k • 429 • 5

euclaise/MiniCoT

Viewer • Updated Jan 23, 2024 • 129k • 754 • 6

euclaise/TinyCoT

Viewer • Updated Jan 23, 2024 • 27.7k • 101 • 9

euclaise/naturalinstructions2_preferences

Viewer • Updated Jan 20, 2024 • 2.04k • 120 • 1

euclaise/reddit-instruct

Viewer • Updated Jan 19, 2024 • 86.8k • 92 • 23

euclaise/WritingPrompts_curated

Viewer • Updated Jan 19, 2024 • 66.3k • 102 • 9

Jade

AI & ML interests

Recent Activity

Organizations

Posts 1

Collections 4

malmaud/onestop_qa

tasksource/ScienceQA_text_only

EleutherAI/logiqa

metaeval/reclor

longface/logicLM

allenai/cosmos_qa

EleutherAI/logiqa

tasksource/spartqa-mchoice

models 15

euclaise/ReMask-3B

euclaise/crow-1b-attempt1

euclaise/Memphis-CoT-3B

euclaise/Memphis-scribe-3B

euclaise/Memphis-scribe-3B-alpha

euclaise/gpt-neox-122m-minipile-digits

euclaise/Echo-3B

euclaise/Ferret-3B

euclaise/Echo-3B-q6-gguf

euclaise/Ferret_7B

datasets 30

euclaise/gsm8k_multiturn

euclaise/reddit-instruct-curated

euclaise/logician

euclaise/SuperMC

euclaise/MegaCoT

euclaise/MiniCoT

euclaise/TinyCoT

euclaise/naturalinstructions2_preferences

euclaise/reddit-instruct

euclaise/WritingPrompts_curated

Jade

AI & ML interests

Recent Activity

Organizations

Posts 1

Collections 4

models 15 Sort: Recently updated

datasets 30 Sort: Recently updated

models 15

datasets 30