Adam

adamo1139

AI & ML interests

Local training and inference.

Recent Activity

reacted to v2ray's post with 👍 4 days ago
GPT4chan Series Release GPT4chan is a series of models I trained on https://huggingface.co/datasets/v2ray/4chan dataset, which is based on https://huggingface.co/datasets/lesserfield/4chan-datasets. The dataset contains mostly posts from 2023. Not every board is included, for example, /pol/ is NOT included. To see which boards are included, visit https://huggingface.co/datasets/v2ray/4chan/tree/main/boards. This release contains 2 models sizes, 8B and 24B. The 8B model is based on https://huggingface.co/meta-llama/Llama-3.1-8B and the 24B model is based on https://huggingface.co/mistralai/Mistral-Small-24B-Base-2501. Why I made these models? Because for a long time after the original gpt-4chan model, there aren't any serious fine-tunes on 4chan datasets. 4chan is a good data source since it contains coherent replies and nice topics. It's fun to talk to an AI generated version of 4chan and get instant replies, and without the need to actually visit 4chan. You can also sort of analyze the content and behavior of 4chan posts by probing the model's outputs. Disclaimer: The GPT4chan models should only be used for research purposes, the outputs they generated do not represent the view of me on the subjects. Moderate the responses before sending it online. Model links: Full model: - https://huggingface.co/v2ray/GPT4chan-8B - https://huggingface.co/v2ray/GPT4chan-24B Adapter: - https://huggingface.co/v2ray/GPT4chan-8B-QLoRA - https://huggingface.co/v2ray/GPT4chan-24B-QLoRA AWQ: - https://huggingface.co/v2ray/GPT4chan-8B-AWQ - https://huggingface.co/v2ray/GPT4chan-24B-AWQ FP8: - https://huggingface.co/v2ray/GPT4chan-8B-FP8
updated a model 8 days ago
adamo1139/Qwen2-VL-7B-Sydney
liked a model 10 days ago
m-a-p/YuE-s1-7B-anneal-en-cot
View all activity

Organizations

None yet

adamo1139's activity

New activity in rhymes-ai/Aria about 2 months ago

Base model not released

11
#2 opened 4 months ago by
adamo1139
New activity in AI-Safeguard/Ivy-VL-llava 2 months ago

Wrong licensing

#1 opened 2 months ago by
adamo1139
New activity in adamo1139/Yi-1.5-34B-32K-rebased-1406 3 months ago

Still active?

9
#1 opened 4 months ago by
DazzlingXeno
New activity in rhymes-ai/Allegro 4 months ago

Expected speed

1
#3 opened 4 months ago by
adamo1139
New activity in allenai/Molmo-7B-D-0924 4 months ago

batch inference supported?

6
#7 opened 5 months ago by
chenkq
New activity in adamo1139/Yi-34B-200K-AEZAKMI-v2 7 months ago
New activity in teknium/OpenHermes-2.5-Mistral-7B 7 months ago

How to do batch inference?

1
#34 opened 9 months ago by
abhijeet-ta
New activity in deepseek-ai/DeepSeek-V2-Lite 8 months ago

mixtral format?

5
#1 opened 9 months ago by
KnutJaegersberg
New activity in LLM360/K2 9 months ago
New activity in szymonrucinski/Curie-7B-v1 9 months ago

Share the dataset used?

#2 opened 9 months ago by
adamo1139
New activity in 01-ai/Yi-1.5-34B-32K 9 months ago

Plans for 200K?

5
#1 opened 9 months ago by
adamo1139
New activity in mightbe/Qwen1.5-32B-llamafied 9 months ago
New activity in adamo1139/toxic-dpo-natural-v1 10 months ago
New activity in 01-ai/Yi-9B-200K 11 months ago