Adam's picture

Adam

adamo1139

·

AI & ML interests

Local training and inference.

Recent Activity

reacted to v2ray's post with 👍 4 days ago

GPT4chan Series Release GPT4chan is a series of models I trained on https://huggingface.co/datasets/v2ray/4chan dataset, which is based on https://huggingface.co/datasets/lesserfield/4chan-datasets. The dataset contains mostly posts from 2023. Not every board is included, for example, /pol/ is NOT included. To see which boards are included, visit https://huggingface.co/datasets/v2ray/4chan/tree/main/boards. This release contains 2 models sizes, 8B and 24B. The 8B model is based on https://huggingface.co/meta-llama/Llama-3.1-8B and the 24B model is based on https://huggingface.co/mistralai/Mistral-Small-24B-Base-2501. Why I made these models? Because for a long time after the original gpt-4chan model, there aren't any serious fine-tunes on 4chan datasets. 4chan is a good data source since it contains coherent replies and nice topics. It's fun to talk to an AI generated version of 4chan and get instant replies, and without the need to actually visit 4chan. You can also sort of analyze the content and behavior of 4chan posts by probing the model's outputs. Disclaimer: The GPT4chan models should only be used for research purposes, the outputs they generated do not represent the view of me on the subjects. Moderate the responses before sending it online. Model links: Full model: - https://huggingface.co/v2ray/GPT4chan-8B - https://huggingface.co/v2ray/GPT4chan-24B Adapter: - https://huggingface.co/v2ray/GPT4chan-8B-QLoRA - https://huggingface.co/v2ray/GPT4chan-24B-QLoRA AWQ: - https://huggingface.co/v2ray/GPT4chan-8B-AWQ - https://huggingface.co/v2ray/GPT4chan-24B-AWQ FP8: - https://huggingface.co/v2ray/GPT4chan-8B-FP8

updated a model 8 days ago

adamo1139/Qwen2-VL-7B-Sydney

liked a model 10 days ago

m-a-p/YuE-s1-7B-anneal-en-cot

View all activity

Organizations

None yet

adamo1139's activity

New activity in adamo1139/Danube3-4b-4chan-HESOYAM-2510-GGUF 14 days ago

Failed to regenerate message

#1 opened 2 months ago by

New activity in rhymes-ai/Aria about 2 months ago

Base model not released

#2 opened 4 months ago by

New activity in AI-Safeguard/Ivy-VL-llava 2 months ago

Wrong licensing

#1 opened 2 months ago by

New activity in adamo1139/Yi-1.5-34B-32K-rebased-1406 3 months ago

Still active?

#1 opened 4 months ago by

New activity in adamo1139/magpie-ultra-v0.1-shareGPT-Conversations 3 months ago

Librarian Bot: Add language metadata for dataset

#1 opened 3 months ago by

New activity in rhymes-ai/Allegro 4 months ago

Expected speed

#3 opened 4 months ago by

New activity in neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic 4 months ago

Can you please add Nemotron 70B static?

#1 opened 4 months ago by

New activity in allenai/Molmo-7B-D-0924 4 months ago

batch inference supported?

#7 opened 5 months ago by

commented a paper 4 months ago

Hermes 3 Technical Report

Paper • 2408.11857 • Published Aug 15, 2024 • 44 •

New activity in adamo1139/Yi-34B-200K-AEZAKMI-v2 7 months ago

Adding Evaluation Results

#4 opened 7 months ago by

leaderboard-pr-bot

New activity in teknium/OpenHermes-2.5-Mistral-7B 7 months ago

How to do batch inference?

#34 opened 9 months ago by

New activity in LoneStriker/DeepSeek-Coder-V2-Instruct-GGUF 8 months ago

How good is the gguf?

#3 opened 8 months ago by

Tom-Neverwinter

New activity in deepseek-ai/DeepSeek-V2-Lite 8 months ago

mixtral format?

#1 opened 9 months ago by

KnutJaegersberg

New activity in LLM360/K2 9 months ago

huggyllama/llama-65b

#1 opened 9 months ago by

KnutJaegersberg

New activity in szymonrucinski/Curie-7B-v1 9 months ago

Share the dataset used?

#2 opened 9 months ago by

New activity in 01-ai/Yi-1.5-34B-32K 9 months ago

Plans for 200K?

#1 opened 9 months ago by

New activity in adamo1139/Lumina-T2I-quantized 9 months ago

can you provide generation examples? is the quantized version coherent?

#1 opened 9 months ago by

New activity in mightbe/Qwen1.5-32B-llamafied 9 months ago

It seems to be a Chat finetune

#1 opened 9 months ago by

New activity in adamo1139/toxic-dpo-natural-v1 10 months ago

[bot] Conversion to Parquet

#1 opened 10 months ago by

parquet-converter

New activity in 01-ai/Yi-9B-200K 11 months ago

GPU Memory Constraints for 01-ai/Yi-9B-200K Model

#3 opened 11 months ago by