Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
63
3
269
Adam
adamo1139
Follow
mestayweb's profile picture
instrido's profile picture
ddh0's profile picture
51 followers
·
34 following
AI & ML interests
Local training and inference.
Recent Activity
reacted
to
v2ray
's
post
with 👍
4 days ago
GPT4chan Series Release GPT4chan is a series of models I trained on https://huggingface.co/datasets/v2ray/4chan dataset, which is based on https://huggingface.co/datasets/lesserfield/4chan-datasets. The dataset contains mostly posts from 2023. Not every board is included, for example, /pol/ is NOT included. To see which boards are included, visit https://huggingface.co/datasets/v2ray/4chan/tree/main/boards. This release contains 2 models sizes, 8B and 24B. The 8B model is based on https://huggingface.co/meta-llama/Llama-3.1-8B and the 24B model is based on https://huggingface.co/mistralai/Mistral-Small-24B-Base-2501. Why I made these models? Because for a long time after the original gpt-4chan model, there aren't any serious fine-tunes on 4chan datasets. 4chan is a good data source since it contains coherent replies and nice topics. It's fun to talk to an AI generated version of 4chan and get instant replies, and without the need to actually visit 4chan. You can also sort of analyze the content and behavior of 4chan posts by probing the model's outputs. Disclaimer: The GPT4chan models should only be used for research purposes, the outputs they generated do not represent the view of me on the subjects. Moderate the responses before sending it online. Model links: Full model: - https://huggingface.co/v2ray/GPT4chan-8B - https://huggingface.co/v2ray/GPT4chan-24B Adapter: - https://huggingface.co/v2ray/GPT4chan-8B-QLoRA - https://huggingface.co/v2ray/GPT4chan-24B-QLoRA AWQ: - https://huggingface.co/v2ray/GPT4chan-8B-AWQ - https://huggingface.co/v2ray/GPT4chan-24B-AWQ FP8: - https://huggingface.co/v2ray/GPT4chan-8B-FP8
updated
a model
8 days ago
adamo1139/Qwen2-VL-7B-Sydney
liked
a model
10 days ago
m-a-p/YuE-s1-7B-anneal-en-cot
View all activity
Organizations
None yet
adamo1139
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
adamo1139/Danube3-4b-4chan-HESOYAM-2510-GGUF
14 days ago
Failed to regenerate message
1
#1 opened 2 months ago by
PeterCastler
New activity in
rhymes-ai/Aria
about 2 months ago
Base model not released
11
#2 opened 4 months ago by
adamo1139
New activity in
AI-Safeguard/Ivy-VL-llava
2 months ago
Wrong licensing
#1 opened 2 months ago by
adamo1139
New activity in
adamo1139/Yi-1.5-34B-32K-rebased-1406
3 months ago
Still active?
9
#1 opened 4 months ago by
DazzlingXeno
New activity in
adamo1139/magpie-ultra-v0.1-shareGPT-Conversations
3 months ago
Librarian Bot: Add language metadata for dataset
#1 opened 3 months ago by
librarian-bot
New activity in
rhymes-ai/Allegro
4 months ago
Expected speed
1
#3 opened 4 months ago by
adamo1139
New activity in
neuralmagic/Llama-3.1-Nemotron-70B-Instruct-HF-FP8-dynamic
4 months ago
Can you please add Nemotron 70B static?
3
#1 opened 4 months ago by
nickandbro
New activity in
allenai/Molmo-7B-D-0924
4 months ago
batch inference supported?
6
#7 opened 5 months ago by
chenkq
commented
a paper
4 months ago
Hermes 3 Technical Report
Paper
•
2408.11857
•
Published
Aug 15, 2024
•
44
•
8
New activity in
adamo1139/Yi-34B-200K-AEZAKMI-v2
7 months ago
Adding Evaluation Results
#4 opened 7 months ago by
leaderboard-pr-bot
New activity in
teknium/OpenHermes-2.5-Mistral-7B
7 months ago
How to do batch inference?
1
#34 opened 9 months ago by
abhijeet-ta
New activity in
LoneStriker/DeepSeek-Coder-V2-Instruct-GGUF
8 months ago
How good is the gguf?
3
#3 opened 8 months ago by
Tom-Neverwinter
New activity in
deepseek-ai/DeepSeek-V2-Lite
8 months ago
mixtral format?
5
#1 opened 9 months ago by
KnutJaegersberg
New activity in
LLM360/K2
9 months ago
huggyllama/llama-65b
4
#1 opened 9 months ago by
KnutJaegersberg
New activity in
szymonrucinski/Curie-7B-v1
9 months ago
Share the dataset used?
#2 opened 9 months ago by
adamo1139
New activity in
01-ai/Yi-1.5-34B-32K
9 months ago
Plans for 200K?
5
#1 opened 9 months ago by
adamo1139
New activity in
adamo1139/Lumina-T2I-quantized
9 months ago
can you provide generation examples? is the quantized version coherent?
2
#1 opened 9 months ago by
MayensGuds
New activity in
mightbe/Qwen1.5-32B-llamafied
9 months ago
It seems to be a Chat finetune
2
#1 opened 9 months ago by
adamo1139
New activity in
adamo1139/toxic-dpo-natural-v1
10 months ago
[bot] Conversion to Parquet
#1 opened 10 months ago by
parquet-converter
New activity in
01-ai/Yi-9B-200K
11 months ago
GPU Memory Constraints for 01-ai/Yi-9B-200K Model
2
#3 opened 11 months ago by
microcn
Load more