Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -7,58 +7,133 @@ sdk: static
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
10 |
-
## Who We Are
|
11 |
|
12 |
-
|
13 |
|
14 |
-
|
15 |
|
16 |
-
|
|
|
|
|
|
|
17 |
|
18 |
-
|
|
|
|
|
|
|
19 |
|
20 |
-
|
21 |
|
22 |
-
|
|
|
|
|
23 |
|
24 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
25 |
|
26 |
-
|
27 |
|
28 |
-
**
|
|
|
|
|
|
|
|
|
|
|
|
|
29 |
|
30 |
-
|
31 |
-
|
32 |
-
|
33 |
-
|
34 |
-
|
35 |
-
|
36 |
-
* Image Tagging
|
37 |
-
* Character Extraction
|
38 |
|
39 |
-
|
|
|
|
|
40 |
|
41 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
42 |
|
43 |
-
|
|
|
|
|
44 |
|
45 |
-
|
46 |
-
|
47 |
-
* Swift data filtering (comic exclusion, monochrome image exclusion, multi-character image exclusion, irrelevant character exclusion, etc.)
|
48 |
-
* Rapid data saving (local, cloud; with metadata, saved in stable diffusion dataset format, etc.)
|
49 |
-
* Quick building of processing pipelines (connecting multiple aforementioned stages)
|
50 |
|
51 |
-
|
|
|
|
|
52 |
|
53 |
-
|
54 |
|
55 |
-
|
|
|
|
|
|
|
56 |
|
57 |
-
|
58 |
|
59 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
60 |
|
61 |
-
|
62 |
|
63 |
-
|
|
|
64 |
|
|
|
|
7 |
pinned: false
|
8 |
---
|
9 |
|
|
|
10 |
|
11 |
+
# 🌟 Who We Are
|
12 |
|
13 |
+
**DeepGHS** (*Deep Generative anime Hobbyist Syndicate*) is a passion-driven, non-profit community building anime/2D-focused infrastructure—because even otakus deserve robust tooling! 🎨
|
14 |
|
15 |
+
We obsess over **ALL things anime-tech**:
|
16 |
+
- 🖼️ Multimodal datasets (images, video, text, audio, 3D)
|
17 |
+
- 🤖 AI models that actually *get* anime aesthetics
|
18 |
+
- 🔧 Developer tools to make weeb R&D 10x easier
|
19 |
|
20 |
+
**Our crew** includes:
|
21 |
+
- **narugo1992** (self-proclaimed Jerry Mouse 🐭 | Software Engineering PhD | 25+ years of coding/teamwork wizardry)
|
22 |
+
- A global squad of AI nerds + anime addicts from top labs/universities 🎓
|
23 |
+
- Pro artists who know "moe" isn’t just a typo 🎨
|
24 |
|
25 |
+
*Fun fact: We hire based on* ***passion for anime*** *first, technical skills second. (Sorry, resume bots!)*
|
26 |
|
27 |
+
**Why trust us?**
|
28 |
+
✅ 100% non-profit | ✅ Fully open-source | ✅ Zero paywalls (forever!)
|
29 |
+
💖 Backed by sponsors & collaborators who *get* our vision
|
30 |
|
31 |
+
---
|
32 |
+
|
33 |
+
# 🚀 Our Projects
|
34 |
+
|
35 |
+
## 📊 Datasets
|
36 |
+
|
37 |
+
### **Danbooru2024 Series**
|
38 |
+
- **Original**: [Full-resolution images](https://huggingface.co/datasets/deepghs/danbooru2024)
|
39 |
+
- **Compressed**: [WebP format (4M pixels)](https://huggingface.co/datasets/deepghs/danbooru2024-webp-4Mpixel)
|
40 |
+
- **SFW Edition**: [Safe-for-work subset](https://huggingface.co/datasets/deepghs/danbooru2024-sfw)
|
41 |
+
|
42 |
+
### **Sankaku Collection**
|
43 |
+
- **Original**: [Full dataset](https://huggingface.co/datasets/deepghs/sankaku_full)
|
44 |
+
- **Compressed**: [WebP format (4M pixels)](https://huggingface.co/datasets/deepghs/sankaku-webp-4Mpixel)
|
45 |
+
|
46 |
+
### **Other Booru-style Repositories**
|
47 |
+
- [Gelbooru Full](https://huggingface.co/datasets/deepghs/gelbooru_full)
|
48 |
+
- [Yande.re Full](https://huggingface.co/datasets/deepghs/yande_full)
|
49 |
+
- [Rule34.xxx Full](https://huggingface.co/datasets/deepghs/rule34_full)
|
50 |
+
- [Konachan Full](https://huggingface.co/datasets/deepghs/konachan_full)
|
51 |
+
- [Anime-Pictures Dataset](https://huggingface.co/datasets/deepghs/anime_pictures_full)
|
52 |
+
- [Zerochan Full](https://huggingface.co/datasets/deepghs/zerochan_full)
|
53 |
+
|
54 |
+
### **Functional Datasets**
|
55 |
+
- [Character Similarity Dataset](https://huggingface.co/datasets/deepghs/character_similarity) - For cross-character recognition
|
56 |
+
- [Anime Face Detection](https://huggingface.co/datasets/deepghs/anime_face_detection) - Labeled facial data
|
57 |
+
- [Anime Head Detection](https://huggingface.co/datasets/deepghs/anime_head_detection) - Head position annotations
|
58 |
+
|
59 |
+
### **Anime Series Datasets**
|
60 |
+
- [BangumiBase](https://huggingface.co/BangumiBase) - Character-centric frame extractions
|
61 |
+
|
62 |
+
### **Search Systems**
|
63 |
+
- [Reverse Image Search](https://huggingface.co/spaces/deepghs/search_image_by_image) 🔍
|
64 |
+
- [Danbooru Character Lookup](https://huggingface.co/spaces/deepghs/danbooru_character_search) 🕵️♂️
|
65 |
+
|
66 |
+
---
|
67 |
|
68 |
+
## 🤖 Models
|
69 |
|
70 |
+
### **Classification Models**
|
71 |
+
- [Anime/Real Classifier](https://huggingface.co/deepghs/anime_real_cls) - Distinguish anime from real images
|
72 |
+
- [Image Type Classifier](https://huggingface.co/deepghs/anime_classification) - Categorize artwork styles
|
73 |
+
- [Furry Detection](https://huggingface.co/deepghs/anime_furry) - Identify anthropomorphic characters
|
74 |
+
- [Aesthetic Scorer](https://huggingface.co/deepghs/anime_aesthetic) - Predict visual appeal
|
75 |
+
- [Style Era Classifier](https://huggingface.co/deepghs/anime_style_ages) - Decade recognition
|
76 |
+
- [Live Demo](https://huggingface.co/spaces/deepghs/anime_image_classification) 🧪
|
77 |
|
78 |
+
### **Detection Models**
|
79 |
+
- [Face Detection](https://huggingface.co/deepghs/anime_face_detection) - Precise facial localization
|
80 |
+
- [Head Detection](https://huggingface.co/deepghs/anime_head_detection) - Head position estimation
|
81 |
+
- [Person Detection](https://huggingface.co/deepghs/anime_person_detection) - Full-body recognition
|
82 |
+
- [NSFW Censor](https://huggingface.co/deepghs/anime_censor_detection) - Content moderation
|
83 |
+
- [Detection Demo](https://huggingface.co/spaces/deepghs/anime_object_detection) 🎯
|
|
|
|
|
84 |
|
85 |
+
### **Specialized Models**
|
86 |
+
- [CCIP](https://huggingface.co/deepghs/ccip) - Character similarity encoding ([Demo](https://huggingface.co/spaces/deepghs/ccip))
|
87 |
+
- [WD Tagger Enhanced](https://huggingface.co/deepghs/wd14_tagger_with_embeddings) - Tagger with embeddings
|
88 |
|
89 |
+
---
|
90 |
+
|
91 |
+
## 🛠️ Open-Source Projects
|
92 |
+
|
93 |
+
### **Core Libraries**
|
94 |
+
- [dghs-imgutils](https://github.com/deepghs/imgutils) - Foundational anime image processing toolkit
|
95 |
+
- [waifuc](https://github.com/deepghs/waifuc) - Python framework for building data pipelines
|
96 |
+
- [sdeval](https://github.com/deepghs/sdeval) - Quantitative evaluation for Stable Diffusion outputs
|
97 |
|
98 |
+
### **Utilities**
|
99 |
+
- [hfutils](https://github.com/deepghs/hfutils) - Enhanced HuggingFace Hub interface
|
100 |
+
- [cheesechaser](https://github.com/deepghs/cheesechaser) - Targeted dataset sampling tool
|
101 |
|
102 |
+
### **Training Systems**
|
103 |
+
- [cyberharem](https://github.com/deepghs/cyberharem) - Automated LoRA training pipeline
|
|
|
|
|
|
|
104 |
|
105 |
+
---
|
106 |
+
|
107 |
+
# 📬 Let’s Connect!
|
108 |
|
109 |
+
**For collabs, sponsorships, or just anime-tech chatter**:
|
110 |
|
111 |
+
- 💬Discord: [https://discord.gg/EAW4WqFdKY](https://discord.gg/EAW4WqFdKY) (We don’t bite—unless you’re a bug!)
|
112 |
+
- 🤗HuggingFace: [https://huggingface.co/deepghs](https://huggingface.co/deepghs)
|
113 |
+
- 💻GitHub: [https://github.com/deepghs](https://github.com/deepghs)
|
114 |
+
- 📧Email: [[email protected]](mailto:[email protected])
|
115 |
|
116 |
+
---
|
117 |
|
118 |
+
# 🎯 Want to Join?
|
119 |
+
|
120 |
+
**We want YOU if**:
|
121 |
+
1. You can code *and* name 10 JoJo stands 💪
|
122 |
+
2. You’re ready to build—not just consume—anime tech
|
123 |
+
|
124 |
+
**How to apply**:
|
125 |
+
- Pitch us via [Discord](https://discord.gg/EAW4WqFdKY)/[Email]([email protected]) with:
|
126 |
+
- Your anime-tech portfolio
|
127 |
+
- What you’ll bring to our dojo 🥋
|
128 |
+
- OR: Directly request to join our [HuggingFace Org](https://huggingface.co/deepghs) , with the same things above
|
129 |
+
|
130 |
+
*Note: We do light vetting to protect our community—it’s faster than a Naruto run, promise!*
|
131 |
+
|
132 |
+
---
|
133 |
|
134 |
+
# 🔮 What’s Next?
|
135 |
|
136 |
+
More tools, datasets, and *~magic~* in development!
|
137 |
+
**Pro tip**: Watch our repos ⭐ + Join Discord 👀 = Never miss an update!
|
138 |
|
139 |
+
~~Btw, the above magic was polished by deepseek-R1. Gotta say, this LLM is a copywriting wizard.~~
|