narugo commited on
Commit
35c83af
·
verified ·
1 Parent(s): 91ff54f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +108 -33
README.md CHANGED
@@ -7,58 +7,133 @@ sdk: static
7
  pinned: false
8
  ---
9
 
10
- ## Who We Are
11
 
12
- We are a team focused on building infrastructure for anime data information, including images, text, audiovisuals, and more.
13
 
14
- Our goal is to automate all common processes for handling anime information, including data acquisition, data filtering, training, step selection, and platform deployment, in order to save manpower and optimally balance quality and performance requirements.
15
 
16
- Our team comprises a Ph.D. in Software Engineering, a Ph.D. candidate in Computer Vision, professionals in art and design, and several AI waifu enthusiasts.
 
 
 
17
 
18
- We are a purely non-profit team, and all our work is completely open, without any form of charge.
 
 
 
19
 
20
- ## Our Technical Outputs
21
 
22
- ### dghs-imgutils
 
 
23
 
24
- Project Link: https://github.com/deepghs/imgutils
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
25
 
26
- Project Documentation: https://deepghs.github.io/imgutils/main/index.html
27
 
28
- **This is a library for various common operations on anime images**, including but not limited to:
 
 
 
 
 
 
29
 
30
- * Tachie (Difference) Detection and Clustering
31
- * Contrastive Character Image Pretraining
32
- * Object Detection
33
- * Edge Detection / Lineart Generation
34
- * Monochrome Image Detection
35
- * Truncated Image Check
36
- * Image Tagging
37
- * Character Extraction
38
 
39
- Check out the documentation for more features.
 
 
40
 
41
- ### Waifuc
 
 
 
 
 
 
 
42
 
43
- Project Link: https://github.com/deepghs/waifuc
 
 
44
 
45
- A **data pipeline framework based on dghs-imgutils**, supporting:
46
- * Fast data retrieval (local disk, danbooru, pixiv, zerochan, etc.)
47
- * Swift data filtering (comic exclusion, monochrome image exclusion, multi-character image exclusion, irrelevant character exclusion, etc.)
48
- * Rapid data saving (local, cloud; with metadata, saved in stable diffusion dataset format, etc.)
49
- * Quick building of processing pipelines (connecting multiple aforementioned stages)
50
 
51
- *Note: This tool is currently a work in progress, although it's in use. It hasn't been released on PyPI and lacks comprehensive documentation. These aspects will be addressed soon.*
 
 
52
 
53
- ### Model Zoo
54
 
55
- We manage our models and datasets on Huggingface: https://huggingface.co/deepghs
 
 
 
56
 
57
- ### Anything More?
58
 
59
- In fact, our plans go beyond what's mentioned here. Other tools are continuously improving and will soon be released. Stay tuned!
 
 
 
 
 
 
 
 
 
 
 
 
 
 
60
 
61
- ### How to Find Us?
62
 
63
- If you are interested in anime waifus or models/tools development, or have some good ideas or feedbacks, just join our discord server: https://discord.gg/EAW4WqFdKY
 
64
 
 
 
7
  pinned: false
8
  ---
9
 
 
10
 
11
+ # 🌟 Who We Are
12
 
13
+ **DeepGHS** (*Deep Generative anime Hobbyist Syndicate*) is a passion-driven, non-profit community building anime/2D-focused infrastructure—because even otakus deserve robust tooling! 🎨
14
 
15
+ We obsess over **ALL things anime-tech**:
16
+ - 🖼️ Multimodal datasets (images, video, text, audio, 3D)
17
+ - 🤖 AI models that actually *get* anime aesthetics
18
+ - 🔧 Developer tools to make weeb R&D 10x easier
19
 
20
+ **Our crew** includes:
21
+ - **narugo1992** (self-proclaimed Jerry Mouse 🐭 | Software Engineering PhD | 25+ years of coding/teamwork wizardry)
22
+ - A global squad of AI nerds + anime addicts from top labs/universities 🎓
23
+ - Pro artists who know "moe" isn’t just a typo 🎨
24
 
25
+ *Fun fact: We hire based on* ***passion for anime*** *first, technical skills second. (Sorry, resume bots!)*
26
 
27
+ **Why trust us?**
28
+ ✅ 100% non-profit | ✅ Fully open-source | ✅ Zero paywalls (forever!)
29
+ 💖 Backed by sponsors & collaborators who *get* our vision
30
 
31
+ ---
32
+
33
+ # 🚀 Our Projects
34
+
35
+ ## 📊 Datasets
36
+
37
+ ### **Danbooru2024 Series**
38
+ - **Original**: [Full-resolution images](https://huggingface.co/datasets/deepghs/danbooru2024)
39
+ - **Compressed**: [WebP format (4M pixels)](https://huggingface.co/datasets/deepghs/danbooru2024-webp-4Mpixel)
40
+ - **SFW Edition**: [Safe-for-work subset](https://huggingface.co/datasets/deepghs/danbooru2024-sfw)
41
+
42
+ ### **Sankaku Collection**
43
+ - **Original**: [Full dataset](https://huggingface.co/datasets/deepghs/sankaku_full)
44
+ - **Compressed**: [WebP format (4M pixels)](https://huggingface.co/datasets/deepghs/sankaku-webp-4Mpixel)
45
+
46
+ ### **Other Booru-style Repositories**
47
+ - [Gelbooru Full](https://huggingface.co/datasets/deepghs/gelbooru_full)
48
+ - [Yande.re Full](https://huggingface.co/datasets/deepghs/yande_full)
49
+ - [Rule34.xxx Full](https://huggingface.co/datasets/deepghs/rule34_full)
50
+ - [Konachan Full](https://huggingface.co/datasets/deepghs/konachan_full)
51
+ - [Anime-Pictures Dataset](https://huggingface.co/datasets/deepghs/anime_pictures_full)
52
+ - [Zerochan Full](https://huggingface.co/datasets/deepghs/zerochan_full)
53
+
54
+ ### **Functional Datasets**
55
+ - [Character Similarity Dataset](https://huggingface.co/datasets/deepghs/character_similarity) - For cross-character recognition
56
+ - [Anime Face Detection](https://huggingface.co/datasets/deepghs/anime_face_detection) - Labeled facial data
57
+ - [Anime Head Detection](https://huggingface.co/datasets/deepghs/anime_head_detection) - Head position annotations
58
+
59
+ ### **Anime Series Datasets**
60
+ - [BangumiBase](https://huggingface.co/BangumiBase) - Character-centric frame extractions
61
+
62
+ ### **Search Systems**
63
+ - [Reverse Image Search](https://huggingface.co/spaces/deepghs/search_image_by_image) 🔍
64
+ - [Danbooru Character Lookup](https://huggingface.co/spaces/deepghs/danbooru_character_search) 🕵️♂️
65
+
66
+ ---
67
 
68
+ ## 🤖 Models
69
 
70
+ ### **Classification Models**
71
+ - [Anime/Real Classifier](https://huggingface.co/deepghs/anime_real_cls) - Distinguish anime from real images
72
+ - [Image Type Classifier](https://huggingface.co/deepghs/anime_classification) - Categorize artwork styles
73
+ - [Furry Detection](https://huggingface.co/deepghs/anime_furry) - Identify anthropomorphic characters
74
+ - [Aesthetic Scorer](https://huggingface.co/deepghs/anime_aesthetic) - Predict visual appeal
75
+ - [Style Era Classifier](https://huggingface.co/deepghs/anime_style_ages) - Decade recognition
76
+ - [Live Demo](https://huggingface.co/spaces/deepghs/anime_image_classification) 🧪
77
 
78
+ ### **Detection Models**
79
+ - [Face Detection](https://huggingface.co/deepghs/anime_face_detection) - Precise facial localization
80
+ - [Head Detection](https://huggingface.co/deepghs/anime_head_detection) - Head position estimation
81
+ - [Person Detection](https://huggingface.co/deepghs/anime_person_detection) - Full-body recognition
82
+ - [NSFW Censor](https://huggingface.co/deepghs/anime_censor_detection) - Content moderation
83
+ - [Detection Demo](https://huggingface.co/spaces/deepghs/anime_object_detection) 🎯
 
 
84
 
85
+ ### **Specialized Models**
86
+ - [CCIP](https://huggingface.co/deepghs/ccip) - Character similarity encoding ([Demo](https://huggingface.co/spaces/deepghs/ccip))
87
+ - [WD Tagger Enhanced](https://huggingface.co/deepghs/wd14_tagger_with_embeddings) - Tagger with embeddings
88
 
89
+ ---
90
+
91
+ ## 🛠️ Open-Source Projects
92
+
93
+ ### **Core Libraries**
94
+ - [dghs-imgutils](https://github.com/deepghs/imgutils) - Foundational anime image processing toolkit
95
+ - [waifuc](https://github.com/deepghs/waifuc) - Python framework for building data pipelines
96
+ - [sdeval](https://github.com/deepghs/sdeval) - Quantitative evaluation for Stable Diffusion outputs
97
 
98
+ ### **Utilities**
99
+ - [hfutils](https://github.com/deepghs/hfutils) - Enhanced HuggingFace Hub interface
100
+ - [cheesechaser](https://github.com/deepghs/cheesechaser) - Targeted dataset sampling tool
101
 
102
+ ### **Training Systems**
103
+ - [cyberharem](https://github.com/deepghs/cyberharem) - Automated LoRA training pipeline
 
 
 
104
 
105
+ ---
106
+
107
+ # 📬 Let’s Connect!
108
 
109
+ **For collabs, sponsorships, or just anime-tech chatter**:
110
 
111
+ - 💬Discord: [https://discord.gg/EAW4WqFdKY](https://discord.gg/EAW4WqFdKY) (We don’t bite—unless you’re a bug!)
112
+ - 🤗HuggingFace: [https://huggingface.co/deepghs](https://huggingface.co/deepghs)
113
+ - 💻GitHub: [https://github.com/deepghs](https://github.com/deepghs)
114
+ - 📧Email: [[email protected]](mailto:[email protected])
115
 
116
+ ---
117
 
118
+ # 🎯 Want to Join?
119
+
120
+ **We want YOU if**:
121
+ 1. You can code *and* name 10 JoJo stands 💪
122
+ 2. You’re ready to build—not just consume—anime tech
123
+
124
+ **How to apply**:
125
+ - Pitch us via [Discord](https://discord.gg/EAW4WqFdKY)/[Email]([email protected]) with:
126
+ - Your anime-tech portfolio
127
+ - What you’ll bring to our dojo 🥋
128
+ - OR: Directly request to join our [HuggingFace Org](https://huggingface.co/deepghs) , with the same things above
129
+
130
+ *Note: We do light vetting to protect our community—it’s faster than a Naruto run, promise!*
131
+
132
+ ---
133
 
134
+ # 🔮 What’s Next?
135
 
136
+ More tools, datasets, and *~magic~* in development!
137
+ **Pro tip**: Watch our repos ⭐ + Join Discord 👀 = Never miss an update!
138
 
139
+ ~~Btw, the above magic was polished by deepseek-R1. Gotta say, this LLM is a copywriting wizard.~~