Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
@@ -175,16 +175,18 @@ with gr.Blocks(theme=theme, css=css) as demo:
|
|
175 |
## 🎶YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
|
176 |
## Model card:
|
177 |
- Model name: `{model_name}`
|
178 |
-
|
179 |
-
|
180 |
-
|
181 |
-
|
182 |
-
|
183 |
-
|
184 |
-
|
185 |
-
|
186 |
-
|
187 |
-
|
|
|
|
|
188 |
|
189 |
## Caution:
|
190 |
- Currently running on CPU, and it takes longer than 3 minutes for a 30-second input. Please try [GPU-HuggingFace-demo](mimbres/YourMT3) for fast inference.
|
@@ -250,5 +252,5 @@ with gr.Blocks(theme=theme, css=css) as demo:
|
|
250 |
# Play
|
251 |
play_video_button.click(play_video, inputs=youtube_url, outputs=youtube_player)
|
252 |
with gr.Column(scale=1):
|
253 |
-
Log(log_file, dark=True, xterm_font_size=12, elem_id='mylog')
|
254 |
demo.launch(debug=True)
|
|
|
175 |
## 🎶YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
|
176 |
## Model card:
|
177 |
- Model name: `{model_name}`
|
178 |
+
<details>
|
179 |
+
<summary>(Details)</summary>
|
180 |
+
|
181 |
+
| **Component** | **Details** |
|
182 |
+
|--------------------------|--------------------------------------------------|
|
183 |
+
| Encoder backbone | Perceiver-TF + Mixture of Experts (2/8) |
|
184 |
+
| Decoder backbone | Multi-channel T5-small |
|
185 |
+
| Tokenizer | MT3 tokens with Singing extension |
|
186 |
+
| Dataset | YourMT3 dataset |
|
187 |
+
| Augmentation strategy | Intra-/Cross dataset stem augment, No Pitch-shifting |
|
188 |
+
| FP Precision | BF16-mixed for training, FP16 for inference |
|
189 |
+
</details>
|
190 |
|
191 |
## Caution:
|
192 |
- Currently running on CPU, and it takes longer than 3 minutes for a 30-second input. Please try [GPU-HuggingFace-demo](mimbres/YourMT3) for fast inference.
|
|
|
252 |
# Play
|
253 |
play_video_button.click(play_video, inputs=youtube_url, outputs=youtube_player)
|
254 |
with gr.Column(scale=1):
|
255 |
+
logger = Log(log_file, dark=True, xterm_font_size=12, every=None, elem_id='mylog')
|
256 |
demo.launch(debug=True)
|