mimbres commited on
Commit
3d63af2
·
verified ·
1 Parent(s): b441c5d

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +13 -11
app.py CHANGED
@@ -175,16 +175,18 @@ with gr.Blocks(theme=theme, css=css) as demo:
175
  ## 🎶YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
176
  ## Model card:
177
  - Model name: `{model_name}`
178
- <details>
179
- <summary>Details</summary>
180
-
181
- - Encoder backbone: Perceiver-TF + Mixture of Experts (2/8)
182
- - Decoder backbone: Multi-channel T5-small
183
- - Tokenizer: MT3 tokens with Singing extension
184
- - Dataset: YourMT3 dataset
185
- - Augmentation strategy: Intra-/Cross dataset stem augment, No Pitch-shifting
186
- - FP Precision: BF16-mixed for training, FP16 for inference
187
- </details>
 
 
188
 
189
  ## Caution:
190
  - Currently running on CPU, and it takes longer than 3 minutes for a 30-second input. Please try [GPU-HuggingFace-demo](mimbres/YourMT3) for fast inference.
@@ -250,5 +252,5 @@ with gr.Blocks(theme=theme, css=css) as demo:
250
  # Play
251
  play_video_button.click(play_video, inputs=youtube_url, outputs=youtube_player)
252
  with gr.Column(scale=1):
253
- Log(log_file, dark=True, xterm_font_size=12, elem_id='mylog')
254
  demo.launch(debug=True)
 
175
  ## 🎶YourMT3+: Multi-instrument Music Transcription with Enhanced Transformer Architectures and Cross-dataset Stem Augmentation
176
  ## Model card:
177
  - Model name: `{model_name}`
178
+ <details>
179
+ <summary>(Details)</summary>
180
+
181
+ | **Component** | **Details** |
182
+ |--------------------------|--------------------------------------------------|
183
+ | Encoder backbone | Perceiver-TF + Mixture of Experts (2/8) |
184
+ | Decoder backbone | Multi-channel T5-small |
185
+ | Tokenizer | MT3 tokens with Singing extension |
186
+ | Dataset | YourMT3 dataset |
187
+ | Augmentation strategy | Intra-/Cross dataset stem augment, No Pitch-shifting |
188
+ | FP Precision | BF16-mixed for training, FP16 for inference |
189
+ </details>
190
 
191
  ## Caution:
192
  - Currently running on CPU, and it takes longer than 3 minutes for a 30-second input. Please try [GPU-HuggingFace-demo](mimbres/YourMT3) for fast inference.
 
252
  # Play
253
  play_video_button.click(play_video, inputs=youtube_url, outputs=youtube_player)
254
  with gr.Column(scale=1):
255
+ logger = Log(log_file, dark=True, xterm_font_size=12, every=None, elem_id='mylog')
256
  demo.launch(debug=True)