File size: 3,462 Bytes
e613f5b
 
 
43fa129
 
 
 
9aacea7
e0aa320
43fa129
9aacea7
43fa129
 
e0aa320
43fa129
 
9aacea7
43fa129
 
 
 
 
9aacea7
 
 
 
 
 
 
 
 
 
 
43fa129
9aacea7
43fa129
 
 
9aacea7
43fa129
 
 
9aacea7
43fa129
 
 
9aacea7
43fa129
 
 
9aacea7
43fa129
 
9aacea7
43fa129
 
9aacea7
43fa129
 
9aacea7
43fa129
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
---

license: mit
---


# Intro 简介
The Chinese National Pentatonic Mode Recognition Model is trained on the Chinese National Pentatonic Mode Dataset, which combines manual annotation with computational analysis. This dataset collects and annotates audio files representing the five primary tonal modes in traditional Chinese music: Gong, Shang, Jiao, Zhi, and Yu (covering five-tone, six-tone, and seven-tone scales). Detailed annotations are provided for these modes, and an in-depth analysis of the methods for identifying Chinese ethnic five-tone modes is presented. The model employs feature extraction, spectral analysis, and pattern recognition techniques to efficiently and accurately identify and classify the five-tone modes in the music. This model's application not only facilitates the digital preservation of ethnic music but also offers robust data support and a technical framework for the analysis and retrieval of ethnic music features.

## Demo
<https://huggingface.co/spaces/ccmusic-database/CNPM>

## Usage
```python

from modelscope import snapshot_download

model_dir = snapshot_download("ccmusic-database/CNPM")

```

## Maintenance
```bash

git clone [email protected]:ccmusic-database/CNPM

cd CNPM

```

## Results
|      Backbone      | Size(M) |     Mel     |             CQT             |   Chroma    |
| :----------------: | :-----: | :---------: | :-------------------------: | :---------: |
|      vit_l_32      |  306.5  |    0.680    |            0.769            |    0.399    |
|      vit_l_16      |  304.3  | **_0.823_** | [**_0.859_**](#best-result) | **_0.549_** |
|                    |         |             |                             |             |
|      vgg11_bn      |  132.9  | **_0.807_** |         **_0.843_**         | **_0.609_** |

|   regnet_y_16gf    |  83.6   |    0.590    |            0.832            |    0.535    |

|  wide_resnet50_2   |  68.9   |    0.694    |            0.757            |    0.531    |

|      alexnet       |  61.1   |    0.742    |            0.744            |    0.542    |

| shufflenet_v2_x2_0 |   7.4   |    0.473    |            0.720            |    0.266    |

### Best result
<table>
    <tr>

        <th>Loss curve</th>

        <td><img src="https://www.modelscope.cn/models/ccmusic-database/CNPM/resolve/master/vit_l_16_cqt_2024-12-03_12-31-17/loss.jpg"></td>

    </tr>

    <tr>

        <th>Training and validation accuracy</th>

        <td><img src="https://www.modelscope.cn/models/ccmusic-database/CNPM/resolve/master/vit_l_16_cqt_2024-12-03_12-31-17/acc.jpg"></td>

    </tr>

    <tr>

        <th>Confusion matrix</th>

        <td><img src="https://www.modelscope.cn/models/ccmusic-database/CNPM/resolve/master/vit_l_16_cqt_2024-12-03_12-31-17/mat.jpg"></td>

    </tr>

</table>


## Dataset
<https://huggingface.co/datasets/ccmusic-database/CNPM>

## Mirror
<https://www.modelscope.cn/models/ccmusic-database/CNPM>

## Evaluation
<https://github.com/monetjoe/ccmusic_eval>

## Cite
```bibtex

@dataset{zhaorui_liu_2021_5676893,

  author       = {Monan Zhou, Shenyang Xu, Zhaorui Liu, Zhaowen Wang, Feng Yu, Wei Li and Baoqiang Han},

  title        = {CCMusic: an Open and Diverse Database for Chinese Music Information Retrieval Research},

  month        = {mar},

  year         = {2024},

  publisher    = {HuggingFace},

  version      = {1.2},

  url          = {https://huggingface.co/ccmusic-database}

}

```