UsefulSensors
/

moonshine-base

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Community

eustlb HF staff commited on 2 days ago

Commit

82daa1b

·

verified ·

1 Parent(s): fec8e73

Update README.md

Files changed (1) hide show

README.md +0 -38

README.md CHANGED Viewed

@@ -93,44 +93,6 @@ We anticipate that Moonshine models’ transcription capabilities may be used fo
 There are also potential dual-use concerns that come with releasing Moonshine. While we hope the technology will be used primarily for beneficial purposes, making ASR technology more accessible could enable more actors to build capable surveillance technologies or scale up existing surveillance efforts, as the speed and accuracy allow for affordable automatic transcription and translation of large volumes of audio communication. Moreover, these models may have some capabilities to recognize specific individuals out of the box, which in turn presents safety concerns related both to dual use and disparate performance. In practice, we expect that the cost of transcription is not the limiting factor of scaling up surveillance projects.
-## Setup
-* Install `uv` for Python environment management
-  - Follow instructions [here](https://github.com/astral-sh/uv)
-* Create and activate virtual environment
-  ```shell
-    uv venv env_moonshine
-    source env_moonshine/bin/activate
-  ```
-* Install the `useful-moonshine` package from this github repo
-  ```shell
-  uv pip install transformers torchaudio
-  ```
-* Test transcribing an audio file
-  ```python
-  from transformers import AutoModelForSpeechSeq2Seq, AutoConfig, PreTrainedTokenizerFast
-  import torchaudio
-  import sys
-  audio, sr = torchaudio.load(sys.argv[1])
-  if sr != 16000:
-    audio = torchaudio.functional.resample(audio, sr, 16000)
-  model = AutoModelForSpeechSeq2Seq.from_pretrained('usefulsensors/moonshine-tiny', trust_remote_code=True)
-  tokenizer = PreTrainedTokenizerFast.from_pretrained('usefulsensors/moonshine-tiny')
-  tokens = model(audio)
-  print(tokenizer.decode(tokens[0], skip_special_tokens=True))
-  ```
 ## Citation
 If you benefit from our work, please cite us:
 ```

 There are also potential dual-use concerns that come with releasing Moonshine. While we hope the technology will be used primarily for beneficial purposes, making ASR technology more accessible could enable more actors to build capable surveillance technologies or scale up existing surveillance efforts, as the speed and accuracy allow for affordable automatic transcription and translation of large volumes of audio communication. Moreover, these models may have some capabilities to recognize specific individuals out of the box, which in turn presents safety concerns related both to dual use and disparate performance. In practice, we expect that the cost of transcription is not the limiting factor of scaling up surveillance projects.
 ## Citation
 If you benefit from our work, please cite us:
 ```