amanu / README.md
katospiegel's picture
README Metadata
fb65181
|
raw
history blame
893 Bytes
---
title: Amanu
emoji: πŸ‘
colorFrom: yellow
colorTo: purple
sdk: gradio
sdk_version: 3.44.4
app_file: app.py
pinned: fa
---
# This repo's goal is to support the transcription and annotation of audios.
## Parts
- `audio.py`: Everything related to audio preprocessing and analysis.
- `transcription.py`: All code for transcript audios using fast-whisper.
- `diarization.py`: Everything related to pyannotation.
- `textformatting.py`: All related to fomatting the text in specific outputs.
## UI parts
1. Transcription.
2. Diarization.
3. Revision.
4. Output formatting.
## How to access to the service?
The user will logging using a password and user specified by me. That user and password will be manually managed by me.
## Pricing
1. Calculate the fixed cost of a server running for a long period of time.
2. Check if I can use the hibernation period to save some money.