parakeet-unified-en-0.6b / explainability.md
aandrusenko's picture
Create explainability.md
604d215
Field Response
Intended Task/Domain: Speech Transcription
Model Type: FastConformer-RNNT
Intended Users: This model is intended for developers and data scientists building interactive call centers, virtual assistants, and language learning assistants.
Output: Transcribed text with timestamps and confidence scores
Describe how the model works: Model transcribes audio input into text for the input language
Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of: Age, Gender, National Origin
Technical Limitations & Mitigation: Transcripts may not be 100% accurate. Accuracy varies depending on the characteristics of the input audio, such as domain, use case, accent, noise, speech type, and speech context.
Verified to have met prescribed NVIDIA quality standards: Yes
Performance Metrics: Word Error Rate (WER), Silence Robustness (Characters/mins of silent audio), Latency (in milliseconds), Throughput (Total audio processed per unit of time)
Potential Known Risks: Not recommended for word-for-word transcription as accuracy varies based on the characteristics of input audio (domain, use case, accent, noise, speech type, and context of speech)
Licensing: Use of this model is governed by the NVIDIA Open Model License Agreement