File size: 2,428 Bytes
604d215 | 1 2 3 4 5 6 7 8 9 10 11 12 13 | Field | Response
:------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------
Intended Task/Domain: | Speech Transcription
Model Type: | FastConformer-RNNT
Intended Users: | This model is intended for developers and data scientists building interactive call centers, virtual assistants, and language learning assistants.
Output: | Transcribed text with timestamps and confidence scores
Describe how the model works: | Model transcribes audio input into text for the input language
Name the adversely impacted groups this has been tested to deliver comparable outcomes regardless of: | Age, Gender, National Origin
Technical Limitations & Mitigation: | Transcripts may not be 100% accurate. Accuracy varies depending on the characteristics of the input audio, such as domain, use case, accent, noise, speech type, and speech context.
Verified to have met prescribed NVIDIA quality standards: | Yes
Performance Metrics: | Word Error Rate (WER), Silence Robustness (Characters/mins of silent audio), Latency (in milliseconds), Throughput (Total audio processed per unit of time)
Potential Known Risks: | Not recommended for word-for-word transcription as accuracy varies based on the characteristics of input audio (domain, use case, accent, noise, speech type, and context of speech)
Licensing: | Use of this model is governed by the [NVIDIA Open Model License Agreement](https://www.nvidia.com/en-us/agreements/enterprise-software/nvidia-open-model-license/) |