Category "speech-recognition"

Standard answers for a speech in a AI

I am building a project where there is a software, It should answer to questions as "Hi","What your name?", "How are you?", ecc. Is there a standard answers I c

Why has SpeechRecognizer suddenly stopped working?

My Android app (Xamarin) uses speech recognition. This has worked fine on smartphones with Android 8, 11 and 12. Since a few weeks the speech recognition of my

Azure pronunciation/fluency assessment for non-native english speaker

I am using azure speech's fluency assessment. https://docs.microsoft.com/en-in/azure/cognitive-services/speech-service/how-to-pronunciation-assessment?pivots=pr

SpeechRecognizer directly gives onError 5 and then 7. Without waiting a couple of seconds for input

I have the following code to start the SpeechRecognizer: fun startVoiceRecognitionActivityNoUI(value: VOICE_COMMANDS) { Log.i(TAG, "SPEECH Results startV

I keep getting a permission denial error whenever I try to access a file in a speech recognition program

I'm trying out speech recognition and using it as input for some statements while having the program "speak" back to me using the playsound and gTTS modules. Bu

Android webkitSpeechRecognition .isFinal variable not showing correct value

I'm attempting to do some voice recognition stuff on mobile. Here's some code.. var recognition = new webkitSpeechRecognition(); recognition.onresult = functi

Google speech to text: Extra language set to "kn" but not working

I am trying to use Google Speech to Text API for Kannada language and I am setting EXTRA_LANGUAGE to "kn" for the same but it is not working. Below is the code

Get alternative suggestions during speech recognition

I would like to use offline speech to text recognition, mostly for German language. Especially, I want to use Mozilla DeepSpeech (a TensorFlow implementation o

Phonetic Speech Recognition

I'm trying to get Latin Speech-Recognition for which I'll need, . . . not word-recognition but . . . phonetic-vowel-and-consonant-recognition (since Latin has o

Cannot Train Wav2vec XLSR Model With Common Voice Data

I am trying to train a transformer ASR model with wav2vec XLSR in the danish language, but whenever I try to pull the danish dataset with datasets library it's

KALDI: steps/make_mfcc.sh: no such file conf/mfcc.conf

I am very new to kaldi this is probably my own mistake any help is very much appreciated. I am working with my own dataset. I have cloned voxforge setup and use

Python pocketsphinx RequestError: missing PocketSphinx module: ensure that PocketSphinx is set up correctly

I am trying to make a Python app that can record audio and translate it into english text using PyAudio, SpeechRecognition and PocketSphinx. I'm running on a Ma