Questions tagged [speech-recognition]

1

votes
1

answer
791

Views

Speech recognition and speech to text on Android without any Google services

I am trying to make an application which uses speech to text for command input. I am making this for Android 6.0.1 and 5.1.1. I started by trying the examples with Google services, but none of them seem to work on my devices. Most of the time there is no clear error or a crash. I have gotten the no...
Timo Loomets
0

votes
0

answer
2

Views

Error converting speech to text using speechrecognition

I am creating a code on speech to text in Python 3 #!C:/Python3.6/python.exe import speech_recognition as sr import os r = sr.Recognizer() with sr.Microphone() as source: audio = r.listen(source) text = r.recognize_google(audio) print(text) This works fine when I run this in jupyter or standalone py...
Rohit
1

votes
4

answer
2.8k

Views

speech_recognition module stuck in “say something” - python

I am trying a python script for speech recognition, i have installed the required pyaudio and SpeechRecognition modules in my enivronment. The program was running fine till yesterday, but now it is stuck in 'say something'. Below is my code. import speech_recognition as sr print 'say something1' r =...
Sleez
1

votes
1

answer
218

Views

Sidekit code freezes during UBM creation after creating features

I have been trying to run the UBM.EM_Split() function. I created a feature file feat.h5 (3.8 MB) which stores features from 24 audio files. I tried to use this feature file as input for the feature_list argument in the function. However, the code has been running for over 72 hours with no output or...
nahomyaja
1

votes
1

answer
110

Views

measuring rate of speech in realtime

I'm looking for a quick and simple way to measure the rate at which I am speaking in real time. Course grained approaches or approximations are sufficient. The idea is to write a simple app/widget that at least tells you to speed up or slow down while speaking. Measuring things like pitch and volume...
imichaelmiers
1

votes
0

answer
44

Views

Using single letters in Microsoft Speech SDK

When I just used the letters, D got confused with B, and a lot of the letters got confused with other letters, so I replaced any letter that was working incorrectly with a word or name, but as you can see, it is half of them. I was wondering if there are any solutions where it is possible to say 'D'...
Roland Ohlsson
10

votes
2

answer
4k

Views

Grammar in Google Web Speech API

Can I improve google speech API recognition by give him a words list (in my case the request of user is very predictable) to make recognition more accurate? Thanks.
Rnd_d
1

votes
0

answer
44

Views

Speech Recognition for kids is not accurate

I need a little bit help with my android app developed with Unity. It is an educational app for kids 2-4 years old. The idea is when kid press a button and speaks a letter - image of the letter is displayed on the screen. We used steaming speech recognition BUT the result is not good – most of the...
Victoria Radeva
1

votes
0

answer
282

Views

Android Audio Record Voice Activity Detection

How to implement voice activity detection for voice signal received in form of byteArray. How to check whether the received signal has speech or not? The current implementation of android code which receives audio in form of byte Array is : public class AudioRecordActivity extends AppCompatActivi...
Prateek Ratnaker
1

votes
0

answer
203

Views

What services required to use webkitSpeechRecognition in Chrome Android

I am using Android board with Android 4.4.2 On OSboard not installed any services of google. But now I want to use webkitSpeechRecognition in chrome browser on this board. I installed Chrome Android but not working. What webkit is required for it to work? Help? Microphone allowed and have icon of n...
NTHsync
1

votes
0

answer
182

Views

Add Speech recognition support for iOS 9 and older?

What is the best option to implement speech recognition in iOS 9? I have implemented SFSpeechRecognizer (Speech framework) for iOS 10+ but I also require support for iOS 9. Can somebody suggest a good approach for the same ?
Gurp321
1

votes
0

answer
73

Views

Alexa - Exclude specific word form slot / utterance

I'm having an issue with Alexa failing to accurately recognise an AMAZON.YesIntent. The problem is that I have a separate intent where the user supplies a single character in answer to a question (i.e. options from a list). This has a custom slot type 'ALPHANUMERIC', which only contains the charact...
David Fulton
1

votes
0

answer
205

Views

Speech Recognition Engine not recognizing speech

im trying to create a dynamic speech recognizer but for some reason it is not working. i have tried to use the emulaterecognize function and the application works fine but it doesn't work when i speak. this means that the list of words are properly added and the speech recognized event functions pro...
Andre Falzon
1

votes
0

answer
149

Views

Pass audio from file to Chrome using selenium

I'm trying to pass .wav file to Google Web Speech API Demonstration with '--use-file-for-fake-audio-capture=/path/to/file.wav' Using Web Speech API requires selecting a language and clicking the microphone icon. In result, I expect the .wav file to be recognized by Chrome's speech recognition. My c...
miszo
1

votes
1

answer
262

Views

How to do speech to text with microsoft cognition API?

I have this node.js code to upload a wav file to to speech to text. But it does not work. I got back this {'RecognitionStatus':'InitialSilenceTimeout','Offset':58100000,'Duration':0} code var data = fse.readFileSync('assets/batman.wav'); data = data.toString(); var options = { host : 'speech.platfor...
omega
1

votes
0

answer
35

Views

iOS - Is there a way to detect popular keywords from a user's text input and sort by popularity or trending?

I am building an app that will allow users to type into a UITextInput field. I want to be able to scan what they type and extract possible keywords/phrases that could be used to populate other things in the app, and also have them sorted by popularity/trending. Example: The user types the following...
JimmyJammed
1

votes
0

answer
34

Views

Can i use speech recognition while making a call in ios?

I am using linphone to make calls. I was wondering if I can implement speech recognition while making call? what i done so far ? I have shared a sample code .. It recognises speech to text while done as a separate app. But I get below error if I use it with linphone. Error Domain=kAFAssistantErrorDo...
starterMac
1

votes
0

answer
117

Views

python error and pyaudio

hi i have a code that gives me an ALSA error so here is the code and i tried everything that i have known .farther more i installed and tried everything so if you can help it would be a fantastic opportunity for us to know each other so i can learn alot from you because i am still a beginner so here...
Majd Barghouti
1

votes
0

answer
322

Views

Flutter: send raw file to google speech api

I'm trying to send a raw file to the google speech api but I don't know how to load the raw file in order to put it into the RecognitionAudio json, any ideas? This is the code I have right now: import 'package:flutter/services.dart'; import 'package:googleapis/speech/v1.dart'; import 'package:google...
jordiPons
1

votes
0

answer
94

Views

When will the extension for speechrecognition (webspeech api) for firefox browser be released?

I want to know the tentative dates and also if a workaround is available right now for it to work on firefox
Jdoe
1

votes
1

answer
90

Views

Logging and deque operation problems in Tensorflow Android Speech Recognition Sample

I'm studying tensorflow speech command sample. The Android codebase I use is the same on tensorflow GitHub android sample and mainly focus on SpeechActivity.java and RecognizeCommands.java. I didn't change anything except logging messages. As far as I know, (1) SpeechActivity.java will pass model...
Jean Lin
1

votes
1

answer
508

Views

How to improve the accuracy for speech to text conversion using recognize_sphinx API in python

Can you please help us improve the accuracy of speech to text conversion using recognize_sphinx API in python? Please find the below code, which needs to improve the accuracy base! import speech_recognition as sr #obtain path to 'english.wav' in the same folder as this script from os import path...
vinay4747
1

votes
0

answer
31

Views

I am trying to run cmu sphinx4 by calling it in android but there is a file not found exception for means file

This is th output after running android application for speech input. File not found exception for means in acoustic model path
Roma Jain
1

votes
0

answer
104

Views

Can I use the SpeechRecognition API on localhost?

While integrating the SpeechRecognition to a user interface, I always get SpeechRecognitionError events when trying to call the .play() method. I have tried to allow the microphone, but it's always marked as blocked when I refresh the page, and I see no way to allow it. Is there any way to test this...
Yanick Rochon
1

votes
0

answer
375

Views

Client-side Speech Recognition on a mobile browser?

I am working on a project that is targeted for browsers on smart phones. And I can't seem to find any way to do a client-side speech recognition, as the mobile version of chrome doesn't even support their own Web Speech API. Does anybody know how to have speech recognition working on a mobile browse...
Felix
1

votes
0

answer
221

Views

Why Google Speech Recognition API returns a part of converted text of audio

Google Speech Recognition doesn't work only for few seconds audio. So I split my audio file to chunks. This is the class of splitting audio. class Split_audio(): def __init__(self): ''' Constructor ''' def create_folder(self,audio): ''' Create folder for chunks ''' #name of the folder: exemple audio...
K.cyrine
1

votes
1

answer
240

Views

Why does the second call to my function get triggered twice? (Swift iOS timer/speechrecognition)

self.adaResponseApi runs twice when the timer hits 1.5 seconds after the last recorded speech input. It should only run once. It is specifically running from the 1.5 interval instantiation and not from the first instantiation, which is triggered when the user specifically-stops speech input. rec...
Vortex Ring State
1

votes
1

answer
413

Views

python can't find module speech_recognition

So I installed the speech_recognition library but when I try to import it it says it can't find it. This is the code I'm using. import speech_recognition as sr r = sr.Recognizer() with sr.Microphone() as source: audio = r.listen(source) try: print('You said: ' + r.recognize(audio)) except: print('i...
H_raven
1

votes
0

answer
72

Views

Find correlation of two voice samples using python_speech_features

This is for a simple automatic command and control system. Commands will be sent over a noisy channel vocoded for voice signals. Ordinary tones or DTMF will not work because the vocoder does not do well with tones or music etc. The transmitter will send a recording of the phrase. At the receiving...
calvinfan
1

votes
1

answer
405

Views

How to increase the voice listen time in Google Recognizer Intent(Speech Recognition) Android

I did try giving time in millisecond to these below given extras Recognizer Intent.EXTRA_SPEECH_INPUT_POSSIBLY_COMPLETE_SILENCE_LENGTH_MILLIS Recognizer Intent.EXTRA_SPEECH_INPUT_COMPLETE_SILENCE_LENGTH_MILLIS Recognizer Intent.EXTRA_SPEECH_INPUT_MINIMUM_LENGTH_MILLIS But doesn't effect the voice l...
1

votes
0

answer
109

Views

System.Speech.Synthesis.SpeechSynthesizer methods throw “asynchronous operation cannot be started at this time” exception."

This question is very similar to this (unanswered) one: SpeechSynthesizer in ASP.NET - async error I am trying to use System.Speech methods in an ASP MVC project but if I do something like this: [HttpGet] public ActionResult SystemSpeechInstalledVoices() { var synt = new System.Speech.Synthesis.Spee...
Ada
1

votes
0

answer
167

Views

SAPI does not implement phonetic alphabet selection. Speech Command App

In my speech command app, I am loading files from an external source, processing that data and loading them into a list of possible commands for execution When I run the app, I get the message in the console Main.exe Information: 0: SAPI does not implement phonetic alphabet selection. I tried soluti...
Jamie Law
1

votes
0

answer
137

Views

Detect fluency from google speech api results

Trying to determine fluency of a speaker using google speech (to text) api. So far i have found that api (betav1) can show the time taken to speak a word ( its starting time and ending time ). And from wikipedia, Oral fluency or speaking fluency is a measurement both of production and reception of...
Sadi Mahmud
1

votes
0

answer
52

Views

System sound doesn't work properly

I tried to implement a speech recognizer in the background. And it works all fine. The service runs in background and listens to words the user say. My problem is: If a music player is running and the speech recognizer restarts, the system sound stream is muted to prevent the beep sound of speech r...
miningian
1

votes
0

answer
236

Views

Bad accuracy with pocketsphinx and error integrating spanish dictioraries

I want a program that recognize certain words. I have this code, that works well using recognize_google and I can change it easily to spanish. I want to use recognize_sphinx beacuse it´s free and works offline but it doesn´t reconize what Im saying in english, not even close. And I don´t know ho...
majo
1

votes
0

answer
42

Views

Objective c - using Speech recognition causes Video playback on iPhone to fail

In my app, I have Video playback and also speech recognition. The Video playback plays fine if I do not use Speech recognition. I am also able to get Air play options as 'iPhone' and 'Apple TV' (I have Apple TV on network). But when I use speech recognition and exit out of it, then the Video does no...
Curious101
1

votes
0

answer
51

Views

issue when running the sample SpeechToText_WPF_sample

When we try running the sample SpeechToText_WPF_sample (https://github.com/Azure-Samples/Cognitive-Speech-STT-Windows) with a free subscription key, somtimes it works but frequently we are blocked with : --- Start speech recognition using microphone with ShortPhrase mode in en-US language ---- --- M...
Wajdi TORKHANI
1

votes
0

answer
63

Views

How to concatenate phone hmm model to a composite word or sentence hmm model

I want to do the embedded training for speech recognition. In the beginning, I want to use the monophone with 3-states, as the paper decripted, I can concatenate all the phones in one word or sentence to make a composited hmm model, and do embedded training on the composited hmm model. like this pi...
YonF
1

votes
0

answer
322

Views

Google DialogFlow Confidence

I'm trying to utilize dialogflow to build automated phone systems(IVR). Below is a sample result from dialog flow. DialogFlow spits out two confidence scores are two confidence scores speechRecognitionConfidence and intentDetectionConfidence. What is the minimum threshold value for speechRecogniti...
Fits
1

votes
2

answer
228

Views

google-speech-api and overriding phone number recognition

Does anyone know if there is a way to manipulate the recognition of phone numbers when using the Google Speech API? I am trying to implement a transcription scenario where a caller will say a string of letters and numbers, but the logic out of the box seems to be to try to fit any sequence of numbe...
justishar

View additional questions