FAQ Database Discussion Community

Find each word in audio files

I have watches a way of doing this with audacity by using sound finder option. But since audacity is only gui it cannot be used with terminal commands. So is there a program that does the same work but in command interface like sox for example.

Is Speech-to-Text-to-Translation an Impossible Dream?

Theoretically, one could use a laptop's or tablet's or phone's microphone to capture spoken words, convert that to words on the screen and then, by accessing an API such as google translate, see "a" (not "the" - hardly ever, anyway) rough "draft" of a translation of those words (say, from...

google speech recognition api in hindi

I am looking for a Google Speech Recognition API, but I was not able to find any. Google has started service of Hindi Speech Recognition, as an example:- If I say "Kahan jaa rahe ho", then it searches for "Kahan jaa rahe ho" which means it converts speech to text....

Trim audio file and get the part between silence

My goal is to get the parts of audio file that contains non-noise sounds by using SoX. I have read the effects of SoX and found noisered and silence which I consider helpful. The problem is that I have not found command that can trim the audio file based on...

C# Get SpeechRecognitionEngine to recognize sentences [duplicate]

This question already has an answer here: C# Speech Recognizing Multiple Words together? (Recognize a sentence) 2 answers If I give Microsoft's SpeechRecognitionEngine a Grammar, it will only recognize individual choices in that grammar instead of combinations of choices. Is there any built in way to get it to...

Speech Recognition Handler

This is my first time posting here. I'm new to Android Studio. I'd like to create a handler for google speech recognition. This is the main class package com.example.recnonocimientodevoz; import android.app.Activity; import android.os.Bundle; import android.view.View; import android.widget.Button; import android.widget.EditText; public class Index extends Activity { private Button boton; private EditText...

ValueError: The truth value of an array with more than one element is ambiguous. Use a.any()

Here's my code: from scipy.io import wavfile fName = 'file.wav' fs, signal = wavfile.read(fName) signal = signal / max(abs(signal)) # scale signal assert min(signal) >= -1 and max(signal) <= 1 And the error is: Traceback (most recent call last): File = "vad.py", line 10, in <module> signal = signal /...

Android TTS sound Leaked Service Connection and speak deprecated

I am using android tts in my class to just say a message like so: Problem: I get leaked service connection public class WorkTimerNotification extends ActionBarActivity implements TextToSpeech.OnInitListener { TextToSpeech tts; @Override protected void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState); setContentView(R.layout.activity_work_timer_notification); //line1 tts = new TextToSpeech(this,this); @Override public void onInit(int status) {...

How do I add variables to Speech Recognition?

I've done some coding on a virtual assistant, and I've hit a block. Here's the code: private void Form1_Load_1(object sender, System.EventArgs e) { SrgsDocument commands = new SrgsDocument(); SrgsRule displayList = new SrgsRule("selectList"); SrgsOneOf listTypes = new SrgsOneOf(new string[] { "hi " + name + ", my name is" ,...

How to include variables in a string? [duplicate]

This question already has an answer here: Python string formatting: % vs. .format 11 answers I'm doing some coursework and I need to determine a character's name. This is what I have so far: charOne=input("Please input your first character's name: ") charTwo=input("Please input your second character's name: ") So...

Continuous Speech Recognition Android - Without Gaps

I have an activity that implements RecognitionListener. To make it continuous, every time onEndOfSpeech() I start the listener again: speech.startListening(recognizerIntent); But, it takes some time (around half a second) till it starts, so there is this half a second gap, where nothing is listening. Therefore, I miss words that were...