How to detect language and convert speech to text? Is Google APi support it? If yes then can anyone put any example of code?
Please help needed to convert speech to text?
Thanks
If you use C# 3.5 and higher u can Add a reference to the System.Speech namespace using the Add Reference in the Solution Explorer.
Then take a look at this Articles:
Speech recognition, speech to text, text to speech, and speech synthesis in C#
C# Speech to Text
Related
please help me in speech recognition using HMM (hidden markov models) or MFCC ( Mel Frequency Cepstral Coefficient ) by longage c# or c++
I want to recognize word "one", "two"... to "ten")
When I say one ===> show MessageBox write one
You should use a toolkit for this purpose like HTK, Kaldi, etc. which are open-source or you could use a free API like Google Speech API, Microsoft Speech API (SAPI), etc.
It is not really easy to do speech recognition using HMM from scratch. BTW, MFCC is not a machine learning tool like HMM. MFCC is a method of feature extraction which is used to prepare observations for HMM training and decoding.
i know how to use speech recognition in C# but the problem is how to add a special word or name into windows speech dictionary database?
in windows 7 and 8 you can do it easily using:
Opening Speech Dictionary > Add new word > Enter the Text of word > Record the pronunciation of the word by Microphone
and then,it's OK! the word will add to database.
we also can edit the word using the Speech Dictionary.
does anyone know how can we do these steps with .NET and programming?
EDIT:
its very simple, windows speech dictionary has limited database , how can we add some other words into this via .NET C#?
for example name "Salad" doesn't exist in windows speech dictionary. how can i add this word and its pronunciation into windows speech dictionary?
sorry i'm a bit new to this great site.
You'll need to use the SAPI Automation APIs (aka SpeechLib) to access the ISpLexicon interfaces.
In particular, ISpLexicon::AddPronunciation will add a new word (and its associated pronunciation) to the user lexicon.
I just want to know if there is any built in libraries or external libraries in Java or C# that allow me to take an audio file and parse it and extract the text from it.
You can use the built in .NET speech recognition APIs to accomplish this. MSDN has a set of complete samples that read an audio file and then write the recognised speech to the console. With a little bit of work they can be modified to output to a plain text file:
http://msdn.microsoft.com/en-us/library/system.speech.recognition.speechrecognitionengine.setinputtoaudiostream.aspx
How can I perform speech recognition on speech coming from an audio file (.mp3, wav) instead of the microphone ?
I want to be able to do that from C#.NET and Delphi.
This article answers your question specifically:
Using WAV File Input with SR Engines
http://msdn.microsoft.com/en-us/library/ms717071(VS.85).aspx
See the following articles for general info:
http://msdn.microsoft.com/en-us/magazine/cc163663.aspx
http://en.wikipedia.org/wiki/Speech_Application_Programming_Interface
http://msdn.microsoft.com/en-us/library/ms723627(VS.85).aspx
Can anybody help me with Text to Speech Engine in C#
Microsoft Speech SDK
http://msdn.microsoft.com/en-us/library/aa920210.aspx
Just for simple t2s, it should be very simple, like
SpVoice objSpeech = new SpVoice();
objSpeech.Speak("Hello world",SpeechVoiceSpeakFlags.SVSFlagsAsync);
objSpeech.WaitUntilDone(Timeout.Infinite);