c# - How to use SAPI to do speech to text? - c#

I want to learn the SAPI api for c# to do speech to text recognition.
I can't find any tutorials. Can anyone show me where an example is or can show it here?
It needs to have aupport to get the speech from a webcam mic.
Thanks.

API documentation: http://msdn.microsoft.com/en-us/library/ms723627(v=vs.85).aspx
Using SAPI in C#: http://windowscoding.com/blogs/blake/archive/2006/11/01/How-to-use-Microsofts-Speech-API-in-a-managed-application.aspx
You're right, there aren't many/any tutorials. But there's the basics - and there are plenty of C/C++ resources. Give it a go and if you have a specific question, that's a far better thing to ask on StackOverflow.

I know it's a little late for this question, but hopefully it can help people later. I want to do the same thing recently, using micphone as voice input, then sapi do the recognition part. I actually found the example code in sapi's own documents, which is written by C#. As default install location, it should be in the folder C:\Program Files\Microsoft SDKs\Speech\v11.0\Docs. It works pretty well.

Related

FFmpeg library in WPF

I am building a softwere that can decode and encode using Wpf and c#, and I want to do that using, FFmpeg libary. I saw the ffmpeg autogen warpper but I couldn't find any documantion for its function.
Can someone please find me one? Or better, give me better warrper.
I already tried FFMpegCore, its working but very slow. For Exaple it takes 20 seconds to decode a specific frame.(And i have monstruos computer)
If will be happy if ou could give me better warpper, or a guide for ffmpeg.autogen.
Thanks in advance.
FFmpeg framework is powerful, from the other hand is so large which makes it complex. If you want to mess with it, you will have to read a lot before start coding. I would recommend starting with the original documentation and examples that can be found here :-
FFmpeg Libraries Documentation
FFmpeg C Examples
And then move to some open source examples -based on what you want- coded in C# with FFmpeg.Autogen bindings.
You can check my recent FFmpeg.cs implementation that is for decoding (audio/video/subtitle frames).
FFmpeg.Autogen C# Example
I've been building my own media player to cater for my needs and I'm using
Flyleaf:https://github.com/SuRGeoNix/Flyleaf which is based on FFMPEG. There are samples in the code to get you started.
I've also seen this: FFME: https://github.com/unosquare/ffmediaelement

C# developer starting SICP - Suggested Dev Environment?

I've finally started reading SICP.
I downloaded GNU/MIT Scheme it appears that I need to work in the Edwin editor which I'm sure is great when you get used to it, however I'd really prefer not to have to learn a new editor while also learning Scheme.
I was hoping there would be a Visual Studio pluggin or something similar.
I'd actual be happy just using notepad++ and just compiling using the command line.
Has anyone got any suggestions?
Take a look at this question (a bit out of date, DrScheme is now called DrRacket).
I also see this page, which looks reasonable.
This page asks about DrRacket directly.
Edwin is pretty much the same as emacs, and that's worth learning. Any text editor will do. If you're really looking for the gui look and feel, try DrRacket? http://www.racket-lang.org/
If you are writing in mit-scheme, and are familiar with Emacs there is xscheme for Emacs
Neil Van Dyke also has a page where he has the book in texinfo format for emacs users.
sicp texinfo
You also can find videos of lectures for the book from the 80s on youtube or going to the MIT ocw site.
video lectures

Edit M4V Video Metadata

So far my google-fu has failed me, so I'm hoping someone here can help:
How can I programmatically modify video metadata (specifically in my case, m4v video files)?
I'm most familiar with Ruby and C#, but I can be fairly language agnostic on this -- that is, if there's a specific language that is good at modifying file metadata, I have no problems spending the required time to learn it well enough to accomplish what I'm trying to do.
This question is pretty similar to:
View/edit ID3 data for MP3 files
I have searched for the library so you dont have to:
http://download.banshee.fm/taglib-sharp/
Even tho it has linux on its name it is for mono, so it should compile fine in Visual Studio :)
Hope it helps.
Consider asking http://www.ffmpeg.org/contact.html in #ffmpeg
I don't think they use ID3 tags.

namespace extension.. galaxy file system toolkit...gmail shell drive like utility

i was looking for some namespace extention to extend using c# (.net) but didn't find much help online except Galaxy Filesystem tooklkit. which are vc++ based but comes with c# and java wrapper classes... which helps me alot to start and i did.
i have extended that enough now and made installer to install. it get installed successfully but don't know why, when i open it, system get stuck... :( i thought my modified version might have some problem so i tried to run Galaxy filesystem toolkit's author original version and it responded in same way as mine do :D :(
now feeling bit helpless as even author is not responding my queries regarding my queries for some reason...
any help would be really appreciated....
FYI: i need to have Gmail drive like stuff...
Do not implement shell extensions with C# or anything that uses a framework.
If you do, you will break a lot of applications!
See here for why you must not do that.

How to write an application that reads text and produces speech

I'm an IT student. I want to write an web browser for blind people. How can I use C# or java to write an application to pronounce some text from a XML file (Text to Speech)?
You can use the SpeechSynthesizer class from the .Net Framework:
Add a reference to System.Speech.dll
Add a using statement for System.Speech.Synthesis
Use this code:
SpeechSynthesizer synthesizer = new SpeechSynthesizer();
synthesizer.Speak("Hello world! How are you?");
For .NET, have a look at the Speech APIs. There's a quick introduction to it here.
Hope that's enough to get you started.
It might be worth looking at accessibility guidelines for the web, this is a good place to start: http://www.w3.org/TR/WCAG10/
This will at least indicate what people are doing to support accessibility which will give you an idea of what your application should do. (eg reading ALT and TITLE tags)
C# can certainly be used to parse web pages and in addition to the text to speech built into the .NET framework there will be 3rd party libraries that you can integrate with.
You can also have a look at existing screen reader applications to give you some inspiration: eg: http://www.freedomscientific.com/products/fs/jaws-product-page.asp
For Java, this question gives you some Text To Speach options, but writing a whole web browser has a lot more to it than just the text to speech. Unless you are looking for something specifically cross platform (which I'm guessing not since you include .NET as an option), Windows comes with TTS accessibility options and of course a web browser built in.
With .NET you can use it's Speech API and interact with Internet Explorer to get you the web browser functionality. Definitely the shortest route, but it may not bring much over the built in Windows abilities.

Categories

Resources