I have developed an ANPR application which requires an OCR engine. I am trying to use tesseract as the OCR engine. But I really cant find a proper step by step tutorial or guidlines as to how to include tessnet2 it in my C#.Net Project. I have already trained tesseract v3.01. can someone help with this issue please?
Thanks
You can't use 3.01 data with Tesseract 2.04 engine -- they're not compatible as Tesseract Wiki states. You would need Tesseract 3.0x engine. There is a .NET wrapper for 3.01: tesseract-ocr-dotnet.
Related
There are some different between emgu cv library , documentation, sample files. emgv.cv.capture is not supported in latest emgu cv library. I cannot access the _capture.start(), _capture.pause(), _capture.stop() functions. I want solution for this.
I am going to assume that your question was not meant as rudely as you stated it. I will suggest that you check again. In the 3.1 documentation and examples not only is Capture there so is Start and Stop.
Doug
surf algorithm is included in Emgu CV 3.1 or not? if included then how i can use it. please Explain in detail am new in Emgu CV. Am trying to implement surf algorithm using c# but can't understand how to implement it.
For version 3.1, the name changed from SURFDetector to SURF
these are the links for version 3.0 and 3.1:
http://www.emgu.com/wiki/files/3.0.0-rc1/document/html/8e5f3e80-3ebb-c21d-3e5a-0b792d2d2a1a.htm
http://www.emgu.com/wiki/files/3.1.0/document/html/dbec25e2-17bb-2070-fad3-454b3b4c0b6d.htm
Check the API files would help a lot.
I am developing an OCR application in EmguCV. My target is to achieve the same results as done by Capture2Text.
Capture2Text uses Tesseract engine for OCR and Leptonica library for Preprocessing. As we know EmguCV is using Tesseract engine for OCR therefore we are left with Preprocessing.
Preprocessing in Capture2Text is done by calling the functions of Leptonica in the following order with specific values. The actual code file is available in the leptonica_util.c uner the folder ...\Capture2Text_v3.5\Capture2Text\SourceCode\leptonica_util
Read in source image
Convert to grey scale
Perform auto negate image
Scale the image (linear interpolation)
Apply unsharp mask
Perform Otsu Binarize
Out of these two possible options, which one is better to follow and how ?
Use Leptonica library in EmguCV by including it in the code and then calling the same functions with same parameters.
Try to find and use equivalent functions already available in EmguCV.
I have tried to go for option 1, but could not do it properly. For 2nd option I am not sure where to look specifically in EmguCV.
Finally, I managed to solve the problem by using Leptonica library in EmguCV. The main issue I found was the mismatch between 'bitness' of Leptonica libraries and EmguCV installation on my system.
What worked for me was to set both of them to be used in a 32 bit environment and build them for x86 machine.
Now the OCR recognition level is same as in Capture2Text (which is very accurate for my application).
I have been reviewing replacements for the Office 2007 MODI OCR (OneNote's 2010 solution has lesser quality/results than 2007 :-( ). I notice that Windows 7 contains an OCR library once you install the optional tiff filter
The OCR component gets installed to
%programfiles%\Common Files\microsoft shared\OCR\7.0\xocr3.psp.dll
but I don't see any API for it?
Does anyone see how this can be interfaced preferably in C#?
ANSWER: Found the soluation, once the optional tiff ifilter win7 feature is installed, i can then get a textoutput of a screenshot using the code/exe on http://www.codeproject.com/KB/cs/IFilter.aspx. Also if add the same [HKEY_CLASSES_ROOT.tiff\PersistentHandler] for .png and .jpg then OCR also works for jpg and png's.
Tessnet OCR is a good solution, but pretty old (last release from 2009). There are couple of very good free OCR solutions available for .NET:
Asprise C# OCR SDK. Very good and fast one.
Microsoft Research Project Hawaii Web-based (cloud) OCR solution with full docs and samples (discontinued 2013)
Bing OCR Web based (cloud) OCR replacement for above. (discontinued March 2014)
Try TessNet, using the suggestions I made to the Poster in this post (enlarge image, use separate process):
c# OCR can't recognize digits (tesseract 2)
I was exploring the windows 7 dlls and I found 3 libraries that might be useful: thocr.psp.dll ,xocr3.psp.dll, and ximage3b.dll. In this website and other similar websites I found out that ximage3b is a Windows system ocr engine. I have been looking for documentation online but I have not been succesful, but hey! at least I know that it's there, I will give you guys an update if I find out how to use it with C#/C/C++.
I'd like to process a directory of mpg's in a batch to have a thumbnail using C#/.NET.
Does anyone have any good suggestions on how I could do this?
I know it's not C# .NET but ffmpeg is a great tool to do exactly this. Can be run as a command line tool from any language.
Here's a small tutorial to get you started.
I did this a few years ago, but I seem to have lost the source. Anyway, the route-of-least-resistance I found was to use DirectShow, there is an interop wrapper for managed code, namely directshow.net. You'll want to use IMediaDet's GetBitmapBits from the Windows Media Format SDK.
There is an example on CodeProject: Extract Frames from Video Files