FAQ Database Discussion Community


Xamarin Tesseract OCR binding for Android

xamarin,monodroid,ocr,tesseract
I would like to use tesseract ocr for Xamarin.Android and Xamarin.iOS applications. I found the binding for iOS (https://github.com/jherby2k/Xamarin-Tesseract-OCR-iOS-Unified). Is there an equivalent for Android ?...

Swift Import Obj-C Framework

ios,swift,xcode6,tesseract
I am having trouble importing a Obj-C Framework into a Swift Project. Beginning with an empty swift project here is everything I did: Drag and drop the Tesseract framework into XCode (copy items if needed was checked) Drag and drop a random .m file into XCode XCode generated a Bridging...

How to use user-words in Tesseract (Java)?

java,ocr,tesseract,tess4j,config-spec
I am using Tesseract for OCR purposes and I have added few additional words into "fin.user-words" (I would like to avoid creating a new word list and replacing tessdata/fin.word-dawg with it). Now, I succeeded doing it in command prompt: >tesseract image.png result -l fin TestConfig where TestConfig (Tesseract configuration file...

Preprocessing image for Tesseract OCR with OpenCV

opencv,image-processing,ocr,tesseract
I'm trying to develop an App that uses Tesseract to recognize text from documents taken by a phone's cam. I'm using OpenCV to preprocess the image for better recognition, applying a Gaussian blur and a Threshold method for binarization, but the result is pretty bad. Here is the the image...

OCR implementing for multiple languages

android,ocr,tesseract,hindi
I have implemented OCR android application for android using Tess-two which is successfully running though it gives only 80%result, but now I want to implement the same android application for another languages such as Hindi, Chinese, french etc. I tried to edit the code of simple-android-OCR by Gautam Gupta. please...

Missing 'strcasestr.cpp' file when compiling Tesseract 3.03 training tools

make,ocr,tesseract,autoconf
I have managed to build the Tesseract 3.03 rc1 from source. But when I try to build the training tools, which is the very feature I want form 3.03, I got the following error. It seems there should be a strcasestr.cpp file at the vs2010 folder. But the downloaded source...

Cleaning up an image for OCR with ImageMagick and 'textcleaner'

imagemagick,ocr,tesseract,imagemagick-convert
I have the following image that I'd like to prepare for an OCR with tesseract: The objective is to clean up the image and remove all of the noise. I'm using the textcleaner script that uses ImageMagick with the following parameters: ./textcleaner -g -e normalize -f 30 -o 12 -s...

Is there true type font file for 'raster font'?

fonts,tesseract,raster,true-type-fonts,python-tesseract
I am using Tesseract to do OCR for some screenshots. The characters in screenshots are in raster fonts. But Tesseract requires True Type Font file for training. I can find many true type font files at Windows/Fonts folder. I am wondering if there's one for raster fonts?...

php tesseract with http post no response

php,html,ocr,tesseract
I am developing an php page on the webserver. It works in the following three steps: get an image uploaded from an HTML form with POST method; execute tesseract to change the image into text; execute tesseract to change the image into text; print the text on the screen; Now...

Progress/cancel callback in Tesseract using ETEXT_DESC

android,c++,tesseract,tess-two
Is there a way to specify a progress and cancel callback in Tesseract? I'm using Tesseract in Android, using the tess-two project. There is already a previous question - Android Tesseract progress callback. However, the answers there imply that it's not possible. I have another crucial detail to add -...

Android Tesseract OCR on Android Studio [closed]

android,eclipse,android-studio,ocr,tesseract
For a while I have been trying to include teseract in my android app on Android Studio (using this tutorial). Since it did not work after many trys (missing allheaders.h) I contacted the creators (blog Gautam Gupta and OCR Robert Theis)they told me to try it on eclipse. Since I...

Camera Preview and OCR

xamarin,monodroid,android-camera,ocr,tesseract
I am new to android development - I'm using Xamarin. I am trying to write an application that initiates the camera preview, and then constantly scans the incoming frames for text (I am using Xamarin.Tesseract from NuGet). In other words, I don't want to make the user take a photo...

Zygote Error at runtime in Android Studio Tesseract OCR app

android,android-studio,tesseract
I was doing a variation of Simple Android OCR in Android Studio with the help of Tesseract OCR. After camera is used, the application was stopped and given the following Errors.. How can it be solved? E/Zygote﹕ Zygote: error closing descriptor libcore.io.ErrnoException: close failed: EBADF (Bad file number) at libcore.io.Posix.close(Native...

How to tell if a string of characters makes intelligible words

java,android,statistics,tesseract,linguistics
So, I'm working on a simple mobile app project (mostly for fun) that uses an OCR library (tesseract) on Android to scan a camera picture, do some stuff with the text, and return it to the user. What I'm wondering is if anyone out there knows of a way to...

How to use jai-imageio in IntelliJ plugin

java,image-processing,intellij-idea,tesseract,jai
I'm developing a plugin for intelliJ. This requires to use tesseract. When i tied to execute it as an console application, it works fine. But when i tried executing a plugin i get the following exception, SEVERE: Need to install JAI Image I/O package. https://java.net/projects/jai-imageio/ java.lang.RuntimeException: Need to install JAI...

android - using the tess-two library

android,ocr,tesseract,tess-two
I am following this tutorial and manage to build the library just fine. My State Now: I take a photo, save it to the external memory (here is the directory path) static String directoryPath = Environment.getExternalStorageDirectory().toString() + "/saved_images"; In the directory there are currently only pictures I took in jpg...

tesseract-ocr works on EC2, not lambda

amazon-ec2,tesseract,amazon-lambda
My goal is to run tesseract-ocr in AWS Lambda. I've built an EC2 instance that attempts to mirror the Lambda environment. Executing tesseract without parameters succeeds in both environments. However, any attempt at substantive image processing, e.g. this code: tess = child_process.exec('tesseract input.tif output -l eng -psm 1 hocr', function(error,...

An invalid parameter error at msvcr120.dll (Building Tesseract Lib in 64bit Windows)

c++,tesseract,tiff,libtiff,leptonica
I have already raised the inquiry to Tesseract Forum, but whether I can have a clue for the error, I raise the issue at this forum again. As this is my favorite forum in solving the problems. I have a problem somehow related to the tesseract lib. The problem is...

Tesseract board detection

java,android,image-processing,tesseract
I'm working with tesseract in android using tess-two wrapper. I've read the documentation about the library, but I'm facing a problem to regconize a square in my image. I'd like to recognize the outermost square in a sudoku board for instance. There is an example in opencv but I cannot...

Tesseract character recognition problems in Android (but not on iOS?)

android,ios,ocr,tesseract,tess-two
I've build an application that uses Tesseract (V3.03 rc1) to identify some specific text strings. These are, unfortunately, printed on a custom font that requires that I build my own traineddata file. I've built the application on both iOS (using https://github.com/gali8/Tesseract-OCR-iOS for inspiration) and Android (using https://github.com/rmtheis/tess-two/ for inspiration as...

Tesseract 3.03: 'boxchar.lo' is not a valid libtool object

gcc,tesseract,autotools,libtool
I have been trying to compile Tesseract 3.03 rc1 these days. I have tried Cygwin, MinGW+MSYS, MSYS2+MinGW-w64. And now I am using the Xubuntu 15.04. The 3.03 rc1 source is downloaded from here. I have successfully compiled the tesseract with make install. But when I trying to compile the training...

Tesseract OCR in C# Code

c#,ocr,tesseract
I am trying to develop Optical Character Recognition(OCR) in bangla with Tesseract.Now I'm in initial state. I found some links about it. But in every place provide this link google code. Actually I want to know how can I use this Tesseract in my C# code and before that how...

How does Tesseract use OpenCL?

parallel-processing,opencl,gpu,tesseract
I am working on a project that requires me to speed up the process of text recognition using Tesseract. I came across an article which said Tesseract is working in conjunction with OpenCL to offload some of the compute intensive tasks onto the CPU or GPUs available. Is there a...

500 Internal server error when I try to run my php script [closed]

php,tesseract
When it try to run this php script <?php require_once 'TesseractOCR/TesseractOCR.php'; require_once 'aws_signed_request.php'; define("public_key", "****"); define("private_key", "****"); define("associate_tag", "****"); if (isset($_GET['ASIN'])) { $ASIN = $_GET['ASIN']; $link = "http://charts.camelcamelcamel.com/us/" . $ASIN . "/sales-rank.png?force=1&zero=0&w=400&h=400&legend=1&ilt=1&tp=1m&fo=0&lang=en"; $localurl = "../imgs/" . $ASIN . ".png"; $slocalurl = "../imgs/" . $ASIN . "-2.png"; $highrank =...

Seven Segment Digital Data Recognition using Tessseract / Java

java,tesseract,image-recognition,tess4j,seven-segment-display
I am trying to recognize seven segment digital text from image using tess4J . My input is here I have made some normalization as follows 1 ] Image cropped . 2 ] Converted it into binary I wish to remove the jagged edges of text from image .How can i...

Can i get tesseract accuracy?

python-2.7,tesseract
I'm trying to use tesseract to recognize numbers in images with some noises. Is it possible to get tesseract accuracy in specified output through python?

Next step in image preprocessing for OCR with Tesseract (tess4j)

java,image-processing,ocr,tesseract,tess4j
I've been trying to use Tesseract to identify some digits in a series of images and after scouring for advice I've made a number of improvements. So far I've attempted the following steps: Binarize the image at an appropriate threshold to pick out the numbers Restrict Tesseract to digits only...

Bing translation error while using tesseract ocr in android for real-time text detection and translation

android,translation,ocr,tesseract
I am using Robert Theis' experimental app (namely, android-ocr) to achieve real-time OCR and translation (using Bing translator.) In class CaptureActivity.java, in function handleOcrContinuousDecode (which is the function for real-time OCR), I have created a TranslateAsycnTask.java object which passes the translated-text to be displayed through the ViewFinderView.java like this: The...

OCR - How to train a new Tesseract model?

machine-learning,ocr,tesseract,text-mining
I am using Tesseract to recognize characters from screenshots. But it seems many models are trained for images like below. This image is very different from a screenshot. Anyone knows where I can find a trained data for screenshot? Or could anyone tell me how to train a model for...

Tesseract integration to my Project on xCode 6.3 iOS 8.3

ios,ocr,tesseract,objective
I'm really digging the web and stackoverflow questions but none of them does not solve my problem. I'm trying use Tesseract OCR in my iOS Project, but integration did not go what I expected. I follow the instructions in this blog and I did the all things but still I...

NPE during concurrent thread access of a single tess4j instance

tesseract,tess4j
I am working with Tesseract 3.0.2 and using 1.4.1 tess4j..this is not working in a thread-safe manner, I get a NPE. I am using Grizzly/Jesery/Spring. @Service("textExtractorService") public class TextExtractorServiceImpl implements TextExtractorService { Logger LOGGER = Logger.getLogger(TextExtractorServiceImpl.class); private final Tesseract instance = Tesseract.getInstance(); // JNA Interface ... .. } ... ......

Tesseract on Linux crashes Glassfish

java,linux,tesseract,tess4j
We are using Tess4J/Tesseract to perform OCR on a webapp. On Windows everyting works fine but when deployed on a Linux machine the program crashes, kills the glassfish process and outputs a dump file: hs_err_pidXXXXX.log. # # A fatal error has been detected by the Java Runtime Environment: # #...