ocr – Row Coding

How do I segment a document using Tesseract then output the resulting bounding boxes and labels

November 28, 2023 by Tarik

Success. Many thanks to the people at the Pattern Recognition and Image Analysis Research Lab (PRImA) for producing tools to handle this. You can obtain them freely on their website or github. Below I give the full solution for a Mac running 10.10 and using the homebrew package manager. I use wine to run windows … Read more

What kind of OCR Java library should I use in Android? [closed]

September 16, 2023 by Tarik

Don’t know how good it is (it definitely needs to be trained first), but there is Ron Cemer’s Java OCR library.

Extracting code from photograph of T-shirt via OCR

September 13, 2023 by Tarik

You can probably type faster than you can clean up images and install OCR engines: #!/usr/bin/perl (my$d=q[AA GTCAGTTCCT CGCTATGTA ACACACACCA TTTGTGAGT ATGTAACATA CTCGCTGGC TATGTCAGAC AGATTGATC GATCGATAGA ATGATAGATC GAACGAGTGA TAGATAGAGT GATAGATAGA GAGAGA GATAGAACGA TC GATAGAGAGA TAGATAGACA G ATCGAGAGAC AGATA GAACGACAGA TAGATAGAT TGAGTGATAG ACTGAGAGAT AGATAGATTG ATAGATAGAT AGATAGATAG ACTGATAGAT AGAGTGATAG ATAGAATGAG AGATAGACAG ACAGACAGAT AGATAGACAG AGAGACAGAT TGATAGATAG ATAGATAGAT TGATAGATAG … Read more

Converting YUV->RGB(Image processing)->YUV during onPreviewFrame in android?

September 12, 2023 by Tarik

Although the documentation suggests that you can set which format the image data should arrive from the camera in, in practice you often have a choice of one: NV21, a YUV format. For lots of information on this format see http://www.fourcc.org/yuv.php#NV21 and for information on the theory behind converting it to RGB see http://www.fourcc.org/fccyvrgb.php. There … Read more

Android OCR Library [closed]

September 9, 2023 by Tarik

Look at ABBYY’s Android OCR lib (paid) Tesseract JNI wrapper (free) Look at this stackoverflow post

Detect if an OCR text image is upside down

August 31, 2023 by Tarik

Python3/OpenCV4 script to align scanned documents. Rotate the document and sum the rows. When the document has 0 and 180 degrees of rotation, there will be a lot of black pixels in the image: Use a score keeping method. Score each image for it’s likeness to a zebra pattern. The image with the best score … Read more

Split text lines in scanned document

August 22, 2023 by Tarik

From your input image, you need to make text as white, and background as black You need then to compute the rotation angle of your bill. A simple approach is to find the minAreaRect of all white points (findNonZero), and you get: Then you can rotate your bill, so that text is horizontal: Now you … Read more

Set Tesseract font for OCR

August 15, 2023 by Tarik

Until now this option is not available. The current version is Tesseract 5.

Using Tesseract for handwriting recognition

August 14, 2023 by Tarik

In short, you would have to train the Tesseract engine to recognize the handwriting. Take a look at this link: Tesseract handwriting with dictionary training This is what the linked post says: It’s possible to train tesseract to recognize handwriting. Here are the instructions: https://tesseract-ocr.github.io/tessdoc/Training-Tesseract But don’t expect very good results. Academics have typically gotten … Read more

Detect text area in an image using python and opencv

August 8, 2023 by Tarik

There are multiple ways to go about detecting text in an image. I recommend looking at this question here, for it may answer your case as well. Although it is not in python, the code can be easily translated from c++ to python (Just look at the API and convert the methods from c++ to … Read more