You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Added ability to perform image preprocessing. This would perform grayscale conversion, denoising and binarization. This should help improve OCR accuracy.
Responsible for doing image preprocessing. Image preprocessing is need to make the image clearer, crispier and easier to read.
3
+
4
+
OCR performs better if the image is:
5
+
- grayscale
6
+
- denoised
7
+
- binarised
8
+
9
+
grayscale conversion: Convert from RGB space to grayscale space. Thus converting a color image into a black-and-white image. Each pixel represents intensity rather than color. Reduces complexity from 3 color channels to 1.
10
+
denoised: Applies filters and removes dots, specks and blurs. Currently MedianBlur algorithm is being used with Pillow.
11
+
binarised: Convert grayscale image to binary. Removed background clutter. Each pixel becomes either black(0) or white(255). Dark becomes darker and light becomes lighter.
12
+
13
+
TODO:
14
+
- DPI Normalization: to make the image crispier and easy to read
15
+
- Contour detection
16
+
17
+
Currently we are using Pillow, which is basic. We can move to opencv which has better denoising and binarization support. Also it supports contour detection and DPI normalization.
0 commit comments