How to use simple OCR to read Japanese text from an image.

Reading Japanese text from an image is very easy. Using this image as an example, it is simple to use an OCR utility to read the text. Install the tesseract utility.

[root@localhost Pictures]# dnf in tesseract

Then download the Japanese language data from GitHub.

And put the files into the /usr/share/tesseract/tessdata/ directory.

[root@localhost Pictures]# ls -hula /usr/share/tesseract/tessdata/
total 41M
drwxr-xr-x. 4 root        root         129 Aug 19 08:00 .
drwxr-xr-x. 3 root        root          22 Aug 19 07:59 ..
drwxr-xr-x. 2 root        root        4.0K Apr  1  2022 configs
-rw-r--r--. 1 root        root        4.0M Aug 19 07:33 eng.traineddata
-rw-r--r--. 1 jcartwright jcartwright  35M Aug 19 08:00 jpn.traineddata
-rw-r--r--. 1 jcartwright jcartwright 2.9M Aug 19 07:59 jpn_vert.traineddata
-rw-r--r--. 1 root        root         572 Dec 27  2019 pdf.ttf
drwxr-xr-x. 2 root        root          98 Apr  1  2022 tessconfigs

Then we are all set to try this out. This works quite well, to be honest.

(jcartwright@localhost) 192.168.1.5 Pictures  $ tesseract japaneseadsammydavisjrsuntorywhiskywhitealksdf_465_683_int.jpg stdout -l jpn --dpi 150
Detected 7 diacritics
 
 
 
選 ぶ ウ イ ス キ ー で 、 男 が 分 か る 。
 
 
 
ゥ サ ン ト ソ ー ホ ワ イ f ト

This works very well to find and print the correct Japanese characters. Even on this image, it worked very well.

This is a great example of the usage of the Linux command line to solve interesting problems.

Leave a Comment Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

S3 bucket find Using Google Dorks 🌍 Here are a couple of examples: site:http://amazonaws.com inurl:". s3.amazonaws.com/" site:http://s3.amazonaws.com intitle:index.

how to run the command?

Oh WordPress ruined the command formatting. I think you can figure this out.

Both things can also be achieved with just yt-dlp. Print chapter titles: yt-dlp --print "%(chapters.:.title)#l" https://www.youtube.com/watch?v=o1jv509M8Zg Print 15 latest videos:…

I used to administer a nextstep machine (color pizza slab, 24MB ram). Wrote display postscript demos and screensavers for it.…

yeah good example of how simple raycasting a-la wolfenstein is i write fun stuf in bash but i'm less cool…

mpv --config=no --audio-device=pulse/alsa_output.usb-0c76_USB_PnP_Audio_Device-00.analog-stereo --quiet --vo=tct --lavfi-complex='[aid1]asplit[ao][a1];[a1]showcqt[vo]' /media/sdc2/Projects/Music/NSF/amazingmusic/NSF_Archive/Chiptune_Artists/Originals/* showcqt ftw! thanks!

Hi.. does the launcher comes in english..also in game options not working Thank you

I got to launch in english...but the launched in Russian...also option menu is not active in game...could you please help...thank…

Hi..I am installing it right now...thank you for replay One question...is it available with English language Thank you

Maybe, easier than Windows XP. IE is spread across the whole OS it seems with the Active Desktop crap. But…

M	T	W	T	F	S	S
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31