Python OCR System Raise characters from images to improve work efficiency

1. Background of creating Python OCR System

Students use various documents such as books and the Internet when writing graduation studies and reports.

I think that creating a report is a heavy burden for students because there are occasions when they are quoted.

So, take a picture or take a screenshot and take a text from the image

I think that if I can wake it up, it will lead to work efficiency, so this time I will use the OCR System for PC

I decided to create it.

2. ___ development environment ___

3. Releases on GitHub

File is released in rtInamoriRyusei / RT-Inamori.

We want many people to use it, so we will publish it for free.

I will put up a link below so please take advantage of it.

https://github.com/rtInamoriRyusei/RT-Inamori/releases/tag/v1.0

4. How to use

① From the above link

・dist.zip ・Tesseract-OCR.zip

Please download the two files and unzip both of them.

(2) Save the Tesseract-OCR File in C: \ Program Files.

③ After that, you can use it by double-clicking main.exe in distFile to start it.

④ From here, I will show you the actual screen and explain it. When you start it, you will see this screen.   1.PNG

⑤ Next, drag and drop the image (.png / .jpg) you want to transcribe into the box above. Then it looks like this.    3.PNG

⑥ Then press the Start Button to wake up the characters.    4.PNG

⑦ All you have to do is copy and paste the characters in the box below.

5. Reference

Python script exe file Convert Python Script to executable format (exe format) Character recognition with Python and Tesseract OCR

6. Finally

How was that.

This time it was a short sentence, so it has good accuracy, but it depends on the size of the long sentence and the photo.

It is impossible to make characters with 100% accuracy due to character differences.

However, I think it would be nice if you could devise it while using the cutting method and size of the image.

There are still many parts that are not practical, but I hope it will lead to more efficient work.

If you like it, feel free to use it.

Recommended Posts

Python OCR System Raise characters from images to improve work efficiency
Post images from Python to Tumblr
I tried to improve the efficiency of daily work with Python
[Python] Try to recognize characters from images with OpenCV and pyocr
A real way for people using python 3.8.0-2 from windows to work with multibyte characters
Changes from Python 3.0 to Python 3.5
Changes from Python 2 to Python 3.0
[Python] Conversion from WGS84 to plane orthogonal coordinate system
[Work efficiency] How to change file names in Python
I want to improve efficiency with Python even in an experimental system (2) RS232C and pySerial
I want to improve efficiency with Python even in an experimental system (1) Install Anaconda with Chocolatey
(Translation) Native connection from Python to Hadoop file system (HDFS)
Post from Python to Slack
Cheating from PHP to Python
OCR from PDF in Python
Anaconda updated from 4.2.0 to 4.3.0 (python3.5 updated to python3.6)
Switch from python2.7 to python3.6 (centos7)
Connect to sqlite from python
[Python] Try to graph from the image of Ring Fit [OCR]
git / python> git log analysis (v0.1, v0.2)> Implementation to estimate work time from git log
I want to improve efficiency with Python even in an experimental system (3) I want to do something like Excel with Pandas
Call Matlab from Python to optimize
How to collect images in Python
Create folders from '01' to '12' with python
Post from python to facebook timeline
[Lambda] [Python] Post to Twitter from Lambda!
5 Easy-to-Use Python Tools | Increase Work Efficiency
Extract text from images in Python
Connect to utf8mb4 database from python
Python (from first time to execution)
How to access wikipedia from python
Python to switch from another language
[Introduction to Python3 Day 21] Chapter 10 System (10.1 to 10.5)
Did not change from Python 2 to 3
Update Python on Mac from 2 to 3
Detect Japanese characters from images using Google's Cloud Vision API in Python