In my previous study post I've mentioned about how you can turn your commuting time into a productive one by using a text to speech online reader.
And some have asked me whether if it is possible to read an image file or pdf document.
The answer is Yes~
But! In order for the reader to perform text-to-speech translation you'll first have to convert your pdf/image file into a flat text file using an Optical Character Recognition (OCR) software~
See it works for Japanese Characters as well~ I am able to get a 100% accuracy from this Japanese characters image.
Things to note:
And some have asked me whether if it is possible to read an image file or pdf document.
The answer is Yes~
But! In order for the reader to perform text-to-speech translation you'll first have to convert your pdf/image file into a flat text file using an Optical Character Recognition (OCR) software~
So what is Optical Character Recognition and its uses?
OCR is a type of character recognition technology that is used for translating images or pdf files into pure text document. The algorithm will scan, detect, differentiate and convert characters to plain text at an efficient speed.
The extracted text can then be used for various applications:
How to extract text using Google Drive OCR software
- Login to Google drive
- Click on the gear icon on the right hand side of your window
- Go to upload settings and check these two settings
☑Convert text from uploaded PDF and image files
☑Confirm settings before each upload - Now when you upload a file a prompt window will appear and ask you to confirm your settings
- Leave the setting as it is if you are converting English Document
- Once the upload is done, open up your new file
- The extracted plain text will be shown below your image
Google Drive OCR settings |
I am going to convert a short Japanese passage |
Raw Image Top, Words Below |
Things to note:
Google Drive file size limitation for OCRed is 2MB. (.jpg, .gif, .png and PDF)
It will only look at the first 10 pages of PDF when searching for your text to extract.
Always do your checks OCRed document may not always be accurate.
For best accuracy:
Use high resolution well lighted image, good contrast between characters and background
Orientate your text in the correct direction. Typically the text will be scanned from left to right.
Multiple columns document dont work well with this OCR
Select the correct language (there are 38)
Use a top down view image like what I did.