- GitHub for the API
- Murtaza Hassan for the tutorial in OpenCV
- tesseract-ocr for the OCR engine
- andrewdcampbell for the idea of using the OpenCV library to do the image processing
- leftthomas for the SRGAN model
→ Combined, these people have made this project possible, I appreciate their work and contribution and I happy to do this project.
Ideal Part: Image processing to convert image text into Vietnamese language, centralized system into handwriting recognition, raw image processing, and model quality improvement SRGAN and extract the data using OCR. The results obtained from the group were successful in the conversion and have successfully used the application results for the processing sound
- Process capture and scan images for handwriting recognition and process raw images captured by image transformation.
- Pre-processing of raw images by enabling image enhancement with SRGAN to improve image quality without losing image detail.
- Extract text using OCR – specifically the Tesseract engine through the available training data.
[1] Capture and scan images for handwriting recognition or printed text and process raw images captured by image transformation.
[2] Pre-processing of raw images by enabling image enhancement with SRGAN to improve image quality without losing image detail.
Notice That image not 100% improved because the Type of image is not the same as the training data (So Carefully) and one thing is that model using much CPU time and Memory so it need much time for processing.
[4] The Final Step is sending the result in step 3 for server to process and return the result in style URL and get this URL and play this text we converted.
Have Good Team to do this other part i want to say thanks for colap with them congtoan, trian and thanhtue. The detail of the step convert text to audio can visit github and the step send and receive the result from server can visit github for details
- So I have to say good experience with this AI project.
python ScanDocV02.py -lsd opencv