Biz Card Data Extraction

https://bizcard-text-extraction-a7jduyfnvvesxdckg8rhy6.streamlit.app/

Introduction

The Biz Card Data project focuses on extracting and processing data from business card images using the EasyOCR library. By leveraging optical character recognition (OCR) techniques, we can automatically extract text from business card images and convert it into structured data for further analysis or storage. This documentation outlines the steps involved in extracting business card data and demonstrates how to use the Streamlit framework to create a user-friendly interface for the data extraction process.

Setup and Dependencies

To get started with the Biz Card Data project, follow these steps:

Install the EasyOCR library using pip: pip install easyocr.
Import the EasyOCR module and set the language to English using reader = easyocr.Reader(['en']).
Import the PIL (Python Imaging Library) module for image handling: from PIL import Image, ImageDraw.
Open an image using PIL's Image.open function: image = Image.open('business_card.jpg').
Use the reader.readtext function to extract text from the image: result = reader.readtext(image).

Data Extraction

To extract data from business card images, we will perform the following steps:

Create a function to read text from a business card image: def extract_biz_card_data(image_path): ....
Inside the function, open the image using Image.open and pass it to reader.readtext to obtain the text results.
Process the extracted text to identify relevant information such as name, phone number, email address, etc.
Return the extracted data in a structured format, such as a dictionary or a list of key-value pairs.

Data Conversion and Storage

To convert the extracted data into a DataFrame and store it for further analysis, follow these steps:

Install the Pandas library: pip install pandas.
Import the Pandas module: import pandas as pd.
Convert the extracted data into a DataFrame: df = pd.DataFrame(extracted_data).
Perform any necessary data cleaning or manipulation on the DataFrame.
Store the DataFrame in a suitable format, such as a CSV file or a database, using Pandas' to_csv or to_sql functions.

Streamlit Integration

To create a user-friendly interface for the Biz Card Data project, we will utilize the Streamlit framework. Follow these steps to integrate Streamlit:

Install Streamlit: pip install streamlit.
Import the Streamlit module: import streamlit as st.
Add a header to the Streamlit app: st.header('Biz Card Data Extraction').
Create a file uploader using Streamlit's file_uploader function.
Inside the file uploader callback, open the uploaded image using PIL's Image.open.
Pass the image to the OCR function to extract the data.
Display the extracted data using Streamlit's write or dataframe functions.
Customize the Streamlit app layout and appearance as desired.

Conclusion

The Biz Card Data project provides a convenient solution for extracting and processing data from business card images. By leveraging the EasyOCR library and Streamlit framework, we can automate the extraction process and create an intuitive user interface for users to upload and extract data from their business card images. This documentation serves as a guide to set up and utilize the Biz Card Data project effectively.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
1.png		1.png
2.png		2.png
3.png		3.png
4.png		4.png
5.png		5.png
README.md		README.md
easyocr-biz card data.ipynb		easyocr-biz card data.ipynb
streamlit.py		streamlit.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Biz Card Data Extraction

Introduction

Setup and Dependencies

Data Extraction

Data Conversion and Storage

Streamlit Integration

Conclusion

About

Releases

Packages

Languages

iambitttu/Bizcard-Text-Extraction

Folders and files

Latest commit

History

Repository files navigation

Biz Card Data Extraction

Introduction

Setup and Dependencies

Data Extraction

Data Conversion and Storage

Streamlit Integration

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages