Docubertify

This repository contains the source code from my college thesis entitled "Design and Development of an Android-Based Document Classification and Digitization Application Using ML Kit SDKs and BERT Algorithm".

Project Overview

This project focuses on automating document classification (KTP and SIM) and digitizing the content using machine learning. It uses the BERT model for text classification and Google's ML Kit SDKs for text recognition. The application is built for Android devices, and it includes a backend server to handle API requests for document classification and a pre-trained BERT model.

Folder Structure

The repository is organized into the following main directories:

.
├── Backend Server API/   # Contains FastAPI code and API endpoints
├── Frontend Android/     # Android app code (Kotlin) with ML Kit integration
└── ML BERT Model/        # Pre-trained BERT model, training scripts, and model fine-tuning

Backend Server

This folder contains the source code for the FastAPI server. The API handles requests from the Android app, processes document text, and performs document classification using the BERT model.

Frontend Android

The Android application, built with Kotlin, allows users to capture or select documents (KTP/SIM), perform text recognition using ML Kit SDKs, and send extracted text to the backend API for classification.

ML BERT Model

This folder includes:

Datasets: CSV files containing text data for document classification (e.g., KTP and SIM).
JSON Dict: A dictionary for filtering and fixing typos in text extracted from the documents.
Jupyter Notebook (.ipynb): The notebook used for training and fine-tuning the BERT model.

Requirements

Python 3.11.0+
Pytorch
Google Colaboratory
Visual Studio Code
FastAPI
Ngrok (Static Domain)
Gemini API
Android Studio Iguana // 2023.2.1+
Kotlin 232-1.9.0-release-358-AS10227.8.2321.11479570+ (Java 11.0.20)
ML Kit SDKs (Text Recognition v2)

Datasets for Fine-Tuned BERT Model

Because the dataset used to build the fine-tuned BERT model is sensitive, which is the image dataset of KTP (ID Card) and SIM (Driving License) documents of the Republic of Indonesia, the data used is in the form of dummy document images for KTP and SIM made in Figma. The dataset amounts to 1000 dummy images with 500 images each for each type of document. The dataset used can be seen here.

⚠️ Disclaimer ⚠️

The dataset is only used for research needs and although the document images are dummy (not based on data from real people, only random generation) still use the KTP and SIM image dataset wisely. Violations that occur due to misuse of KTP and SIM document images are beyond the responsibility of the author.

How To Run Backend Server API

Before using the app, the API server must be running by following these steps:

Run the API locally using Uvicorn
Start the FastAPI server on your local machine:
```
uvicorn app:app --host 0.0.0.0 --port 8000
```
Run Ngrok to expose the server using a static domain
You need to expose your local server to the internet using Ngrok with a static domain. Generate a static domain from the Ngrok dashboard, then run the following command:
```
ngrok http --domain=[static-domain] 8000
```
For more information on how to generate static domains for Ngrok, you can visit this Ngrok blog post.

Android App Preview

MainActivity	Select Image (Camera Option)	Select Image (Gallery Option)	Crop Image

ResultActivity	ResultActivity (Loading State)	ResultActivity (Bottom Sheet Fragment Show Half)

KtpBottomSheetFragment	SimBottomSheetFragment

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Backend Server API		Backend Server API
Frontend Android		Frontend Android
ML BERT Model		ML BERT Model
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Docubertify

Table of Contents

Project Overview

Folder Structure

Backend Server

Frontend Android

ML BERT Model

Requirements

Datasets for Fine-Tuned BERT Model

⚠️ Disclaimer ⚠️

How To Run Backend Server API

Android App Preview

About

Releases

Packages

Languages

Riofuad/Docubertify

Folders and files

Latest commit

History

Repository files navigation

Docubertify

Table of Contents

Project Overview

Folder Structure

Backend Server

Frontend Android

ML BERT Model

Requirements

Datasets for Fine-Tuned BERT Model

⚠️ Disclaimer ⚠️

How To Run Backend Server API

Android App Preview

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages