Skip to content

synthetic-society/corehr-pdf-split

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

CoreHR Application Pack PDF Splitter

Python License

A Python script to extract individual applications from a combined PDF file, such as for Oxford HR application packs.

Requirements

  • Python 3.7+
  • PyPDF2
  • click

Installation

  1. Clone this repository:

    git clone https://github.com/synthetic-society/corehr-pdf-split.git
    cd corehr-pdf-split
    
  2. Install dependencies with pixi:

    pixi install
    

You can also install the two small dependencies using pip or your preferred Python package manager.

Usage

Run the script from the command line:

pixi run python main.py --input-pdf <path_to_input_pdf> --output-dir <path_to_output_directory>

For example:

pixi run python main.py --input-pdf applicationspack.pdf --output-dir output

This will process the applicationspack.pdf file and save individual applications in the output directory. The output folder will be created if it does not exist yet. Each applicant's PDF is saved with a filename format: LastName,FirstName [ApplicantID].pdf.

License

This project is available under the MIT License.

Contributing

Contributions, issues, and feature requests are welcome.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages