CoreHR Application Pack PDF Splitter

A Python script to extract individual applications from a combined PDF file, such as for Oxford HR application packs.

Requirements

Python 3.7+
PyPDF2
click

Installation

Clone this repository:

git clone https://github.com/synthetic-society/corehr-pdf-split.git
cd corehr-pdf-split

Install dependencies with pixi:
```
pixi install
```

You can also install the two small dependencies using pip or your preferred Python package manager.

Usage

Run the script from the command line:

pixi run python main.py --input-pdf <path_to_input_pdf> --output-dir <path_to_output_directory>

For example:

pixi run python main.py --input-pdf applicationspack.pdf --output-dir output

This will process the applicationspack.pdf file and save individual applications in the output directory. The output folder will be created if it does not exist yet. Each applicant's PDF is saved with a filename format: LastName,FirstName [ApplicantID].pdf.

License

This project is available under the MIT License.

Contributing

Contributions, issues, and feature requests are welcome.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CoreHR Application Pack PDF Splitter

Requirements

Installation

Usage

License

Contributing

About

Releases

Packages

Languages

synthetic-society/corehr-pdf-split

Folders and files

Latest commit

History

Repository files navigation

CoreHR Application Pack PDF Splitter

Requirements

Installation

Usage

License

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages