Skip to content

benjakson/PDF_Excel_Converter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 

Repository files navigation

PDF_Excel_Converter

This project aims to develop a PDF to Excel conversion tool for requirements management at Kiewit. The tool allows the company to efficiently extract and organize information from PDF documents into Excel sheets, making it easier for the requirements management team to analyze, allocate, and track project requirements. The tool is specifically designed to handle a variety of PDF formats, making it highly adaptable to the diverse range of documents encountered in real-world projects.

Note: The source code for this project is internal to Kiewit and is not included in this repository due to company policies.

PDFExcelerator PHoto

Features

  • Conversion of PDF to Excel: The tool takes a PDF file as input and converts it into an Excel file. Each chunk of text in the PDF is divided into separate rows in the Excel sheet for easy organization and analysis.

  • Section Identifier: The tool identifies sections in the PDF based on the indentation or level of the text. This allows for logical grouping of content in the resulting Excel sheet, making it easier to understand the document's structure.

  • Context Generation: For each piece of text in the PDF, the tool generates context information to provide additional clarity to the requirements management team. This context helps the team better comprehend the content and make informed decisions.

  • Scalability: The program has been optimized to work efficiently with over 1000 different PDF files. It can handle a large volume of documents without compromising on speed or accuracy.

  • Stylistic Adaptation: The tool adjusts to the stylistic differences found in various PDFs. Whether it's contracts or code, the program can intelligently process and extract relevant information without any loss of data.

Use Cases

This PDF to Excel conversion tool has been specifically tailored for the requirements management team at Kiewit to streamline their processes for large projects. Some of the key use cases include:

  1. Contract Analysis: The tool can be used to extract important contractual information from lengthy and complex legal documents. This allows the team to quickly identify crucial obligations and requirements.

  2. Code Requirements Allocation: For projects involving software development, the tool assists in breaking down the codebase into manageable sections. This helps in allocating requirements and tracking progress more effectively.

  3. Rapid Review and Verification: The Excel format allows the requirements management team to rapidly review and verify information, significantly reducing manual effort and ensuring accuracy.

  4. Collaboration and Reporting: The Excel files can be easily shared and collaborated on by team members. Additionally, the structured data simplifies reporting for stakeholders and project managers.

Note: As mentioned earlier, the actual source code for this project is not publicly available in this GitHub repository. For inquiries or more information about the tool, please reach out to the Kiewit team.

For more information about Kiewit, visit their website.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published