Skip to content

JacYuan1/Parsing-Common-Log-Format-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Parsing Common Log Format

Table of Contents

Introduction

The premise of this project is to parse Common Log Format logs into a readable format which will then output the desired fields by the user. The initial regular expression implentation seperates the information into 8 different categories/fields, IP_of_requesting_host, Remote_user, Timestamp, Request_from_client, HTTP_response_code, Size_of_bytes_returned, Http_referer, and Http_user_agent. To further output the desired data, another script was ran which will take the .json file that was created in the previous step and output the information as per the users choice. The final report will be linked in the References section.

Tools Used

The tools used here are the following:

  1. Python
  2. log2json

Approach to Problem

  1. Used log2json to parse the initial Common Log Format.
  2. Used Python to create another script that will take the .json file that was created in the previous step and output the information as per the users choice (8 different catagories/fields, IP_of_requesting_host, Remote_user, Timestamp, Request_from_client, HTTP_response_code, Size_of_bytes_returned, Http_referer, and Http_user_agent).

Learning Outcomes

  1. Learned how to pull Github repositories.
  2. Understood the concept of Common Log Format and how it relates to cybersecurity.
  3. Learned how to read Common Log Format files.

References

  1. Written report linked here
  2. Code linked here

Releases

No releases published

Packages

No packages published

Languages