Skip to content

Latest commit

 

History

History
147 lines (114 loc) · 5.59 KB

README.md

File metadata and controls

147 lines (114 loc) · 5.59 KB

Proofpoint Safe/Block List Analyzer

This tool was created to answer a common question from Proofpoint on Demand administrators, "Which global block or safelist entry fired the most or never?" This normally happens after years of these lists being poorly maintained. These global lists can grow massive and ultimately map back to "blocked" or "safe" classifiers associated with the core filtering module.

Requirements:

  • Linux or Windows + WSL
  • pthreads
  • libre2-dev
  • cmake
  • g++ (C++20 is required)

The global safe and block entries have the following match fields.

### MatchFields
$hfrom - Header From: email address only.  
$from  - Envelope sender address.  
$helo  - SMTP Helo string presented at time of connction.  
$host  - Hostname associated with the connecting IP.  
$ip    - Connecting IP address.  
$rcpt  - Recipient email address.  

Every match field has an associated match type, these types can be partial matches, literal matches, and pattern matches. The match types are shown below:

### MatchTypes
equal            - String matches the entire match field.  
not_equal        - String matches match fields that don't match the entire field.  
match            - String matches the match field partially or completely.  
not_match        - String matches the match fields that don't match partially or completely.  
regex            - Regular expression matches the match fields.  
not_regex        - Regular expression matches the match fields that don't match.  
ip_in_net        - Matches a CIDR block for $ip match field only.  
ip_not_in_net    - Matches ip that are not in the CIDR block, for $ip match field only.
if_in_domain_set - Matches addresses that match entries contained in a domainset. (Not Yet Implemented)

Getting Started

# Install necessary packages for the build 
sudo apt update
sudo apt install cmake
sudo apt install g++
sudo apt install libre2-dev
sudo apt-get install gdb

# Ubuntu 20.04 (requires GCC 10.1 or greater) you will need to install the following
sudo apt update
sudo apt install -y gcc-10 g++-10 cpp-10
sudo update-alternatives --install /usr/bin/gcc gcc /usr/bin/gcc-10 100 --slave /usr/bin/g++ g++ /usr/bin/g++-10 --slave /usr/bin/gcov gcov /usr/bin/gcov-10

# Download slparser via Git or Zip
git clone https://github.com/pfptcommunity/slanalyzer

# Change directory into slanalyzer source directory
cd slanalyzer

# Create the Release build profile
cmake -S . -B build/ -D CMAKE_BUILD_TYPE=Release

# Compile the code
cmake --build build/

image

Processing Global Block / Safe Lists

You may need to export multiple 1M record chunks from smart search for a 30 day window. You can filter safelisted messages based on the direction of the flow.
image

Export the block / safe list you want to compare.
image

Export the results to a CSV(s) and pass them to the tool as the last parameter.
image

Reviewing the Data

The below output is limited, but ever match conditon is annotated with the number of messages that triggered the blocked or safe list. This is provides a good heatmap to see which safelist entry is excessive or inactive.

"FieldType","MatchType","Pattern","Comment","Inbound","Outbound"
"$from","match","@indeed.com","","155","0"
"$from","match","@pluralsight.com","","82","0"
"$hfrom","match","trystoryboard.com","","2","0"
"$from","match","em8330.trystoryboard.com","","2","1"
"$ip","equal","192.89.112.126","","2","0"

Processing User Block / Safe Lists

Basic Output

To process user safe and blocklists you must export the user data via Export CSV in the Proofpoint Protection admin user interface. image

The basic user safe / block information has the following format.

"givenName","sn","mail","mailLocalAddress","safelist","blocklist","safe_list_count","block_list_count"
"Ludvik","Jerabek","ljerabek@domains.com","ljerabekalias@domains.com","block@domain.com;domain2.com;@domain3.com","safe@domain.com","56","2"

Extended Output

It's possible to get extended output via -x

"givenName","sn","mail","mailLocalAddress","safe","safe_sender","safe_hfrom","block","block_sender","block_hfrom"

Performance

During testing analyzer was able to process 10,000(10K) safelist entries and 10,000,000(10M) row smart search in ~74 seconds that would be 10,000,000,000(10B) permutations.

### Global List Load Completed ###
              Load Time: 0.003212s
             List Count: 10000
            List Errors: 0
              List File: /home/ljerabek/megasafe.csv

### Preprocessing Completed ###
              Load Time: 0.029909s
         Pattern Errors: 0

### Analysis Completed ###
          Analysis Time: 74.477194s
      Smart Search File: /home/ljerabek/mega_smart_search.csv

### Analysis Summary ###
          Total Inbound: 823500
         Total Outbound: 0

### Global List Save Completed ###
              Save Time: 0.004267s

### Processing Completed ###
  Total Processing Time: 74.524358s

Limitations

Safe and Block list entries using domain sets are not currently supported. RE2 doesn't support all PCRE expressions, future release will implement hybrid method to process expressions.