Skip to content

SamuelePilleri/facebook-schemas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This project is an effort to cover all files contained within a Facebook profile dump with appropriate schemas. It started as a side project of Plaso's Facebook plugin.

Purpose

In 2018 our Digital Forensics team observed an increase in the amount of cases involving social networks, especially Facebook due to its popularity and misuse.

In order to simplify the task of analysis and correlation of evidences and help investigators solve this kind of cases, we developed a plugin for the well-known digital forensics tool Plaso which can automatically extract evidences from Facebook profile dumps.

Since schemas can serve multiple purposes, we decided to release them as a separate and independent project. This repository contains the schemas used by our plugin along with tools that can help developers improve the coverage of such dumps.

Rationale

Art. 20 GDPR endows users with

the right to receive the personal data concerning him or her, which he or she has provided to a controller, in a structured, commonly used and machine-readable format

While complying with this principle, Facebook does not provide a way to fully understand and interpret information contained in the profile dump any user can download, leaving those who want to perform automatic data scraping unable.

Hence this project aims at reconstructing the possible structure of such files, covering each one with an appropriate JSON schema.

Contributing

Given the purpose of this project, the main contribution the community may bring is a better coverage of dump files. To achieve this, users can download their profile and inspect it with reporter.py, a tool created to spot and outline uncovered files and fields. The tool is compatible with both Python 2 and 3 and only requires the jsonschema library, found in pypi or available as a standalone package in Debian and most derivatives.

Pull requests are welcome.