PRIDA: PRIvacy-preserving data aggregation with multiple Data Analysers

By Beyza Bozdemir¹, Federica Germinario¹, Alessandro Pisani¹, Jakub Klemsa², and Melek Önen¹

(¹EURECOM; ²Czech Technical University).

The code is written using MK-TFHE library.

This implementation is used to benchmark an extended version of the MK-TFHE library for the symmetric version of MK-FHE.

The proposed scenario works for three Data Owners and two Data Analysers. Between them two Aggregators are required: Aggregator1 and Aggregator2 .

Quick installation of the proposed Protocol

git clone -b only_three_keys https://github.com/federicagerminario31/MK-TFHE/

After cloning the repository, do:

cd MK-TFHE 
git submodule init
git submodule update 
mkdir build
cd build
cmake ../src -DENABLE_TESTS=on -DENABLE_FFTW=off -DENABLE_NAYUKI_PORTABLE=on -DENABLE_NAYUKI_AVX=on -DCMAKE_BUILD_TYPE=debug
make

To test (from build):

./test/PPanalytics-spqlios-fma

Proposed Protocol: PRIDA based on MK-TFHE

The proposed protocol is a PRIDA implementation using a symmetric multi-key FHE. To deploy the protocol MK-TFHE (a symmetric multi-key FHE) is used.
As far as keys are concerned each Data Analyser DAj shares one common key k_j with all DOs and each DO establishes one pairwise key with each Aggregator;
When Agg and Agg2 receive individual ciphertexts encrypted with different sets of keys, they partially decrypt them with the keys that they know and re-encrypt them with their unique key only known by themselves.

PRIDA involves the following three parties which are represented in Fig. 1:

Data Owner DO owns some confidential input data and outsources this confidential data to the Aggregators encrypted and secretly shared. DO also secretly defines which Data Analysers can access the aggregate result involving its input.
Data Analyser DA obtains the data aggregation result over the inputs from many DOs in cleartext if authorised.
The two Aggregators (Aggregator1 and Aggregator2) are two non-colluding cloud servers which collect the encrypted data from many DOs, perform the data aggregation requested by Data Analysers, and send the results to the authorised Data Analyser. It is important to notice that the Agg1 is only in touch with the Data Owners and the Agg2 while the Agg2 can communicate only with the Aggregator and the Data Analysers.

The details of multiplication over protected data are provided below:

The index i ∈ {1, . . . , n} represents a Data Owner (DO) whereas index j ∈ {1, . . . , m} represents a Data Analyser, DAj (n and m are two natural numbers). An MK-FHE ciphertext is denoted with [.].

Phase 0: Key Generation. Setup part executed by DO_i, DA_j, Agg₁ and Agg₂.

Generation of symmetric shared key between data owner DO_i with Agg₁ and DO_i with Agg₂ : DO_i1 and DO_i2.
Generation of Data Analyser DA_j, Agg₁ and Agg₂ specific keys.

Phase 1: Deciding the authorised Data Analysers. At the end of Phase 1, two Aggregators can have the information of chosen Data Analysers who are able to obtain the data aggregation result. Basically, Aggregators who get the shared choice vectors, compute simple additions over these shared choice vectors and finally find the number of DOs who chose DA_j. Therefore, both know who is authorised to receive the data aggregation result.
In more details, Phase 1 consists of the following steps:
Each Data Owner

generates secret shares of its choice vector c_i consisting choices of 0 or 1 for Data Analysers.
sends c_i1 to Agg1 and c_i2 to Agg2.

Each Aggregator

After receiving shares of the choice vectors for *DA_j*s, adds these shares without any interaction with the other Aggregator.
This partial sum is sent to the other Aggregator.
both Aggregators bring together the partial sums to observe whether Data Analyser DA_j is authorised. If the total sum is bigger than or equal to threshold t, Aggregators operates the data aggregation over the private data for Data Analyser DA_j. Otherwise, the two non-colluding Aggregators stop processing for DA_j.

Phase 2: Computing the data aggregation result. Phase 2 is simply the componentwise multiplication of the choice vector and the shared-encrypted input data vector using Beaver’s triplets. Then, the multiplication result vectors of DOs are componentwisely added to see the data aggregation result vector. The details of Phase 2 are represented as follows:
Each DO

generates secret shares of its private data vector d_i for Data Analysers.
After secret shares’ generation, each component of shared data vectors is encrypted with the public keys of DA_j and the keys shared with the two Aggregators (keys DO_i1 and DO_i2).
To be used for Beaver’s triplets, DO generates secret shares of random numbers α_i and β_i where γ_i = α_i ∗ β_i. (α_i = α_i1 + α_i2 and β_i = β_i1 + β_i2).
sends the shares of [d_i], c_i, γ_i, α_i, and β_i to Agg1 and Agg2.

After receiving all data from DOs, Aggregator1 and Aggregator2 perform the following steps to obtain thedata aggregation result over protected DOs’ private data.

Unmask the known secret keys of DO_ik and Agg_k to change the part of [d_ik] related to the DO_ik’s secret key to the Agg_k’s secret key. Agg1 and Agg2 end up with samples encrypted with the keys of DA_j, Agg₁, Agg₂.
Each Aggregator computes [ϵ_ik] with the addition of [d_ik] and α_ik, and δ_ik with the addition of c_ik and β_ik, where k = 1 or 2.
Each Aggregator sends [ϵ_ik] and δ_ik to the other Aggregator.
Each Aggregator obtains [ϵ_i] and δ_i making use of [ϵ_i1] and [ϵ_i2], and δ_i1 and δ_i2, respectively.
Each Aggregator computes γ_ik + [d_ik] ∗ δ_i + c_ik ∗ [ϵ_i], k = 1 or 2.
After this step, Aggregator1 sends 4 to Aggregator2.
After receiving 5, Aggregator2 finds γi + [d_i] ∗ δ_i + c_i ∗ ϵ_i.
Then, Aggregator2 subtracts [ϵ_i] ∗ δ_i from the previous result.
Aggregator2 obtains [d_i] ∗ c_i for only one DO.
Aggregator2 adds all multiplication result vectors to find the data aggregation results for Data Analysers.
Finally, Aggregator2 has the data aggregation vector [s_j] which is equal to the summation (for i which goes from 1 to n) of [d_i] ∗ c_i.

Phase 3: Decryption of the data aggregation result. The final steps of the data aggregation is the decryption as follows:

Agg2 sends the data aggregation vector [s_j] to Aggregator1.
Agg2 partially decrypts [s_j] for the authorised Data Analysers.
Like Agg2, Agg1 partially decrypts [s_j] for the authorised Data Analysers.
Agg1 sends this partially decrypted data aggregation vector [s_j] to Agg2.
Agg2 sends the data from 2 and 3 to authorised Data Analyser DA_j.
Finally, authorised DA_j firstly partially decrypts its part, merges all partial decryptions of [s_j ], and receives the data aggregation result.

MK-TFHE Improvements

First of all to implement our protocol using MK-TFHE library we had many issues to target. Firstly, MK-TFHE differently from the TFHE library is only a proof-of-concept and as such only the basic NAND gate was implemented, tested and benchmarked. Since our protocol needs as basic operations additions and multiplications we had to implement and test all the missing gates.
These gates were the building blocks for more complex logic circuits like the full_adder, full_subtracter, shift_left and many more which are needed to achieve our goals.

Another challenging issue is that while in the MK-TFHE proposed paper it seemed that it was possible to perform partial encryption and partial decryption in the proposed implementation only two functions were provided which allowed to encrypt and decrypt with the Multi-key data structure for all the involved parties. To this extent a modification to the existing implementation was needed in order to achieve a particular masking and unmasking operation for a party p.

Here are listed in more detail the Four main improvements to the existing library:

PRELIMINARY. We have implemented some useful gates:
- All basic gates present in TFHE library which were not present in MK-TFHE (OR, AND, XOR, NOT, COPY, CONSTANT, ...)
- Complex ones like Subtractor, XOR3, 2OF3, Shift_Left, ...
FULL ADDER. We have implemented a full_adder_v2 using the gates XOR3 and 2OF3
MULTIPLICATION. We have implemented a multiplier_v2 using shift left operations and additions
MK STRUCTURE. Improved the masking and unmasking operation to use data structures of maximum three keys.
- MKlweFirstPartyEncrypt: at p-th block of mask, generate a random mask and add to the masked message at other blocks of mask, fill zeros
- MKlweNthPartyEncrypt: Add p-th party mask
- MKlweNthPartyUnmask: remove p-th party mask
- MKlweLastPartyDecrypt: decrypt by removing the last party's mask

The final improvement is aimed at reducing as maximum as possible the number of keys:

Each Aggregator needs to store 4 keys 1 for each DO and 1 which will be replacing the others
Each data will need (at most) 3 layers of encryption only

The code for these new functions can be found in MK-TFHE/src/libtfhe/mkTFHEfunctions.cpp. The associated prototyped instead can be found in MK-TFHE/src/include/mkTFHEfunctions.h

Benchmark Results (and discussion)

As a result of the improvements done on the MK-TFHE library we can see how the benchmark took benefit from the various improvements listed before.

In the table below it can be found the Performance results for each player of PRIDA (computation time displayed in seconds).

Protocols	Data Owner	Receiver	Aggregator	Decryptor
Protocol 2 with MK-TFHE	4.452	2.275	2632.251	3160.709

It is good to highlight that the difference in performances (even if only slightly different) between the two Aggregators are due to the higher number of steps performed by the Agg2 with respect to the Agg1 in the Beaver's triplets phase.

It can be noticed that when we apply improvements 2, 3 and 4 listed above, we obtain a better result. In fact in all cases we do have an improvement of at least 50% with respect to the plain MK-TFHE implementation.
However the protocol is still very slow compared with the other versions of PRIDA using other cryptographic techniques like:

asymmetric multi-key fully homomorphic encryption (MK-FHE),
threshold fully homomorphic encryption (Th-FHE).

It is important to notice however that the version of the protocol using as a cryptographic technique symmetric multi-key fully homomorphic encryption (i.e., MK-TFHE) is inherently slow due to the usage of low level implementation of the Homomorphic encryption.

Please notice that in terms of FFT (Fast Fourier Transform) libraries it is not casual the choice of SPQLIOS_FMA for compilation because it is the one that provides better results in terms of timing. Below we have listed a ranking of the FFT libraries from the faster to the slowest one:

SPQLIOS_FMA: compiles libtfhe-spqlios-fma.a, using tfhe's dedicated fma assembly version for FFT computations
SPQLIOS_AVX: compiles libtfhe-spqlios-avx.a, using tfhe's dedicated avx assembly version for FFT computations
NAYUKI_AVX: compiles libtfhe-nayuki-avx.a, using the avx assembly version of nayuki for FFT computations
NAYUKI_PORTABLE: compiles libtfhe-nayuki-portable.a, using the fast C version of nayuki for FFT computations

MK-TFHE

Multi-Key Homomophic Encryption from TFHE

MK-TFHE is a proof-of-concept (please do not expect maintenance or support for this code) implementation of a multi-key version of TFHE. The code is written on top of the TFHE library (https://tfhe.github.io/tfhe/).

MK-TFHE is described in the paper "Multi-Key Homomophic Encryption from TFHE" by Hao Chen, Ilaria Chillotti, Yongsoo Song (https://eprint.iacr.org/2019/116).

We report below the TFHE readme, giving details on the original library and instructions for the dependencies and installation. We refer to their webpage for more details.

Quick installation of MK-TFHE

After cloning the repository, do:

cd MK-TFHE 
git submodule init
git submodule update 
mkdir build
cd build
cmake ../src -DENABLE_TESTS=on -DENABLE_FFTW=off -DENABLE_NAYUKI_PORTABLE=off -DENABLE_NAYUKI_AVX=off -DCMAKE_BUILD_TYPE=release
make

To test (from build):

./test/testMKbootNAND_FFT_v2-spqlios-fma

tfhe

Fast Fully Homomorphic Encryption Library over the Torus

version 1.0 -- first release date: 2017.05.02

version 1.0-rc1 -- first pre-release date: 2017.04.05

version 0.1 -- Proof of concept release date: 2016.08.18

TFHE is open-source software distributed under the terms of the Apache 2.0 license. The scheme is described in the paper "Faster fully homomorphic encryption: Bootstrapping in less than 0.1 seconds" presented at the IACR conference Asiacrypt 2016 by Ilaria Chillotti, Nicolas Gama, Mariya Georgieva, Malika Izabachène.

Description

The TFHE library implements a very fast gate-by-gate bootstrapping, based on [CGGI16]. Namely, any binary gate is evaluated homomorphically in about 13 milliseconds on a single core which improves [DM15] by a factor 50, and the mux gate takes about 26 CPU-ms (or 13ms on 2 cores).

The library implements a Ring-variant of the GSW [GSW13] cryptosystem and makes many optimizations described in [DM15] and [CGGI16].

It also implements a dedicated Fast Fourier Transformation for the anticyclic ring R[X]/(X^N+1), and uses AVX, AVX2 and FMA assembly vectorization instructions. The default parameter set achieves at least 110-bit of cryptographic security, based on ideal lattice assumptions.

From the user point of view, the library can evaluate a net-list of binary gates homomorphically at a rate of about 50 gates per second per core, without decrypting its input. It suffices to provide the sequence of gates, as well as ciphertexts of the input bits. And the library computes ciphertexts of the output bits.

Unlike other libraries, TFHE has no restriction on the number of gates or on their composition. This makes the library usable with either manually crafted circuits, or with the output of automated circuit generation tools. For TFHE, optimal circuits have the smallest possible number of gates, and to a lesser extent, the possibility to evaluate them in parallel.

Dependencies

The library interface can be used in a regular C code. However, to compile the core of the library you will need a standard C++11 compiler. Currently, the project has been tested with the g++ >= 5.2 compiler and clang >=3.8 under Linux, as well as clang under MacOS. In the future, we plan to extend the compatibility to other compilers, platforms and operating systems.

At least one FFT processor is needed to run the project:

The default processor comes from Project Nayuki, who proposes two implementations of the fast Fourier transform - one in portable C, and the other using the AVX assembly instructions. This component is licensed under the MIT license, and we added the code of the reverse FFT (both in C and in assembly). Original source: https://www.nayuki.io/page/fast-fourier-transform-in-x86-assembly
we provide another processor, named the spqlios processor, which is written in AVX and FMA assembly in the style of the nayuki processor, and which is dedicated to the ring R[X]/(X^N+1) for N a power of 2.
We also provide a connector for the FFTW3 library: http://www.fftw.org. With this library, the performance of the FFT is between 2 and 3 times faster than the default Nayuki implementation. However, you should keep in mind that the library FFTW is published under the GPL License. If you choose to use this library in a final product, this product may have to be released under GPL License as well (other commercial licenses are available on their web site)
We plan to add other connectors in the future (for instance the Intel’s IPP Fourier Transform, which should be 1.5× faster than FFTW for 1D real data)

Installation

To build the library with the default options, run make and make install from the top level directory of the TFHE project. This assumes that the standard tool cmake is already installed on the system, and an up-to-date c++ compiler (i.e. g++ >=5.2 or clang >= 3.8) as well. It will compile the shared library in optimized mode, and install it to the /usr/local/lib folder.

If you want to choose additional compile options (i.e. other installation folder, debug mode, tests, fftw), you need to run cmake manually and pass the desired options:

mkdir build
cd build
cmake ../src -DENABLE_TESTS=on -DENABLE_FFTW=on -DCMAKE_BUILD_TYPE=debug
make

The available options are the following:

Variable Name	values
CMAKE_INSTALL_PREFIX	/usr/local installation folder (libs go in lib/ and headers in include/)
CMAKE_BUILD_TYPE	optim enables compiler's optimization flags, including native architecture specific optimizations debug disables any optimization and include all debugging info (-g3 -O0)
ENABLE_TESTS	on/off compiles the library's unit tests and sample applications in the test/ folder. To enable this target, you first need to download google test sources: `git submodule init; git submodule update` (then, use `ctest` to run all unittests)
ENABLE_FFTW	on/off compiles libtfhe-fftw.a, using FFTW3 (GPL licence) for fast FFT computations
ENABLE_NAYUKI_PORTABLE	on/off compiles libtfhe-nayuki-portable.a, using the fast C version of nayuki for FFT computations
ENABLE_NAYUKI_AVX	on/off compiles libtfhe-nayuki-avx.a, using the avx assembly version of nayuki for FFT computations
ENABLE_SPQLIOS_AVX	on/off compiles libtfhe-spqlios-avx.a, using tfhe's dedicated avx assembly version for FFT computations
ENABLE_SPQLIOS_FMA	on/off compiles libtfhe-spqlios-fma.a, using tfhe's dedicated fma assembly version for FFT computations

References

[CGGI16]: I. Chillotti, N. Gama, M. Georgieva, and M. Izabachène. Faster fully homomorphic encryption: Bootstrapping in less than 0.1 seconds. In Asiacrypt 2016, pages 3-33.

[DM15]: L. Ducas and D. Micciancio. FHEW: Bootstrapping homomorphic encryption in less than a second. In Eurocrypt 2015, pages 617-640.

[GSW13]: C. Gentry, A. Sahai, and B. Waters. Homomorphic encryption from learning with errors: Conceptually-simpler, asymptotically-faster, attribute-based. In Crypto 2013, pages 75-92

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
src		src
FinalPresentation.pptx		FinalPresentation.pptx
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
benchmark.png		benchmark.png
players.png		players.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PRIDA: PRIvacy-preserving data aggregation with multiple Data Analysers

By Beyza Bozdemir¹, Federica Germinario¹, Alessandro Pisani¹, Jakub Klemsa², and Melek Önen¹

(¹EURECOM; ²Czech Technical University).

Quick installation of the proposed Protocol

Proposed Protocol: PRIDA based on MK-TFHE

MK-TFHE Improvements

Benchmark Results (and discussion)

MK-TFHE

Quick installation of MK-TFHE

tfhe

Description

Dependencies

Installation

References

About

Releases

Packages

Languages

License

A-Pisani/MK-TFHE

Folders and files

Latest commit

History

Repository files navigation

PRIDA: PRIvacy-preserving data aggregation with multiple Data Analysers

By Beyza Bozdemir¹, Federica Germinario¹, Alessandro Pisani¹, Jakub Klemsa², and Melek Önen¹

(¹EURECOM; ²Czech Technical University).

Quick installation of the proposed Protocol

Proposed Protocol: PRIDA based on MK-TFHE

MK-TFHE Improvements

Benchmark Results (and discussion)

MK-TFHE

Quick installation of MK-TFHE

tfhe

Description

Dependencies

Installation

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages