Question regarding the pooling in QR trick #366

fangleigit · 2023-11-24T07:27:58Z

When attempting to use the QR trick, I have noticed that the implementation differs from what is described in the original paper. In the paper, the embedding of a token is obtained by applying operations such as 'add', 'mult', or 'concat' to two separate embedding tables, after which sum or mean pooling is applied. However, the implementation

dlrm/tricks/qr_embedding_bag.py

Line 189 in c848e83

embed_q = F.embedding_bag(

first applies pooling to the embeddings from the separate tables and then applies 'add', 'mult', or 'concat' to obtain the embedding feature. I am unsure whether this difference is by design or if the two methods are equivalent.

hjmshi · 2023-11-28T22:00:22Z

Hi @fangleigit, good question! Since Criteo only supports one-hot encodings, we've implemented QR here this way. In this case, these two approaches are equivalent.

However, you are correct that ideally the implementation should perform the operation before the sum or mean pooling - this would require a more involved implementation that requires changing the underlying embedding bag implementation. We have not implemented this here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question regarding the pooling in QR trick #366

Question regarding the pooling in QR trick #366

fangleigit commented Nov 24, 2023

hjmshi commented Nov 28, 2023

Question regarding the pooling in QR trick #366

Question regarding the pooling in QR trick #366

Comments

fangleigit commented Nov 24, 2023

hjmshi commented Nov 28, 2023