Replies: 3 comments
-
Hi! Apologies for the late reply; we were off due to the summer break.
If you have any further questions, please don't hesitate to reach out! Cheers, |
Beta Was this translation helpful? Give feedback.
-
Hi @Nikoletos-K , no worries. I appreciate your reply nonetheless.
So if I understand correctly - with Clustering methods like CenterClustering it's possible to achieve an output where one product in dataset B has multiple candidates from dataset A ? And in case I wanted 1-to-1 I could use something like UniqueMappingClustering?
I experimented with BLAST to achieve this - Generate blocks of similar products. Now, if I understand correctly BLAST will be building blocks of similar products within the full dataset regardless of it's source? So it can happen that a block will contain products from dataset A ? Example if expected output
|
Beta Was this translation helpful? Give feedback.
-
Yeah, Unique Mapping is the best for 1-1 matches, and yes by using CenterClustering this could happen. BLAST is a comparison cleaning method among others. I strongly suggest you to test and experiment with the nearest neighbor workflow, as we have seen that yields really good results. |
Beta Was this translation helpful? Give feedback.
-
Hi,
I am new to entity resolution and I found your library and examples super useful.
I have 2 questions though:
How to get 1-to-many matches?
For context: I am trying to match multiple products in dataset A to 1 product in dataset B. Where dataset A has 2000 products and dataset B has 500 products.
q1: Should I stop at the entity matching step and try to export the pairs from this step?
q2: What would be the best clustering method for this requirement?
Exporting the results
How can I export the predicted results for each of the products in dataset B i.e. get multiple products from dataset B matched to 1 product in dataset A.
Thank you.
Beta Was this translation helpful? Give feedback.
All reactions