Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

model wont train using few records e.g. 20 records #44

Open
tinashebutsa opened this issue Aug 16, 2021 · 1 comment
Open

model wont train using few records e.g. 20 records #44

tinashebutsa opened this issue Aug 16, 2021 · 1 comment

Comments

@tinashebutsa
Copy link

I get the following error when I use only a few records. for example if I use more than 37 records it works fine but less than 37 I get an error. What could be the problem?

BlockingError: No records have been blocked together. Is the data you are trying to match like the data you trained on?

@ieriii
Copy link
Collaborator

ieriii commented Mar 27, 2022

@tinashebutsa,
you should probably increase the sample_size argument when running dedupe_dataframe.
You have a very small number of records. Som I would set sample_size=1.

Let me know if anything is unclear or need further guidance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants