-
Notifications
You must be signed in to change notification settings - Fork 140
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing feature names in Wisconsin dataset #20
Comments
@lacava Any idea? Should we update this dataset based on OpenML? |
Similar issue for the tic-tac-toe dataset. OpenML ref: https://www.openml.org/d/50 |
sure, we just need to make sure they match.
agreed! if you have bandwidth to submit a PR please do |
Unfortunately we are all in that situation with this project. Fortunately, the source of most of these datasets is pretty obvious. If everyone tackled a few datasets and verified their origin (e.g. through a checksum as in here) we could quickly have origin information attached to most of the datasets. The only realistic way I see it happening is if everyone does a few and submits PRs.
Agreed; that's discussed in issue #13. At the moment, metadata properties for the datasets are extracted for the readme files since PR #11. |
Currently, the features in the Wisconsin Prognostic Breast Cancer dataset do not have names.
The (I think) corresponding dataset on OpenML or even Kaggle seem to have this information. It would be helpful for these feature names to be added.
The text was updated successfully, but these errors were encountered: