Releases · srlearn/datasets

drug_interactions: Devendra Singh Dhami's relational version of the drug-drug interactions dataset (#12)
toy_machines: Toy multiclass-classification dataset based on one distributed with the ACE data mining system (#15)

Fixes and Other Changes

Add "Julia" section to README
Fix link to GitHub tags in README
Fix typo in README paage -> page (#13)
Move boston_housing background to correct location, previously it was incorrectly added to the boston_housing/ instead of boston_housing/boston_housing (#14)

Assets 14

06 Aug 18:28

hayesall

v0.0.4

b74bda1

Data Standardization and Validation

Standardized Data Formatting

All datasets are now validated with the grammar defined in srlearn/linter

Datasets

Four more datasets are included in this release:

financial_nlp_small
nell_sports
boston_housing
icml

Other Changes

RELEASE_VERSION is now appended to the end of zipfiles. So instead of releasing toy_cancer.zip, this and future versions will have a version (e.g. toy_cancer_v0.0.4.zip) as part of the file name.
Add general usage instructions to main project README.md
Add a hash_datasets.sh script. This is not used at the moment, but can be used to get a hash value for all files in a dataset. This could be helpful for tracking whether two versions of a dataset are exactly the same, even when the zipped contents are different.
Add lint_datasets.sh script for testing dataset content
CI build: on pull requests and pushes to the main branch, the lint_datasets.sh script runs on all datasets under srlearn/

Assets 12

13 Jul 20:08

hayesall

v0.0.3

83d21f6

4 more datasets

Datasets:

✨ Add uwcse
✨ Add cora
✨ Add webkb
✨ Add citeseer

Other Changes

📄 Add MIT License for code in this repository
✨ Add Makefile to assist with builds
🔥 Delete ~13.8 Megabytes of unnecessary comments
📝 Add overview to README and srlearn/README
🔥 Drop Gifs/ and Images/ directories

Assets 8

12 Jul 21:27

hayesall

v0.0.2

1952389

Hotfix patch for deploying artifacts

Fix typo users -> uses

Assets 4

12 Jul 21:24

hayesall

v0.0.1

f9af187

Release Test with Two Datasets

Add toy_cancer benchmark dataset
Add toy_father benchmark dataset

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

Datasets

Fixes and Other Changes

Standardized Data Formatting

Datasets

Other Changes

Releases: srlearn/datasets

v0.0.6 - California Housing, RoofWorld20, Deprecate Boston Housing

What's Changed

Contributors

Drug Interactions and Toy Machines

Datasets

Fixes and Other Changes

Data Standardization and Validation

Standardized Data Formatting

Datasets

Other Changes

4 more datasets

Hotfix patch for deploying artifacts

Release Test with Two Datasets