Skip to content

Recommendation for packaging a package using standalone CUDA programs and large datafiles #417

Answered by rgommers
tloredo asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @tloredo, thanks for your question and interest.

To answer this one first:

My current plan is to mimic how some AstroPy packages handle such cases: The data are not included in the project's repo, but rather stored in a permanent, citable location [...]

This seems very reasonable. I'll note that scikit-learn, scikit-image and SciPy all have data loaders that work along similar lines. I believe scikit-learn has custom code for data set downloading, while scikit-image and SciPy both use https://github.com/fatiando/pooch as an optional dependency.

If the data is optional, I'd not add the option to do the data retrieval in the package build files. Rather, just let the user do import mypk…

Replies: 1 comment 6 replies

Comment options

You must be logged in to vote
6 replies
@tloredo
Comment options

@rgommers
Comment options

@eli-schwartz
Comment options

@tloredo
Comment options

@tloredo
Comment options

Answer selected by rgommers
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants