Replies: 1 comment
-
Turning this into a Discussion |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Description
Related to the overall plug-in epic of #583 I've been thinking about both the Kedro team's own maintenance burden and what user friction I see with working with dataset contributions today.
Context
At a high level the following points contribute to this status quo:
fsspec
boilerplate overheard in every single file based class.Possible Implementation
I suggest Kedro introduce a set of CLI commands focused on this dataset workflow. We have history of these ideas in the micropackaging journey as well.
They would all follow the
kedro dataset <command>
pattern:pull
kedro-datasets
name as per the catalog e.g.polars.GenericDataSet
. It would pull the source code, add the dependencies and provide an example catalog entry. Longer term we could think about how 3rd party polyrepos could work e.g. (1) (2)create
install
pip
command, adding the dependency to your project and providing an example catalog entry.contribute
pull
s/create
s back into the open source projectBeta Was this translation helpful? Give feedback.
All reactions