Marcin Gierdalski
06/10/2024, 10:20 PMploomber build
until it works as intended. How should I go about running it in isolation as a proper notebook in VSC or jupyter, and still having benefits of the information in upstream = [ 'get', ...]
preamble and perhaps in pipeline.yaml
and env.yaml
? I could add some shim code to inject desired inputs into upstream
, only to excise it later, when not needed anymore, but I'm hoping you guys could suggest a tried and proved pattern...Eduardo
ploomber build
has a --partially
argument, which will build your pipeline up until the task name that you pass. e.g. ploomber build --partially train-model
this will execute in the same way as ploomber build
it'll just stop once it reaches that task, does this work?Marcin Gierdalski
06/11/2024, 6:24 PMupstream['get']
obviously breaks because it's defined in a stub at the top as upstream = ['get']
, not as a dictionary with path(s) to my inputs (that's a job for papermill at pipeline run, I believe ). Once I'm happy with NB, I would reintegrate it and run entire pipeline.Eduardo
Marcin Gierdalski
06/11/2024, 6:28 PM