Slackbot
03/19/2022, 1:33 AMEduardo
Ido (Ploomber)
MrFiat124Spider
03/19/2022, 1:39 AMMrFiat124Spider
03/19/2022, 1:40 AMIdo (Ploomber)
Eduardo
product
section are meant to be read-only, if you wish to edit the source code it is better to edit the .py
file in the source:
section. you can open it as a notebook by right clicking. https://ploomber.io/images/doc/lab-open-with-notebook.pngβΎ
Eduardo
ploomber nb -i
to inject it manually in the source
filesIdo (Ploomber)
nb: output/get.ipynb
so you have the context of your execution.MrFiat124Spider
03/19/2022, 1:44 AMtimeSeries=(pd.read_csv(upstream['createTimeSeries']['dotsTimeSeries']))
needs to become
timeSeries=pd.read_csv('actual/path/data.csv')
in order to run the notebook outside of the ploomber pipeline and actually write the code.
then once I'm done writing, I uncomment the top line and comment out the bottom line?Eduardo
ploomber nb -i
and ploomber will inject the upstream variable πEduardo
MrFiat124Spider
03/19/2022, 1:47 AMEduardo
ploomber build
, it'll override the upstream if it exists, so it doesn't matter. you can run ploomber nb -i
and whenever you want to run the full thing, run ploomber build
- but if you change anything in your pipeline.yaml, then you'll need to run ploomber nb -i
againMrFiat124Spider
03/19/2022, 1:49 AMEduardo
MrFiat124Spider
03/19/2022, 1:50 AMMrFiat124Spider
03/19/2022, 1:50 AMMrFiat124Spider
03/19/2022, 1:50 AMEduardo
ploomber nb -i
should do the trick, but feel free to post another question if it doesn't solve your issueMrFiat124Spider
03/19/2022, 1:51 AMMrFiat124Spider
03/19/2022, 1:52 AMMrFiat124Spider
03/19/2022, 1:52 AMEduardo
Eduardo
Eduardo
pipeline.yaml
MrFiat124Spider
03/19/2022, 1:55 AMEduardo
MrFiat124Spider
03/19/2022, 2:00 AMdel(largeDF)
but it would also be nice to make individual files interactive like:
ploomber nb -i -specificFile.py
Eduardo
ploomber nb -i
command. all it's doing is creating and extra cell with the upstream
variable, but it isn't loading anything. so you're goodEduardo
ploomber task {task-name}
or do a partial build with ploomber build -p {some-task}
MrFiat124Spider
03/19/2022, 2:08 AMMrFiat124Spider
03/19/2022, 2:09 AMEduardo
ctrl + c
will stop executionMrFiat124Spider
03/19/2022, 2:10 AM- source: 01-timeSeries/xgboostWindow.py
product:
nb: 01-timeSeries/xgboost.ipynb
# resultsDict: 00-data/model_output/xgboostResults.pickle
# pcaResultsDict: 00-data/model_output/xgboostPCAResults.pickle
Ido (Ploomber)
xgboostWindow
You can also give it a custom name, in case you want something shorter, with this key: -name: task_name
Also, for VSCode, if you wanna run this command automatically you can configure a file watcher.
Click here for a VSCode extension
And hereβs the docs link: https://docs.ploomber.io/en/latest/user-guide/editors.htmlMrFiat124Spider
03/19/2022, 2:20 AMEduardo
xgboostWindow
it's always the filename without the extensionMrFiat124Spider
03/19/2022, 2:26 AMMrFiat124Spider
03/19/2022, 2:29 AMlist(dag)
Out[1]:
['downLoadMetaData',
'dataCollect',
'createTimeSeries',
'calculateNetworkStats',
'xgboostWindow']
Eduardo
xgboostWindow
doesn't work?MrFiat124Spider
03/19/2022, 2:30 AMEduardo
MrFiat124Spider
03/19/2022, 5:52 PMMrFiat124Spider
03/19/2022, 5:53 PM