Afaque Ahmad
06/12/2023, 8:14 AMkedro-pluginsrequirements.txtAbhishek Bhatia
06/12/2023, 1:06 PMMemoryDataSetcopy_modeassignJose Nuñez
06/12/2023, 3:32 PMDataSetError: Failed while saving data to data set ParquetDataSet(filepath=/Users/jose_darnott/PycharmProjects/planta-litio/data/01_raw/data_sql.parquet, load_args={'engine': pyarrow}, protocol=file, save_args={'engine': pyarrow}). Duplicate column names found:  ['timestamp', 'lims_BFIL CO3', 'lims_BFIL Ca %', ...]data_sql:
  type: pandas.ParquetDataSet
  filepath: data/01_raw/data_sql.parquet
  load_args:
    engine: pyarrow
  save_args:
    engine: pyarrow
  layer: IIITrevor
06/12/2023, 4:52 PMMemoryDataSetmainJared T
06/12/2023, 4:54 PMValueError: Duplicate keys found in 
<project repo>/conf/base/parameters/pr
epare.yml and:
- 
<project repo>/conf/base/parameters/in
gest.yml: train_pipelinetrain_pipeline# The following is a list of parameters for ingest pipeline for each namespace (train, inference)
# Parameters for train namespace
train_pipeline:
  ingestion_options:
    #Portfolio to use
    portfolio_name: has_meds_portfolio.HasMedsPortfolio
    # Feature store sub-pipes, only one for now.
    feature_store_subpipe_name: BasicFeaturePipeline
    # Expected output columns
    expected_columns:
      datetime: datetime64[ns]
      patient_id: int64
      age_days: int64
      Male: int64
      binary_smoking_status: object
      overall_censorship_time: datetime64[ns]
      months_until_overall_censorship: int64
      death_date: datetime64[ns]
# Parameters for inference namespace
# currently same as train but this will change
# first updated to Nightly Porrtfolio then to
# an api call to the valuation queue.
inference_pipeline:
  ingestion_options:
    #Portfolio to use
    portfolio_name: has_meds_portfolio.HasMedsPortfolio
    # Feature store sub-pipes, only one for now.
    feature_store_subpipe_name: BasicFeaturePipeline
    # Expected output columns
    expected_columns:
      datetime: datetime64[ns]
      patient_id: int64
      age_days: int64
      Male: int64
      binary_smoking_status: object
      overall_censorship_time: datetime64[ns]
      months_until_overall_censorship: int64
      death_date: datetime64[ns]# all parameters for prepare pipeline are in train_pipeline namespace
train_pipeline:
  preparation_options:
    # target params
    target_death_buffer_months: 2
    
    # split params 
    splitter: TimeSeriesSplit
    holdout_size: 0.3CHIRAG WADHWA
06/13/2023, 4:34 AMkedro-datasets 1.4.0 does not provide the extra 'pickle.pickledataset'kedro.extrasAbhishek Bhatia
06/13/2023, 10:21 AMPartitionedDataSetPartionedDataSetJose Nuñez
06/13/2023, 1:39 PMJeremi DeBlois-Beaucage
06/13/2023, 4:32 PMAndreas_Kokolantonakis
06/14/2023, 12:19 PMkedro run --env=devValueError: Failed to format pattern '${s3_root_path}': no config value found, no default providedRafał Nowak
06/14/2023, 4:49 PMgtosemver >= 3kedro-vizkedro-viz 6.3.0semver < 3kedro-vizsemver < 3semver3.0.1Alexandre Ouellet
06/14/2023, 7:07 PMKhangjrakpam Arjun
06/15/2023, 12:08 PMtype: kedro.extras.datasets.pandas.HTMLDataSetkedro.io.core.DataSetError: An exception occurred when parsing config for DataSet 'boxplot_figures_cfa':
Class 'kedro.extras.datasets.pandas.HTMLDataSet' not found or one of its dependencies has not been installed.Javier del Villar
06/15/2023, 6:51 PMGeorgi Iliev
06/16/2023, 7:56 AMONNXPicklepipelineONNXpublishPickeONNXconvert_sklearn.onnxKhangjrakpam Arjun
06/16/2023, 8:23 AMkedro.extras.datasets.matplotlib.MatplotlibWriter'Figure' object has no attribute 'save'sav_figsaveCamilo López
06/16/2023, 12:18 PMGuilherme Parreira
06/16/2023, 12:28 PMauto-sklearn3.9python 3.9.16pyenvPipfile3.9Vici
06/16/2023, 1:08 PMkedro vizplots:
  type: PartitionedDataSet
  path: data/08_reporting/plots
  dataset:
    type: plotly.JSONDataSet
  filename_suffix: '.json'fig = plotly.io.read_json(file); fig.show()Sebastian Cardona Lozano
06/16/2023, 2:14 PMCircularDependencyError: Circular dependencies exist among these items: [node1 ...., node2]Nok Lam Chan
06/17/2023, 10:33 AMAbhishek Bhatia
06/17/2023, 1:15 PMPartitionedDataSetAbhishek Bhatia
06/19/2023, 7:46 AMPartitionedDataSetscenario_x/
├── iter_1/
│   ├── run_1.csv
│   ├── run_2.csv
│   └── run_3.csv
└── iter_2/
    ├── run_1.csv
    ├── run_2.csv
    └── run_3.csv
scenario_y/
├── iter_1/
│   ├── run_1.csv
│   ├── run_2.csv
│   └── run_3.csv
└── iter_2/
    ├── run_1.csv
    ├── run_2.csv
    └── run_3.csv_partitioned_csvs: &_partitioned_csvs
  type: PartitionedDataSet
  dataset:
    type: pandas.CSVDataSet
    load_args:
      index_col: 0
    save_args:
      index: true
  overwrite: true
  filename_suffix: ".csv"
_partitioned_jsons: &_partitioned_jsons
  type: PartitionedDataSet
  dataset:
    type: json.JSONDataSet
  filename_suffix: ".json"
my_csv_part_ds:
  path: data/07_model_output/my_csv_part_ds
  <<: *_partitioned_csvs
my_json_part_ds:
  path: data/07_model_output/my_json_part_ds
  <<: *_partitioned_jsonsdef node_that_generates_part_ds(scenario, **kwargs):
  res = {'scenario_x/iter_1/run_1': df1, 'scenario_x/iter_1/run_2': df2,  .... and so on}}
  return resresmarrrcin
06/19/2023, 7:49 AMboolstr{%- if <http://cookiecutter.my|cookiecutter.my>_flag != "False" %}Juan Luis
06/20/2023, 10:42 AM127.0.0.1localhostPranav Khurana
06/20/2023, 11:32 AMKevin Mills
06/20/2023, 7:32 PMIdris Benkhelil
06/21/2023, 6:02 AM[etape 1] > [etape 2] > [if score_etape2 < X ] > [etape4]
				      > [if score_etape2 >= X ] > [etape5]Marc Gris
06/21/2023, 7:10 AM@singledispatchmethodfunctoolsfit()predict()ValueError: Invalid Node definition: first argument must be a function, not 'singledispatchmethod'.if not callable(func):
            raise ValueError(
                _node_error_message(
                    f"first argument must be a function, not '{type(func).__name__}'."
                )
            )Marc Gris
06/21/2023, 9:31 AMrandom_state: 42random_state: ${random_state}kedro run 
>>> [...] 
TypeError: Cannot cast scalar from dtype('<U15') to dtype('int64') according to the rule 'safe'random_state"42"42int(params['random_state'])