This message was deleted Hamilton Open Source #hamilton-help

Join Slack

This message was deleted.

# hamilton-help

Slackbot

10/25/2022, 8:24 PM

This message was deleted.

👀 1

Stefan Krawczyk

10/25/2022, 8:28 PM

Thanks for the question! I have a meeting and will get back to you shortly.

Stefan Krawczyk

10/25/2022, 9:13 PM

You have a few options.

Stefan Krawczyk

10/25/2022, 9:15 PM

You parameterize the DAG based on the config — so that we construct a DAG for each fixed parameterization.

Stefan Krawczyk

10/25/2022, 9:17 PM

So you can have

Copy code

@config.when(mean_type='A')
def moving_average__A(input1: ... ) -> ...:
    # logic for A version
    ...

@config.when(mean_type='B')
def moving_average__B(input1: ... ) -> ...:
    # logic for B version
    ...

That will switch out and run different logic — you can hard code values here if you like and it’ll be close to the transforms.

Stefan Krawczyk

10/25/2022, 9:20 PM

Since Hamilton is python object agnostic — if you’re happy coupling your code to that class object, declare it as input and use it directly in the function:

Copy code

@config.when(mean_type='special')
def moving_average__special(custom_object: ITS_TYPE, input1: ...) -> ...:
   # logic to use the custom object... e.g.
   return custom_object.compute(input1)

Stefan Krawczyk

10/25/2022, 9:20 PM

Then at driver time you just need to pass in the right “DAG config”, and then any inputs that are required to run the DAG.

Stefan Krawczyk

10/25/2022, 9:25 PM

In terms of

I’d like to keep track of all of these parameters in some python (data)class/attrs object.

you have options — depends on how you want to manage that code. i.e. with the transforms themselves (as alluded to above), with the driver code, or outside of hamilton code entirely.

Stefan Krawczyk

10/25/2022, 9:31 PM

So the other options are something like:

Copy code

def moving_average(config_required: dict) -> ...:
   # use a dict of things to parameterize this function's logic 
   ...

and the driver code would be

Copy code

# for first DF
config = {
  'config_required': {...} # or load from some other place
  'input': df_1
}
dr = driver.Driver(config, modules)
df = dr.execute([...])

# for second DF
config = {
  'config_required': {...} # or load from some other place
  'input': df_2
}
dr = driver.Driver(config, modules)
df = dr.execute([...])

Stefan Krawczyk

10/25/2022, 9:32 PM

does that make sense @Zouhair Mahboubi?

Zouhair Mahboubi

10/25/2022, 10:32 PM

thanks - I think I get it. What I was missing is that config can contain all of the set of “inputs” Couple of followups: 1. Do the objects in the Driver config object (

config_required

in this example) need to be json serialazable (dicts/etc.) or can they be any python object (e.g. a callback, a class, etc.) 2. can these other parameters/objects be in inputs vs in config? a. and what’s the difference between putting inputs in the config vs passing it to driver.execute as shown for example here ?

Elijah Ben Izzy

10/25/2022, 11:15 PM

Yep — its really open-ended. 1. Any python object is fine (although serializable will probably make your life easier at some point down the line…). If you’re passing in a callback or class I’d highly recommend using the type of config-based delegatation @Stefan Krawczyk recommended and baking it in the dataflow itself. E.G. if instead of passing in a function you embed that into the DAG and choose it depending on a parameter, your life will get a lot easier. 2. Yep! Either one is fine. The difference between config and input is a little subtle. config allows you to use it at DAG construction (E.G. with

config.when

). So it can actually shape the DAG. Inputs (as well as config) are passed in during runtime — so you can actually declare a dependency on an it. If it’s not passed in, execution will error out!

Elijah Ben Izzy

10/25/2022, 11:18 PM

So:

Copy code

# can be run with bar supplied in inputs *or* config
def foo(bar: int) -> int:
    ...

# bar has to be supplied in "config"
@config.when(bar=1)
def foo__1() -> int:
    ...

Note there’s the notion of overrides (not

config

inputs

but a third, more mysterious thing 😆) that enables you to short-circuit node execution and replace a node. But I digress…

Zouhair Mahboubi

10/26/2022, 1:35 AM

awesome - thanks. I think that’s likely enough to get me started with a prototype and see what I break 😛

Elijah Ben Izzy

10/26/2022, 1:36 AM

Awesome! Happy to answer more qs as they come up :)

Open in Slack

Previous Next