Hello guys, We have `redshift-usage` and `redshift...
# ingestion
s
Hello guys, We have
redshift-usage
and
redshift
. Can we use both when ingesting in order to have stats and tables metadata ? Or it's only one or other ? Thank you in advance !
d
You have to use both
redshift-usage adds usage stats on the top of redshift ingestion
s
Perfect !
Thanks for the quick answer!
@dazzling-judge-80093 If
redshift-usage
adds usage stats on the top of
redshift
ingestion, what is the difference between setting
profiling.enabled: true
in
redshift-usage
and
redshift
? Should I, therefore, keep all the transformers and profiling options only on
redshift
, and keep
redshift-usage
yaml the most simple possible simply to add usage stats ?
d
with
redshift-usage
you can’t do profiling
s
@dazzling-judge-80093 But we have that option on the docs: https://datahubproject.io/docs/generated/ingestion/sources/redshift#config-details-1
d
Thanks, I think this is something we should clean from the doc
s
@dazzling-judge-80093 You are very welcome ! 😎
@dazzling-judge-80093 And for all the other options ?
stateful_ingestion
,
profile_pattern
,
schema_pattern
,
transformers
,
options.connect_args
... I've used them and the CLI accepts these keys on the yaml, but I don't know if it really works. Do they work or is just another "doc cleaning" problem ? Thank you a lot for all your help. :)
Related to
redshift-usage
d
I’m only not sure in
stateful_ingestion
the others should work I think
s
@dazzling-judge-80093 But since
redshift-usage
simply adds usage stats on the top of
redshift
ingestion, what's the meaning of using all those other yaml options ?
d
it can filter out some usage info if you want, so for example to not collect usage for a specific schema