wide-ghost-47822
05/14/2023, 8:33 PMlog_ingestion_stats
in pipeline object. And I wondered if I can get some metrics about the pipeline which is runned.
I saw some code block inside this method which sends some statistics data using telemetry object. It is like this:
telemetry.telemetry_instance.ping(
"ingest_stats",
{
"source_type": self.config.source.type,
"sink_type": self.config.sink.type,
"records_written": stats.discretize(
self.sink.get_report().total_records_written
),
"source_failures": stats.discretize(source_failures),
"source_warnings": stats.discretize(source_warnings),
"sink_failures": stats.discretize(sink_failures),
"sink_warnings": stats.discretize(sink_warnings),
"global_warnings": global_warnings,
"failures": stats.discretize(source_failures + sink_failures),
"warnings": stats.discretize(
source_warnings + sink_warnings + global_warnings
),
},
Inside the ping method, the code sends this data to an external api called Mixpanel. It seems you are collecting data about the pipeline from my machine.
I don’t like this way of collecting data. Why are you collecting this data?lively-cat-88289
05/14/2023, 8:33 PMastonishing-answer-96712
05/16/2023, 8:46 PM