okay, also we want to schedule this job periodically to ingest data and #no. of such schedule could be high (hourly and assume 100s of different batch pipelines) so I thought of standard spark+airflow to do the job.
I felt like - in minion framework, we are using pinot-controller as a scheduler which might have some impact on the overall cluster throughput if no. of jobs are high. Am I right to have that assumption?