Hi :) I'm a volunteer working on vaccine surveillance and modeling for one of the counties in New York.
Each day we have to retrieve an extract from New York State Immunization Information System (NYSIIS) of the last day's vaccinations. The files are named with the date they were cut (this was yesterday's file):
20211202_nysiis.csv.zip
The plan is to dump CSVs into Postgres, do some transforms, then shift the data into Elasticsearch for analysis.
I'm hoping someone would be willing to help me understand how best to use the
Files source given that it does not yet support multiple files.
I'm looking for guidance on how to handle the initial load + the ongoing daily incrementals. I could do preprocessing before the Airbyte process - but I was hoping to use this as an opportunity to learn best practices inside the Airbyte ecosystem.
Thanks! :) π