Ashwini Mali
05/23/2022, 6:56 AMCorey Canestrare
05/23/2022, 11:36 AMAkhtar Bhat
05/24/2022, 5:50 AMAshwini Mali
05/24/2022, 6:23 AMEngineering Team
05/24/2022, 8:55 AMio.airbyte.workers.DefaultReplicationWorker$SourceException: Source cannot be stopped!
------------------------------------------------------------------
[0m > 09:19:17.259653 [error] [MainThread]: 404 Not found: Table xxxx-realm-122345:stripe_data._airbyte_raw_disputes was not found in location US
Only some data is syncedAkhtar Bhat
05/24/2022, 10:08 AMClémence Jullian
05/24/2022, 11:57 AMAnton Podviaznikov
05/24/2022, 1:26 PM0.38.4-alpha
on k8s and trying to sync one table from PG to Snowflake.
Table has 32 mln records.
It takes airbyte anywhere from 2h30m to 3h30 min to do initial sync on this table.
Pipelinewise takes 37min.
I'm not sure how to get the same numbers.
Another thing that confuses me that after sync is done I see that both tables in snowflake have 32 mln records.
But the size of the table created by pipelinewise is 2.6GB and the one created by airbyte is 5GB (and on top of that why does airbye UI shows that 49.25 GB worth of data were processed - those numbers don't match).
Why is that? Any ideas.Santiago Stachuk
05/24/2022, 2:16 PMSlackbot
05/24/2022, 4:05 PMCameron Whitehead
05/24/2022, 4:17 PMview_id, ga_date
so a composite one (which I hope is right, it seems to be that the ga_date
shouldn't be replicated in a given view_id
? But I'm not sure what to use for Google Ads. Just the segments.date
, right? Or is that dumb for some reason I'm not seeing?
Thanks for any pointers!David Dalmaso
05/24/2022, 4:48 PMCarlos Marques
05/24/2022, 6:01 PMFrank Bardelli
05/24/2022, 7:52 PMJenny Brown
05/24/2022, 9:30 PMEddard
05/25/2022, 12:33 AMZak Keener
05/25/2022, 2:17 AMairbyte_workspace
volume /data
. According to the docs this is where logs, configs, and I assume other things are stored. Is it safe to delete these after a period of time? I need to cleanup logs on a very large Airbyte instance (millions of directories in /data
) and would like to remove all directories older than 2 weeks, but do not want to remove data that will not be regenerated.Devon Solomon
05/25/2022, 10:29 AMHarvey Marshall
05/25/2022, 11:27 AMWei Mei
05/25/2022, 1:42 PMRahul Patel
05/25/2022, 3:16 PMEngineering Team
05/26/2022, 4:51 AMSimon Thelin
05/26/2022, 8:03 AMHarvey Marshall
05/26/2022, 10:14 AMVytautas Bartkevičius
05/26/2022, 1:08 PMWARNING! Updating the schema will delete all the data for this connection in your destination and start syncing from scratch
. Why is that? Why the data is deleted from all streams, not only from new one, but also from currently existing. So after this I need to collect all data from from scratch? Or how I could prevent from this?Lluís Gassó
05/26/2022, 3:08 PMDhruv Satish
05/26/2022, 3:52 PMAndrew Ferreira
05/26/2022, 4:33 PMRahul Patel
05/26/2022, 5:20 PMDarragh
05/26/2022, 7:09 PM