Isaac
11/03/2021, 4:24 AMJeff Crooks
11/03/2021, 1:10 PMChristian Roy
11/03/2021, 10:02 PMIf you want to schedule a 20-min call with our team to help you get set up, please select some time directly here.But the link (https://calendly.com/nataliekwong/airbyte-onboarding) ends up on a page that reads
Natalie Kwong. This Calendly URL is not valid.
Manav
11/05/2021, 8:14 PMVinod Varma
11/06/2021, 3:02 PMJeff Crooks
11/08/2021, 4:40 PMBaatch
11/09/2021, 6:42 AMPrasad Chalasani
11/10/2021, 2:26 AMGergely Lendvai
11/10/2021, 11:52 AMJonas Bolin
11/10/2021, 12:51 PMsegment.date
to Date
, metrics.cost_micros
to Cost
and casts them to the proper type
I can see how this would reduce the setup time for a source from 5+ hours for Google Ads (for an analyst like myself) to 5min
This may not be the market you're primarily going after, but having sources like this would mean that using Airbyte would suddenly be within reach (from a UI standpoint) of the hundreds of thousands marketers our there who are getting ripped off by Funnel.io, Supermetrics etc (in Europe)gunu
11/11/2021, 11:04 AMSimbazz
11/15/2021, 8:32 AMJoshua Dawson
11/15/2021, 3:02 PMDerek Wene
11/16/2021, 4:02 PMVijay
11/16/2021, 11:02 PMRytis Zolubas
11/18/2021, 10:54 AMJeff Crooks
11/18/2021, 12:54 PMNicolas Smith
11/18/2021, 3:46 PMAnatole Callies
11/18/2021, 4:46 PMCraig Bunyan
11/18/2021, 6:11 PMNeethika Singh
11/19/2021, 12:15 PMNeethika Singh
11/19/2021, 1:03 PMZach Brak
11/19/2021, 5:33 PM"*DEPROVISIONED"*
status when not filtered.
This corrupts the output, as de-provisioned users arenāt returned, and you will always overstate your active users in the dataset.
The "DEPROVISIONED"
status users need to be returned within the stream in order for a complete user picture out of okta.
This heavily devalues the use of this connector for seeing a full user lifecycle.
Proposed solution
The "DEPROVISIONED"
status users can be returned through a filtered call - Iām not sure the best way this can be resolved, but I believe if the stream can be filtered initially - it would return all de-provisioned users along with it.
A couple ideas - not sure which one makes most sense:
⢠The users stream makes a secondary call setting the filter to status eq "DEPROVISIONED"
⢠An initial āSync data from dateā parameter is passed, facilitating an initial filter of: lastUpdated gt "yyyy-MM-dd'T'HH:mm:ss.SSSZ"
(this may be most accessible to the existing stream)
⢠A secondary users stream, called deprovisioned_users
is copied from the initial with a fixed filter of status eq "DEPROVISIONED"
Thank you for your consideration, Again this is extremely important to having us able to effectively account for user lifecycle out of the Okta tenant.Anatole Callies
11/22/2021, 11:09 AMYoram
11/23/2021, 9:05 AMJeff Crooks
11/23/2021, 7:50 PMThomas
11/24/2021, 1:25 PMClovis Masson
11/25/2021, 10:58 AMnode A : 1 worker
, node B : 1 worker
and node C : 2 workers
). When syncing, if my source-worker
and destination-worker
are correctly distributed, Iām able to process about 25M rows in one hour (with a well distributed CPU load). However, if by any chance source-worker
and destination-worker
are both started in node C
, then the nodeās CPU goes up to 190% (against 10% and 10% for the two others) and time processing is much more slower as Iām only able to process about 15M rows within an hour.Ā
Not sure if it's an actual request as I don't know if there is an existing strategy to avoid this situation but is there a way to force the parallelization of workers on different nodes to maximize performance ?yu
11/25/2021, 12:17 PMJonas Bolin
11/25/2021, 1:44 PM