https://linen.dev logo
Join Slack
Powered by
# ask-ai
  • v

    Vasiliy Knizhnikov

    07/26/2021, 1:46 PM
    Hey guys! Need some help 🙏 I’m developing a custom source connector and have source data structure represented as JSON:
    Copy code
    {
       "visitId":"xxx",
       "visitStartTime":"xxx",
       "date":"xxx",
       "geoNetwork":{
          "country":"xxx"
       },
       "hits":[
          {
             "page":{
                "pagePath":"xxx",
                "hostname":"xxx,
                "pageTitle":"xxx"
             }
          }
       ]
    }
    Corresponding catalog for the schema:
    Copy code
    {
      "streams": [
        {
          "sync_mode": "full_refresh",
          "destination_sync_mode": "overwrite",
          "stream": {
            "name": "data",
            "json_schema": {
              "type": "object",
              "properties": {
                "visitId": {
                  "type": "number"
                },
                "visitStartTime": {
                  "type": "number"
                },
                "date": {
                  "type": "string"
                },
                "geoNetwork": {
                  "type": "object",
                  "properties": {
                    "country": {
                      "type": "string"
                    }
                  }
                },
                "hits": {
                  "type": "array",
                  "properties": {
                    "page": {
                      "type": "object",
                      "properties": {
                        "pagePath": {
                          "type": "string"
                        },
                        "hostname": {
                          "type": "string"
                        },
                        "pageTitle": {
                          "type": "string"
                        }
                      }
                    }
                  }
                }
              }
            }
          }
        }
      ]
    }
    So effectively,
    hits
    property is an array where one of properties is an object. I’m struggling with normalization here. Basic normalization works fine for flat nested objects like geoNetwork, but
    hits
    table not even being created ❓ Do I need to create a custom dbt transform in order to normalize records like these or there’s other way to do this? Thank you!
    ✅ 1
    c
    o
    • 3
    • 7
  • c

    charles

    07/26/2021, 6:19 PM
    How big is the airbyte-integrations dir supposed to be? Mine is 5.6G 🤣 My guess is that most of the weight here is the multiple pyenvs.
    s
    • 2
    • 4
  • s

    s

    07/27/2021, 10:53 PM
    I’ve been using Slapdash for a few months and it’s been an amazing tool. It’s basically Mac Spotlight, but across all of your apps. Thought it might be interesting to people here
    ✨ 1
    👍🏽 1
    🌟 1
    ⭐ 1
    🤩 1
    👍 5
    🌠 1
    a
    o
    • 3
    • 3
  • h

    Howard Lince III

    08/02/2021, 1:50 PM
    tfw you wake up on a Monday morning and 8 connections are down
    😭 3
    d
    • 2
    • 1
  • n

    Norman Rosner

    08/06/2021, 8:28 AM
    Can I rant here to reaffirm your work and what you're up to? 😊
    👍 6
    c
    m
    • 3
    • 3
  • l

    Lars Kamp

    08/11/2021, 7:21 AM
    I just tried subscribing to the Airbyte newsletter with a new email. Looks like the form field isn't working - I tried this both on Safari and Chrome on a Macbook.
    u
    • 2
    • 1
  • c

    charles

    08/16/2021, 11:04 PM
    We are the second result that comes up in google when you search "Failed to exec spawn helper m1". This feels like a form of greatness. But also, we will fix the M1 issues this month!
    😂 15
    o
    • 2
    • 1
  • s

    s

    08/19/2021, 6:13 PM
    Really interesting data tool I saw on HN yesterday. It provides an abstraction over any external data source directly in SQL: https://steampipe.io/ demo video:

    https://www.youtube.com/watch?v=2BNzIU5SFaw&t=183s▾

    👍 11
    g
    • 2
    • 1
  • g

    gunu

    08/23/2021, 11:15 AM
    anyone have a show & tell of insights from the github connector? i know there was a recent demo with superset showcasing this. but possibly a public
    airbyte-dbt
    package (similar to fivetran dbt package)
    u
    • 2
    • 1
  • h

    Hicham Rahj

    08/25/2021, 8:08 AM
    Hello, i have a question, i've seen that airbyte always create a destination with name "_airbyte_raw" is there any scenario where is doesn't create it ? and is it possible to delete them after the jobs are run or will this create a problem during the futur syncs ?
    m
    g
    • 3
    • 6
  • j

    John (Airbyte)

    08/26/2021, 4:40 AM
    for our users, don't hesitate to vote on that poll 🙂 https://www.reddit.com/r/dataengineering/comments/pbaw2f/what_etl_tool_do_you_use/
    c
    g
    o
    • 4
    • 4
  • s

    Steve

    08/26/2021, 9:43 AM
    Hi, Does anyone have any information on the "Webhook connector" mentioned at https://docs.airbyte.io/project-overview/roadmap - either on its progress/ETA or perhaps what it might provide? Thanks
    ✅ 1
    s
    • 2
    • 1
  • j

    Jarrod Parkes

    09/03/2021, 3:55 PM
    who made all the Airbyte/Octavia emoji? they are great!
    octavia loves 11
    b
    m
    • 3
    • 3
  • n

    Noel Gomez

    09/11/2021, 2:22 PM
    Poll: What’s your opinion on DataOps? 1️⃣ Don’t care, most people are not ready 2️⃣ Only needed at large companies over 600 employees 3️⃣ I wish I could do it, but don’t have the skills 4️⃣ I do it, but set up is difficult 5️⃣ Whats DataOps?
    3️⃣ 1
    5️⃣ 4
    1️⃣ 1
    a
    • 2
    • 3
  • s

    s

    09/13/2021, 11:05 PM
    https://www.investors.intuit.com/news/news-details/2021/Intuit-to-Acquire-Mailchimp/default.aspx
    👀 1
    m
    d
    f
    • 4
    • 4
  • f

    Farman Pirzada

    09/14/2021, 11:18 PM
    anyone here make a transition from API development to data engineering? I am going to be doing that and wanted to hear about other people backgrounds, unless there is a more preferable channel for that
    n
    a
    i
    • 4
    • 8
  • m

    Michel

    09/15/2021, 9:33 PM
    The Airbyte team is a big fan of Among Us, we play it together at least once a week. Great to see that we will be able to do it in real life :) https://www.theverge.com/tldr/2021/9/15/22676173/among-us-official-costume-impostor-crewmate
    venting 3
    j
    a
    • 3
    • 2
  • j

    John (Airbyte)

    09/15/2021, 10:49 PM
    Screen Shot 2021-09-16 at 9.48.35 AM.png
    airbyte heart 2
    🤣 6
    this 3
    a
    • 2
    • 2
  • m

    Marwan

    09/22/2021, 8:54 PM
    I just found out that Airbyte owns the .com domain too. Strange, why do you prefer the .io domain @Michel?
    j
    m
    • 3
    • 3
  • e

    Elaaf

    09/27/2021, 11:21 AM
    Hi everyone, I had a question ! This may have been already asked but ... How does Airbyte scale data migration ? Is there an in-house developed MPP system being used to distribute and manage a HUGE ingestion job across multiple workers ? Regards, Elaaf
    ✅ 2
    c
    b
    v
    • 4
    • 24
  • m

    Mayank Sardana

    09/30/2021, 4:45 AM
    Hi everyone, I am planning to use Airbyte in our team for replication. We are currently using Redshift and are evaluating Snowflake as a potential warehouse alternative. Does anyone know if it's easier to move the ingestion task destination from redshift to snowflake (assuming the existing data is moved)?
    ✅ 1
    u
    • 2
    • 6
  • c

    chad cole

    09/30/2021, 6:02 PM
    is this an addon which keeps your old files without deleting them on a free plan?
    u
    • 2
    • 1
  • m

    Matheus de Freitas Andrade

    09/30/2021, 9:52 PM
    Good night, I have airbyte on AWS and working perfectly, has anyone ever used auth0 to put a user authentication layer? or some other user authentication tool? (I know that it is possible to release some ip’s in the airbyte vpc but I would like to do it with authentication, so I would not always need to be putting the ip)
    u
    • 2
    • 2
  • d

    Davin Chia (Airbyte)

    10/04/2021, 4:24 PM
    👀 https://www.independent.co.uk/life-style/gadgets-and-tech/facebook-down-instagram-whatsapp-today-b1932059.html
    s
    h
    j
    • 4
    • 8
  • a

    Aldo M

    10/08/2021, 8:07 AM
    Hello everyone 🙂 I am curious to know when you plan to integrate Prefect ?
    🔥 5
    m
    m
    u
    • 4
    • 11
  • m

    Michel Ebner

    10/12/2021, 9:14 AM
    Hello community, I got a question about the mysql integration and log based replication. Does the integration handle changing log files? When the log file reaches its size limit, mysql creates a new log file with a higher index (ex: bin-log.00001 -> bin-log.00002). The stitch integration does not handle this...
    ✅ 1
    s
    • 2
    • 2
  • g

    Gergely Lendvai

    10/15/2021, 8:47 AM
    Hi all, I’m curious, what’s the recommended way to run airbyte on kubernetes? I found a helm chart but I also saw some kustomize related files here and I was wondering which one should I choose.
    ✅ 1
    👀 1
    h
    • 2
    • 2
  • s

    Sheshan

    10/19/2021, 10:10 AM
    Hi people, I am using airbyteairbyte along with other services to play with the data. I was curious if I can use airbyte-db (postgres) to store data specific to other services ? 🤔 just create another database for other services to store there data. Will this affect airbyte in any way?
    ✅ 1
    d
    • 2
    • 2
  • g

    Gergely Lendvai

    10/20/2021, 1:24 PM
    Hi all, I’m trying to check with the rest api if a sync finished with any status however I couldn’t find the corresponding api call. Is there a way to achieve this with the API?
    ✅ 1
    • 1
    • 1
  • g

    George Claireaux (Airbyte)

    10/27/2021, 3:49 PM
    Cool mini-game to find your favourite code font for IDEs: https://www.codingfont.com/
    🔥 2
    ✅ 1
    j
    t
    • 3
    • 3
12345...48Latest