https://linen.dev logo
Join Slack
Powered by
# good-reads
  • a

    Ari Bajo

    10/14/2021, 7:53 PM
    @Tuan Nguyen published his first Airbyte recipe! 🎉 As you know, Airbyte uses dbt to normalize the extracted data. Tuan shares how you can modify the dbt code generated by Airbyte to partition and cluster BigQuery tables. This has being asked before on the Slack! https://airbyte.io/recipes/bigquery-partition-cluster
    💯 1
    👀 2
    🔥 8
    j
    m
    t
    • 4
    • 4
  • j

    John (Airbyte)

    10/26/2021, 12:04 PM
    A case study on Airbyte!! https://blog.threado.com/p/airbytecasestudy
    octavia loves 3
    airbyte rocket 7
    👍 11
    m
    • 2
    • 1
  • j

    John (Airbyte)

    10/27/2021, 4:49 AM
    We just published an article to explain why we started by open-sourcing ELT, and not any other type of data integration. We also give a glimpse on our vision to become the new standard for all data movement in the future. https://airbyte.io/blog/airbyte-strategy-to-commoditize-all-data-integration
    👀 4
    airbyte rocket 7
    j
    • 2
    • 2
  • m

    MChorfa

    10/31/2021, 11:26 PM
    https://www.castordoc.com/blog/etl-benchmark-for-mid-market-companies
    j
    a
    • 3
    • 2
  • j

    John (Airbyte)

    11/02/2021, 12:33 PM
    report on state of community tools from orbit.love https://cdn.glitch.me/9f234bd6-bd99-421b-997d-8c15b9676ce8%2FConstellation-Report-State-of-Community-Tools-2021.pdf?v=1635444413419
    a
    • 2
    • 1
  • g

    George Claireaux (Airbyte)

    11/03/2021, 10:42 AM
    Databricks proving lakehouse pattern by breaking warehouse performance record: https://databricks.com/blog/2021/11/02/databricks-sets-official-data-warehousing-performance-record.html
    k
    r
    b
    • 4
    • 7
  • e

    Enrico Tuvera Jr.

    11/18/2021, 5:40 AM
    are there any good books on airbyte out there? or any introductory material that isn't the documentation
    j
    a
    • 3
    • 10
  • j

    JP (Veezoo)

    11/29/2021, 2:03 PM
    I just recently read the classic papers from Codd and Boyce/Chamberlin and felt very inspired by their sense of purpose. So I decided to write a tribute to these pioneers from our field. https://jpmonteiro.substack.com/p/codds-dream
    ❤️ 2
    o
    • 2
    • 2
  • a

    Ari Bajo

    12/08/2021, 9:54 PM
    Airbyte's engineering blog is out! The first article comes from @Jared Rhizor (Airbyte) https://airbyte.io/blog/extending-docker-images-on-kubernetes "When we first started orchestrating third-party containers in Kubernetes, we found out that we needed to extend container entrypoints to be able to perform syncs. Since Kubernetes does not allow inspecting Docker entrypoints, we needed a strategy to identify this entrypoint when launching the container." More articles coming soon! Excited that our engineers are sharing the code on our Github repo and also our learning along the journey :)
    👌 14
    👍 1
    👌🏼 1
    j
    f
    • 3
    • 4
  • j

    John (Airbyte)

    12/14/2021, 1:45 PM
    thanks @Dániel Molnár! https://www.dataengineering.academy/pipeline-data-engineering-academy-blog/the-pipeline-academy-awards-2021-pipies
    👏🏽 1
    👏 4
    m
    d
    • 3
    • 2
  • a

    Ari Bajo

    12/14/2021, 10:22 PM
    For those who couldn't attend the last Community Call, here is the recipe sharing how to integrate Prefect, Airbyte and dbt 🙂 https://airbyte.io/recipes/elt-pipeline-prefect-airbyte-dbt
    🔥 6
    j
    • 2
    • 1
  • a

    abhi

    12/16/2021, 9:55 PM

    https://www.youtube.com/watch?v=kyGiUEWhOKQ&ab_channel=Airbyte▾

    For those that missed our Demo Hours this week, here's a demo on how you can easily deploy Airbyte on K8s with the open-source project, plural.sh!
    👍 3
    😎 6
    🔥 6
    u
    m
    • 3
    • 2
  • a

    Ari Bajo

    01/03/2022, 9:41 PM
    Featuring Airbyte, dbt, Superset, and OpenMetadata among other cool projects 🎉 https://towardsdatascience.com/building-an-end-to-end-open-source-modern-data-platform-c906be2f31bd
    o
    t
    • 3
    • 2
  • d

    Davin Chia (Airbyte)

    01/11/2022, 9:04 AM
    We wrote a blog post explaining how Airbyte works on Kubernetes - those that run Kube might want to give this a read to better understand what goes on behind the scenes!
    👏 1
    👍 12
    c
    m
    d
    • 4
    • 5
  • u

    [DEPRECATED] Marcos Marx

    01/12/2022, 12:34 AM
    A nice article about data platform: https://towardsdatascience.com/building-an-end-to-end-open-source-modern-data-platform-c906be2f31bd
    👏 4
    a
    a
    s
    • 4
    • 4
  • a

    Ari Bajo (Airbyte)

    01/20/2022, 3:41 PM
    How do you collect behavioral data? @Arpit (ask.astorik.com) published his first article on our blog sharing his experience using CDI, CDP and ELT tools to collect behavioral data from what he likes to call primary and secondary sources. https://airbyte.com/blog/collect-behavioral-data-guide
    👍 9
    a
    t
    • 3
    • 3
  • m

    MayowaPelemo

    02/06/2022, 5:39 PM
    Pls how can I be a writer on Airbyte??
    j
    a
    y
    • 4
    • 4
  • a

    Ari Bajo (Airbyte)

    02/16/2022, 11:40 AM
    @Madison Mae shared her best practices for creating a style guide for your dbt projects. It starts by storing a copy of your raw data, creating base models with dbt and then defining which kind of transformations to apply for each stage of your data pipeline. Curious to know what others include on a style guide for data modelling? https://airbyte.com/blog/best-practices-dbt-style-guide
    👏 6
    f
    a
    • 3
    • 2
  • j

    John (Airbyte)

    02/17/2022, 2:41 PM
    interesting video "Build your open source data warehouse with Restack, Airbyte, Metabase & BigQuery"

    https://www.youtube.com/watch?v=RIwXIgD4AS0▾

    💜 1
    🙏 1
    octavia loves 8
    i
    • 2
    • 1
  • a

    Ari Bajo (Airbyte)

    03/15/2022, 10:08 AM
    Kudos to @Dunith who shared how to build a data pipeline to feed a user-facing analytics dashboard. I found interesting how Apache Pinot is able to parse the Airbyte raw data going through Kafka and handle updated data coming from the MySQL CDC log with upserts! https://airbyte.com/tutorials/real-time-data-analytics-pipeline
    👍 6
  • c

    Connor Lough

    03/17/2022, 4:54 PM
    I'm trying to read more about connecting to APIs, FTP servers, and the like, in Python... do y'all know any good books for that? All things connections?
    ➕ 2
    o
    o
    • 3
    • 2
  • a

    Amanda Robson

    03/21/2022, 5:27 PM
    Hi all - more of a listen than a read - but the open-source startup podcast just released a new episode with Airbyte CEO Michel Tricot! Check it out⚡ https://anchor.fm/ossstartuppodcast/episodes/E21-Airbyte--Open-Source-Data-Integration-e1g1n7t/a-a7k6d4g
    👍🏼 1
    👍 6
    a
    • 2
    • 1
  • c

    Chris Sean

    03/24/2022, 1:09 AM
    In this week's community call, Saif Abid teaches us how to build Go Based Connectors with Bitstrapped! Here's the video in case you missed the live stream. 😎👇

    https://youtu.be/4I_JIbAHDkg▾

    🙌 5
    o
    • 2
    • 1
  • j

    Johannes Judt

    04/04/2022, 12:56 PM
    Incase you are interested in Natural Language Query (NLQ) in BI. It all started with Alan Turing writing about “Computing Machinery and Intelligence” in 1950. https://www.veezoo.com/blog/natural-language-query-nlq-in-business-intelligence-history-and-comparison/
    o
    • 2
    • 2
  • a

    Ari Bajo (Airbyte)

    04/11/2022, 5:56 PM
    Hey! @Sherif Nada shared how we test connectors correctness and best practices with Connector Acceptance Tests (CATs)! https://airbyte.com/blog/black-box-testing-data-connectors?
    🐱 2
    sherif 2
    airbyte rocket 1
    🙌 6
  • j

    Jorge Lucas

    04/13/2022, 2:32 PM
    good morning team Anyone managed to upload the airbyte with EKS - Fargate on Amazon can help me? I have volume related problem
  • a

    Ari Bajo (Airbyte)

    04/19/2022, 3:54 PM
    Airbyte engineering blog is on fire! If you are curious about how we orchestrate ELT jobs internally, @Benoit Moriceau (Airbyte) shared how we write Temporal workflows in Java 🙂 https://airbyte.com/blog/scale-workflow-orchestration-with-temporal
    octavia loves 8
    ❤️ 3
    j
    • 2
    • 1
  • c

    Chris Sean

    04/28/2022, 7:49 PM
    This isn't necessarily a good read but more of a good watch! Our latest community call with Faros is now live on YouTube! 😄

    https://www.youtube.com/watch?v=2jCWRwDGQYc▾

    octavia rocket 2
    airbyte rocket 4
  • s

    Sonal Goyal

    05/06/2022, 4:22 AM
    https://crossroads-cx.medium.com/building-open-access-to-nc-campaign-finance-data-the-plan-ff80c275d4d7
    👍 4
  • a

    Ari Bajo (Airbyte)

    05/09/2022, 12:43 PM
    @Michael Louis and his team shared the process they used to create an open-source dbt package to clean and compute metrics coming from the Airbyte Github connector! 👏 Do you think that dbt packages for Airbyte sources should be shared between companies, or are those too specific for each use case? https://airbyte.com/tutorials/dbt-package-to-analyze-github-data
    👏🏻 3
    👏 3
12345...12Latest