https://linen.dev logo
Join Slack
Powered by
# good-reads
  • s

    Simon Späti

    08/25/2022, 4:51 PM
    A new Data Insight on DataLake and Lakehouse includes: • Differences between Lakehouse & Warehouse • Components of Data Lake ◦ Storage Layer (AWS S3, Azure Blob Storage, Google Cloud Storage) ◦ File Format (Apache Parquet, Avro, ORC) ◦ Table Format (Delta Lake, Apache Hudi, and Iceberg) • Trends in the Market • How to turn it into a Lakehouse https://airbyte.com/blog/data-lake-lakehouse-guide-powered-by-table-formats-delta-lake-iceberg-hudi
    airbyte heart 5
  • t

    Thalia Barrera (Airbyte)

    08/26/2022, 1:52 PM
    We just published a new blog post! Reverse ETL is the process of syncing data from a source like a data warehouse to a business application so it can be used by marketing, sales, support, and other teams in the tools they use. It is rapidly becoming a standard component of data stacks.
    💡 2
    🔥 6
  • s

    Shawn Wang (Airbyte)

    08/29/2022, 2:54 AM
    reading Barr’s OG post on data observability today - old but really gold https://www.montecarlodata.com/blog-what-is-data-observability/
    👍 2
    • 1
    • 1
  • s

    Shawn Wang (Airbyte)

    08/30/2022, 1:04 AM
    Snowflake had a very good quarter https://softwarestackinvesting.com/snowflake-snow-q2-fy2023-earnings-report/
    👍 2
  • a

    Ari Bajo (Airbyte)

    09/02/2022, 5:29 PM
    Hey! I shared some thoughts about why I feel data quality is more complex than code quality, and how to make data quality an easier problem to solve. Curious to hear how others think of a data architecture and organization that reduces the overall data quality issues you face? https://airbyte.com/blog/data-quality-issues
    ✅ 2
    👀 3
    👏 3
    👍 3
  • a

    Alex Marquardt (Airbyte)

    09/07/2022, 3:13 PM
    A new tutorial has been published! Learn how to build a Data Ingestion Pipeline from MSSQL Server to Snowflake as part of your data integration strategy. https://airbyte.com/tutorials/replicate-microsoft-sql-server-to-snowflake
    😍 2
    airbyte rocket 4
  • a

    Alex Marquardt (Airbyte)

    09/08/2022, 6:30 PM
    The second tutorial in Airbyte’s series on synchronization modes is now live. This article gives an overview of Airbyte’s ELT implementation, explores the SQL code used under-the-hood for incremental data synchronizations, and shows how your replicated data will look. https://airbyte.com/tutorials/incremental-data-synchronization
    clapping 6
    airbyte heart 4
    a
    l
    d
    • 4
    • 3
  • m

    Mariya Bouraima

    09/09/2022, 3:58 PM
    Check out these data security insights from our Head of Data Policy here at Airbyte, Patsy Bailin: https://airbyte.com/blog/4-questions-data-security-experts-ask-before-moving-data
    👍 1
    👍🏻 1
  • a

    Alex Marquardt (Airbyte)

    09/13/2022, 1:36 PM
    A new tutorial has been published! Learn how to build a Data Ingestion Pipeline to send CSV data from S3 to Snowflake as part of your data integration strategy. https://airbyte.com/tutorials/copy-s3-csv-to-snowflake
    gratitude thank you 1
    clapping 1
    💯 3
    ✅ 1
    t
    • 2
    • 1
  • s

    Simon Späti

    09/13/2022, 3:37 PM
    A new blog post is live. It's the start of a series where we bring you on a journey on how we build our internal data stack at Airbyte!
    😍 1
  • a

    Alex Marquardt (Airbyte)

    09/14/2022, 3:13 PM
    A new blog post that shows how Airbyte can be used to move data from Snowflake to Postgres is now live! https://airbyte.com/tutorials/replicate-data-from-snowflake-to-postgres
    👍 4
  • s

    Shawn Wang (Airbyte)

    09/14/2022, 4:30 PM
    ooh, a Dagster vs Airflow article on HN! https://news.ycombinator.com/item?id=32839147
  • j

    Jaime Oliveira

    09/15/2022, 4:50 PM
    Hello everyone🙂. I'm preparing a presentation about Airbyte for my internal data team; anyone knows any good resource to help me with that? The idea is to be something simple. To complement, I prepared an Airbyte Demo replicating Postgres Data (CDC) to Snowflake. Thanks 🙂
    s
    s
    • 3
    • 9
  • j

    Joey Taleño

    09/16/2022, 4:20 AM
    https://taleno.digital/setup-a-modern-data-stack-in-3-easy-steps/
    octavia loves 4
  • a

    Anatole Callies

    09/16/2022, 10:06 PM
    https://cloud.google.com/datastream-for-bigquery That's a huge game changer
    s
    • 2
    • 1
  • j

    Joey Taleño

    09/26/2022, 2:48 PM
    https://taleno.digital/easiest-way-to-extract-and-load-data-using-airbyte-plural/
    fiesta parrot 2
  • s

    Simon Späti

    09/29/2022, 6:05 PM
    🎙️ A Semantic Layer is a topic that fascinated me from the beginning. As a BI engineer, I have used the initial concept since the early SAP BusinessObjects. Now with the open-source trend, there are new exciting tools out there. I explored the history, trends, tooling, and difference between existing concepts we all know. I am answering questions about whether it is just an OLAP cube, a new way of virtualization, another term for lakehouse, the difference between a Metrics Layer or Headless BI tool, and much more. Hopefully, this is helpful to you or sparked some new ideas. Please let me know what you think or if you have any questions—looking forward to it 🤗.
    airbyte heart 8
    octavia loves 2
    fiesta parrot 2
    💯 2
    🔥 1
    clapping 2
    airbyte code 1
    🙂 2
    airbyte growth 1
    airbyte rocket 1
    octavia muscle 1
    aussiecongaparrot 7
    h
    • 2
    • 4
  • a

    Alex Marquardt (Airbyte)

    09/29/2022, 8:32 PM
    The third tutorial in Airbyte’s series on synchronization modes is now live. This article gives an overview of Airbyte’s Change Data Capture (CDC) data synchronization. https://airbyte.com/tutorials/incremental-change-data-capture-cdc-replication
    airbyte code 2
    airbyte growth 1
    🚀 3
    👍 12
    🔥 8
    🙂 3
    eyes shaking 2
    aussiecongaparrot 4
    octavia loves 1
    fiesta parrot 1
    airbyte heart 4
    airbyte rocket 2
    👍🏻 1
  • s

    Shawn Wang (Airbyte)

    10/04/2022, 8:42 PM
    https://seekingalpha.com/article/4538291-snowflake-inc-snow-deutsche-bank-technology-conference-transcript snowflake with some eye popping numbers - 25% of the largest 5000 companies in the world, avg spend 1.2m each. Capital One went from spending $29m/yr to $50m/yr in 2 years. “Apache Iceberg is emerging as the de facto standard for table formats for our largest customers” they bought Applica for 200m, despite their previous valuation being 600m. a lot of companies for sale at deep discounts
    🤩 1
  • d

    Dario Forti

    10/04/2022, 9:51 PM
    Hi everybody, I'm Dario from Fortisoft. I would like to share with the community an article we wrote on how to develop Airbyte connectors. It is intended for people who have completed the PokeAPI tutorial, and are looking to learn more about connector development through a harder challenge. How to create your own Airbyte connector and pull data from Discord API
    😍 1
    m
    • 2
    • 1
  • s

    Shawn Wang (Airbyte)

    10/04/2022, 10:53 PM
    big /r/dataengineering data warehouse poll going on right now with 2.2k votes https://twitter.com/felipehoffa/status/1577413199875387393?s=46&t=enuXKuJ_dYz27lwVk3Tldg
    👀 1
  • s

    Shawn Wang (Airbyte)

    10/06/2022, 11:31 AM
    just heard about Canvas for BI: https://techcrunch.com/2022/01/28/canvas-gives-non-technical-teams-data-exploration-knowledge-without-needing-a-degree-in-sql/ Spreadsheets > SQL, and can easily reuse dbt logic
    👀 2
    • 1
    • 1
  • j

    Jordan Fox

    10/07/2022, 3:03 AM
    There's a series of posts here that are anti FiveTran, dbt, Airbyte. Good read for the alternative perspective, if not just for the laughs: https://medium.com/@laurengreerbalik/how-fivetran-dbt-actually-fail-3a20083b2506
    👍 1
    j
    h
    • 3
    • 3
  • a

    Alex Marquardt (Airbyte)

    10/07/2022, 1:08 PM
    The final article in Airbyte’s series on synchronization modes is now live. This article gives an overview and comparison of the different replication options that are available. https://airbyte.com/blog/understanding-data-replication-modes
    fiesta parrot 1
    airbyte rocket 4
  • a

    Ari Bajo (Airbyte)

    10/07/2022, 1:49 PM
    Great read, shows how you can visualize data dependencies across tools! https://medium.com/starschema-blog/dagster-airbyte-dbt-how-software-defigned-assets-change-the-way-we-orchestrate-ac70bb29d640
    airbyte rocket 2
    m
    s
    k
    • 4
    • 4
  • s

    Shawn Wang (Airbyte)

    10/08/2022, 12:52 AM
    this interview of George Fraser gets really interesting when he talks about the possible danger to the mpp analytical datawarehouse because of advancements in hardware

    https://youtu.be/P_jwP_2_SFU?t=1802▾

    👍 1
  • s

    Shawn Wang (Airbyte)

    10/08/2022, 6:43 PM
    good new memo on dbt https://research.contrary.com/reports/dbt-labs
    • 1
    • 1
  • d

    Developer Crunchy

    10/11/2022, 5:11 AM
    need help to sync data from mysql to postgres. anyone ?
    a
    v
    • 3
    • 2
  • s

    Shawn Wang (Airbyte)

    10/18/2022, 9:12 PM
    good new metrics definition template from Mode, from the dbt conference today https://docs.google.com/document/d/1GOKQQRzC4bfj0-xviDD4L5YLmKHrVOVK8s9hHWRTp3A/edit
    🔥 2
  • s

    Simon Späti

    10/20/2022, 3:57 PM
    Learning Rust? I gave it the first try, compared it to Python, and checked it for data engineering tasks. Let me know if you disagree and what your experiences with Rust are so far.
    airbyte heart 3
    party parrot 1
    🎉 1
    airbyte rocket 3
    yay 1
    p
    • 2
    • 2
1...567...12Latest