https://linen.dev logo
Join Slack
Powered by
# good-reads
  • s

    Shawn Wang (Airbyte)

    07/22/2022, 2:00 AM
    goodread from Claire Carroll (from dbt, now hex) on the problems with CTEs https://hex.tech/blog/stop-using-so-many-ctes
  • s

    Shawn Wang (Airbyte)

    07/25/2022, 5:29 AM
    new oreilly book out on making data stack choices https://www.amazon.com/Fundamentals-Data-Engineering-Robust-Systems/dp/1098108302?&linkCode=sl1&tag=dataeng-20&linkId=3588b2ff2124df343a079088fb7e6c4e&language=en_US&ref_=as_li_ss_tl
    👍 1
    • 1
    • 1
  • a

    Alex Marquardt (Airbyte)

    07/25/2022, 1:04 PM
    We have just published a new tutorial! Learn how to extract data from the Intercom API and replicate it into S3 where it can be analyzed by ML/NLP jobs for generating customer insights and sentiment analysis. Additionally, replicating Intercom data into S3 can provide you with backups, help you to meet regulatory compliance requirements, and/or provide you with access to historical data. See: https://airbyte.com/tutorials/intercom-api-s3
    airbyte rocket 5
    octavia muscle 1
  • s

    Shawn Wang (Airbyte)

    07/30/2022, 1:40 AM
    today i am marveling at this ~1yr old post that has coined a name for analytics/data engs that has really stuck with me https://www.getdbt.com/blog/we-the-purple-people/
  • s

    Shawn Wang (Airbyte)

    07/31/2022, 1:21 PM
    suuper well written piece on the BI paradigms from Maxime Beauchemin https://preset.io/blog/dataset-centric-visualization/
    s
    s
    • 3
    • 12
  • s

    Shawn Wang (Airbyte)

    08/01/2022, 1:48 PM
    good thread on reddit with people sharing their data stacks! https://reddit.com/r/dataengineering/comments/wcw0nt/_/iifb0i3/?context=1
    👀 2
    a
    • 2
    • 2
  • a

    Alex Marquardt (Airbyte)

    08/02/2022, 10:25 AM
    Hi all, we have published a new tutorial that shows you how to extract data from Stripe’s REST API and send it into Snowflake. Once your Stripe data is in Snowflake tables it can be easily combined with data from other internal systems, which will provide you with enhanced insights into your business. https://airbyte.com/tutorials/stripe-rest-api-to-snowflake
    airbyte heart 4
  • a

    Alex Izydorczyk

    08/03/2022, 2:28 AM
    https://magis.substack.com/p/an-observation-on-dashboard-speed
    s
    • 2
    • 1
  • s

    Shawn Wang (Airbyte)

    08/03/2022, 2:46 AM
    oh wow, Airflow being attacked again on HN https://news.ycombinator.com/item?id=32317558
  • s

    Shawn Wang (Airbyte)

    08/03/2022, 4:00 AM
    “the modern data stack is dead” - https://www.linkedin.com/posts/ethanaaron_snowflakesummit-data-analytics-activity-694[…]vGT/?utm_source=linkedin_share&utm_medium=member_desktop_web
  • a

    Alex Marquardt (Airbyte)

    08/03/2022, 11:25 AM
    The first tutorial in Airbyte’s series on synchronization modes is now live. This article gives an overview of Airbyte’s ELT implementation, explores the SQL code used under-the-hood for full refresh synchronizations, and shows how your replicated data will look. https://airbyte.com/tutorials/full-data-synchronization
    🔥 4
    fiesta parrot 2
    🙏 1
    😂 1
  • s

    Shawn Wang (Airbyte)

    08/04/2022, 11:46 PM
    new BI tool just got funding! https://twitter.com/medriscoll/status/1555303693938962433?s=21&t=FevPcBw2cGsGMDoCCgqxaQ
    👌🏻 1
    s
    s
    • 3
    • 10
  • s

    Shawn Wang (Airbyte)

    08/08/2022, 6:23 AM
    TIL of “Potemkin data science” - performative data work that is a lot of work but not actually useful https://mcorrell.medium.com/potemkin-data-science-fba2b5ba5cc6 kind of an old article now but great term for a definitely real phenomenon
    • 1
    • 1
  • a

    Alex Marquardt (Airbyte)

    08/08/2022, 1:05 PM
    A new tutorial has been published! Learn how to build a Data Ingestion Pipeline from HubSpot to Snowflake as part of your data integration strategy. https://airbyte.com/tutorials/copy-data-from-hubspot-to-snowflake
    🔥 1
    👍 1
    👍🏻 1
  • s

    Shawn Wang (Airbyte)

    08/09/2022, 5:54 PM
    anyone else excited about Dagster 1.0 and Dagster Day?

    https://youtu.be/70c84LDZuzQ▾

    ▾

    pretty high production quality
    😄 1
    fiesta parrot 4
  • s

    Shawn Wang (Airbyte)

    08/10/2022, 7:08 PM
    a 2012 post I reshared on HN did pretty well today https://news.ycombinator.com/item?id=32407873 - brings up some interesting discussion on schema design (star schema, entity-attribute-value, etc)
    👍 3
  • k

    Karen (Airbyte)

    08/11/2022, 7:03 PM
    A good-listen from Ceora Ford -- How to Make Your First Open Source Contribution or read the transcript included.
    👍 1
    👍🏻 1
    • 1
    • 1
  • t

    Thalia Barrera (Airbyte)

    08/12/2022, 12:11 PM
    We recently published a tutorial where you can learn how we created an ELT pipeline to sync data from Postgres to BigQuery at Airbyte, using Airbyte Cloud! You can follow these steps to create your own. https://airbyte.com/tutorials/postgres-to-bigquery
    😍 1
    a
    • 2
    • 1
  • s

    Shawn Wang (Airbyte)

    08/15/2022, 2:06 PM
    on the topic of Dagster and orchestration, Benn’s post goes into whats wrong with DAGs https://benn.substack.com/p/down-with-the-dag (i also just posted on HN)
    👍 1
  • s

    Simon Späti

    08/16/2022, 7:15 AM
    Interesting read also with its used data stack (he calls it
    ngods (new generation open-source data stack)
    instead of Modern Data Stack). https://blog.devgenius.io/modern-data-stack-demo-5d75dcdfba50
    👍 3
    d
    • 2
    • 6
  • s

    Shawn Wang (Airbyte)

    08/16/2022, 1:50 PM
    starschema founder writing about us! https://twitter.com/tfoldi/status/1559223629522542592
    👍 2
  • s

    Shawn Wang (Airbyte)

    08/16/2022, 5:07 PM
    interesting new BI tool of the day - Omni announcing seed + Series A to compete with Looker https://techcrunch.com/2022/08/16/omni-looks-to-take-on-looker-with-its-cloud-powered-bi-platform/
    Omni is comparable to existing BI tools like the aforementioned Looker and Tableau, Zima says. But the platform can also take raw SQL — the language used to communicate with databases — and break it into modeled components. Omni’s built-in tools generate data models and components from SQL, creating a “sandbox” data model and allowing users to promote metrics to the official, shared model that the whole organization can use. Beyond this, Omni runs “automated aggregates” in-database to accelerate queries and manage costs for users (and their employers).
    👀 3
  • s

    Shawn Wang (Airbyte)

    08/17/2022, 12:17 PM
    this is the first meme ive ever seen that in itself is a goodread https://twitter.com/largedatabank/status/1559651463919452161?s=21&t=zsoIYmluQHBReWSYaIKD1Q
    👍🏻 1
    👍 1
    😂 2
  • a

    Ari Bajo (Airbyte)

    08/17/2022, 3:04 PM
    Dremio wrote a blog post about consolidating all your data into a data lake with Airbyte, and how to turn the data lake into lakehouse with Dremio! Interesting how Dremio adds a SQL engine and data management to your data lake. Does anyone here have experience with Dremio? https://airbyte.com/tutorials/build-an-open-data-lakehouse-with-dremio
    fiesta parrot 3
    👀 1
    💡 1
  • s

    Shawn Wang (Airbyte)

    08/17/2022, 6:39 PM
    postgres in the browser, with tutorial! https://www.crunchydata.com/blog/learn-postgres-at-the-playground
    s
    • 2
    • 1
  • s

    Simon Späti

    08/19/2022, 8:57 AM
    Awesome write-up about trendy Table Formats (Delta Lake, #ApacheIceberg #ApacheHudi) and their features explained well with super illustrations: https://twitter.com/sspaeti/status/1560549706459414529
    👍 4
    👍🏼 1
    airbyte heart 2
    • 1
    • 1
  • s

    swyx (Airbyte)

    08/22/2022, 7:39 PM
    recently discovered https://pgstats.dev/ which visually explains postgres functionality + architecture BY VERSION which is just mindblowing
    👍 2
  • s

    Shawn Wang (Airbyte)

    08/22/2022, 8:30 PM
    interesting Snowflake discussion on HN today: https://news.ycombinator.com/item?id=32551212
    👍 1
  • c

    Charles Giardina (Airbyte)

    08/23/2022, 5:21 PM
    I really enjoyed this blog post by @Abi Noda (co-founder of DX--a tool we use at Airbyte). Great synthesis of how to think about waste in the development process.
    fiesta parrot 1
    👍 3
  • s

    Shawn Wang (Airbyte)

    08/25/2022, 3:40 PM
    Bill Inmon on Data Mesh vs Big Data (Cloudera) vs Data Warehousing is just a fantastic listen https://overcast.fm/+wcMo2qvqg/36:00
    • 1
    • 3
1...456...12Latest