https://datahubproject.io logo
Join Slack
Powered by
# integrate-abfs-support
  • b

    bland-application-65186

    09/26/2023, 10:34 AM
    šŸ‘‹
    šŸ‘ 3
  • p

    powerful-engine-44741

    09/27/2023, 7:06 AM
    @billions-rose-75566 here is the abfs channel
  • b

    billions-rose-75566

    09/27/2023, 7:22 AM
    Hi all! blob wave
  • d

    dazzling-judge-80093

    09/28/2023, 7:18 AM
    Hi!
    blob wave 1
  • b

    bulky-shoe-65107

    10/16/2023, 12:35 AM
    has renamed the channel from "integration-abfs-support" to "integrate-abfs-support"
  • b

    bland-application-65186

    10/16/2023, 1:19 PM
    Hi @dazzling-judge-80093, we would like to schedule another meeting, 30 to 45 minutes, to go over a few of our findings, better understand the scope as a driver for the RFC, and a few other topics. When are you available? We would propose any time from the 24/10.
  • d

    dazzling-judge-80093

    10/18/2023, 3:13 PM
    I’m available during daytime CET
  • b

    bland-application-65186

    10/24/2023, 1:11 PM
    Untitled.md
    Untitled.md
  • b

    bland-application-65186

    10/24/2023, 1:11 PM
    Hi guys, following todays meeting, our initial proposal for the RFC. I reckon its quite empty still. Please share your comments.
  • h

    handsome-mechanic-255

    10/27/2023, 5:12 AM
    Hi @dazzling-judge-80093, I succeeded in setting up the local dev environment, creating a new source called
    abs
    and ingesting a s3 source as abs; so no need for support but tnx for the offer ...
    d
    • 2
    • 1
  • b

    bland-application-65186

    11/28/2023, 8:24 AM
    Morning @dazzling-judge-80093, could we have a brief meeting this week to cover the path_spec implementation on top of Azure? We have a look at it but could not find a easy way to "break" it into smaller developments, maybe you can help us.
    d
    • 2
    • 1
  • h

    handsome-mechanic-255

    12/08/2023, 12:48 PM
    Hi Tamas, When generating the local dev datahub, when executing a command, we get:
    from datahub.metadata.urns import Urn  # noqa: F401
    a
    ModuleNotFoundError: No module named 'datahub.metadata.urns'
    If we revert to commit
    feat(sdk): autogenerate urn types (#9257)
    the problem no longer occurs. Are there additional steps we need to take to make the module available? (or any other way to resolve the issue?)
    d
    • 2
    • 3
  • b

    bland-application-65186

    02/12/2024, 2:25 PM
    Hey @dazzling-judge-80093, long time since we spoke. We have been dedicating some extra time to blob ingestion and, following your advice, implemented a first solution to ingest paths without a "path_spec". Furthermore, we are now able to ingest not only the current folder but also subfolders, that said, we have been commented a check around the
    allowed
    and
    globmatch
    methods: metadata-ingestion/venv/lib/python3.9/site-packages/wcmatch/pathlib.py:136. In this context, could you tell us whats purpose of the
    _wcparse.py
    file?
    d
    • 2
    • 10
  • b

    bland-application-65186

    02/13/2024, 9:48 AM
    After discussing the above behaviour with some colleagues i understand that Datahub does not "support" recursive ingestion, is that correct? As in, urls ending with
    \*.*
    only ingest the current folder. If thats the case the above point is a non-issue.
    d
    • 2
    • 1