https://datahubproject.io logo
Join Slack
Powered by
# design-business-glossary
  • l

    little-megabyte-1074

    02/16/2022, 9:08 PM
    set the channel description: Central channel to improve managing Business Glossaries via the UI
  • l

    little-megabyte-1074

    02/16/2022, 9:10 PM
    Hi Folks! @big-carpet-38439 has been picking up some work on Business Glossaries; since we have @many-glass-61317 picking up work as well, wanted to make sure we all had visibility into what is in flight!
  • g

    green-football-43791

    02/16/2022, 9:11 PM
    hihi
  • m

    many-glass-61317

    02/16/2022, 9:12 PM
    hi friends, 👋
  • m

    many-glass-61317

    02/16/2022, 9:13 PM
    of course, good idea
  • m

    many-glass-61317

    02/16/2022, 11:19 PM
    @green-football-43791 what’s ideal or practical way to run things in locally. I may have doing it very long and hard way. 😄 1. Running react app on
    3000
    port using
    yarn start
    2. Running docker compose from
    docker
    dir using
    ./dev.sh
    3. If I change .pdl or graphql files then I run
    ./gradlew build
    4. then run
    ./nuke.sh
    again running
    ./dev.sh
    5. if I change graphql queries in frontent then just restarting react app Does it makes sense to you? or is there any other quicker way to do things? I think I have tried running Play java servers from GMS project on Intellij before
    g
    • 2
    • 10
  • s

    silly-beach-19296

    02/23/2022, 3:20 PM
    Hi!! I have a question, how can I ingest from the API or from the emitter?
    b
    • 2
    • 1
  • l

    little-megabyte-1074

    05/12/2022, 1:34 AM
    Hey @bulky-soccer-26729 & @many-glass-61317! wanted to put the two of you in touch — Tengis has done some work recently (& had a handful of conversations with @green-football-43791) to rethink our Business Glossary functionality & mentioned to me this evening that he was considering focusing on that for the Hackathon. Chris is picking up work this week to address the Q2 Roadmap item to support UI-based creation/management of glossary terms + looking into markdown support for YAML-defined terms.
    👍 2
  • l

    little-megabyte-1074

    05/12/2022, 1:35 AM
    I’ll leave it up to the two of you to determine if there are opportunities for collaboration (we all know how much I love some good teamwork teamwork ); I’m more than happy to help facilitate a scoping/requirements session — just let me know how I can help!
    teamwork 3
  • c

    chilly-potato-57465

    06/07/2022, 11:13 AM
    Hello, Trying to understand how to add a glossary. I have installed the datahub-business-glossary plugin installed and I can see the Add Glossary Term button in the UI on the right side of a dataset page. However, I can't see where to ingest/preload and existing glossary or create a custom one. Could you please point me out to the relevant docs - it does not seem to be covered here https://datahubproject.io/docs/generated/ingestion/sources/business-glossary/ . Many thanks!
    l
    b
    c
    • 4
    • 7
  • c

    curved-rose-25481

    06/24/2022, 7:52 PM
    Was wondering what folks thought about adding synonyms (as well as other types of "related terms" that are not going up or down a level) to the business glossary. Last mention of such things seems to have happened a while ago: https://datahubspace.slack.com/archives/CUMUWQU66/p1635760216153400. The two most important things I'm interested in representing are: 1. "This concept has multiple string representations but they all mean the same thing", and 2. "This is a closely related term that this term neither contains nor inherits, and just putting them in a group together adds an unnecessary level"
    g
    • 2
    • 1
  • f

    famous-fireman-41042

    08/14/2022, 7:22 AM
    I want to ingest a business glossary, so I looked at the docs on how to design the yaml. The link to the example from the docs is broken: https://datahubproject.io/docs/generated/ingestion/sources/business-glossary Is there a new link so I can understand how to design the glossary yaml file?
    b
    • 2
    • 1
  • c

    cuddly-butcher-39945

    09/21/2022, 9:16 PM
    Hey gang, I may have stumbled upon a potential bug opportunity 🙂 My sample business glossary is properly formatted and looks like this:
    Copy code
    version: 1
    source: DataHub
    owners:
      users:
        - joshua.garza
    nodes:
      - name: Classification
        description: A set of terms related to Data Classification
        terms:
          - name: Sensitive
            description: Sensitive Data
            custom_properties:
              is_confidential: false
          - name: Confidential
            description: Confidential Data
            custom_properties:
              is_confidential: true
          - name: HighlyConfidential
            description: Highly Confidential Data
            custom_properties:
              is_confidential: true
      - name: ClientsAndAccounts
        description: Provides basic concepts such as account, account holder, account provider, relationship manager that are commonly used by financial services providers to describe customers and to determine counterparty identities
        owners:
          groups:
            - POWER_USERS
        terms:
          - name: Accounts Payable
            description: Amounts owed to suppliers
            term_source: "INTERNAL"
            source_ref: SHARP
            source_url: "<https://www.sharp.com/>"
            inherits:
              - Classification.HighlyConfidential
            contains:
              - ClientsAndAccounts.ActiveIndicatorFlag
          - name: ActiveIndicatorFlag
            description: This is an internal identifier used in the data warehouse to flag records no longer active in the source system(s). Inactive records are suppressed from Merlin queries.
            term_source: "INTERNAL"
            source_ref: SHARP
            source_url: "<https://www.sharp.com/>"
            inherits:
              - Classification.HighlyConfidential
          - name: AccountsReceivable
            description: Amounts owed
            term_source: "WorkDay"
            source_ref: SHARP
            source_url: "<https://www.sharp.com/>"
            inherits: 
              - Classification.Sensitive
          - name: WriteOff
            description: Amount which will not be collected or associated with revenue stream
            term_source: INTERNAL
            source_ref: SHARP
            source_url: "<https://www.sharp.com/>"
            inherits: 
              - Classification.Confidential
            contains:
              - ClientsAndAccounts.AccountsReceivable
              - ClientsAndAccounts.Accounts Payable
    I am able to successfully load it
    Copy code
    datahub ingest -c ingest_business_glossary_recipe_complete.yaml
    This is my ingestion yaml file
    Copy code
    source:
      type: datahub-business-glossary
      config:
        # Coordinates
        file: /home/joshua.garza/.datahub/quickstart/finance_terms_sample.yml
    However, when I review the changes in the UI, I don't see the
    term_source
    values make it in...
  • c

    cuddly-butcher-39945

    09/21/2022, 9:27 PM
    image.png
  • l

    little-megabyte-1074

    09/22/2022, 10:26 PM
    Hi @cuddly-butcher-39945! Thanks so much for surfacing this — would you mind opening a GH Issue with this detail so we can get it prioritized?
  • l

    little-megabyte-1074

    09/22/2022, 10:26 PM
    Also curious - what is the level of severity/urgency to get this resolved?
  • c

    cuddly-butcher-39945

    09/23/2022, 1:33 AM
    Hey @little-megabyte-1074, absolutely! It's not a super urgent issue right now, as I am still scripting out all existing terms. Thanks for checking on this. I'll get the issue created real soon.
    teamwork 1
  • f

    faint-tiger-13525

    11/09/2022, 4:48 PM
    Hello! Could you please help with a clear understanding of the Business Glossary terms connection? We can connect our terms as "Contains" and "Inherits". For me, this means that if, for example, Term 1 contains Term 2, then Term 2 inherits Term 1. However, when I add a relation in term 1, I don't automatically get a reverse link in term 2. Is it a bug or feature? In the thread, I'll try to give an example. Thank you in advance!
    a
    • 2
    • 5
  • r

    rhythmic-translator-30453

    11/22/2022, 7:12 PM
    👋 I have a general question about the business glossary. It looks entity association works only up to one level of inheritance. So for glossary terms with an inheritance relation of
    g1 -> g2 -> g3
    entities associated with g3 will show up for searches of its parent term g2 but won’t show up for searches of its grandparent term g1. Is this the correct view of glossary term inheritance and searchability? I might be misunderstanding how to use inheritance with glossary terms. I also made a quick example inheritance structure in the DataHub demo instance. In the example
    lvl3-term
    is associated with an entity. That entity shows up in the “Related Entities” tab for
    lvl3-term
    and its parent
    lvl2-term
    but not for its grandparent
    lvl1-term
    .
    a
    • 2
    • 1
  • b

    best-umbrella-88325

    01/18/2023, 5:33 PM
    Hi all. I've been able to create glossary terms using the Java emitter successfully. Now as a second step, I need to identify if I can clean up the already created terms using the Java emitter too. Is there any way we can delete the terms, just like we create them? Any help appreciated. Thanks in advance
    a
    • 2
    • 5
  • m

    miniature-policeman-55414

    05/04/2023, 8:02 AM
    Hey All, I am trying to ingest glossary into datahub like having multiple YAML files.
    general.yaml
    ,
    commercial.yaml
    and
    operations.yaml
    . I am trying used related terms functionality and in our case we have few terms from
    general
    dependent on
    commercial
    (Like
    inherits
    and
    contains
    mapping). It seems that ingestion works fine, however the mapped term details are not properly shown under related terms. Will related terms works by ingesting multiple YAML files or should I have only one YAML file to ingest?
  • f

    future-analyst-98466

    07/25/2023, 9:39 AM
    Hey all, how to export/checking time to create/edit/delete one or more business term? like log activity via web UI or CLI
  • f

    fresh-rain-44904

    10/11/2023, 9:18 AM
    Hi All, question on the business glossary; Is there anyway to export an existing business glossary from DataHub to a .yaml file?
  • h

    hundreds-wire-55258

    01/05/2024, 1:10 PM
    #design-business-glossary how can i extract urn for a particular glossary term in Datahub using datahub cli (edited)