https://flyte.org logo
Join Slack
Powered by
# feature-discussions
  • f

    freezing-airport-6809

    01/04/2022, 4:44 PM
    Folks, I am working on a new design that would allow users to run low latency workflows on Flyte and run existing simple python tasks and reduce / ammortize the overhead of running a pod. Would folks here be interested in helping me refine the design and the UX model? Goal: Users should be able to run sub second tasks and yet get the same level of visibility, fault tolerance and simplicity for their code. We might lose some isolation properties that running an individual pod offers, but we will continue to offer isolated environments, versioning and dynamic infrastructure.
    🚀 1
    ❤️ 4
    b
    • 2
    • 6
  • e

    eager-jordan-33561

    02/08/2022, 11:03 AM
    Metaflow, which is a comparable framework (though more limited), recently added functionality for “Google cards for machine learning models”: https://towardsdatascience.com/dag-card-is-the-new-model-card-70754847a111. Does there exist anything similar for Flyte today, or is there a plan to implement something like this? If not, would it be of interest?
    f
    • 2
    • 2
  • d

    damp-lifeguard-95868

    02/16/2022, 1:35 AM
    Feature request : Separate tutorial repo with jupyter notebooks Not sure if its already being considered or worked on, having the below tutorials in jupyter notebook format could be very useful for new users https://docs.flyte.org/projects/cookbook/en/latest/tutorials.html
    f
    • 2
    • 1
  • s

    straight-ghost-77338

    02/22/2022, 3:36 PM
    Hey everyone, the graph UI in Flyte console is looking great ❤️ , do you think Dynamic Tasks will be supported as well in the future, I mean like expanding the nodes inside as it's possible for subworkflows? 🤔 🙏
    👍 1
    h
    f
    c
    • 4
    • 7
  • s

    swift-animal-75798

    02/25/2022, 6:29 PM
    We had a streaming pipeline that read the flyte events and from the execution we used to fetch the inputUri and from the closure the Outputuri, then fetch the files from gcs. Something like below:
    Copy code
    final String inputUri = getNodeExecution().getInputUri();
        final String outputUri = getNodeExecution().getClosure().getOutputUri();
    
        gcsFileReader.readToLiteralMap(inputUri)
        gcsFileReader.readToLiteralMap(outputUri)
    Also we had some special handling for the scenarios where the files are not there meaning that either the task does not have inputs/outputs. Now since the outputUri is deprecated, we are trying to use the NodeExecutionGetData rpc, but we notice some nil pointers here and there. How should we determine when the execution data exists, so we can make the rpc and succeed.
    f
    h
    • 3
    • 17
  • f

    flaky-action-19778

    04/01/2022, 6:03 PM
    hey i have a request that would serve the Forecasting platform we’re building at Spotify on top of Flyte. We currently subscribe to a topic containing execution events to populate our UI with visibility on what forecasts have been run/where their outputs are located etc. That’s worked quite well! What Im wondering if a similar topic exists to capture the *Workflow* information created at registration time For context, our platform works as such: 1. A model developer registers to Flyte a workflow that represents a forecast model that takes in several assumptions as CSVs at runtime 2. A separate financial analyst writes those assumptions and uploads them to our platform 3. the model can be launched from the platform with the appropriate assumptions by anyone My thought is that if we have an understanding of workflow dependencies at registration time, we can have the platform enforce at assumption upload time that all the correct assumptions are uploaded for each model (with the correct schemas) according to the Flyte version selected. Im sure there are other ways to do this (create a hook at registration time that queries the API for all workflows belonging to the registered version), but through a topic is ideal for us so just want to know feasibility here. Thanks!
    l
    f
    • 3
    • 19
  • r

    rough-monitor-45729

    04/22/2022, 6:29 PM
    Based on the openapi documentation, there doesn't seem to be an offset parameter I can pass to utilize pagination on my site. Is that something you plan to implement? Secondly, when using pagination it would be nice to get a total count of all the entries so I can display the total number in a dashboard card.
    t
    • 2
    • 8
  • f

    freezing-boots-56761

    04/27/2022, 3:21 PM
    is there any interest in platform-level override for default service account to use for k8s pods? right now if there is no k8s service account explicitly set in the launch plan spec, it uses ‘default’. likely constitutes a simple change here: https://github.com/flyteorg/flyteplugins/blob/d2f64cb8c49adccadb01f46960031df3219dce32/go/tasks/pluginmachinery/flytek8s/pod_helper.go#L72 @hallowed-mouse-14616: it seems likely that the pod template is respected here. does that sound right?
    👍 3
    h
    f
    a
    • 4
    • 9
  • i

    incalculable-zebra-90365

    06/27/2022, 5:18 PM
    We have just submitted a RFC for the eviction of cached task outputs (and proposing the addition of a basic
    flyteadmin
    API for clearing the cache of completed executions): https://github.com/flyteorg/flyte/pull/2633 Thank you for the initial feedback to @hallowed-mouse-14616. As this change could have a slightly larger impact on Flyte, we'd be happy about some extra thoughts or ideas some of you might have - any comments would be appreciated!
    ❤️ 9
    h
    • 2
    • 3
  • a

    adamant-zoo-87864

    08/10/2022, 4:13 AM
    Hi everyone. I don't know if I can use flyte-pod-webhook to send workflow status notifications to discord app?. Thanks all
    f
    g
    • 3
    • 17
  • a

    adamant-zoo-87864

    08/16/2022, 3:37 AM
    Hi everyone. I was wondering if there is a way to submit the spark job to the amazon EMR cluster from a Flyte workflow. (My Flyte deployed on AWS cloud environment). Thanks all
    f
    • 2
    • 2
  • t

    thankful-dress-89577

    08/30/2022, 8:24 PM
    Hello, curious if anyone else has had a need for a FlyteFile that has an associated lifetime - whereby it would be automatically cleaned up after a user-specified timeframe. This could be along the lines of compliance with data retention policies.
    f
    • 2
    • 18
  • s

    sticky-art-97180

    09/06/2022, 5:53 PM
    Question about flyte workflows roadmap/features: If i have a pipeline of 10 steps, could I simply click on the 5th step and just re-run that from the UI/cmd line ?
    h
    f
    • 3
    • 16
  • b

    boundless-twilight-37555

    09/15/2022, 8:18 PM
    Within Spotify, we were discussing human in the loop feature and could see that some work has been done on this: https://github.com/lyft/flyte/issues/208 What is the best way to check it and share feedback?
    h
    t
    • 3
    • 6
  • s

    steep-jackal-21573

    09/19/2022, 3:22 PM
    Hi. What do you think of this feature request? https://github.com/flyteorg/flyte/issues/2883
    h
    • 2
    • 5
  • h

    helpful-crowd-74546

    09/20/2022, 11:29 AM
    Hey, I was thinking about adding
    nodeSelector
    to the Pod template propeller uses to create pods. Currently I am only using tolerations to allow my pods to schedule on certain nodes. However, let’s say I have multiple node pools for e.g., CPU/GPU tasks and I want to be verbose about which node I want to run my workloads on. Is tolerations sufficient or would I benefit from using
    nodeSelectors
    ? It would be a fun contribution to work on, if you also think it would add value. This could e.g., be added in the
    @task
    decorator by the same means you are adjusting
    requests/limits
    . Let me know what you think! 🙂
    f
    • 2
    • 7
  • t

    thankful-dress-89577

    10/21/2022, 7:12 PM
    Has the topic of creating a terraform provider for flyte ever been raised? I'm thinking in terms of being able to create projects, register workflows, configure launchplans, etc. not the flyte cluster itself.
    v
    f
    • 3
    • 4
  • h

    hallowed-mouse-14616

    11/07/2022, 11:27 PM
    Hey All, as discussed in our OSS Sync last week we're working on improving performance observability of Flyte components. I drafted an RFC on adding runtime metrics, offering overhead estimates, and orchestration metrics, using opentelemetry traces, to improve performance analysis in both production environments and during feature benchmarking. Please take a look at the RFC draft (with a formatted markdown version here) and leave any comments / questions / concerns. We are very excited to hear from the greater community and incorporate feedback! Also, trying a simple github PR formatted in markdown for this, we are always refining our RFC process - thoughts also welcome regarding this.
    b
    • 2
    • 1
  • t

    tall-lock-23197

    02/17/2023, 12:00 PM
    set the channel topic: If you're interested in discussing adding new features, please let us know! We'd love to hear your ideas and work together to make Flyte even better.
  • f

    freezing-airport-6809

    05/14/2023, 9:24 PM
    moving everything to the contribute channel
  • f

    freezing-airport-6809

    05/14/2023, 9:24 PM
    archived the channel