For llm generate is it possible to run a local huggingface m Distributed Data Community #general

For llm_generate is it possible to run a local hug...

Kyle

09/11/2025, 5:04 AM

For llm_generate is it possible to run a local huggingface model? Perhaps by directly putting the local model repo path in the params instead of an open huggingface repo name?

Kevin Wang

09/11/2025, 5:06 AM

We currently do not have this ability. How are you using the model outside of Daft?

Kyle

09/11/2025, 5:11 AM

Currently simply via transformers or on vllm, but would like to test this out for batch inferencing for local small models (such as for evals or synthesis)

Sammy Sidhu

09/11/2025, 4:32 PM

We have this capability for our new embed api but we need to extend it to generate! cc: @Desmond Cheong @Robert Howell

Everett Kleven

09/11/2025, 4:59 PM

@Kyle I’d also recommend LMStudio if you are on Mac. There’s usually MLX variants for most models and LMStudio spins up an OpenAI server which you can use with llm_generate.

Desmond Cheong

09/11/2025, 6:29 PM

Yep, if you're using LM Studio you can use a local model like so

Copy code

import daft
from daft import col
from daft.functions import llm_generate, format
df = daft.from_pydict({"city": ["Paris", "Tokyo", "New York"]})
df = df.with_column(
    "description",
    llm_generate(
        format(
            "Describe the main attractions and unique features of this city: {}.",
            col("city"),
        ),
        model="openai/gpt-oss-20b",
        base_url="<http://127.0.0.1:1234/v1>",
        api_key="this-is-not-needed",
        provider="openai",
    ),
)
df.collect()

Desmond Cheong

09/11/2025, 6:30 PM

You can replace the

model

parameter with any local model you've loaded onto LM Studio

Kyle

09/12/2025, 12:05 AM

Cool thanks! And does this also work on Ray?

Desmond Cheong

09/12/2025, 12:08 AM

that becomes a little harder 😅 we're in the process of beefing up/creating new apis for embed/extract/prompt in the next few weeks, which would then let you run open models in distributed mode

Everett Kleven

09/12/2025, 12:22 AM

@Kyle Are you dead set on offline serving?

Kyle

09/12/2025, 3:54 AM

@Everett Kleven I'm thinking of running things on finetuned models via their various checkpoints as I go along so I was thinking offline serving may be easier, but just checking out the feasibility as I plan things out!

Kyle

09/12/2025, 3:57 AM

And actually the local model path did end up working for me on vllm mode with llm_generate, but somehow it stopped working and started giving a weird exceptiongroup error since yesterday...

Everett Kleven

09/12/2025, 6:48 PM

@Kyle mind creating issue and tagging us? It would be best to track debugging there.

Kyle

09/13/2025, 1:11 AM

A bit hesitant because I think it might just be my own environment flaking on me 🫠

Everett Kleven

09/13/2025, 7:36 PM

Totally understandable. Its still a pretty rough time getting a new model up and running for the first time.

Open in Slack

Previous Next