feat: start a recipe with llama-stack backend #3016

feloy · 2025-05-15T11:58:03Z

Signed-off-by: Philippe Martin phmartin@redhat.com

What does this PR do?

This PR is a first step towards being able to run a recipe using llama stack as backend.

This PR covers:

AI Lab supports recipes with backend="llama-stack"
for such recipe, no model is asked in the "Start Recipe" form
when AI Lab deploys a recipe with llama-stack backend, it does not deploy any inference server
the MODEL_ENDPOINT passed to the recipe is the endpoint of Llama Stack
all models served by Llama stack are accessible to the app. It is the responsability of the app to choose with which model to work

Screenshot / video of UI

llama-stack-recipe-2.mp4

What issues does this PR fix or reference?

Fixes #2625

How to test this PR?

Add this recipe to your ~/.local/share/containers/podman-desktop/extensions-storage/redhat.ai-lab/user-catalog.json file:

{
  "version": "1.0",
  "recipes": [
    {
      "id": "chatbot-llama-stack",
      "description": "This recipe provides a blueprint for developers to create their own AI-powered chat applications using Streamlit and llama-stack.",
      "name": "ChatBot using Llama Stack",
      "repository": "https://github.com/feloy/chatbot-llama-stack-recipe",
      "ref": "main",
      "icon": "natural-language-processing",
      "categories": ["natural-language-processing"],
      "basedir": "/",
      "readme": "# Chat Application using Llama Stack\n",
      "backend": "llama-stack",
      "languages": ["python"],
      "frameworks": ["streamlit", "llama-stack"]
    }
  ],
  "models": [],
  "categories": []
}

The app will work with the first model listed by llama stack: be sure to have only models compatible with chat served by Llama stack
Go to the Recipes catalog and start the "ChatBot using Llama Stack" recipe
Go to the running application details and open the app url

axel7083 · 2025-05-16T07:59:17Z

packages/backend/src/managers/application/applicationManager.ts

+    if (model) {
+      // upload model to podman machine if user system is supported
+      if (!recipeComponents.inferenceServer) {


suggestion: use a single if model && !recipeComponents.inferenceServer

axel7083 · 2025-05-16T07:59:58Z

packages/backend/src/managers/application/applicationManager.ts

    labels: Record<string, string> = {},
  ): Promise<PodInfo> {
+    const modelId = model ? model.id : '0';


I don't really like that, use something more understandable like <none> or none

Sure, me neither:

Note: I'm not very happy with the way the applicationManager functions accept an optional model, depending on if we want to use llama-stack or not. Let's first discuss the high-level architecture of this solution, I can change these implementation details when we agree on the architecture

Sorry I went to code too fast 🙏

Let's first discuss the high-level architecture of this solution,

What depends on the modelId? We use it for operation like hasApplicationPod & removeApplication. Maybe we should only use the recipeId as key?

I'm not very comfortable changing the labels, as the model label is also used for the tasks (the user can start the same recipe with different models, and we want to differentiate the tasks for them).

I have pushed a new commit changing how the parameters are passed: having explicit options inferModel / startLlamaStack, which enforce the other parameters (the model parameter for the moment). I also changed this arbitrary 0 model Id to <none>

@axel7083 WDYT?

Signed-off-by: Philippe Martin <phmartin@redhat.com>

gastoner · 2025-05-19T08:57:25Z

Please add "llama-stack" here https://github.com/containers/podman-desktop-extension-ai-lab/blob/main/packages/frontend/src/lib/RecipeCardTags.ts#L25 so it works with other tags (I think this should be only change to make it work)

packages/shared/src/models/IRecipe.ts

axel7083 · 2025-05-19T10:36:36Z

packages/backend/src/managers/recipes/RecipeManager.ts

-    labels?: { [key: string]: string },
-  ): Promise<RecipeComponents> {
-    const localFolder = path.join(this.appUserDirectory, recipe.id);
+  public async buildRecipe(options: ApplicationOptions, labels?: { [key: string]: string }): Promise<RecipeComponents> {


why not including the labels in the options? feel weird to move everything inside ApplicationOptions but the labels?

First reason is because labels is not part of the parameters of requestPullApplication, and I would like to keep the same options structure everywhere

packages/backend/src/studio-api-impl.ts

packages/backend/src/models/ApplicationOptions.ts

Signed-off-by: Philippe Martin <phmartin@redhat.com>

feloy requested review from benoitf, jeffmaury and a team as code owners May 15, 2025 11:58

feloy requested review from cdrage and SoniaSandler May 15, 2025 11:58

jeffmaury requested a review from slemeur May 15, 2025 12:20

feloy force-pushed the feat-2624/llama-stack-recipes branch from 6954ab1 to a5f9985 Compare May 16, 2025 07:54

axel7083 reviewed May 16, 2025

View reviewed changes

feloy added 4 commits May 19, 2025 09:38

feat: start a recipe with llama-stack backend

0079024

Signed-off-by: Philippe Martin <phmartin@redhat.com>

test: unit tests (to be continued)

345eefe

Signed-off-by: Philippe Martin <phmartin@redhat.com>

feat: start llama stack if not running

6b38b2e

Signed-off-by: Philippe Martin <phmartin@redhat.com>

feat: make llama recipes visible in running apps

159f238

Signed-off-by: Philippe Martin <phmartin@redhat.com>

feloy force-pushed the feat-2624/llama-stack-recipes branch from a6227e1 to 49e4639 Compare May 19, 2025 08:38

feloy requested a review from axel7083 May 19, 2025 08:44

axel7083 reviewed May 19, 2025

View reviewed changes

packages/shared/src/models/IRecipe.ts Outdated Show resolved Hide resolved

axel7083 reviewed May 19, 2025

View reviewed changes

packages/backend/src/studio-api-impl.ts Show resolved Hide resolved

axel7083 reviewed May 19, 2025

View reviewed changes

packages/backend/src/models/ApplicationOptions.ts Outdated Show resolved Hide resolved

feat: add llama-stack label for recipe catalog

ce79877

Signed-off-by: Philippe Martin <phmartin@redhat.com>

feloy force-pushed the feat-2624/llama-stack-recipes branch from 8608fe7 to ce79877 Compare May 19, 2025 12:03

feloy marked this pull request as draft May 19, 2025 12:15

feloy added 2 commits May 19, 2025 14:27

test: fix unit tests

95d20f7

Signed-off-by: Philippe Martin <phmartin@redhat.com>

refactor: introduce ApplicationOptions

7db6fe5

Signed-off-by: Philippe Martin <phmartin@redhat.com>

feloy force-pushed the feat-2624/llama-stack-recipes branch from c22c5d9 to 7db6fe5 Compare May 19, 2025 13:54

feloy requested a review from axel7083 May 19, 2025 13:59

feloy marked this pull request as ready for review May 19, 2025 13:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: start a recipe with llama-stack backend #3016

feat: start a recipe with llama-stack backend #3016

Uh oh!

feloy commented May 15, 2025 •

edited

Loading

Uh oh!

axel7083 May 16, 2025

Uh oh!

axel7083 May 16, 2025 •

edited

Loading

Uh oh!

feloy May 16, 2025

Uh oh!

axel7083 May 16, 2025

Uh oh!

feloy May 19, 2025 •

edited

Loading

Uh oh!

gastoner commented May 19, 2025

Uh oh!

Uh oh!

axel7083 May 19, 2025

Uh oh!

feloy May 19, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feat: start a recipe with llama-stack backend #3016

Are you sure you want to change the base?

feat: start a recipe with llama-stack backend #3016

Uh oh!

Conversation

feloy commented May 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Screenshot / video of UI

What issues does this PR fix or reference?

How to test this PR?

Uh oh!

axel7083 May 16, 2025

Choose a reason for hiding this comment

Uh oh!

axel7083 May 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

feloy May 16, 2025

Choose a reason for hiding this comment

Uh oh!

axel7083 May 16, 2025

Choose a reason for hiding this comment

Uh oh!

feloy May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gastoner commented May 19, 2025

Uh oh!

Uh oh!

axel7083 May 19, 2025

Choose a reason for hiding this comment

Uh oh!

feloy May 19, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feloy commented May 15, 2025 •

edited

Loading

axel7083 May 16, 2025 •

edited

Loading

feloy May 19, 2025 •

edited

Loading