Deep Dive Into Fine-Tuning An LM Using KAITO on AKS – Part 3: Deploying the FT Model

Now that I have fine-tuned a model in Part 2, next is to deploy the fine tuned model into a new Kaito workspace. This blog post is part of a series.Part 1: Intro and overview of the KAITO fine-tuning workspace yamlPart 2: Executing the Training Kubernetes Training JobPart 3: Deploying the Fine-Tuned ModelPart 4: Evaluating …

Continue reading Deep Dive Into Fine-Tuning An LM Using KAITO on AKS – Part 3: Deploying the FT Model

Deep Dive Into Fine-Tuning An LM Using KAITO on AKS – Part 2: Execution

I will continue from the Part 1 to execute the deployment of the fine-tuning workspace job. This blog post is part of a series.Part 1: Intro and overview of the KAITO fine-tuning workspace yamlPart 2: Executing the Training Kubernetes Training JobPart 3: Deploying the Fine-Tuned ModelPart 4: Evaluating the Fine-Tuned Model Let' start the fine …

Continue reading Deep Dive Into Fine-Tuning An LM Using KAITO on AKS – Part 2: Execution

Effortlessly Setup Kaito v0.3.1 on Azure Kubernetes Service To Deploy A Large Language Model

KAITO simplifies the deployment of large language models (LLMs) in Azure Kubernetes Service (AKS) environments with preset GPU configurations. This tool automates the setup process, including node provisioning and identity management, essential for data experiments while ensuring security compliance. It enhances efficiency, allowing engineers to focus on AI/ML model experimentation. #azure #kubernetes #AI #genAI #mvpbuzz

Did My Apps Go Down During AKS Upgrade?

I did an experiment of running a load test against 4 applications and while executing a Azure Kubernetes Cluster Upgrade to see the affects and any downtime. The applications hosted in my AKS cluster are AKS Helloworld - super simple application Voting App - simple application with Redis backend Bookinfo Istio demo - microservices architecture …

Continue reading Did My Apps Go Down During AKS Upgrade?

Fixing A Failed State AKS Cluster

If you ever encounter the following error, I'll show you one way that helped me resolve my AKS cluster. The cluster is in a failed state. If you didn't do an operation, AKS may resolve the provisioning status automatically if your cluster applications continue to run. To further troubleshoot, you can refer to https://github.com/MicrosoftDocs/SupportArticles-docs/blob/main/support/azure/azure-kubernetes/cluster-node-virtual-machine-failed-state.md For …

Continue reading Fixing A Failed State AKS Cluster

Installing Kubelogin in WSL Ubuntu 20.04

I like to share in hoping to have helped the small minority that couldn't get kubelogin setup properly with the struggles that I have been having. My desktop environment WSL on Windows 10 Ubuntu 20.04 Here's my error ~/linux-scripts $ kubelogin convert-kubeconfigerror: unknown command "convert-kubeconfig" for "kubelogin" I should have paid attention that the kubelogin …

Continue reading Installing Kubelogin in WSL Ubuntu 20.04

Deploying Azure Kubernetes Service Demo Store App with Azure Open AI – Part 2

This blog post is a continuation of Deploying Azure Kubernetes Service Demo Store App with Azure Open AI – Part 1. It dives into the Python code that calls the Azure Open AI Service and highlights the use of Semantic Kernel. The AKS Demo Store uses the AI functionality to generate product descriptions. The article emphasizes the benefits of using Azure Kubernetes Service (AKS) to support AI Ops platform.

Azure Managed Grafana Setup with AKS

The following is a walk through of a quick setup of Azure Managed Grafana Instance and viewing performance graphs on applications running in Azure Kubernetes Service. This is a relatively new Azure platform service and the general documentation can be found at https://learn.microsoft.com/en-us/azure/managed-grafana/how-to-data-source-plugins-managed-identity?tabs=azure-portal I have had experience to build, configure and deploy Grafana into an …

Continue reading Azure Managed Grafana Setup with AKS