AI app templates

This section of the documentation introduces you to the AI app templates and related articles that use these templates to demonstrate how to perform key developer tasks. AI app templates provide you with well-maintained, easy to deploy reference implementations that help to ensure a high-quality starting point for your AI apps.

There are two categories of AI app templates, building blocks and end-to-end solutions. The following sections introduce some of the key templates in each category for the programming language you have selected at the top of this article. To browse a more comprehensive list including these and other templates, see the AI app templates on the AI App Template gallery.

Building blocks

Building blocks are smaller-scale samples that focus on specific scenarios and tasks. Most building blocks demonstrate functionality that leverages the end-to-end solution for a chat app that uses your own data.

Building block Description
Load balance with Azure Container Apps Learn how to add load balancing to your application to extend the chat app beyond the Azure OpenAI token and model quota limits. This approach uses Azure Container Apps to create three Azure OpenAI endpoints, as well as a primary container to direct incoming traffic to one of the three endpoints.
Building block Description
Configure document security for the chat app When you build a chat application using the RAG pattern with your own data, make sure that each user receives an answer based on their permissions. An authorized user should have access to answers contained within the documents of the chat app. An unauthorized user shouldn't have access to answers from secured documents they don't have authorization to see.
Evaluate chat app answers Learn how to evaluate a chat app's answers against a set of correct or ideal answers (known as ground truth). Whenever you change your chat application in a way which affects the answers, run an evaluation to compare the changes. This demo application offers tools you can use today to make it easier to run evaluations.
Load balance with Azure Container Apps Learn how to add load balancing to your application to extend the chat app beyond the Azure OpenAI token and model quota limits. This approach uses Azure Container Apps to create three Azure OpenAI endpoints, as well as a primary container to direct incoming traffic to one of the three endpoints.
Load balance with API Management Learn how to add load balancing to your application to extend the chat app beyond the Azure OpenAI token and model quota limits. This approach uses Azure API Management to create three Azure OpenAI endpoints, as well as a primary container to direct incoming traffic to one of the three endpoints.
Load test the Python chat app with Locust Learn the process to perform load testing on a Python chat application using the RAG pattern with Locust, a popular open-source load testing tool. The primary objective of load testing is to ensure that the expected load on your chat application does not exceed the current Azure OpenAI Transactions Per Minute (TPM) quota. By simulating user behavior under heavy load, you can identify potential bottlenecks and scalability issues in your application.
Secure your AI App with keyless authentication Learn the process to secure your Python Azure OpenAI chat application with keyless authentication. Application requests to most Azure services should be authenticated with keyless or passwordless connections. Keyless authentication offers improved management and security benefits over the account key because there's no key (or connection string) to store.
Building block Description
Load balance with Azure Container Apps Learn how to add load balancing to your application to extend the chat app beyond the Azure OpenAI token and model quota limits. This approach uses Azure Container Apps to create three Azure OpenAI endpoints, as well as a primary container to direct incoming traffic to one of the three endpoints.
Building block Description
Evaluate chat app answers Learn how to evaluate a chat app's answers against a set of correct or ideal answers (known as ground truth). Whenever you change your chat application in a way which affects the answers, run an evaluation to compare the changes. This demo application offers tools you can use today to make it easier to run evaluations.
Load balance with Azure Container Apps Learn how to add load balancing to your application to extend the chat app beyond the Azure OpenAI token and model quota limits. This approach uses Azure Container Apps to create three Azure OpenAI endpoints, as well as a primary container to direct incoming traffic to one of the three endpoints.
Load balance with API Management Learn how to add load balancing to your application to extend the chat app beyond the Azure OpenAI token and model quota limits. This approach uses Azure API Management to create three Azure OpenAI endpoints, as well as a primary container to direct incoming traffic to one of the three endpoints.

End-to-end solutions

End-to-end solutions are comprehensive reference samples including documentation, source code, and deployment to allow you to take and extend for your own purposes.

Chat with your data using Azure OpenAI and Azure AI Search with .NET

This template is a complete end-to-end solution demonstrating the Retrieval-Augmented Generation (RAG) pattern running in Azure. It uses Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

To get started with this template, see Get started with the chat using your own data sample for .NET. To access the source code and read in-depth details about the template, see the azure-search-openai-demo-csharp GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Apps
Azure Functions
Azure OpenAI
Azure Computer Vision
Azure Form Recognizer
Azure AI Search
Azure Storage
GPT 3.5 Turbo
GPT 4.0

Contoso chat retail Copilot with .NET and Semantic Kernel

This template implements Contoso Outdoors, a conceptual store specializing in outdoor gear for hiking and camping enthusiasts. This virtual store enhances customer engagement and sales support through an intelligent chat agent. This agent is powered by the Retrieval Augmented Generation (RAG) pattern within the Microsoft Azure AI Stack, enriched with Semantic Kernel and Prompty support.

To access the source code and read in-depth details about the template, see the contoso-chat-csharp-prompty GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Apps
Azure OpenAI
Microsoft Entra ID
Azure Managed Identity
Azure Monitor
Azure AI Search
Azure AI Foundry
Azure SQL
Azure Storage
GPT 3.5 Turbo
GPT 4.0

Process automation with speech to text and summarization with .NET and GPT 3.5 Turbo

This template is a process automation solution that receives issues reported by field and shop floor workers at a company called Contoso Manufacturing, a manufacturing company that makes car batteries. The issues are shared by the workers either live through microphone input or pre-recorded as audio files. The solution translates audio input from speech to text and then uses an LLM and Prompty or Promptflow to summarize the issue and return the results in a format specified by the solution.

To access the source code and read in-depth details about the template, see the summarization-openai-csharp-prompty GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Apps Speech to Text
Summarization
Azure OpenAI
GPT 3.5 Turbo

Chat with your data using Azure OpenAI and Azure AI Search with Python

This template is a complete end-to-end solution demonstrating the Retrieval-Augmented Generation (RAG) pattern running in Azure. It uses Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Question and Answer (Q&A) experiences.

To get started with this template, see Get started with the chat using your own data sample for Python. To access the source code and read in-depth details about the template, see the azure-search-openai-demo GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Apps Azure OpenAI
Azure AI Search
Azure Blob Storage
Azure Monitor
Azure Document Intelligence
GPT 3.5 Turbo
GPT 4
GPT 4o
GPT 4o-mini

Multi-Modal Creative Writing Copilot with DALL-E

This template is a creative writing multi-agent solution to help users write articles. It demonstrates how to create and work with AI agents driven by Azure OpenAI.

It includes:

  1. A Flask app that takes an article and instruction from a user.
  2. A research agent that uses the Bing Search API to research the article.
  3. A product agent that uses Azure AI Search to do a semantic similarity search for related products from a vector store.
  4. A writer agent to combine the research and product information into a helpful article.
  5. An editor agent to refine the article presented to the user.

To access the source code and read in-depth details about the template, see the agent-openai-python-prompty GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Registry
Azure Kubernetes
Azure OpenAI
Bing Search
Azure Managed Identity
Azure Monitor
Azure AI Search
Azure AI Foundry
GPT 3.5 Turbo
GPT 4.0
DALL-E

Contoso Chat Retail Copilot with Azure AI Foundry

This template implements Contoso Chat - a retail copilot solution for Contoso Outdoor that uses a retrieval augmented generation design pattern to ground chatbot responses in the retailer's product and customer data. Customers can ask questions from the website in natural language, and get relevant responses with potential recommendations based on their purchase history - with responsible AI practices to ensure response quality and safety.

This template illustrates the end-to-end workflow (GenAIOps) for building a RAG-based copilot code-first with Azure AI and Prompty. By exploring and deploying this sample, learn to:

  1. Ideate and iterate rapidly on app prototypes using Prompty
  2. Deploy and use Azure OpenAI models for chat, embeddings, and evaluation
  3. Use Azure AI Search (indexes) and Azure Cosmos DB (databases) for your data
  4. Evaluate chat responses for quality using AI-assisted evaluation flows
  5. Host the application as a FastAPI endpoint deployed to Azure Container Apps
  6. Provision and deploy the solution using the Azure Developer CLI
  7. Support Responsible AI practices with content safety & assessments

To access the source code and read in-depth details about the template, see the contoso-chat GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Apps
Azure OpenAI
Azure AI Search
Azure AI Foundry
Prompty
Azure Cosmos DB
GPT 3.5 Turbo
GPT 4.0
Managed Integration Runtime (MIR)

Process automation with speech to text and summarization with Azure AI Foundry

This template creates a web-based app that allows workers at a company called Contoso Manufacturing to report issues via text or speech. Audio input is translated to text and then summarized to highlight important information and the report is sent to the appropriate department.

To access the source code and read in-depth details about the template, see the summarization-openai-python-promptflow GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Apps Azure AI Foundry
Speech to Text Service
Prompty
Managed Integration Runtime (MIR)
GPT 3.5 Turbo

Assistant API Analytics Copilot with Python and Azure AI Foundry

This template is an Assistant API to chat with tabular data and perform analytics in natural language.

To access the source code and read in-depth details about the template, see the assistant-data-openai-python-promptflow GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Machine Learning service Azure AI Search
Azure AI Foundry
Managed Integration Runtime (MIR)
Azure OpenAI
GPT 3.5 Turbo
GPT 4

Chat with your data using Azure OpenAI and Azure AI Search with Java

This template is a complete end-to-end solution that demonstrates the Retrieval-Augmented Generation (RAG) pattern running in Azure. It uses Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

To get started with this template, see Get started with the chat using your own data sample for Java. To access the source code and read in-depth details about the template, see the azure-search-openai-demo-java GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure App Service
Azure Container Apps
Azure Kubernetes Service
Azure OpenAI
Azure AI Search
Azure Document Intelligence
Azure Storage
Azure App Insights
Azure Service Bus
Azure Event Grid
gpt-35-turbo

Multi Agents Banking Assistant with Java and Semantic Kernel

This project is designed as a Proof of Concept (PoC) to explore the innovative realm of generative AI within the context of multi-agent architectures. By leveraging Java and Microsoft Semantic Kernel AI orchestration framework, our aim is to build a chat web app to demonstrate the feasibility and reliability of using generative AI agents to transform user experience from web clicks to natural language conversations while maximizing reuse of the existing workload data and APIs.

The core use case revolves around a banking personal assistant designed to revolutionize the way users interact with their bank account information, transaction history, and payment functionalities. Utilizing the power of generative AI within a multi-agent architecture, this assistant aims to provide a seamless, conversational interface through which users can effortlessly access and manage their financial data.

Invoices samples are included in the data folder to make it easy to explore payments feature. The payment agent equipped with optical character recognition (OCR) tools (Azure Document Intelligence) leads the conversation with the user to extract the invoice data and initiate the payment process. Other account fake data - such as transactions, payment methods, and account balance - are also available to be queried by the user. All data and services are exposed as external REST APIs and consumed by the agents to provide the user with the requested information.

To access the source code and read in-depth details about the template, see the agent-openai-java-banking-assistant GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Apps Azure OpenAI
Azure Document Intelligence
Azure Storage
Azure Monitor
gpt-4o
gpt-4o-mini

Chat with your data using Azure OpenAI and Azure AI Search with JavaScript

This template is a complete end-to-end solution demonstrating the Retrieval-Augmented Generation (RAG) pattern running in Azure. It uses Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

To get started with this template, see Get started with the chat using your own data sample for JavaScript. To access the source code and read in-depth details about the template, see the azure-search-openai-javascript GitHub repo.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Container Apps
Azure Static Web Apps
Azure OpenAI
Azure AI Search
Azure Storage
Azure Monitor
text-embedding-ada-002

Azure OpenAI chat frontend

This template is a minimal OpenAI chat web component that can be hooked to any backend implementation as a client.

To access the source code and read in-depth details about the template, see the azure-openai-chat-frontend GitHub repo.

Video demonstrating JavaScript chat frontend application.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Static Web Apps Azure AI Search
Azure OpenAI
GPT 3.5 Turbo
GPT4

Serverless AI chat with RAG using LangChain.js

The template is a serverless AI chatbot with Retrieval Augmented Generation using LangChain.js and Azure that uses a set of enterprise documents to generate responses to user queries. It uses a fictitious company called Contoso Real Estate, and the experience allows its customers to ask support questions about the usage of its products. The sample data includes a set of documents that describes its terms of service, privacy policy and a support guide.

To learn how to deploy and run this template, see Get started with Serverless AI Chat with RAG using LangChain.js. To access the source code and read in-depth details about the template, see the serverless-chat-langchainjs GitHub repo.

Learn how to deploy and run this JavaScript reference template.

This template demonstrates the use of these features.

Azure hosting solution Technologies AI models
Azure Static Web Apps
Azure Functions
Azure AI Search
Azure OpenAI
Azure Cosmos DB
Azure Storage
Azure Managed Identity
GPT4
Mistral
Ollama