Skip to content

Commit 5b7339b

Browse files
Merge pull request #5798 from MicrosoftDocs/main
Merged by Learn.Build PR Management system
2 parents e483fba + b3c5821 commit 5b7339b

File tree

66 files changed

+1689
-419
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

66 files changed

+1689
-419
lines changed

articles/ai-foundry/agents/faq.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ metadata:
77
manager: nitinme
88
ms.service: azure-ai-agent-service
99
ms.topic: faq
10-
ms.date: 01/15/2025
10+
ms.date: 06/30/2025
1111
ms.author: aahi
1212
author: aahill
1313
title: Azure AI Foundry Agent Service frequently asked questions

articles/ai-foundry/agents/how-to/tools/azure-ai-search-samples.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: cognitive-services
66
manager: nitinme
77
ms.service: azure-ai-agent-service
88
ms.topic: how-to
9-
ms.date: 04/11/2025
9+
ms.date: 06/30/2025
1010
author: aahill
1111
ms.author: aahi
1212
ms.custom: azure-ai-agents

articles/ai-foundry/agents/how-to/tools/azure-ai-search.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: azure-ai-agent-service
66
manager: nitinme
77
ms.service: azure-ai-agent-service
88
ms.topic: how-to
9-
ms.date: 04/11/2025
9+
ms.date: 06/30/2025
1010
author: aahill
1111
ms.author: aahi
1212
ms.custom: azure-ai-agents

articles/ai-foundry/agents/how-to/tools/code-interpreter-samples.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Find code samples to enable code interpreter for Azure AI Agents.
55
author: aahill
66
ms.author: aahi
77
manager: nitinme
8-
ms.date: 04/09/2025
8+
ms.date: 06/30/2025
99
ms.service: azure-ai-agent-service
1010
ms.topic: how-to
1111
ms.custom:

articles/ai-foundry/agents/how-to/tools/code-interpreter.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: cognitive-services
66
manager: nitinme
77
ms.service: azure-ai-agent-service
88
ms.topic: how-to
9-
ms.date: 12/11/2024
9+
ms.date: 06/30/2025
1010
author: aahill
1111
ms.author: aahi
1212
ms.custom: azure-ai-agents

articles/ai-foundry/agents/how-to/tools/function-calling.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: cognitive-services
66
manager: nitinme
77
ms.service: azure-ai-agent-service
88
ms.topic: how-to
9-
ms.date: 01/30/2025
9+
ms.date: 06/30/2025
1010
author: aahill
1111
ms.author: aahi
1212
zone_pivot_groups: selection-function-calling
@@ -18,7 +18,7 @@ ms.custom: azure-ai-agents
1818
Azure AI Agents supports function calling, which allows you to describe the structure of functions to an agent and then return the functions that need to be called along with their arguments.
1919

2020
> [!NOTE]
21-
> Runs expire ten minutes after creation. Be sure to submit your tool outputs before the expiration.
21+
> Runs expire 10 minutes after creation. Be sure to submit your tool outputs before the expiration.
2222
2323
### Usage support
2424

articles/ai-foundry/agents/how-to/tools/openapi-spec.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ services: cognitive-services
66
manager: nitinme
77
ms.service: azure-ai-agent-service
88
ms.topic: how-to
9-
ms.date: 03/12/2025
9+
ms.date: 06/30/2025
1010
author: aahill
1111
ms.author: aahi
1212
ms.custom: azure-ai-agents
@@ -67,7 +67,7 @@ With API key authentication, you can authenticate your OpenAPI spec using variou
6767

6868
1. Create a `custom keys` connection to store your API key.
6969

70-
1. Go to the [Azure AI Foundry portal](https://ai.azure.com/?cid=learnDocs) and select the AI Project. Click **connected resources**.
70+
1. Go to the [Azure AI Foundry portal](https://ai.azure.com/?cid=learnDocs) and select the AI Project. Select **connected resources**.
7171
:::image type="content" source="../../media\tools\bing\project-settings-button.png" alt-text="A screenshot of the settings button for an AI project." lightbox="../../media\tools\bing\project-settings-button.png":::
7272

7373
1. Select **+ new connection** in the settings page.
@@ -124,4 +124,4 @@ To set up authenticating with Managed Identity:
124124

125125
1. Click **Finish**.
126126

127-
1. Once the setup is done, you can continue by using the tool through the Foundry Portal, SDK, or REST API. Use the tabs at the top of this article to see code samples.
127+
1. Once the setup is done, you can continue by using the tool through the Azure AI Foundry portal, SDK, or REST API. Use the tabs at the top of this article to see code samples.

articles/ai-foundry/agents/how-to/triggers.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
22
title: Trigger an Azure AI Foundry agent using Logic Apps
33
description: Use this article to learn how to trigger an AI agent when an event occurs.
4-
ms.date: 03/20/2025
4+
ms.date: 06/30/2025
55
ms.topic: how-to
66
author: aahill
77
ms.author: aahi

articles/ai-foundry/agents/index.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ metadata:
1111
ms.topic: landing-page
1212
author: aahill
1313
ms.author: aahi
14-
ms.date: 12/20/2024
14+
ms.date: 06/30/2025
1515

1616
# linkListType: architecture | concept | deploy | download | get-started | how-to-guide | learn | overview | quickstart | reference | tutorial | video | whats-new
1717
# Limits: https://review.learn.microsoft.com/help/contribute/contribute-how-to-write-landing-page?branch=main#limits

articles/ai-foundry/agents/quickstart.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.author: aahi
77
manager: nitinme
88
ms.service: azure-ai-agent-service
99
ms.topic: quickstart
10-
ms.date: 05/27/2025
10+
ms.date: 06/30/2025
1111
ms.custom:
1212
- azure-ai-agents
1313
- build-2025
Lines changed: 64 additions & 38 deletions
Original file line numberDiff line numberDiff line change
@@ -1,70 +1,96 @@
11
---
2-
title: Deploy models in Azure AI Foundry portal
2+
title: Deployment options for Azure AI Foundry Models
33
titleSuffix: Azure AI Foundry
4-
description: Learn about deploying models in Azure AI Foundry portal.
4+
description: Learn about deployment options for Azure AI Foundry Models.
55
manager: scottpolly
66
ms.service: azure-ai-foundry
77
ms.topic: concept-article
8-
ms.date: 03/24/2025
8+
ms.date: 06/30/2025
99
ms.reviewer: fasantia
1010
ms.author: mopeakande
1111
author: msakande
1212
---
1313

14-
# Overview: Deploy AI models in Azure AI Foundry portal
14+
# Deployment overview for Azure AI Foundry Models
1515

16-
The model catalog in Azure AI Foundry portal is the hub to discover and use a wide range of models for building generative AI applications. Models need to be deployed to make them available for receiving inference requests. Azure AI Foundry offers a comprehensive suite of deployment options for models, depending on your needs and model requirements.
16+
The model catalog in Azure AI Foundry is the hub to discover and use a wide range of Foundry Models for building generative AI applications. Models need to be deployed to make them available for receiving inference requests. Azure AI Foundry offers a comprehensive suite of deployment options for Foundry Models, depending on your needs and model requirements.
1717

18-
## Deploying models
18+
## Deployment options
1919

20-
Deployment options vary depending on the model offering:
20+
Azure AI Foundry provides several deployment options depending on the type of models and resources you need to provision. The following deployment options are available:
2121

22-
* **Azure OpenAI in Azure AI Foundry Models:** The latest OpenAI models that have enterprise features from Azure with flexible billing options.
23-
* **Serverless API deployment:** These models don't require compute quota from your subscription and are billed per token in a serverless API deployment.
24-
* **Open and custom models:** The model catalog offers access to a large variety of models across modalities, including models of open access. You can host open models in your own subscription with a managed infrastructure, virtual machines, and the number of instances for capacity management.
22+
- Standard deployment in Azure AI Foundry resources
23+
- Deployment to serverless API endpoints
24+
- Deployment to managed computes
2525

26-
Azure AI Foundry offers four different deployment options:
26+
### Standard deployment in Azure AI Foundry resources
2727

28-
|Name | Azure OpenAI | Azure AI Foundry Models | Serverless API deployment | Managed compute |
29-
|-------------------------------|----------------------|-------------------|----------------|-----------------|
30-
| Which models can be deployed? | [Azure OpenAI models](../../ai-services/openai/concepts/models.md) | [Azure OpenAI models and serverless API deployment](../../ai-foundry/model-inference/concepts/models.md) | [serverless API deployment](../how-to/model-catalog-overview.md) | [Open and custom models](../how-to/model-catalog-overview.md#availability-of-models-for-deployment-as-managed-compute) |
31-
| Deployment resource | Azure OpenAI resource | Azure AI services resource | AI project resource | AI project resource |
32-
| Requires Hubs/Projects | No | No | Yes | Yes |
33-
| Data processing options | Regional <br /> Data-zone <br /> Global | Global | Regional | Regional |
34-
| Private networking | Yes | Yes | Yes | Yes |
35-
| Content filtering | Yes | Yes | Yes | No |
36-
| Custom content filtering | Yes | Yes | No | No |
37-
| Key-less authentication | Yes | Yes | No | No |
38-
| Best suited when | You're planning to use only OpenAI models | You're planning to take advantage of the flagship models in Azure AI catalog, including OpenAI. | You're planning to use a single model from a specific provider (excluding OpenAI). | If you plan to use open models and you have enough compute quota available in your subscription. |
39-
| Billing bases | Token usage & [provisioned throughput units](../../ai-services/openai/concepts/provisioned-throughput.md) | Token usage | Token usage<sup>1</sup> | Compute core hours<sup>2</sup> |
40-
| Deployment instructions | [Deploy to Azure OpenAI](../how-to/deploy-models-openai.md) | [Deploy to Foundry Models](../model-inference/how-to/create-model-deployments.md) | [Deploy to serverless API deployment](../how-to/deploy-models-serverless.md) | [Deploy to Managed compute](../how-to/deploy-models-managed.md) |
28+
Azure AI Foundry resources (formerly referred to as Azure AI Services resources), is **the preferred deployment option** in Azure AI Foundry. It offers the widest range of capabilities, including regional, data zone, or global processing, and it offers standard and [provisioned throughput (PTU)](../../ai-services/openai/concepts/provisioned-throughput.md) options. Flagship models in Azure AI Foundry Models support this deployment option.
4129

42-
<sup>1</sup> A minimal endpoint infrastructure is billed per minute. You aren't billed for the infrastructure that hosts the model in serverless API deployment. After you delete the endpoint, no further charges accrue.
30+
This deployment option is available in:
4331

44-
<sup>2</sup> Billing is on a per-minute basis, depending on the product tier and the number of instances used in the deployment since the moment of creation. After you delete the endpoint, no further charges accrue.
32+
* Azure AI Foundry resources
33+
* Azure OpenAI resources<sup>1</sup>
34+
* Azure AI hub, when connected to an Azure AI Foundry resource (requires the [Deploy models to Azure AI Foundry resources](#configure-azure-ai-foundry-portal-for-deployment-options) feature to be turned on).
35+
36+
<sup>1</sup>If you're using Azure OpenAI resources, the model catalog shows only Azure OpenAI in Foundry Models for deployment. You can get the full list of Foundry Models by upgrading to an Azure AI Foundry resource.
37+
38+
To get started with standard deployment in Azure AI Foundry resources, see [How-to: Deploy models to Azure AI Foundry Models](../foundry-models/how-to/create-model-deployments.md).
39+
40+
### Serverless API endpoint
41+
42+
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md) and it allows the creation of dedicated endpoints to host the model, accessible via API. Azure AI Foundry Models support serverless API endpoints with pay-as-you-go billing.
43+
44+
Only regional deployments can be created for serverless API endpoints, and to use it, you _must_ **turn off** the "Deploy models to Azure AI Foundry resources" option.
45+
46+
To get started with deployment to a serverless API endpoint, see [Deploy models as serverless API deployments](../how-to/deploy-models-serverless.md).
4547

46-
> [!TIP]
47-
> To learn more about how to track costs, see [Monitor costs for models offered through Azure Marketplace](../how-to/costs-plan-manage.md#monitor-costs-for-models-offered-through-the-azure-marketplace).
48+
### Managed compute
4849

49-
### How should I think about deployment options?
50+
This deployment option is available **only in** [Azure AI hub resources](ai-resources.md) and it allows the creation of a dedicated endpoint to host the model in a **dedicated compute**. You need to have compute quota in your subscription to host the model, and you're billed per compute uptime.
5051

51-
Azure AI Foundry encourages you to explore various deployment options and choose the one that best suites your business and technical needs. In general, Consider using the following approach to select a deployment option:
52+
Managed compute deployment is required for model collections that include:
5253

53-
* Start with [Foundry Models](../../ai-foundry/model-inference/overview.md), which is the option with the largest scope. This option allows you to iterate and prototype faster in your application without having to rebuild your architecture each time you decide to change something. If you're using Azure AI Foundry hubs or projects, enable this option by [turning on the Foundry Models feature](../model-inference/how-to/quickstart-ai-project.md#configure-the-project-to-use-foundry-models).
54+
* Hugging Face
55+
* NVIDIA inference microservices (NIMs)
56+
* Industry models (Saifr, Rockwell, Bayer, Cerence, Sight Machine, Page AI, SDAIA)
57+
* Databricks
58+
* Custom models
5459

55-
* When you're looking to use a specific model:
60+
To get started, see [How to deploy and inference a managed compute deployment](../how-to/deploy-models-managed.md) and [Deploy Azure AI Foundry Models to managed compute with pay-as-you-go billing](../how-to/deploy-models-managed-pay-go.md).
61+
62+
## Capabilities for the deployment options
63+
64+
We recommend using [Standard deployments in Azure AI Foundry resources](#standard-deployment-in-azure-ai-foundry-resources) whenever possible, as it offers the largest set of capabilities among the available deployment options. The following table lists details about specific capabilities available for each deployment option:
65+
66+
| Capability | Standard deployment in Azure AI Foundry resources | Serverless API Endpoint | Managed compute |
67+
|-------------------------------|--------------------------------------------------|------------------------|-----------------|
68+
| Which models can be deployed? | [Foundry Models](../../ai-foundry/foundry-models/concepts/models.md) | [Foundry Models with pay-as-you-go billing](../how-to/model-catalog-overview.md) | [Open and custom models](../how-to/model-catalog-overview.md#availability-of-models-for-deployment-as-managed-compute) |
69+
| Deployment resource | Azure AI Foundry resource | AI project (in AI hub resource) | AI project (in AI hub resource) |
70+
| Requires AI Hubs | No | Yes | Yes |
71+
| Data processing options | Regional <br /> Data-zone <br /> Global | Regional | Regional |
72+
| Private networking | Yes | Yes | Yes |
73+
| Content filtering | Yes | Yes | No |
74+
| Custom content filtering | Yes | No | No |
75+
| Key-less authentication | Yes | No | No |
76+
| Billing bases | Token usage & [provisioned throughput units](../../ai-services/openai/concepts/provisioned-throughput.md) | Token usage<sup>1</sup> | Compute core hours<sup>2</sup> |
77+
78+
<sup>1</sup> A minimal endpoint infrastructure is billed per minute. You aren't billed for the infrastructure that hosts the model in standard deployment. After you delete the endpoint, no further charges accrue.
79+
80+
<sup>2</sup> Billing is on a per-minute basis, depending on the product tier and the number of instances used in the deployment since the moment of creation. After you delete the endpoint, no further charges accrue.
5681

57-
* If you're interested in Azure OpenAI models, use Azure OpenAI in Foundry Models. This option is designed for Azure OpenAI models and offers a wide range of capabilities for them.
82+
## Configure Azure AI Foundry portal for deployment options
5883

59-
* If you're interested in a particular model from serverless pay per token offer, and you don't expect to use any other type of model, use [serverless API deployment](../how-to/deploy-models-serverless.md). serverless API deployments allow deployment of a single model under a unique set of endpoint URL and keys.
84+
Azure AI Foundry portal might automatically pick up a deployment option based on your environment and configuration. We recommend using Azure AI Foundry resources for deployment whenever possible. To do that, ensure that the **Deploy models to Azure AI Foundry resources** feature is **turned on**.
6085

61-
* When your model isn't available in serverless API deployment and you have compute quota available in your subscription, use [Managed Compute](../how-to/deploy-models-managed.md), which supports deployment of open and custom models. It also allows a high level of customization of the deployment inference server, protocols, and detailed configuration.
86+
:::image type="content" source="../media/concepts/deployments-overview/docs-flag-enable-foundry.png" alt-text="A screenshot showing the steps to enable deployment to Azure AI Foundry resources in the Azure AI Foundry portal." lightbox="../media/concepts/deployments-overview/docs-flag-enable-foundry.png":::
6287

88+
Once the **Deploy models to Azure AI Foundry resources** feature is enabled, models that support multiple deployment options default to deploy to Azure AI Foundry resources for deployment. To access other deployment options, either disable the feature or use the Azure CLI or Azure Machine Learning SDK for deployment. You can disable and enable the feature as many times as needed without affecting existing deployments.
6389

6490
## Related content
6591

66-
* [Configure your AI project to use Foundry Models](../../ai-foundry/model-inference/how-to/quickstart-ai-project.md)
67-
* [Add and configure models to Foundry Models](../model-inference/how-to/create-model-deployments.md)
92+
* [Configure your AI project to use Foundry Models](../../ai-foundry/foundry-models/how-to/quickstart-ai-project.md)
93+
* [Add and configure models to Foundry Models](../foundry-models/how-to/create-model-deployments.md)
6894
* [Deploy Azure OpenAI models with Azure AI Foundry](../how-to/deploy-models-openai.md)
6995
* [Deploy open models with Azure AI Foundry](../how-to/deploy-models-managed.md)
70-
* [Model catalog and collections in Azure AI Foundry portal](../how-to/model-catalog-overview.md)
96+
* [Explore Azure AI Foundry Models](../how-to/model-catalog-overview.md)

articles/ai-foundry/concepts/safety-evaluations-transparency-note.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.service: azure-ai-foundry
77
ms.custom:
88
- build-2024
99
ms.topic: article
10-
ms.date: 01/10/2025
10+
ms.date: 06/30/2025
1111
ms.reviewer: mithigpe
1212
ms.author: lagayhar
1313
author: lgayhardt

0 commit comments

Comments
 (0)