Azure OpenAI monitoring data reference

Article
08/28/2024

This article contains all the monitoring reference information for this service.

See Monitor Azure OpenAI for details on the data you can collect for Azure OpenAI Service and how to use it.

Metrics

This section lists all the automatically collected platform metrics for this service. These metrics are also part of the global list of all platform metrics supported in Azure Monitor.

For information on metric retention, see Azure Monitor Metrics overview.

Supported metrics for Microsoft.CognitiveServices/accounts

Here are the most important metrics we think you should monitor for Azure OpenAI. Later in this article is a longer list of all available Azure AI services metrics which contains more details on metrics in this shorter list.

Azure OpenAI Requests
Active Tokens
Generated Completion Tokens
Processed FineTuned Training Hours
Processed Inference Tokens
Processed Prompt Tokens
Provisioned-managed Utilization V2
Prompt Token Cache Match Rate
Time to Response
Time Between Tokens
Time to Last Byte
Normalized Time to First Byte

You can also monitor Content Safety metrics that are used by other Azure AI services.

Blocked Volume
Harmful Volume Detected
Potential Abusive User Count
Safety System Event
Total Volume Sent for Safety Check

Note

The Provisioned-managed Utilization metric is now deprecated and is no longer recommended. This metric has been replaced by the Provisioned-managed Utilization V2 metric.

Cognitive Services metrics have the category Cognitive Services - HTTP Requests in the following table. These metrics are legacy metrics that are common to all Azure AI Services resources. Microsoft no longer recommends that you use these metrics with Azure OpenAI.

The following table lists the metrics available for the Microsoft.CognitiveServices/accounts resource type.

All columns might not be present in every table.
Some columns might be beyond the viewing area of the page. Select Expand table to view all available columns.

Table headings

Category - The metrics group or classification.
Metric - The metric display name as it appears in the Azure portal.
Name in REST API - The metric name as referred to in the REST API.
Unit - Unit of measure.
Aggregation - The default aggregation type. Valid values: Average (Avg), Minimum (Min), Maximum (Max), Total (Sum), Count.
Dimensions - Dimensions available for the metric.
Time Grains - Intervals at which the metric is sampled. For example, PT1M indicates that the metric is sampled every minute, PT30M every 30 minutes, PT1H every hour, and so on.
DS Export- Whether the metric is exportable to Azure Monitor Logs via diagnostic settings. For information on exporting metrics, see Create diagnostic settings in Azure Monitor.

Category	Metric	Name in REST API	Unit	Aggregation	Dimensions	Time Grains	DS Export
Feature Occurences	Action Feature Occurrences Number of times each action feature appears.	`ActionFeatureIdOccurrences`	Count	Total (Sum)	`FeatureId`, `Mode`, `RunId`	PT1M	Yes
Features Per Event	Action Features Per Event Average number of action features per event.	`ActionFeaturesPerEvent`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Actions	Action Occurences Number of times each action appears.	`ActionIdOccurrences`	Count	Total (Sum)	`ActionId`, `Mode`, `RunId`	PT1M	Yes
Namespaces Per Event	Action Namespaces Per Event Average number of action namespaces per event.	`ActionNamespacesPerEvent`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Actions	Actions Per Event Number of actions per event.	`ActionsPerEvent`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Azure OpenAI - Usage	Active Tokens Total tokens minus cached tokens over a period of time. Applies to PTU and PTU-managed deployments. Use this metric to understand your TPS or TPM based utilization for PTUs and compare to your benchmarks for target TPS or TPM for your scenarios. To breakdown API requests, you can add a filter or apply splitting by the following dimensions: ModelDeploymentName, ModelName, and ModelVersion.	`ActiveTokens`	Count	Minimum, Maximum, Average, Total (Sum)	`Region`, `ModelDeploymentName`, `ModelName`, `ModelVersion`	PT1M	Yes
SpeechServices - Usage	Audio Seconds Transcribed Number of seconds transcribed	`AudioSecondsTranscribed`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
SpeechServices - Usage	Audio Seconds Translated Number of seconds translated	`AudioSecondsTranslated`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
SpeechServices - Usage	Avatar Model Hosting Seconds Number of Seconds.	`AvatarModelHostingSeconds`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Azure OpenAI - HTTP Requests	Prompt Token Cache Match Rate Percentage of the prompt tokens hit the cache, avaiable for PTU-managed.	`AzureOpenAIContextTokensCacheMatchRate`	Percent	Minimum, Maximum, Average	`Region`, `ModelDeploymentName`, `ModelName`, `ModelVersion`	PT1M	No
Azure OpenAI - Usage	Provisioned-managed Utilization Utilization % for a provisoned-managed deployment, calculated as (PTUs consumed / PTUs deployed) x 100. When utilization is greater than or equal to 100%, calls are throttled and error code 429 returned. To breakdown this metric, you can add a filter or apply splitting by the following dimensions: ModelDeploymentName, ModelName, ModelVersion and StreamType (Streaming vs non-streaming requests)	`AzureOpenAIProvisionedManagedUtilization`	Percent	Minimum, Maximum, Average	`Region`, `StreamType`, `ModelDeploymentName`, `ModelName`, `ModelVersion`	PT1M	No
Azure OpenAI - Usage	Provisioned-managed Utilization V2 Utilization % for a provisoned-managed deployment, calculated as (PTUs consumed / PTUs deployed) x 100. When utilization is greater than or equal to 100%, calls are throttled and error code 429 returned. To breakdown this metric, you can add a filter or apply splitting by the following dimensions: ModelDeploymentName, ModelName, ModelVersion and StreamType (Streaming vs non-streaming requests)	`AzureOpenAIProvisionedManagedUtilizationV2`	Percent	Minimum, Maximum, Average	`Region`, `StreamType`, `ModelDeploymentName`, `ModelName`, `ModelVersion`	PT1M	No
Azure OpenAI - HTTP Requests	Azure OpenAI Requests Number of calls made to the Azure OpenAI API over a period of time. Applies to PTU, PTU-Managed and Pay-as-you-go deployments. To breakdown API requests, you can add a filter or apply splitting by the following dimensions: ModelDeploymentName, ModelName, ModelVersion, StatusCode (successful, clienterrors, server errors), StreamType (Streaming vs non-streaming requests) and operation.	`AzureOpenAIRequests`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`, `StreamType`, `ModelDeploymentName`, `ModelName`, `ModelVersion`, `StatusCode`	PT1M	Yes
Azure OpenAI - HTTP Requests	Time to Response Recommended latency (responsiveness) measure for streaming requests. Applies to PTU and PTU-managed deployments. Calculated as time taken for the first response to appear after a user sends a prompt, as measured by the API gateway. This number increases as the prompt size increases and/or cache hit size reduces. To breakdown time to response metric, you can add a filter or apply splitting by the following dimensions: ModelDeploymentName, ModelName, and ModelVersion. Note: this metric is an approximation as measured latency is heavily dependent on multiple factors, including concurrent calls and overall workload pattern. In addition, it does not account for any client-side latency that may exist between your client and the API endpoint. Please refer to your own logging for optimal latency tracking.	`AzureOpenAITimeToResponse`	MilliSeconds	Minimum, Maximum, Average	`ApiName`, `OperationName`, `Region`, `StreamType`, `ModelDeploymentName`, `ModelName`, `ModelVersion`, `StatusCode`	PT1M	Yes
Slots	Baseline Estimator Overall Reward Baseline Estimator Overall Reward.	`BaselineEstimatorOverallReward`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Slots	Baseline Estimator Slot Reward Baseline Estimator Reward by slot.	`BaselineEstimatorSlotReward`	Count	Average	`SlotId`, `SlotIndex`, `Mode`, `RunId`	PT1M	Yes
Slots	Baseline Random Estimator Overall Reward Baseline Random Estimator Overall Reward.	`BaselineRandomEstimatorOverallReward`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Slots	Baseline Random Estimator Slot Reward Baseline Random Estimator Reward by slot.	`BaselineRandomEstimatorSlotReward`	Count	Average	`SlotId`, `SlotIndex`, `Mode`, `RunId`	PT1M	Yes
Estimations	Baseline Random Event count Estimation for baseline random event count.	`BaselineRandomEventCount`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Estimations	Baseline Random Reward Estimation for baseline random reward.	`BaselineRandomReward`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Cognitive Services - HTTP Requests	Blocked Calls Number of calls that exceeded rate or quota limit. Do not use for Azure OpenAI service.	`BlockedCalls`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`, `RatelimitKey`	PT1M	Yes
Usage	Inference Count Inference Count of Carnegie Frontdoor Service	`CarnegieInferenceCount`	Count	Total (Sum)	`Region`, `Modality`, `Category`, `Language`, `SeverityLevel`, `UseCustomList`	PT1M	Yes
Usage	Characters Trained (Deprecated) Total number of characters trained.	`CharactersTrained`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`	PT1M	Yes
Usage	Characters Translated (Deprecated) Total number of characters in incoming text request.	`CharactersTranslated`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`	PT1M	Yes
Cognitive Services - HTTP Requests	Client Errors Number of calls with client side error (HTTP response code 4xx). Do not use for Azure OpenAI service.	`ClientErrors`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`, `RatelimitKey`	PT1M	Yes
Usage	Computer Vision Transactions Number of Computer Vision Transactions	`ComputerVisionTransactions`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
ContentSafety - Usage	Call Count for Image Moderation Number of calls for image moderation.	`ContentSafetyImageAnalyzeRequestCount`	Count	Total (Sum)	`ApiVersion`	PT1M	Yes
ContentSafety - Usage	Call Count for Text Moderation Number of calls for text moderation.	`ContentSafetyTextAnalyzeRequestCount`	Count	Total (Sum)	`ApiVersion`	PT1M	Yes
Feature Occurences	Context Feature Occurrences Number of times each context feature appears.	`ContextFeatureIdOccurrences`	Count	Total (Sum)	`FeatureId`, `Mode`, `RunId`	PT1M	Yes
Features Per Event	Context Features Per Event Number of context features per event.	`ContextFeaturesPerEvent`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Namespaces Per Event	Context Namespaces Per Event Number of context namespaces per event.	`ContextNamespacesPerEvent`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Usage	Custom Vision Training Time Custom Vision training time	`CustomVisionTrainingTime`	Seconds	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Custom Vision Transactions Number of Custom Vision prediction transactions	`CustomVisionTransactions`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Cognitive Services - HTTP Requests	Data In Size of incoming data in bytes. Do not use for Azure OpenAI service.	`DataIn`	Bytes	Total (Sum)	`ApiName`, `OperationName`, `Region`	PT1M	Yes
Cognitive Services - HTTP Requests	Data Out Size of outgoing data in bytes. Do not use for Azure OpenAI service.	`DataOut`	Bytes	Total (Sum)	`ApiName`, `OperationName`, `Region`	PT1M	Yes
Usage	Document Characters Translated Number of characters in document translation request.	`DocumentCharactersTranslated`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Document Custom Characters Translated Number of characters in custom document translation request.	`DocumentCustomCharactersTranslated`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Face Images Trained Number of images trained. 1,000 images trained per transaction.	`FaceImagesTrained`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Faces Stored Number of faces stored, prorated daily. The number of faces stored is reported daily.	`FacesStored`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Face Transactions Number of API calls made to Face service	`FaceTransactions`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
FeatureCardinality	Feature Cardinality by Action Feature Cardinality based on Action.	`FeatureCardinality_Action`	Count	Average	`FeatureId`, `Mode`, `RunId`	PT1M	Yes
FeatureCardinality	Feature Cardinality by Context Feature Cardinality based on Context.	`FeatureCardinality_Context`	Count	Average	`FeatureId`, `Mode`, `RunId`	PT1M	Yes
FeatureCardinality	Feature Cardinality by Slot Feature Cardinality based on Slot.	`FeatureCardinality_Slot`	Count	Average	`FeatureId`, `Mode`, `RunId`	PT1M	Yes
Azure OpenAI - Usage	Processed FineTuned Training Hours Number of Training Hours Processed on an OpenAI FineTuned Model	`FineTunedTrainingHours`	Count	Total (Sum)	`ApiName`, `ModelDeploymentName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Azure OpenAI - Usage	Generated Completion Tokens Number of tokens generated (output) from an OpenAI model. Applies to PTU, PTU-Managed and Pay-as-you-go deployments. To breakdown this metric, you can add a filter or apply splitting by the following dimensions: ModelDeploymentName and ModelName.	`GeneratedTokens`	Count	Total (Sum)	`ApiName`, `ModelDeploymentName`, `FeatureName`, `UsageChannel`, `Region`, `ModelVersion`	PT1M	Yes
Usage	Images Stored Number of Custom Vision images stored.	`ImagesStored`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Cognitive Services - HTTP Requests	Latency Latency in milliseconds. Do not use for Azure OpenAI service.	`Latency`	MilliSeconds	Average	`ApiName`, `OperationName`, `Region`, `RatelimitKey`	PT1M	Yes
Usage	Learned Events Number of Learned Events.	`LearnedEvents`	Count	Total (Sum)	`IsMatchBaseline`, `Mode`, `RunId`	PT1M	Yes
Usage	LUIS Speech Requests Number of LUIS speech to intent understanding requests	`LUISSpeechRequests`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	LUIS Text Requests Number of LUIS text requests	`LUISTextRequests`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Matched Rewards Number of Matched Rewards.	`MatchedRewards`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Usage	Non Activated Events Number of skipped events.	`NonActivatedEvents`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Slots	Slots Number of slots per event.	`NumberOfSlots`	Count	Average	`Mode`, `RunId`	PT1M	Yes
SpeechServices - Usage	Number of Speaker Profiles Number of speaker profiles enrolled. Prorated hourly.	`NumberofSpeakerProfiles`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Observed Rewards Number of Observed Rewards.	`ObservedRewards`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Usage	Document Sync Characters Translated Number of characters in document translation (synchronous) request.	`OneDocumentCharactersTranslated`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Slots	Online Estimator Overall Reward Online Estimator Overall Reward.	`OnlineEstimatorOverallReward`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Slots	Online Estimator Slot Reward Online Estimator Reward by slot.	`OnlineEstimatorSlotReward`	Count	Average	`SlotId`, `SlotIndex`, `Mode`, `RunId`	PT1M	Yes
Estimations	Online Event Count Estimation for online event count.	`OnlineEventCount`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Estimations	Online Reward Estimation for online reward.	`OnlineReward`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Usage	Processed Characters Number of Characters processed by Immersive Reader.	`ProcessedCharacters`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Processed Health Text Records Number of health text records processed	`ProcessedHealthTextRecords`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Processed Images Number of images processed	`ProcessedImages`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Processed Pages Number of pages processed	`ProcessedPages`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Azure OpenAI - Usage	Processed Prompt Tokens Number of prompt tokens processed (input) on an OpenAI model. Applies to PTU, PTU-Managed and Pay-as-you-go deployments. To breakdown this metric, you can add a filter or apply splitting by the following dimensions: ModelDeploymentName and ModelName.	`ProcessedPromptTokens`	Count	Total (Sum)	`ApiName`, `ModelDeploymentName`, `FeatureName`, `UsageChannel`, `Region`, `ModelVersion`	PT1M	Yes
Usage	Processed Text Records Count of Text Records.	`ProcessedTextRecords`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	QA Text Records Number of text records processed	`QuestionAnsweringTextRecords`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
ContentSafety - Risks&Safety	Potentially Abusive User Count Number of potentially abusive user that detected over a period of time. You can add a filter or apply splitting by the following dimension: ModelDeploymentName.	`RAIAbusiveUsersCount`	Count	Total (Sum)	`Region`, `ModelDeploymentName`	PT1M	Yes
ContentSafety - Risks&Safety	Harmful Volume Detected Number of calls made to Azure OpenAI API and detected as harmful(both block model and annotate mode) by content filter applied over a period of time. You can add a filter or apply splitting by the following dimensions: ModelDeploymentName, ModelName and TextType.	`RAIHarmfulRequests`	Count	Total (Sum)	`Region`, `ModelDeploymentName`, `ModelName`, `ModelVersion`, `ApiName`, `TextType`, `Category`, `Severity`	PT1M	Yes
ContentSafety - Risks&Safety	Blocked Volume Number of calls made to Azure OpenAI API and rejected by content filter applied over a period of time. You can add a filter or apply splitting by the following dimensions: ModelDeploymentName, ModelName and TextType.	`RAIRejectedRequests`	Count	Total (Sum)	`Region`, `ModelDeploymentName`, `ModelName`, `ModelVersion`, `ApiName`, `TextType`, `Category`	PT1M	Yes
ContentSafety - Risks&Safety	Safety System Event System event for risks & safety monitoring. You can add a filter or apply splitting by the following dimension: EventType.	`RAISystemEvent`	Count	Average	`Region`, `EventType`	PT1M	Yes
ContentSafety - Risks&Safety	Total Volume Sent For Safety Check Number of calls made to Azure OpenAI API and detected by content filter applied over a period of time. You can add a filter or apply splitting by the following dimensions: ModelDeploymentName, ModelName.	`RAITotalRequests`	Count	Total (Sum)	`Region`, `ModelDeploymentName`, `ModelName`, `ModelVersion`, `ApiName`	PT1M	Yes
Cognitive Services - HTTP Requests	Ratelimit The current ratelimit of the ratelimit key. Do not use for Azure OpenAI service.	`Ratelimit`	Count	Total (Sum)	`Region`, `RatelimitKey`	PT1M	Yes
Rewards	Average Reward Per Event Average reward per event.	`Reward`	Count	Average	`BaselineAction`, `ChosenActionId`, `MatchesBaseline`, `NonDefaultReward`, `Mode`, `RunId`	PT1M	Yes
Cognitive Services - HTTP Requests	Server Errors Number of calls with service internal error (HTTP response code 5xx). Do not use for Azure OpenAI service.	`ServerErrors`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`, `RatelimitKey`	PT1M	Yes
Feature Occurences	Slot Feature Occurrences Number of times each slot feature appears.	`SlotFeatureIdOccurrences`	Count	Total (Sum)	`FeatureId`, `Mode`, `RunId`	PT1M	Yes
Features Per Event	Slot Features Per Event Average number of slot features per event.	`SlotFeaturesPerEvent`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Slots	Slot Occurrences Number of times each slot appears.	`SlotIdOccurrences`	Count	Total (Sum)	`SlotId`, `SlotIndex`, `Mode`, `RunId`	PT1M	Yes
Namespaces Per Event	Slot Namespaces Per Event Average number of slot namespaces per event.	`SlotNamespacesPerEvent`	Count	Average	`Mode`, `RunId`	PT1M	Yes
Rewards	Slot Reward Reward per slot.	`SlotReward`	Count	Average	`BaselineActionId`, `ChosenActionId`, `MatchesBaseline`, `NonDefaultReward`, `SlotId`, `SlotIndex`, `Mode`, `RunId`	PT1M	Yes
SpeechServices - Usage	Speaker Recognition Transactions Number of speaker recognition transactions	`SpeakerRecognitionTransactions`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
SpeechServices - Usage	Speech Model Hosting Hours Number of speech model hosting hours	`SpeechModelHostingHours`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Speech Session Duration (Deprecated) Total duration of speech session in seconds.	`SpeechSessionDuration`	Seconds	Total (Sum)	`ApiName`, `OperationName`, `Region`	PT1M	Yes
Cognitive Services - HTTP Requests	Successful Calls Number of successful calls. Do not use for Azure OpenAI service.	`SuccessfulCalls`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`, `RatelimitKey`	PT1M	Yes
Cognitive Services - SLI	AvailabilityRate Availability percentage with the following calculation: (Total Calls - Server Errors)/Total Calls. Server Errors include any HTTP responses >=500. Do not use for Azure OpenAI service.	`SuccessRate`	Percent	Minimum, Maximum, Average	`ApiName`, `OperationName`, `Region`, `RatelimitKey`	PT1M	No
SpeechServices - Usage	Synthesized Characters Number of Characters.	`SynthesizedCharacters`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Text Characters Translated Number of characters in incoming text translation request.	`TextCharactersTranslated`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Text Custom Characters Translated Number of characters in incoming custom text translation request.	`TextCustomCharactersTranslated`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Usage	Text Trained Characters Number of characters trained using text translation.	`TextTrainedCharacters`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
Azure OpenAI - Usage	Processed Inference Tokens Number of inference tokens processed on an OpenAI model. Calculated as prompt tokens (input) plus generated tokens (output). Applies to PTU, PTU-Managed and Pay-as-you-go deployments. To breakdown this metric, you can add a filter or apply splitting by the following dimensions: ModelDeploymentName and ModelName.	`TokenTransaction`	Count	Total (Sum)	`ApiName`, `ModelDeploymentName`, `FeatureName`, `UsageChannel`, `Region`, `ModelVersion`	PT1M	Yes
Cognitive Services - HTTP Requests	Total Calls Total number of calls. Do not use for Azure OpenAI service.	`TotalCalls`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`, `RatelimitKey`	PT1M	Yes
Cognitive Services - HTTP Requests	Total Errors Total number of calls with error response (HTTP response code 4xx or 5xx). Do not use for Azure OpenAI service.	`TotalErrors`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`, `RatelimitKey`	PT1M	Yes
Usage	Total Events Number of events.	`TotalEvents`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Cognitive Services - HTTP Requests	Total Token Calls Total number of token calls.	`TotalTokenCalls`	Count	Total (Sum)	`ApiName`, `OperationName`, `Region`	PT1M	Yes
Usage	Total Transactions (Deprecated) Total number of transactions.	`TotalTransactions`	Count	Total (Sum)	<none>	PT1M	Yes
Estimations	User Baseline Event Count Estimation for user defined baseline event count.	`UserBaselineEventCount`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
Estimations	User Baseline Reward Estimation for user defined baseline reward.	`UserBaselineReward`	Count	Total (Sum)	`Mode`, `RunId`	PT1M	Yes
SpeechServices - Usage	Video Seconds Synthesized Number of seconds synthesized	`VideoSecondsSynthesized`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
SpeechServices - Usage	Voice Model Hosting Hours Number of Hours.	`VoiceModelHostingHours`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes
SpeechServices - Usage	Voice Model Training Minutes Number of Minutes.	`VoiceModelTrainingMinutes`	Count	Total (Sum)	`ApiName`, `FeatureName`, `UsageChannel`, `Region`	PT1M	Yes

Metric dimensions

For information about what metric dimensions are, see Multi-dimensional metrics.

This service has the following dimensions associated with its metrics.

ApiName
FeatureName
ModelDeploymentName
ModelName
ModelVersion
OperationName
Region
StatusCode
StreamType
UsageChannel

Resource logs

This section lists the types of resource logs you can collect for this service. The section pulls from the list of all resource logs category types supported in Azure Monitor.

Supported resource logs for Microsoft.CognitiveServices/accounts

Category	Category display name	Log table	Supports basic log plan	Supports ingestion-time transformation	Costs to export
`Audit`	Audit Logs	AzureDiagnostics Logs from multiple Azure resources.	No	No	No
`RequestResponse`	Request and Response Logs	AzureDiagnostics Logs from multiple Azure resources.	No	No	No
`Trace`	Trace Logs	AzureDiagnostics Logs from multiple Azure resources.	No	No	No

Azure Monitor Logs tables

This section lists the Azure Monitor Logs tables relevant to this service, which are available for query by Log Analytics using Kusto queries. The tables contain resource log data and possibly more depending on what is collected and routed to them.

Azure OpenAI microsoft.cognitiveservices/accounts

Activity log

The linked table lists the operations that can be recorded in the activity log for this service. These operations are a subset of all the possible resource provider operations in the activity log.

For more information on the schema of activity log entries, see Activity Log schema.

AI + machine learning resource provider operations

See Monitor Azure OpenAI for a description of monitoring Azure OpenAI.
See Monitor Azure resources with Azure Monitor for details on monitoring Azure resources.

Share via

Azure OpenAI monitoring data reference

Metrics

Supported metrics for Microsoft.CognitiveServices/accounts

Metric dimensions

Resource logs

Supported resource logs for Microsoft.CognitiveServices/accounts

Azure Monitor Logs tables

Azure OpenAI microsoft.cognitiveservices/accounts

Activity log

Feedback

Additional resources

Share via

Azure OpenAI monitoring data reference

Metrics

Supported metrics for Microsoft.CognitiveServices/accounts

Metric dimensions

Resource logs

Supported resource logs for Microsoft.CognitiveServices/accounts

Azure Monitor Logs tables

Azure OpenAI microsoft.cognitiveservices/accounts

Activity log

Related content

Feedback

Additional resources