SQL Hyperscale performance troubleshooting diagnostics
Applies to: Azure SQL Database
To troubleshoot performance problems in a Hyperscale database, the general SQL performance tuning methodologies is the starting point of any performance investigation. However, given the distributed architecture of Hyperscale, additional diagnostic data might need to be considered. This article describes Hyperscale-specific diagnostic data.
Reduced log rate waits
Every database and elastic pool in Azure SQL Database manages log generation rate via log rate governance. In Hyperscale, the log rate governance limit is set to 105 MB/s, regardless of the compute size. This value is exposed in the primary_max_log_rate
column in sys.dm_user_db_resource_governance.
At times, log generation rate on the primary compute replica must be reduced to maintain recoverability SLAs. For example, this can happen when a page server or another compute replica is significantly behind applying new log records from the log service. If no Hyperscale components are behind, the log rate governance mechanism allows log generation rate to reach 100 MB/s. This is the effective maximum log generation rate in all Hyperscale compute sizes.
Note
Log generation rate of 150 MiB/s is available as an opt-in preview feature for premium-series and premium-series memory optimized. For more information and to opt in to 150 MiB/s, see Blog: November 2024 Hyperscale enhancements.
The following wait types appear in sys.dm_os_wait_stats when the log rate is reduced:
Wait type | Reason |
---|---|
RBIO_RG_STORAGE |
Delayed log consumption by a page server |
RBIO_RG_DESTAGE |
Delayed log consumption by the long-term log storage |
RBIO_RG_REPLICA |
Delayed log consumption by an HA secondary replica or a named replica |
RBIO_RG_GEOREPLICA |
Delayed log consumption by a geo-secondary replica |
RBIO_RG_DESTAGE |
Delayed log consumption by the log service |
RBIO_RG_LOCALDESTAGE |
Delayed log consumption by the log service |
RBIO_RG_STORAGE_CHECKPOINT |
Delayed log consumption on by a page server due to slow database checkpoint |
RBIO_RG_MIGRATION_TARGET |
Delayed log consumption by the non-Hyperscale database during reverse migration |
The sys.dm_hs_database_log_rate() dynamic management function (DMF) provides additional details to help you understand log rate reduction, if any. For example, it can tell you which specific secondary replica is behind applying log records, and what is the total size of the not yet applied transaction log.
Page server reads
The compute replicas do not cache a full copy of the database locally. The data local to the compute replica is stored in the buffer pool (in memory) and in the local resilient buffer pool extension (RBPEX) cache that contains a subset of the most frequently accessed data pages. This local SSD cache is sized proportionally to the compute size. Each page server, on the other hand, has a complete SSD cache for the portion of the database it maintains.
When a read IO is issued on a compute replica, if the data doesn't exist in the buffer pool or in the local SSD cache, the page at the requested Log Sequence Number (LSN) is fetched from the corresponding page server. Reads from page servers are remote and are slower than reads from the local SSD cache. When troubleshooting I/O-related performance problems, we need to be able to tell how many IOs were done via the relatively slower page server reads.
Several dynamic managed views (DMVs) and extended events have columns and fields that specify the number of remote reads from a page server, which can be compared against the total reads. Query Store also captures page server reads in query runtime statistics.
- Columns to report page server reads are available in execution DMVs and catalog views:
- Page server reads fields are present in the following extended events:
sql_statement_completed
sp_statement_completed
sql_batch_completed
rpc_completed
scan_stopped
query_store_begin_persist_runtime_stat
query_store_execution_runtime_info
ActualPageServerReads
/ActualPageServerReadAheads
attributes are present in the query plan XML for plans that include runtime statistics. For example:<RunTimeCountersPerThread Thread="8" ActualRows="90466461" [...] ActualPageServerReads="0" ActualPageServerReadAheads="5687297" ActualLobPageServerReads="0" ActualLobPageServerReadAheads="0" />
Tip
To view these attributes in the query plan properties window, SSMS 18.3 or later is required.
Virtual file stats and IO accounting
In Azure SQL Database, the sys.dm_io_virtual_file_stats() DMF is one way to monitor database I/O statistics such as IOPS, throughput, and latency. I/O characteristics in Hyperscale are different due to its distributed architecture. In this section, we focus on read and write I/O as seen in this DMF. In Hyperscale, each data file visible in this DMF corresponds to a page server. The DMF also provides I/O statistics for the local SSD cache on the compute replica and for the transaction log.
Local SSD cache usage
Because the local SSD cache exists on the same compute replica where the database engine is processing queries, I/O against this cache is faster than I/O against page servers. In a Hyperscale database or elastic pool, sys.dm_io_virtual_file_stats()
has a special row reporting I/O statistics for the local SSD cache. This row has the value of 0
for both database_id
and file_id
columns. For example, the query below returns the local SSD cache I/O statistics since database startup.
SELECT *
FROM sys.dm_io_virtual_file_stats(0, NULL);
A ratio of reads from the local SSD cache to the aggregated reads from all other data files is the local SSD cache hit ratio. This metric is provided by the RBPEX cache hit ratio
and RBPEX cache hit ratio base
performance counters, available in the sys.dm_os_performance_counters DMV.
Data reads
- When reads are issued by the database engine on a compute replica, they might be served either by the local SSD cache, or by page servers, or by a combination of the two if reading multiple pages.
- When the compute replica reads some pages from a specific data file, for example the file with
file_id
1, if this data resides solely in the local SSD cache, all IO for this read is accounted againstfile_id
0. If some part of that data is in the local SSD cache, and some part is on page servers, then IO is accounted towardsfile_id
0 for the part served from the local SSD cache, and the part served from page servers is accounted towards their corresponding files. - When a compute replica requests a page at a particular LSN from a page server, if the page server has not yet caught up to the LSN requested, the read on the compute replica waits until the page server catches up before the page is returned. For any read from a page server on the compute replica, you see a
PAGEIOLATCH_*
wait type if it is waiting on that IO. In Hyperscale, this wait time includes both the time to catch up the requested page on the page server to the LSN required, and the time needed to transfer the page from the page server to the compute replica. - Large reads such as read-aheads are often done using scatter-gather reads. This allows reading up to 4 MB as a single read IO. However, when the data being read is in the local SSD cache, these reads are accounted as multiple individual 8-KB reads, since the buffer pool and the local SSD cache always use 8-KB pages. As the result, the number of read IOs seen against the local SSD cache might be larger than the actual number of IOs performed by the engine.
Data writes
- The primary compute replica does not write directly to page servers. Instead, log records from the log service are replayed on the corresponding page servers.
- Writes on the compute replica are predominantly writes to the local SSD cache (
file_id
0). For writes that are larger than 8 KB, in other words those done using gather-write, each write operation is translated into multiple 8-KB individual writes to the local SSD cache since the buffer pool and the local SSD cache always use 8-KB pages. As the result, the number of write IOs seen against the local SSD cache might be larger than the actual number of IOs performed by the engine. - Data files other than
file_id
0 that correspond to page servers might also show writes. In Hyperscale, these writes are simulated, because compute replicas never write directly to page servers. I/O statistics are accounted as they occur on the compute replica. IOPS, throughput, and latency seen on a compute replica for data files other thanfile_id
0 do not reflect the actual I/O statistics of writes that occur on page servers.
Log writes
- On the primary compute replica, a log write is accounted in
sys.dm_io_virtual_file_stats()
underfile_id
2. - Unlike in AlwaysOn Availability Groups, when a transaction commits on the primary compute replica, log records are not hardened on the secondary replica. In Hyperscale, log is hardened in the log service, and applied to the secondary replicas asynchronously. Because log writes don't actually occur on secondary replicas, any accounting of log IOs in
sys.dm_io_virtual_file_stats()
on the secondary replicas shouldn't be used as transaction log I/O statistics.
Data IO in resource utilization statistics
In a non-Hyperscale database, combined read and write IOPS against data files, relative to the resource governance data IOPS limit, are reported in sys.dm_db_resource_stats and sys.resource_stats views, in the avg_data_io_percent
column. The corresponding DMVs for elastic pools are sys.dm_elastic_pool_resource_stats and sys.elastic_pool_resource_stats. The same values are reported as the Data IO Percentage Azure Monitor metrics for databases and elastic pools.
In a Hyperscale database, these columns and metrics report on the data IOPS utilization relative to the limit for local SSD storage on compute replica only, which includes I/O against the local SSD cache and in the tempdb
database. A 100% value in this column indicates that resource governance is limiting local storage IOPS. If this is correlated with a performance problem, tune the workload to generate less IO, or increase the compute size to increase the resource governance Max Data IOPS limit. For resource governance of local SSD cache reads and writes, the system counts individual 8-KB IOs, rather than larger IOs that may be issued by the database engine.
Data IO against page servers is not reported in resource utilization views or via Azure Monitor metrics, but is reported in sys.dm_io_virtual_file_stats()
as described earlier.
Related content
- For vCore resource limits for a Hyperscale single database see Hyperscale service tier vCore Limits
- For monitoring Azure SQL Databases, enable database watcher
- For Azure SQL Database performance tuning, see Query performance in Azure SQL Database
- For performance tuning using Query Store, see Performance monitoring using Query store
- For DMV monitoring scripts, see Monitoring performance Azure SQL Database using dynamic management views