Chapter 14. Performance tuning SQL Server

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 14
Performance tuning SQL Server

Understanding isolation levels and concurrency

Understanding durability settings for performance

Understanding how SQL Server executes your query

Understanding advanced engine features for tuning queries

In this chapter, we review the database concepts and objects most associated with tuning the performance of queries and coded objects within Microsoft SQL Server and Azure SQL DB. We will not be looking at performance tuning queries that make use of the PolyBase feature, as tuning these queries will go beyond tuning SQL Server into tuning the external systems that they access.

For more details on PolyBase, see Chapter 20, “Leveraging Big Data and Machine Learning.”

In the first two sections of the chapter we will look at isolation levels and durability, which will touch on three of the ACID properties of an RDBMS (Relational Database Management System). These correspond to settings and configurations that let your code affect performance. The ACID properties are:

Atomicity. Transactions, which make multiple step processes behave as one operation
Isolation. Dealing with how multiple connections interact with one another
Durability. When a transaction is committed, we have certain expectations on what happens if the server stops working.
Consistency. Making sure data meets business rules in the database at the end of a transaction

Of these, we will cover the first three, but not consistency, as this is more of a database design task, which is covered in Chapter 7, “Understanding table features.”

Then, we explore the process of how SQL Server executes queries, including understanding the execution plans that the query processor creates to execute your query, including how they are used in the Query Store feature. We discuss execution plans in some detail, what to look for when performance tuning, and how to control when they go parallel, meaning SQL Server can use multiple processors to execute your query, without the code changing at all.

The examples that in this chapter behave identically in SQL Server instances and databases in Azure SQL Database unless otherwise noted. All sample scripts in this book are available for download at https://MicrosoftPressStore.com/SQLServer2019InsideOut/downloads.

Understanding isolation levels and concurrency

The fundamental problem when we are working on a multi-user computing system is how to handle the problem that frequently users need access to the same resources. So, if there is a row, say Row X, and User 1 and User 2 both want to do something with this row, what is the effect? If they both want to read the row, one set of concerns exists, and if one wishes to read it, and the other write it, there is another is set of issues. Finally, if both want to write to the row, still another set of concerns arises. This is where the concept of isolation comes in, including how to isolate one connection from the other.

This is all related to the concept of atomicity, and as such, transactions containing one or more statements, because we need to isolate logical atomic operations from one another. Even a single statement in a declarative programming system like Transact-SQL provides will be hundreds and thousands of steps behind the scenes.

Isolation isn’t only a matter of physical access to resources (a disk drive is fetching data from a row, so the next reader must wait for that to complete). This is a different problem for the hardware. Instead, it is a matter that while one transaction is doing its operations, others need to be as isolated from the data the user has used as needed. The performance implications are large, because the more isolated the operations need to be, the slower processing can be, but the less isolated transactions are, the greater the chance for loss of data.

The types of things that can occur between two transactions are generally referred to as phenomenon. They are:

Dirty Read. Reading data that another connection is in the process of changing. The problem is much like trying to read a post-it note that your boss is writing your new salary on. Read too early, and it looks like you are getting a $1 raise annually, or $100000, because the decimal point has not been added.
Non-Repeatable Read. Reading the same data over again that has changed or gone away. This problem is like when you check the box of doughnuts and see there is one left. While you are standing there, in control of the box, no one can take that last doughnut. But step away to get coffee, and when you come back it has a bite taken out of it. A repeatable read will always give you back rows with the same content as you first read (but may include more rows that did not exists when you first read a set of rows).
Phantom Read. A phantom read is when you read a set of data, but then come back and read it again and get new rows that did not previously exist. In the previous doughnut example, this is the happiest day of your life since there are now more doughnuts. However, this can be bad if your query needs to get back the same answer every time you ask the same question.
Reading a Previous Committed Version of Data. In some cases, you may be able to eliminate blocking by allowing connections to read a previously committed version of data that another connection is in the process of changing after your transaction started. A real-world example of this regularly happens in personal banking. You and your partner see you have $N in your account, and you both withdraw $N, not realizing the intentions of the other connection. Your ATM may even say you have $0 after both transactions, using stale information. This does not change the fees you will be receiving, but often it is perfectly acceptable for the task at hand to see how the data was at the start of the transaction.

Where this gets complicated is that many operations in a database system will be bundled into multi-step operations that need to be treated as one atomic operation. Reading data and getting back different results when executing the same query again, during what you expect to be an atomic operation, greatly increases the likelihood of returning incorrect results.

You can’t control which individual phenomenon you will allow your transaction to be affected by, but they are bundled into isolation levels that allow certain effects to occur. For example, the default isolation level, READ COMMITTED, is subject to nonrepeatable reads and phantom rows, but not dirty reads. This provides adequate protection and performance in most situations, but definitely not all.

It is important to have a fundamental understanding of these effects because these aren’t just arcane keywords you study only when it is certification time; they can have a profound effect on application performance, stability, and absolutely the most important thing for any RDBMS: data integrity.

For example, consider you are writing software to control the trains that are using track T. Two trains wish to use track T, and both conductors ask if the track is vacant, so both see it is vacant and put their trains on the track heading toward each other. Not good.

Understanding the differing impact of isolation levels on locking and blocking, and therefore on concurrency, is the key to understanding when you should use an isolation level different from the default of READ COMMITTED. Table 14-1 presents the isolation levels available in SQL Server along with the phenomena that are allowed.

Table 14-1 Isolation levels and phenomena that can be incurred

Transaction isolation level	Dirty reads	Nonrepeatable reads	Phantom rows	Reading a Previous Committed Version of Data
READ UNCOMMITTED	X	X	X
READ COMMITTED		X	X
REPEATABLE READ			X
SERIALIZABLE
READ COMMITTED SNAPSHOT (RCSI)		X	X	X
SNAPSHOT				X

When you are choosing an isolation level for a transaction in an application, you should consider primarily the transactional safety and business requirements of the transaction in a highly concurrent multiuser environment. The performance of the transaction should be a distant second priority (yet still a priority) when choosing an isolation level.

Locking, which SQL Server uses for normal isolation of processes is not bad, it is the way that every transaction in SQL Server cooperates with others when dealing with disk-based tables.

The default isolation level of READ COMMITTED is generally a safe isolation level because it only allows connection to access data that has been committed by other transactions. Dirty reads are generally the only modification phenomenon that is almost universally bad. With READ COMMITTED, modifications to a row blocks reads from other connections to that same row. This is especially important during multi-statement transactions, such as when parent and child rows in a foreign key relationship must be created in the same transaction. In that scenario, reads should not access either row in either table until both changes are updated.

Since READ COMMITTED isolation level allows nonrepeatable reads and phantom rows, it does not ensure that row data and row count won’t change between two SELECT queries on the same data in a transaction. READ COMMITTED isolation levels allow SQL Server to release locks from objects it has read and lets other users have any access, holding only locks on resources that it has changed.

For some application scenarios, this might be acceptable or desired, but not for others. To avoid these two problematic scenarios (which we talk more about soon), you need to choose the proper, more stringent isolation level for the transaction.

For scenarios in which transactions must have a higher degree of isolation from other transactions, escalating the isolation level of a transaction is appropriate. For example, if a transaction must write multiple rows, even in multiple tables and statements, it cannot allow other transactions to change data it has read during the transaction, where escalating the isolation level of a transaction is appropriate. Here are two examples.

In this example, the REPEATABLE READ isolation level blocks other transactions from changing or deleting rows needed during a multistep transaction. Unlike READ COMMITTED, REPEATABLE READ has the effect of holding locks on resources and preventing any other readers from changing them until it has completed, thus avoiding non-repeatable reads.

If the transaction in this example needs to ensure that the same exact rows in a result set is returned throughout a multistep transaction, the SERIALIZABLE isolation is necessary. It is the only isolation level that prevents other transactions from inserting new rows inside of a range of rows. It prevents other connections from adding new rows by not only locking rows it has accessed, but ranges of rows that it would have accessed had they existed. For example, say you queried for rows LIKE 'A%' in a SERIALIZABLE transaction and got back Apple and Annie. If another user tries to insert Aardvark, it is prevented until the LIKE 'A%' transaction is completed.

Lastly, it is essential to understand that every statement is a transaction. UPDATE TableName SET column = 1; operates in a transaction, as does a statement like SELECT 1;. When you do not manually start a transaction, it is referred to as an implicit transaction. An explicit transaction is one where you start with BEGIN TRANSACTION and end with COMMIT TRANSACTION or ROLLBACK TRANSACTION. The REPEATABLE READ and SERIALIZABLE isolation levels can gather a lot of locks, more so with explicit transactions of multiple statements, if they are not quickly closed. The more locks are present, the more likely your connection might be stuck indefinitely waiting.

For more on monitoring database locking and blocking, see Chapter 8, “Maintaining and monitoring SQL Server.”

SET LOCK_TIMEOUT n;

Where n is the number of milliseconds before a request is cancelled by SQL Server. You can determine the current setting of the lock timeout using the global variable @@LOCK_TIMEOUT. The default is -1, indicating that there is no time-out. When the value is positive, and a connection waits on a lock longer than the timeout, error 1222 is raised, with the message: “Lock request time out period exceeded. The statement has been terminated.”

The most complex of the phenomena concerns reading data that is not the committed version that was initially accessed. There are two main places where this comes into a concern.

Reading previous versions of data. Using SNAPSHOT or READ COMMITTED SNAPSHOT (RCSI), your query will see how data looked at the start of the query (RCSI) or once the transaction accesses data in the database. This means that the data you have may not match the data as it exists in the database.

A side effect of this is that in SNAPSHOT isolation level if two transactions try to modify or delete the same row, you will get an update conflict, requiring you to restart the transaction.
Reading new versions of data. In any isolation level that allows phantoms and non-repeatable reads, running the same statement twice can return entirely different results. It is incumbent on the programmer to recognize if this matters. For example, if you try to implement a foreign key construct in READ COMMITTED isolation level, after you check to see if the row exists, another transaction could have deleted the row.

All the topics in this introduction will be covered in greater detail in the following sections. Isolation levels are very important and can be difficult to get right. This is mostly because it is very hard to test your code to see what happens when two connections try to make incompatible reads and modifications to data simultaneously.

Understanding how concurrent sessions become blocked

In this section, we review a series of examples of how concurrency works in a multiuser application interacting with SQL Server tables. First, let’s discuss how to diagnose whether a request is being blocked or if it is blocking another request. Note that in these initial examples, we will be assuming that SQL Server has been configured in the default manner for concurrency. We will be adjusting that later in the chapter to give you more ways to tune performance.

What causes blocking?

We have alluded to it already, and the answer is that when you use resources, they are locked. These locks can be on several different levels and types of resources, as seen in Table 14-2.

Table 14-2 Lockable Resources (Not every type of resource)

Type of Lock	Granularity
Row or row identifier (RID)	A single row in a heap table
Key	A single value in an index (Note that a clustered table is represented as an index in all physical structures.)
Key Range	A range of key values (for example, to lock rows with values from A–M, even if no rows currently exist). Used for SERIALIZABLE isolation level.
Extent	A contiguous group of 8, 8-KB pages
Page	An 8-KB index or data page
HoBT	An entire heap or B-tree structure
Object	An entire table, including all rows and indexes; view, stored procedure, etc.
Application	A special type of lock that is user defined
Metadata	Metadata about the schema, such as catalog objects
Database	An entire database
Allocation unit	A set of related pages that are used as a unit
File	A data or log file in the database

Locks on a given resource are of a mode. In Table 14-3 is the list of modes that an index may be in. Two of the most important ones are shared (indicating a row is being read only), and exclusive (indicating a row should not be accessible by any other connection.)

Table 14-3 Lock Modes

Lock Mode	Definition
Shared	This lock mode grants access for reads only. It’s generally used when users are looking at but not editing the data. It’s called “shared” because multiple processes can have a shared lock on the same resource, allowing read-only access to the resource. However, sharing resources prevents other processes from modifying the resource.
Exclusive	This mode gives exclusive access to a resource and is used during modification of data also. Only one process may have an active exclusive lock on a resource.
Update	This mode is used to inform other processes that you’re planning to modify the data. Other connections may also issue shared, but not update or exclusive, locks while you’re still preparing to do the modification. Update locks are used to prevent deadlocks (covered later in this section) by marking rows that a statement will possibly update, rather than upgrading directly from a shared lock to an exclusive one.
Intent	This mode communicates to other processes that taking one of the previously listed modes might be necessary. It establishes a lock hierarchy with taken locks, allowing processes that are trying to take a lock on a resource (like a table), that there are other connections with locks at a lower level such as a page. You might see this mode as intent shared, intent exclusive, or shared with intent exclusive.
Schema	This mode is used to lock the structure of a resource when it’s in use, so you cannot alter a structure (like a table) when a user is reading data from it. (Note that schema locks show up as part of the mode in many views)

As queries are doing different operations, querying data, modifying data, or changing objects; resources are locked in a given mode. Blocking comes when one connection has a resource locked in a certain mode, and another connection needs to lock a resource in an incompatible mode. In Table 14-4, the compatibility of different modes is presented.

Table 14-4 Lock Modes

Mode	IS	S	U	IX	SIX
Intent shared (IS)	X	X	X	X	X
Shared (S)	X	X	X
Update (U)	X	X
Intent exclusive (IX)	X
Shared Intent Exclusive (SIX)		X
Exclusive (X)

To read this table, pick the lock mode in one axis, then an X will be in any compatible column in the other. For example, an update lock is compatible with an intent shared and a shared lock, but not another update lock, or any of the exclusive variants.

If a connection is reading data, it will take a shared lock, allowing other readers to also take a shared lock, which will not cause a blocked situation. However, if another connection is modifying data, it will get an exclusive lock, which will prevent the connection (and any other connections) from accessing the exclusively locked resources in any manner (other than ignoring the locks, which will be discussed later in this section).

How to observe locks and blocking

It’s easy to find out live whether a request is being blocked. The dynamic management object (DMO) sys.dm_db_requests, when combined with sys_dm_db_sessions on the session_id column, provides data about blocking and the state of sessions on the server. This provides much more information than the legacy sp_who or sp_who2 commands, as you can see displayed from this query:

RANGE_HI_KEY	RANGE_ROWS	EQ_ROWS	DISTINCT_RANGE_ROWS	AVERAGE_RANGE_ROWS…
Smith	400	200	20	10
Tests	200	120	23	5

Table of Contents for Chapter 14. Performance tuning SQL Server

Create new playlist

Sign In

Sign Up

Chapter 14Performance tuning SQL Server

Understanding isolation levels and concurrency

Understanding how concurrent sessions become blocked

What causes blocking?

How to observe locks and blocking

Changing the isolation level

Using the SET TRANSACTION ISOLATION LEVEL statement

Using table hints to change isolation

Understanding and handling common concurrency scenarios

Understanding concurrency: two requests updating the same rows

Understanding concurrency: a write blocks a read

Understanding concurrency: nonrepeatable reads

Understanding concurrency: preventing a nonrepeatable read

Understanding concurrency: experiencing phantom rows

Understanding concurrency: preventing phantom rows

The case against READ UNCOMMITTED isolation level

Understanding the enterprise solution to concurrency: row version-based concurrency

Understanding concurrency: accessing data in SNAPSHOT isolation level

Implementing row-versioned concurrency

Understanding update operations in SNAPSHOT isolation level

Understanding on-disk versus memory-optimized concurrency

Understanding reading memory optimized data in other than SNAPSHOT isolation level

Specifying isolation level for memory-optimized tables in queries

Understanding durability settings for performance

Delayed durability database options

Understanding how SQL Server executes your query

Understanding the overall query execution process

Retrieving execution plans in SQL Server Management Studio

Displaying the estimated execution plan

Displaying the actual execution plan

Displaying live query statistics

Permissions necessary to view execution plans

Understanding execution plans

Interpreting graphical execution plans

Start with the upper left operator

Next, look right, then read from right to left

The weight of the lines connecting operators tells part of the story, but isn’t the full story

Operator cost share isn’t the full story either

Look for Join operators and understand the different algorithms

Look for Parallel icons

Cardinality estimation

Understanding parameterization and “parameter sniffing”

Understanding the Procedure Cache

Clearing the Procedure Cache

Analyzing cached execution plans

Permissions required to access cached plan metadata

Understanding parallelism

Forcing a parallel execution plan

Understanding advanced engine features for tuning queries

Plan Guides and Query Store

Using Plan Guides

Object Plan Guides

SQL Plan Guides

Template Plan Guide

Using the Query Store feature

Initially configuring the query store

Using query store data in your troubleshooting

Automatic Plan Correction

Intelligent Query Processing

Adaptive Query Processing

Table Variable Deferred Compilation

Batch Mode on Rowstore

T-SQL scalar User Defined Functions (UDF) inlining

Approximate Query Processing

Table of Contents for
Chapter 14. Performance tuning SQL Server

Chapter 14
Performance tuning SQL Server