Active Session History EARLY ACCESS

Get real-time and historical information about active sessions to analyze and troubleshoot performance issues

Active Session History (ASH) provides a current and historical view of system activity by sampling session activity in the database. A database session or connection is considered active if it is consuming CPU, or has an active RPC call that is waiting on one of the wait events.

ASH exposes session activity in the form of SQL views so that you can run analytical queries, aggregations for analysis, and troubleshoot performance issues. To run ASH queries, you need to enable YSQL.

Currently, ASH is available for YSQL, YCQL, and YB-TServer. ASH facilitates analysis by recording wait events related to YSQL, YCQL, or YB-TServer requests while they are being executed. These wait events belong to the categories including but not limited to CPU, WaitOnCondition, Network, and Disk IO.

Analyzing the wait events and wait event types lets you troubleshoot, answer the following questions, and subsequently tune performance:

Why is a query taking longer than usual to execute?
Why is a particular application slow?
What are the queries that are contributing significantly to database load and performance?

Configure ASH

To configure ASH, you can set the following YB-TServer flags for each node of your cluster.

Flag	Description
ysql_yb_enable_ash	Enables ASH. Changing this flag requires a TServer restart. Default: true
ysql_yb_ash_circular_buffer_size	Size (in KiB) of circular buffer where the samples are stored. Defaults: 32 MiB for 1-2 cores 64 MiB for 3-4 cores 128 MiB for 5-8 cores 256 MiB for 9-16 cores 512 MiB for 17-32 cores 1024 MiB for more than 32 cores Changing this flag requires a TServer restart.
ysql_yb_ash_sampling_interval_ms	Sampling interval (in milliseconds). Changing this flag doesn't require a TServer restart. Default: 1000
ysql_yb_ash_sample_size	Maximum number of events captured per sampling interval. Changing this flag doesn't require a TServer restart. Default: 500

Limitations

Note that the following limitations are subject to change.

ASH is available per node and is not aggregated across the cluster.
ASH is not available for YB-Master processes.
ASH is available for queries and a few background activities like compaction and flushes. ASH support for other background activities will be added in future releases.

YSQL views

ASH exposes the following views in each node to analyze and troubleshoot performance issues.

yb_active_session_history

This view provides a list of wait events and their metadata. The columns of the view are described in the following table.

Column	Type	Description
sample_time	timestamp	Timestamp when the sample was captured
root_request_id	UUID	A 16-byte UUID that is generated per request. Generated for queries at YSQL/YCQL layer.
rpc_request_id	integer	ID for internal requests, it is a monotonically increasing number for the lifetime of a YB-TServer.
wait_event_component	text	Component of the wait event, which can be YSQL, YCQL, or TServer.
wait_event_class	text	Class of the wait event, such as TabletWait, RocksDB, and so on.
wait_event	text	Name of the wait event.
wait_event_type	text	Type of the wait event such as CPU, WaitOnCondition, Network, Disk IO, and so on.
wait_event_aux	text	Additional information for the wait event. For example, tablet ID for TServer wait events.
top_level_node_id	UUID	16-byte TServer UUID of the YSQL/YCQL node where the query is being executed.
query_id	bigint	Query ID as seen on the `/statements` endpoint. This can be used to join with pg_stat_statements/ycql_stat_statements. See Constant query identifiers.
pid	bigint	PID of the process that is executing the query. For YCQL and background activities, this will be the YB-TServer PID.
client_node_ip	text	IP address of the client which sent the query to YSQL/YCQL. Null for background activities.
sample_weight	float	If in any sampling interval there are too many events, YugabyteDB only collects `ysql_yb_ash_sample_size` samples/events. Based on how many were sampled, weights are assigned to the collected events. For example, if there are 200 events, but only 100 events are collected, each of the collected samples will have a weight of (200 / 100) = 2.0
ysql_dbid	oid	Database OID of the YSQL database. This is 0 for YCQL databases.

yb_wait_event_desc

This view displays the class, type, name, and description of each wait event. The columns of the view are described in the following table.

Column	Type	Description
wait_event_class	text	Class of the wait event, such as TabletWait, RocksDB, and so on.
wait_event_type	text	Type of the wait event such as CPU, WaitOnCondition, Network, Disk IO, and so on.
wait_event	text	Name of the wait event.
wait_event_description	text	Description of the wait event.

Constant query identifiers

These fixed constants are used to identify various YugabyteDB background activities. The query IDs of the fixed constants are described in the following table.

Query ID	Wait Event Component	Description
1	TServer	Query ID for write ahead log (WAL) appender thread.
2	TServer	Query ID for background flush tasks.
3	TServer	Query ID for background compaction tasks.
4	TServer	Query ID for Raft update consensus.
5	YSQL/TServer	Default query ID, assigned in the interim before pg_stat_statements calculates a proper ID for the query.
6	TServer	Query ID for write ahead log (WAL) background sync.

To obtain the IP address and location of a node where a query is being executed, use the top_level_node_id from the active session history view in the following command:

SELECT * FROM pg_catalog.yb_servers() WHERE uuid = <top_level_node_id>;

     host     | port | num_connections | node_type | cloud |  region   |    zone    | public_ip |               uuid
--------------+------+-----------------+-----------+-------+-----------+------------+-----------+----------------------------------
 10.9.111.111 | 5433 |               0 | primary   | aws   | us-west-2 | us-west-2a |           | 5cac7c86ba4e4f0e838bf180d75bcad5

Wait events

The following describes the wait events available in the active session history, along with their type and, where applicable, the auxiliary information (wait_event_aux) provided with that type. The events are categorized by wait event class.

YSQL

These are the wait events introduced by YugabyteDB. Some of the following wait events inherited from PostgreSQL might also show up in the yb_active_session_history view.

TServerWait class

Wait Event	Type	Aux	Description
TableRead	Network		A YSQL backend is waiting for a table read from DocDB.
CatalogRead	Network		A YSQL backend is waiting for a catalog read from master.
IndexRead	Network		A YSQL backend is waiting for a secondary index read from DocDB.
StorageFlush	Network		A YSQL backend is waiting for a table/index read/write from DocDB.
TableWrite	Network		A YSQL backend is waiting for a table write from DocDB.
CatalogWrite	Network		A YSQL backend is waiting for a catalog write from master.
IndexWrite	Network		A YSQL backend is waiting for a secondary index write from DocDB.
WaitingOnTServer	Network	RPC name	A YSQL backend is waiting for on TServer for an RPC. The RPC name is present on the wait event aux column.

YSQLQuery class

Wait Event	Type	Description
QueryProcessing	CPU	A YSQL backend is doing CPU work.
yb_ash_metadata	LWLock	A YSQL backend is waiting to update ASH metadata for a query.
YBParallelScanEmpty	IPC	A YSQL backend is waiting on an empty queue while fetching parallel range keys.
CopyCommandStreamRead	IO	A YSQL backend is waiting for a read from a file or program during COPY.
CopyCommandStreamWrite	IO	A YSQL backend is waiting for a write to a file or program during COPY.
YbAshMain	Activity	The YugabyteDB ASH collector background worker is waiting in the main loop.
YbAshCircularBuffer	LWLock	A YSQL backend is waiting for YugabyteDB ASH circular buffer memory access.
QueryDiagnosticsMain	Activity	The YugabyteDB query diagnostics background worker is waiting in the main loop.
YbQueryDiagnostics	LWLock	A YSQL backend is waiting for YugabyteDB query diagnostics hash table memory access.
YbQueryDiagnosticsCircularBuffer	LWLock	A YSQL backend is waiting for YugabyteDB query diagnostics circular buffer memory access.
YBTxnConflictBackoff	Timeout	A YSQL backend is waiting for transaction conflict resolution with an exponential backoff.

YB-TServer

Common class

Wait Event	Type	Description
OnCpu_Passive	CPU	An RPC or task is waiting for a thread to pick it up.
OnCpu_Active	CPU	An RPC or task is being actively processed on a thread.
Idle	WaitOnCondition	The Raft log appender/sync thread is idle.
Rpc_Done	WaitOnCondition	An RPC is done and waiting for the reactor to send the response to a YSQL/YCQL backend.
RetryableRequests_SaveToDisk	DiskIO	The in-memory state of the retryable requests is being saved to the disk.

TabletWait class

Wait Event	Type	Description
MVCC_WaitForSafeTime	WaitOnCondition	A read/write RPC is waiting for the safe time to be at least the desired read-time.
LockedBatchEntry_Lock	WaitOnCondition	A read/write RPC is waiting for a DocDB row-level lock.
BackfillIndex_WaitForAFreeSlot	WaitOnCondition	A backfill index RPC is waiting for a slot to open if there are too many backfill requests at the same time.
CreatingNewTablet	DiskIO	The CreateTablet RPC is creating a new tablet, this may involve writing metadata files, causing I/O wait.
SaveRaftGroupMetadataToDisk	DiskIO	The Raft/tablet metadata is being written to disk, generally during snapshot or restore operations.
TransactionStatusCache_DoGetCommitData	Network	An RPC needs to look up the commit status of a particular transaction.
WaitForYSQLBackendsCatalogVersion	WaitOnCondition	CREATE INDEX is waiting for YSQL backends to have up-to-date pg_catalog.
WriteSysCatalogSnapshotToDisk	DiskIO	Writing initial system catalog snapshot during initdb.
DumpRunningRpc_WaitOnReactor	WaitOnCondition	DumpRunningRpcs is waiting on reactor threads.
ConflictResolution_ResolveConficts	Network	A read/write RPC is waiting to identify conflicting transactions.
ConflictResolution_WaitOnConflictingTxns	WaitOnCondition	A read/write RPC is waiting for conflicting transactions to complete.

Consensus class

Wait Event	Type	Aux	Description
WAL_Append	DiskIO	Tablet ID	A write RPC is persisting WAL edits.
WAL_Sync	DiskIO	Tablet ID	A write RPC is synchronizing WAL edits.
Raft_WaitingForReplication	Network	Tablet ID	A write RPC is waiting for Raft replication.
Raft_ApplyingEdits	WaitOnCondition/CPU	Tablet ID	A write RPC is applying Raft edits locally.
ConsensusMeta_Flush	DiskIO		ConsensusMetadata is flushed, for example, during Raft term, configuration change, remote bootstrap, and so on.
ReplicaState_TakeUpdateLock	WaitOnCondition		A write/alter RPC needs to wait for the ReplicaState lock to replicate a batch of writes through Raft.

RocksDB class

Wait Event	Type	Description
RocksDB_ReadBlockFromFile	DiskIO	RocksDB is reading a block from a file.
RocksDB_OpenFile	DiskIO	RocksDB is opening a file.
RocksDB_WriteToFile	DiskIO	RocksDB is writing to a file.
RocksDB_Flush	CPU	RocksDB is doing a flush.
RocksDB_Compaction	CPU	RocksDB is doing a compaction.
RocksDB_PriorityThreadPoolTaskPaused	WaitOnCondition	RocksDB pauses a flush or compaction task for another flush or compaction task with a higher priority.
RocksDB_CloseFile	DiskIO	RocksDB is closing a file.
RocksDB_RateLimiter	WaitOnCondition	RocksDB flush/compaction is slowing down due to rate limiter throttling access to disk.
RocksDB_WaitForSubcompaction	WaitOnCondition	RocksDB is waiting for a compaction to complete.
RocksDB_NewIterator	DiskIO	RocksDB is waiting for a new iterator to be created.

YCQL

YCQLQuery class

Wait Event	Type	Aux	Description
YCQL_Parse	CPU		YCQL is parsing a query.
YCQL_Read	CPU	Table ID	YCQL is processing a read query.
YCQL_Write	CPU	Table ID	YCQL is processing a write query.
YCQL_Analyze	CPU		YCQL is analyzing a query.
YCQL_Execute	CPU		YCQL is executing a query.

Client class

Wait Event	Type	Description
YBClient_WaitingOnDocDB	Network	YB Client is waiting on DocDB to return a response.
YBClient_LookingUpTablet	Network	YB Client is looking up tablet information from the master.

Examples

Before you start

The examples will run on any YugabyteDB universe.
To create and connect to a universe, see Set up YugabyteDB universe.

Make sure you have an active ysqlsh session (./bin/ysqlsh) to run the following examples.

Distribution of wait events for each query_id

Check the distribution of wait events for each query_id, for the last 20 minutes.

SELECT
    query_id,
    wait_event_component,
    wait_event,
    wait_event_type,
    COUNT(*)
FROM
    yb_active_session_history
WHERE
    sample_time >= current_timestamp - interval '20 minutes'
GROUP BY
    query_id,
    wait_event_component,
    wait_event,
    wait_event_type
ORDER BY
    query_id,
    wait_event_component,
    wait_event_type;

 query_id             | wait_event_component |             wait_event             | wait_event_type | count
 ---------------------+----------------------+------------------------------------+-----------------+------------
 -4157456334073660389 | YSQL                 | CatalogRead                        | Network         |     3
 -1970690938654296136 | TServer              | Raft_ApplyingEdits                 | Cpu             |    54
 -1970690938654296136 | TServer              | OnCpu_Active                       | Cpu             |   107
 -1970690938654296136 | TServer              | OnCpu_Passive                      | Cpu             |   144
 -1970690938654296136 | TServer              | RocksDB_NewIterator                | DiskIO          |     6
 -1970690938654296136 | TServer              | ConflictResolution_ResolveConficts | Network         |    18
 -1970690938654296136 | TServer              | Raft_WaitingForReplication         | Network         |   194
 -1970690938654296136 | TServer              | Rpc_Done                           | WaitOnCondition |    18
 -1970690938654296136 | TServer              | MVCC_WaitForSafeTime               | WaitOnCondition |     5
 -1970690938654296136 | YSQL                 | QueryProcessing                    | Cpu             |  1023
                    0 | TServer              | OnCpu_Passive                      | Cpu             |    10
                    0 | TServer              | OnCpu_Active                       | Cpu             |     9
  6107501747146929242 | TServer              | OnCpu_Active                       | Cpu             |   208
  6107501747146929242 | TServer              | RocksDB_NewIterator                | DiskIO          |     5
  6107501747146929242 | TServer              | MVCC_WaitForSafeTime               | WaitOnCondition |    10
  6107501747146929242 | TServer              | Rpc_Done                           | WaitOnCondition |    15
  6107501747146929242 | YSQL                 | QueryProcessing                    | Cpu             |   285
  6107501747146929242 | YSQL                 | TableRead                        | Network         |   658
  6107501747146929242 | YSQL                 | CatalogRead                        | Network         |     1

Distribution of wait events for each query

As ASH's query_id is the same as the pg_stat_statement queryid, you can join with pg_stat_statements to view the distribution of wait events for each query. This may help in finding what's wrong with a particular query, or determine where most of the time is being spent on. In this example, you can see that in YB-TServer, most of the time is spent on the wait event ConflictResolution_WaitOnConflictingTxns which suggests that there are a lot of conflicts in the DocDB storage layer.

SELECT
    SUBSTRING(query, 1, 50) AS query,
    wait_event_component,
    wait_event,
    wait_event_type,
    COUNT(*)
FROM
    yb_active_session_history
JOIN
    pg_stat_statements
ON
    query_id = queryid
WHERE
    sample_time >= current_timestamp - interval '20 minutes'
GROUP BY
    query,
    wait_event_component,
    wait_event,
    wait_event_type
ORDER BY
    query,
    wait_event_component,
    wait_event_type;

                query                          | wait_event_component |             wait_event                   | wait_event_type | count
-----------------------------------------------+----------------------+------------------------------------------+-----------------+------------
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | OnCpu_Passive                            | Cpu             |    46
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | Raft_ApplyingEdits                       | Cpu             |    34
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | OnCpu_Active                             | Cpu             |    39
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | RocksDB_NewIterator                      | DiskIO          |     3
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | ConflictResolution_ResolveConficts       | Network         |    99
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | Raft_WaitingForReplication               | Network         |    38
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | ConflictResolution_WaitOnConflictingTxns | WaitOnCondition |  1359
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | Rpc_Done                                 | WaitOnCondition |     5
 UPDATE test_table set v = v + $1 where k = $2 | TServer              | LockedBatchEntry_Lock                    | WaitOnCondition |   141
 UPDATE test_table set v = v + $1 where k = $2 | YSQL                 | QueryProcessing                          | Cpu             |  1929

Detect a hot shard

In this example, you can see that a particular tablet is getting a lot of requests as compared to the other tablets. The wait_event_aux field contains the tablet_id in case of YB-TServer events.

SELECT
    wait_event_aux AS tablet_id,
    COUNT(*)
FROM
    yb_active_session_history
WHERE
    wait_event_component = 'TServer' AND
    wait_event_aux IS NOT NULL
GROUP BY
    wait_event_aux
ORDER BY
    count DESC;

    tablet_id    | count
-----------------+-------
 09f26a0bb117411 | 33129
 a1d82ef77aa64a8 |  5235
 31bc90e0c59e4da |  2431
 7b49c915e7fe4f1 |  1518
 6b6a264711a84d2 |   403
 96948dbb19674cb |   338
 e112a0dd35994e5 |   320
 f901168f334f432 |   315
 bddebf9b7d9b485 |   310
 04a37ec2cecf49e |    70
 70f6e424970c44c |    66
 77bdebc4f7e3400 |    65
 b4ae6f1115fc4a9 |    63
 8674a0708cba422 |    63
 9cf4fc4a834040d |    61
 e66879054249434 |    61
 c2cfa997bf63463 |    59
 9d64f3479792499 |    58
 e70fd34078e84fe |    58
 8ea1aa0f2e4749a |    56
 b3bbaec3014f4f1 |    53
 d3bbd37828ab422 |    53
 542c6f91ff6a403 |    52
 27780cde5a1b445 |    50
 4a64a9f25e414ce |    44
 09bb0274a41146a |     5
 d58c56ce3fc7458 |     4
 0350744dad944bd |     4
 219fd39bafee44a |     3

You can join with yb_local_tablets to get more information about the table type, table_name, and partition keys. As the wait_event_aux has only the first 15 characters of the tablet_id as a string, you have to join with only the first 15 characters from yb_local_tablets.

SELECT
    tablet_id,
    table_type,
    namespace_name,
    table_name,
    partition_key_start,
    partition_key_end,
    COUNT(*)
FROM
    yb_active_session_history
JOIN
    yb_local_tablets
ON
    wait_event_aux = SUBSTRING(tablet_id, 1, 15)
GROUP BY
    tablet_id,
    table_type,
    namespace_name,
    table_name,
    partition_key_start,
    partition_key_end
ORDER BY
    table_name,
    count DESC;

            tablet_id             | table_type | namespace_name |  table_name  | partition_key_start | partition_key_end | count
----------------------------------+------------+----------------+--------------+---------------------+-------------------+-------
 09f26a0bb117411fa068df13420ea643 | YSQL       | yugabyte       | test_table   |                     | \x2aaa            | 33129
 d58c56ce3fc74584bd2eb892cea51e2a | YSQL       | yugabyte       | test_table   | \x5555              | \x8000            |    10
 0350744dad944bd9ba70c91432c5d8e2 | YSQL       | yugabyte       | test_table   | \xaaaa              | \xd555            |     9
 219fd39bafee44a59117c4089a2f71bf | YSQL       | yugabyte       | test_table   | \x2aaa              | \x5555            |     8
 09bb0274a41146abbb6fe70e41b4f3c1 | YSQL       | yugabyte       | test_table   | \xd555              |                   |     7
 a1d82ef77aa64a85987eef4aa2322c11 | System     | system         | transactions | \xf555              |                   |  7973
 31bc90e0c59e4da5b36e38e9e48554a0 | System     | system         | transactions | \x9555              | \xa000            |  5169
 7b49c915e7fe4f13afdee958028d4446 | System     | system         | transactions | \xb555              | \xc000            |  4256
 6b6a264711a84d25b3226f78b9dd4d6f | System     | system         | transactions |                     | \x0aaa            |   403
 96948dbb19674cb596c7f69177533000 | System     | system         | transactions | \x7555              | \x8000            |   338
 e112a0dd35994e5990b25c4ae8a00eb6 | System     | system         | transactions | \x5555              | \x6000            |   320
 f901168f334f43289007ee8385fedb67 | System     | system         | transactions | \x8aaa              | \x9555            |   315
 bddebf9b7d9b4858b4e502b8a1d93e01 | System     | system         | transactions | \x6aaa              | \x7555            |   310
 04a37ec2cecf49e5bdb974ee80bb5c97 | System     | system         | transactions | \x2000              | \x2aaa            |    73
 70f6e424970c44c391edbbc18225ccb0 | System     | system         | transactions | \x0aaa              | \x1555            |    69
 77bdebc4f7e340068ad3d8c6ccf74b35 | System     | system         | transactions | \xeaaa              | \xf555            |    69
 b4ae6f1115fc4a94b2d9e7aa514b5f28 | System     | system         | transactions | \xe000              | \xeaaa            |    67
 8674a0708cba4228bd1d435a642134bb | System     | system         | transactions | \xaaaa              | \xb555            |    66
 8ea1aa0f2e4749a4802986fefbf54d43 | System     | system         | transactions | \xd555              | \xe000            |    66
 e70fd34078e84fef945f28fe6b004309 | System     | system         | transactions | \x8000              | \x8aaa            |    64
 e66879054249434da01b3e7a314b983c | System     | system         | transactions | \x4000              | \x4aaa            |    63
 9d64f34797924998911922563d3ad0ec | System     | system         | transactions | \xa000              | \xaaaa            |    63
 9cf4fc4a834040df91987b26bf9831ed | System     | system         | transactions | \x4aaa              | \x5555            |    63
 c2cfa997bf634637880137870cee25a1 | System     | system         | transactions | \xcaaa              | \xd555            |    61
 d3bbd37828ab4225a68bf4dd54924138 | System     | system         | transactions | \x1555              | \x2000            |    56
 b3bbaec3014f4f1aa01e04b42c5f40c7 | System     | system         | transactions | \x3555              | \x4000            |    56
 542c6f91ff6a403399ddf46e4d5f29bb | System     | system         | transactions | \x6000              | \x6aaa            |    56
 27780cde5a1b445d83ff83c865b9bea5 | System     | system         | transactions | \xc000              | \xcaaa            |    51
 4a64a9f25e414ce7a649be2e6d24c7ee | System     | system         | transactions | \x2aaa              | \x3555            |    46

You can see that only a single tablet of the table test_table is getting most of the requests, and you have the partition range of that particular tablet.

Distribution of the type of the wait events for each component

Detect where most time is being spent to help understand if the issue is related to disk IO, network operations, waiting for a condition or lock, or intense CPU work.

SELECT
    wait_event_component,
    wait_event_type,
    COUNT(*)
FROM
    yb_active_session_history
GROUP BY
    wait_event_component,
    wait_event_type
ORDER BY
    wait_event_component,
    count;

 wait_event_component | wait_event_type | count
----------------------+-----------------+-------
 TServer              | WaitOnCondition |    47
 TServer              | Network         |   910
 TServer              | Cpu             |  2665
 TServer              | DiskIO          |  8193
 YSQL                 | LWLock          |     1
 YSQL                 | Cpu             |  1479
 YSQL                 | Network         |  4575

Detect which client/application is sending the most amount of queries

Suppose you have a rogue application which is sending a lot of queries, you can identify the application using the following example query:

SELECT
    client_node_ip,
    COUNT(*)
FROM
    yb_active_session_history
WHERE
    client_node_ip IS NOT NULL
GROUP BY
    client_node_ip;

 client_node_ip  |  count
-----------------+-------
 127.0.0.2:56471 |    92
 127.0.0.1:56473 |   106
 127.0.0.1:56477 |    91
 127.0.0.2:56481 |    51
 127.0.0.3:56475 |    53
 127.0.0.3:56485 |    18
 127.0.0.3:56479 | 10997