Appendix B. Performance Counters

<feature><title>In This Chapter</title> <objective>

Counters Maintained by the Monitored Computer

</objective>
<objective>

Counters Maintained by the Management Server

</objective>
<objective>

Counters Maintained by the OpsMgr Connector

</objective>
<objective>

Counters Maintained by the OpsMgr Web Console

</objective>
<objective>

Counters Maintained by the ACS Collector

</objective>
</feature>

Operations Manager (OpsMgr) 2007 collects a number of performance counters for the agent (monitored computer), management server, and Audit Collection System (ACS) collector. These counters are created when the corresponding component is installed on a computer.

Counters Maintained by the Monitored Computer

Each managed agent maintains two sets of performance counters. Management servers maintain these counters as well.

  • Health Service counters—These are performance counters for the Health Service running on that computer (see Table B.1).

    Table B.1. Health Service Counters

    Counter[*]

    Description

    Malformed Data Items Dropped

    The total number of malformed data items that have been dropped

    Malformed Data Items Dropped Rate

    The rate at which malformed data items are being dropped

    Module Count

    The number of modules running in the Health Service

    Task Count

    The number of tasks running in the Health Service

    Workflow Count

    The number of workflows running in the Health Service

    [*] There will be an instance of each of these per management group.

  • Health Service Management Group counters—These are performance counters relating to activity for the management group associated with that agent (see Table B.2). If the agent reports to multiple management groups (multihomed), there will be multiple instances of the Health Service Management Group counters.

    Table B.2. Health Service Management Group Counters

    Counter[*]

    Description

    Active File Downloads

    The number of file downloads to this management group that are currently active.

    Active File Uploads

    The number of file uploads from this management group that are currently active.

    Active Batch Processing Time

    The average amount of time in milliseconds it takes to process a batch of data.

    Average Bind Write Action Batch Size

    The average amount of time in milliseconds it takes to process a batch of data.

    Batch Processing Rate

    The rate at which batches of data are being processed.

    Bind Data Source Average Batch Size

    The average number of items inserted per batch into the workflow which originated from a remote Health Service.

    Bind Data Source Batch Post Rate

    The rate at which batches that originated from a remote Health Service inserted into the workflow.

    Bind Data Source Item Drop Rate

    The rate at which items are dropped by the bind data source because the workflow cannot process them quickly enough.

    Bind Data Source Item Incoming Rate

    The rate at which items are received from another Health Service.

    Bind Data Source Item Post Rate

    The rate at which items received from another Health Service are inserted into the workflow.

    Bind Write Action Batch Insertion Rate

    The rate at which batches are sent through the workflow destined for a remote Health Service.

    Bind Write Action Multiple Item Batches

    The number of batches sent through the workflow destined for a remote Health Service with more than one item in them.

    Bind Write Action Single Item Batches

    The number of batches sent through the workflow destined for a remote Health Service that have exactly one item in them.

    Bind Write Action Zero Item Batches

    The number of batches sent through the workflow destined for a remote Health Service that have zero items in them (NULL Sets).

    Incoming Management Data Rate

    The rate at which management data is received by the Health Service.

    Item Incoming Rate

    The rate at which items are being received by the Health Service.

    Maximum Send Queue Size

    The maximum size of the Send Queue.

    Send Queue % Used

    The percentage of the Send Queue in use.

    Send Queue Ack Timeout Rate (>4 send attempt)

    The number of Ack timeouts for items that have been sent more than four times.

    Send Queue Ack Timeout Rate (1 send attempt)

    The number of Ack timeouts for items that have been sent only once.

    Send Queue Ack Timeout Rate (2 send attempts)

    The number of Ack timeouts for items that have been sent twice.

    Send Queue Ack Timeout Rate (3 send attempts)

    The number of Ack timeouts for items that have been sent three times.

    Send Queue Ack Timeout Rate (all send attempts)

    The rate of items scheduled to re-send because they have not been acknowledged and are marked as requiring guaranteed delivery.

    Send Queue Incoming Acknowledgement Rate

    The rate of incoming acknowledgements for items sent with guaranteed delivery.

    Send Queue Insertion Rate

    The number of items inserted into the queue.

    Send Queue Insertion Rate (Guaranteed Delivery)

    The number of items marked as guaranteed delivery inserted into the queue.

    Send Queue Insertion Rate (Non-Guaranteed Delivery)

    The number of items marked as nonguaranteed delivery inserted into the queue.

    Send Queue Item Expiration Rate

    The rate at which items are being purged from the queue because they have not been sent within the queue expiration time.

    Send Queue Send Rate

    The number of items sent from the queue.

    Send Queue Send Rate (Guaranteed Delivery)

    The number of items sent that are marked for guaranteed delivery. This only records items sent; it does not account for whether or not the items are acknowledged.

    Send Queue Send Rate (Non-Guaranteed Delivery)

    The number of items sent that are not marked for guaranteed delivery.

    Send Queue Size

    The size of the send queue in bytes, not including any journaled data.

    [*] There will be an instance of each of these per management group.

 

Counters Maintained by the Management Server

In addition to the counters referenced in Tables B.1 and B.2, all management servers maintain performance counters for a number of items.

The first three groups of performance counters are located on each management server, but the associated service only runs when that management server is a Root Management Server (RMS). When the server is not acting as the RMS, these counters do not collect data.

  • OpsMgr Config Service (see Table B.3)

    Table B.3. OpsMgr Config Service

    Counter[*]

    Description

    Avg Management Pack Request

    Average time to service a management pack (MP) request

    Avg Request Queue Length

    Average length of the request queue

    Avg Request Queue Wait Time

    Average wait time of requests in the queue

    Change Notifications Failed

    Number of change notifications that failed

    Dirty State Notifications Failed

    Number of dirty state notifications that could not be sent

    Management Pack Requests Failed

    Number of MP requests that could not be processed

    Number of Active Requests

    Number of requests being processed

    Number of Queued Requests

    Number of requests waiting in the queue

    Number of Requests per Sec

    Requests competed per second

    [*] These are active on the RMS and are per management group.

  • OpsMgr Config Service per Agent (see Table B.4)

    Table B.4. OpsMgr Config Service per Agent

    Counter[*]

    Description

    Avg. State Sync Request (sec)

    Average time to service a State Sync Request from this agent

    State Sync Requests Failed

    Number of State Sync Requests received from this agent that could not be processed

    State Sync Requests Received

    Number of State Sync Requests received from this agent

    [*] These are active on the RMS and are per management group.

  • OpsMgr SDK Service (see Table B.5)

    Table B.5. OpsMgr SDK Service

    Counter[*]

    Description

    Client Connections

    Number of current clients connected to the OpsMgr SDK service

    Client Connections Using Cache

    Number of current clients connected to the OpsMgr SDK service that are using the cache

    Pending Client Type Cache Refresh

    Number of currently executing type cache refresh threads in the OpsMgr SDK service

    [*] These are active on the RMS and are per management group.

The next sets of counters are maintained on each management server.

  • OpsMgr DB Write Action Cache (see Table B.6)

    Table B.6. OpsMgr DB Write Action Cache

    Counter[*]

    Description

    Cache Hit Ratio

    Hit ratio for the cache

    Cache Hit Size

    Number of entries in the cache

    [*] These are active on each management server.

  • OpsMgr DB Write Action Modules (see Table B.7)

    Table B.7. OpsMgr Write Action Modules

    Counter[*]

    Description

    Avg. Batch Size

    Average number of entries processed by the module

    Avg. Processing Time

    Amount of time it takes to process an incoming batch on average

    [*] These occur on each management server for the following instances: Alert Write Module, Discover Write Module, Event Write Module, Performance Signature Write Module, Performance Write Module, SQLobj Write Module, and State Change Module.

  • OpsMgr DW Synchronization Module (see Table B.8)

    Table B.8. OpsMgr DW Synchronization Module

    Counter[*]

    Description

    Avg Batch Processing Time, ms.

    Average batch processing time in milliseconds

    Avg Batch Size

    Average batch size

    Batch Age/sec

    Batch age in seconds

    Batch Size

    Batch size

    Batches/sec

    Batches per second

    Data Items/sec

    Data items per second

    Errors/sec

    Errors per second

    Total Error Count

    Total error count

    [*] These occur on each management server for the following instances: Configuration, Domain Snapshot, Health Service Outage, Maintenance Mode, Managed Identity, Relationship, and Type Managed Entity.

  • OpsMgr DW Write Module (see Table B.9)

    Table B.9. OpsMgr DW Writer Module

    Counter[*]

    Description

    Avg. Batch Processing Time, ms

    Average batch processing time in milliseconds

    Avg Batch Size

    Average batch size

    Batches/sec

    Batches per second

    Data Items/sec

    Data items per second

    Dropped Batch Count

    Dropped batch count

    Dropped Data Item Count

    Dropped data item count

    Errors/sec

    Errors per second

    Total Error Count

    Total error count

    [*] These occur on each management server for the Alert, Event, Performance, and State instances.

Counters Maintained by the OpsMgr Connector

The OpsMgr Connector is located on management servers and database servers. The Connector is an interface to the OpsMgr SDK Service that allows applications to communicate synchronously with the RMS. Counters used by the OpsMgr Connector are listed in Table B.10.

Table B.10. OpsMgr Connector Counters

Counter

Description

Bytes Decrypted

The number of bytes decrypted by the connector per second. This may be more or less than the number of data bytes received by the connector due to compression.

Bytes Encrypted

The number of bytes encrypted by the connector per second. This may be more or less than the number of data bytes submitted for transmission due to compression.

Bytes Received

The total number of network bytes received by the OpsMgr connector per second. This may be more or less than the number of data bytes received due to compression and encryption.

Bytes Transmitted

The total number of network bytes transmitted by the OpsMgr connector per second. This may be more or less than the number of data bytes submitted for transmission due to compression and encryption.

Fragmented Compression Packets

The number of compression packets that were received by the connector and had to be reassembled per second. This can be caused by a number of factors, including the size of the packet, the Maximum Transmission Unit (MTU) of the network interface, the quantity of traffic on the network, and/or the amount of load on the connector. Packet reassembly can hurt performance and the ability of the server to scale.

Fragmented Data Packets

The number of data packets that were received by the connector and had to be reassembled per second. This can be caused by a number of factors, including the size of the packet, the MTU of the network interface, the quantity of traffic on the network, and/or the amount of load on the connector. Packet reassembly can hurt performance and the ability of the server to scale.

Fragmented Encryption Packets

The number of encrypted packets that were received by the connector and had to be reassembled per second. This can be caused by factors including the size of the packet, the MTU of the network interface, the quantity of traffic on the network, the encryption algorithm used, and/or the amount of load on the connector. Packet reassembly can hurt performance and the ability of the server to scale.

Fragmented Session Packets

The number of session packets that were received by the connector and had to be reassembled per second. This can be caused by a number of factors, including the size of the packet, the MTU of the network interface, the quantity of traffic on the network, and/or the amount of load on the connector. Packet reassembly can hurt performance and the ability of the server to scale.

Fragmented SSPI Packets

The number of Security Support Provider Interface (SSPI) negotiation packets that were received by the connector and had to be reassembled per second. This can be caused by a number of factors, including the size of the packet, the MTU of the network interface, the quantity of traffic on the network, and/or the amount of load on the connector. Packet reassembly can hurt performance and the ability of the server to scale.

I/O Errors

The number of I/O errors encountered by the connector per second.

I/O Operations Timed Out

The number of I/O operations on the connector that were timed out due to inactivity per second.

Open Connections

The number of TCP/IP (Transmission Control Protocol/Internet Protocol) connections currently open in the connector.

Server Listen Queue Length

The number of outstanding listen requests that are queued. If this drops to zero, the connector will not be able to accept incoming connections.

 

Counters Maintained by the OpsMgr Web Console

Table B.11 lists the counters used the OpsMgr Web Console application.

Table B.11. OpsMgr Web Console Counters

Counter

Description

CacheItems

Number of items that have been cached

Connect

Number of connects

ConnectErrors

Connection errors

PrepareView Time

Amount of time it takes to prepare a view

Reconnect

Number of reconnects

Session

Number of sessions

Threads

Number of threads

Counters Maintained by the ACS Collector

On the ACS Collector, we find two specific performance objects:

  • ACS Collector (see Table B.12)

    Table B.12. ACS Collector Counters

    Counter

    Description

    Connected Clients

    Number of clients currently connected.

    Database Queue % Full

    This is the ratio of the number of events currently in the database loader queue divided by the size of the database loader queue, expressed as a percentage.

    Database Queue Backoff Threshold

    The collector will not allow new connections while the database queue length is greater than this threshold.

    Database Queue Disconnect Threshold

    The collector will disconnect existing connections while the database queue length is greater than this threshold.

    Database Queue Length

    This is the number of events currently in the database loader queue.

    DB Loader Event Inserts/sec

    This is the number of records per second inserted into the dtEvent database table.

    DB Loaded Principal Inserts/sec

    This is the number of records per second inserted into the dtPrincipal database table.

    DB Loader String Inserts/sec

    This is the number of records per second inserted into the dtString database table.

    DB Principal Cache Hit %

    This is the percentage of all principal handling requests handled by the string cache, avoiding a principal insert or lookup in the database.

    DB Request Queue Length

    This is the number of requests from the collector currently waiting to be serviced by the ACS database. These requests are used during forwarder handshake and database maintenance and are not part of normal event handling.

    DB String Cache Hit %

    This is the percentage of all string-handling requests handled by the string cache, avoiding a string insert or lookup in the database.

    Event time in collector in milliseconds

    This is the current time between event arrival at the collector and insertion in the database queues in milliseconds.

    Incoming Events/sec

    This is the current time between event arrival at the collector and insertion in the database queues in milliseconds.

    Interface Audit Insertions/sec

    Number of interface audit insertions per second.

    Interface Queue Length

    Length of the internal queue to the subscriber interface.

    Registered Queries

    Number of queries currently registered.

  • ACS Collector Client (see Table B.13)

    Table B.13. ACS Collector Client Counters

    Counter[*]

    Description

    Average Time between event generation and collection and seconds

    This is the average time between event generation time (creation timestamp) and collection time in milliseconds.

    Incoming Audit Size

    This is the total size of events per second arriving at the collector from a specific forwarder.

    Incoming Audits/sec

    This is the total number of events per second arriving at the collector from a specific forwarder.

    [*] These occur for each collector/client pair.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset