Chapter 13. Performance and throughput considerations

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Performance and throughput considerations

Performance, like beauty, is in the eye of the beholder. It can be very subjective but at the same time, can be precisely measured. This chapter discusses the performance attributes of JES2 and JES3. Though it will not be able to answer the question of how JES2 will perform in your environment, it will improve your understanding of the performance differences.

13.1 Performance of JES2 compared to JES3

Measuring performance involves looking at factors such as CPU usage, I/O rates, and memory usage. Comparing one version of a product to another generally involves keeping the workload constant, updating the products being compared, rerunning the workload, and comparing the measurements. But when comparing JES2 to JES3, the problem is complicated by the differences in where each JES does its processing.

In JES3, there is a JES3 address space on each z/OS image. One of those address spaces is designated the JES3 global. The other JES3 address spaces are designated JES3 locals. The JES3 global maintains the job queues, selects work to be processed on each z/OS image in the JESplex, and processes many of the requests made to JES3. The JES3 locals normally do relatively little processing. The local’s primary purpose is to act as a hot standby in case the global goes down, and to process requests that must be run on that particular z/OS image. For example, data set locate processing using a catalog or volume that is only accessible on that image, and commands that need processing on the local’s image.

In JES2, similar to JES3, each z/OS image has a JES2 address space. However in JES2, all instances of JES2 are the same. There is no centralized global processor responsible for distributing work to each member of the JESplex. Instead, each JES2 address space is responsible for providing work to that z/OS system and for servicing requests from that system. Each address space maintains a copy of the work queues, selects work to process on the system, and handles all requests made of JES on that member.

There is no straightforward comparison that can be made of a JES3 address space and a JES2 address space. Perhaps in a single member JESplex, some comparisons can be made. But there are few of these environments in the real world. You could aggregate the consumption of all the JES3 address space and all the JES2 address spaces, but that would miss another important component, communications used by JES to accomplish the needed tasks. JES3 uses XCF (through a component called JESXCF) to perform the required JESplex communications. JES2 also uses XCF (through JESXCF) but to a lesser degree. However, JES2 uses the checkpoint to share the work queues among the member.

Processing for the XCF messaging done by JES is part of the JESXCF and XCF address spaces. Since JES3 does more of its communications using JESXCF, that also has to be considered when trying to measure performance. JES2 checkpoint processing occurs in the JES2 address space. If the JES2 checkpoint is in a Coupling Facility structure, there is the cost of XES processing and the Coupling Facility overhead that is also part of the performance picture.

The comparison is further complicated by the differences in where each JES performs different parts of their processing. Depending on the function (and sometimes the options), processing might be done in the JES address space or in another address space. Here are some examples of processing differences:

C/I Conversion and interpretation processing converts the JCL that was submitted into an internal representation that can be used by z/OS to run a job. In JES3, C/I processing can be done on the JES3 global or it can be offloaded to a JES3 C/I FSS. In JES2 V2R1, it works the same way. Before that release, conversion was always done in the JES2 address space and interpretation was done when work was selected for execution. That processing might occur on the system that the job was submitted on, but it does not necessarily have to occur there.

Input Input processing creates the basic structure for a new job that is being submitted. During the input phase, a first pass through the submitted JCL is performed, processing any JECL cards and building the required instream data sets. In JES3, input processing is performed in the JES3 address space on the global. In JES2, internal reader and NJE over TCP/IP processing is primarily done in the submitting (application or NETSERV) address space. For other input sources, processing is done in the JES2 address space.

Extended Status Extended status is an interface into JES to query the status of jobs and output in the system. JES3 does most of the processing in the JES3 address space on the global. All the results are sent from the global to the requesting address space via JESXCF. In JES2, all the processing for extended status is performed in the requesting address space.
Processing for the JES Property SSI and the JES Device SSI also is done by the global address space in JES3 and in the requesting address space in JES2.
These interfaces are used by products like SDSF, FTP, z/OSMF, and others that display information about jobs and devices associated with JES.

SNA NJE NJE processing involves CPU intensive processing to build the data records required to transmit data across NJE. Since JES3 uses BDT to perform SNA NJE processing, that CPU intensive processing occurs in the BDT address space. JES2 does all the SNA processing in the JES2 address space.

Application SPOOL I/O
When applications write SYSOUT to SPOOL, they use a JES-specific access method under the covers to perform the SPOOL I/O. In JES3, SPOOL write I/Os from all address spaces on a system are queued to a single write queue (per SPOOL extent) to be written. The actual STARTIO request will be issued in whatever address space is in control when the current I/O ends or when the first element is added to the queue. JES3 maintains one active I/O per SPOOL extent per system.
In JES2, each address space performs its own SPOOL I/O. Each address space can start up to 2 I/Os per SPOOL extent. Each I/O is recorded (charged) to the address space that issued the I/O. I/O queuing priority also applies to SPOOL I/Os. JES2 can also take advantage of multiple paths (PAV) to the SPOOL volumes.

Monitor Each JES has a real-time monitor that looks at what the JES is doing and reports anomalous situations. These monitors tend to poll JES looking for processes that are looping or waiting for extended periods of time. By their nature, they tend to use noticeable amount of CPU. In JES3 the monitor runs in the JES3 address space. In JES2, there is a separate address space (JES2MON) that performs the monitoring.

There are other more subtle examples of these differences. Because of the complexity of where processing is done and trying to identify what processing is JES-related versus application-related, we cannot make simple statements about how much CPU one JES uses compared to the other. What we can do is look at factors like overall system overhead to understand the requirements of each JES. We can also look at wall clock time to complete specific activities as a way to compare the performance of the two JESs.

13.1.1 Steady state processing

When trying to compare the performance of JES3 and JES2, one easy environment to measure is steady state processing. In an idle system how much CPU is being used by the JESs? In JES3’s case, there is nothing for it to do when there are no requests to process and thus the global is mostly idle. It is awakened by requests to perform processing as they occur. However, each JES2 acts independently and primarily discovers work by reading the JES2 checkpoint data set. In effect, it is polling the checkpoint on a periodic basis to see whether any other member has added work that needs to be processed.

As a result, even in an idle system, JES2 is constantly reading and writing the JES2 checkpoint data set and consuming resources. How often this is done can be controlled by JES2 tuning parameters on the JES2 MASDEF statement. For more information about tuning JES2, see z/OS JES2 Initialization and Tuning Guide, SA22-7532.

Another thing that occurs on an idle JES2 system is the maintenance of data areas used by various interfaces. Even though JES2 processes extended status requests in the user address space, the data that is used to satisfy the request comes from a copy of the JES2 work queues that is maintained in a data space. The data space copy is kept up to date every time the JES2 checkpoint is written. Similarly, device information is stored in 64-bit common storage for use by the JES device information requests.

13.2 Throughput considerations

Where performance is often looked at as how much resource is needed perform a task, throughput is the amount of work that can get done in a given time. In some cases, throwing more resources at a problem can improve throughput. However, in some cases, throwing more resources at a problem will not get it done any faster.

Often the solution to a throughput problem is not more resources but a better way of utilizing the existing resources to get more stuff done in a unit of time.

13.2.1 MAS considerations

Both JES3 and JES2 have a single main task that is running in the JES address space. This task services the requests that are made of each JES.

In JES3 though, each JES3 address space has a main task, and most of the work occurs in the main task on the JES3 global. This task must handle most of the requests made of JES3 throughout the JESplex.

In JES2, the main task in each JES2 address space handles JES2 requests that originate on that member (they are not sent to other members to process). However, the JES2 main task must access the JES2 checkpoint to respond to many requests. Since only one member of a MAS can access the checkpoint at a time, there can be only one main task actively processing these requests in the JES2 MAS. However, not all processing done by the JES2 main task requires access to the checkpoint, meaning that some progress to be made on requests when the checkpoint is not owned.

If JES2 is running as a single member MAS, it can constantly own the checkpoint and there are no delays introduced by not owning the checkpoint. However, when a second member joins a MAS, the checkpoint access introduces a delay processing many requests (this is known as a MAS penalty).

Tuning of the JES2 checkpoint can greatly reduce the impact of checkpoint serialization. This can make the checkpoint available in a more timely manner. But there is only so much tuning that can be done for the checkpoint. Eventually, you need to start thinking about parallelizing processes to improve throughput.

Submitting jobs

Submitting jobs to JES involves allocating an internal reader, writing a JCL stream to the internal reader, then closing and freeing the internal reader. As part of this process, JES must examine every card that is submitted, process cards that have information needed at JES input processing time (JECL, JOB card information, and so on), and separate instream data from JCL cards.

In JES3, this processing is done by writing to SPOOL all the cards passed to the internal reader for the job. Once finished, the application must close the internal reader or issue an ENDREQ macro to get the SPOOL data set queued to the JES3 global. The JES3 local does not get involved in the submitting of jobs. So when the address space completes writing the job to SPOOL, a request is queued to the global to process the job. The task in the submitting address space will wait for input processing to complete before returning from the ENDREQ or CLOSE that was issued. To perform input processing for the job, the global reads the cards written to the internal reader data set and process the JCL stream. Each stream (internal reader submission) is processed sequentially, however, multiple streams can be processed by separate threads (DSPs) in the JES3 main task. In this scheme, adding more DSPs that can process internal reader data streams (via the OPTIONS INTRDR= keyword) can improve throughput but only to the point that the single main task TCB can interleave I/Os that read and write various data. Parallelizing internal reader submission (using 20 internal readers each submitting one job versus one internal reader submitting 20 jobs) only helps if there are enough DSPs to process the work.

JES2 performs input processing in a completely different way. In JES2, the bulk of the input processing work occurs in the address space that allocated the internal reader. However, the JES2 main task must place the job in the job queue at the start of the job submission (when the JOB card is detected) and move the job to its next queue at the end the job. These processes must access the checkpoint and are subject to checkpoint delays. Due to the nature of the main task processing, a single checkpoint update (delay) can process one or 100 or more jobs. So by submitting jobs using multiple internal readers, you can greatly increase the number of jobs per minute that can be submitted. One hundred internal readers can submit 100 jobs (one job per internal reader) in the same time one internal reader takes to submit one job. In JES2’s case, there are no resources that can make input processing for a single internal reader go faster, but by parallelize the processing (using multiple internal readers), you can get a significant throughput improvement.

Many job scheduler products have options to increase the number of internal readers they use to submit jobs. Though these options might not have been needed in JES3, use them when you run JES2.

13.2.2 Setting limits and counts

Limits are a way to manage resources in a system. Limits can be used to manage a constrained resource like memory, or they can be used to limit the effect of runaway processes. Counts control the number of processors available to accomplish a function. These processes are consumers of resources. Some processes can be enhanced just by adding resources. Other processes can be enhanced by adding more processors. Still others need both more resources and more processors.

There are a number of ways to manage constrained resources. Applications could choose to not have any external limit and react when a resource is no longer available. Alternatively, an application could set a hard limit that causes processing to wait or fail when the limit is reached. Or the application could include code that cleverly manages limits.

Understanding what limits exist and how to properly manage those limits can improve throughput and prevent outages.

JES2 processing generally either has no limits or a hard limit. No limit is something that is only limited by the architecture (such as a data space can only be 2 gigabytes in size). A hard limit is something specified via a JES2 parameter (such as the number of data buffers in JES2). Hard limits can be changed but require someone to take an action to do so.

As we will see, most limits in JES2 can be displayed using commands and products such as SDSF. However, no matter how many ways an application notifies users when limits are being reached, it is no good if no one is looking or they are looking in the wrong place. JES2 has a number of ways to assist installations in manage resource limits. Understanding this will ensure that JES2 continues to operate smoothly.

Monitoring JES2 resources

The resource limits in JES2 are monitored by a process in JES2 called the resource monitor. The monitor is looking for resources whose use has reached an installation-specified limit. Every resource has an installation-specifiable warning level used by the resource monitor to determine at what point a HASP050 message will be issued. The warning level is specified as a percentage on WARN= keywords on various JES2 statements. The default is always WARN=80 (start issuing warning messages when the resource usage reaches 80%).

When a resource usage reaches the warning level, a highlighted HASP050 message is issued. The HASP050 message contains the current utilization percentage. For example, a shortage of SPOOL space (Track GroupS – TGS) message would look like this:

$HASP050 JES2 RESOURCE SHORTAGE OF TGS - 93% UTILIZATION REACHED

Every 30 seconds the current utilization of any resource that whose usage has exceeded the threshold is examined. If the utilization has dropped below the warning level, the message is deleted (DOMed). If the percentage has changed but is still above the threshold, the old message is deleted (DOMed) and a new message issued with the new percentage. If the utilization is nearly 100%, the old message is deleted and a new message will be issued every 30 seconds. This ensures that the installation is reminded that the resource has been exhausted.

If this is a MAS-scope resource (such as SPOOL space), the first member that discovers the shortage issues the message. Other members defer messaging to that member. This prevents a flood of HASP050 message from each member. However, if resource utilization reaches near 100%, the HASP050 message is issued on every member.

The HASP050 message for some resources includes additional information to help determine a more appropriate resource limit. This occurs when a process requires a resource that has been exhausted. In this case, a longer form of the HASP050 message is used as shown in Example 13-1.

Example 13-1 Example of a long form HASP050 message

*$HASP050 JES2 RESOURCE SHORTAGE OF BUFX - 100% UTILIZATION REACHED

A TOTAL OF 292 BUFX ARE CURRENTLY DEFINED, OF WHICH:

292 (100%) ARE IN USE

3 (1%) ARE BEING WAITED FOR

0 PROCESSORS REQUESTED BUFX BUT DID NOT WAIT

THE LARGEST UNFULFILLED REQUEST WAS FOR 0 BUFX

A MINIMUM OF 295 BUFX IS REQUIRED TO SATISFY CURRENT DEMAND

In this example, the BUFX resource (specified on BUFDEF EXTBUF) has been completely exhausted and at least 295 buffers are needed to satisfy current demand. Use an operator command to increase the limit (to 295 plus at least another 20% so that you will not be above the warning threshold again). The long form of the message is used primarily for buffers used by multiple processes in JES2 (such as data, control block, VTAM, BSC, and header buffers).

Many installations automate on the HASP050 message and based on the resource, trigger a text message or send an email to someone responsible for the system. However, in many installations, the HASP050 message has become so frequent that it is ignored. This typically happens when the warning level for that resource is set too low and the system is normally operating above the warning level for some resources. As a result, the operations staff learns to ignore that HASP050 message because of the frequency of it, and end up ignoring all HASP050 messages.

Because of this, you must scan SYSLOGs for instances of HASP050 messages and take the appropriate action to address the cause of the message. You can either raise the resource limit, raise the warning level, or remove monitoring of the resource (by setting the warning level to 0). This ensures that you only see the HASP050 message when there is a problem and operations can be told to take action whenever they see the HASP050 message.

JES2 resources usage history

One of the functions of the JES2 health monitor is to maintain a history of the utilization of the major JES2 resources. The history is maintained in memory and encompasses the life of this JES2 subsystem. There are two ways to retrieve this information. The JES2 command $JDHISTORY will display this on the console as shown in Example 13-2.

Example 13-2 Display of the history of JES2 resource usage