EXEC TIMES - Optimization

ojdiaz · New User Joined: 19 Nov 2008 Posts: 98 Location: Spain

Hello

I have a job that is taking a long time to execute in production. Last exec took 177 minutes to execute, and total CPU Time= 5.15

We have identified the problem steps in some sorts, which allocate huge files in tapes.

For standard, the JCL downloaded the tape to disk (SYSDA) and then, merged it with a sort with new data, and deleted/recreated the tape. This last step was taking a long time (Clock Time: 01:16:32 , CPU TIME 165.59S)

We discarded that and decided to simply copy with an ICEGENER the new records to the TAPE with disp=MOD, so that should reduce the exec time drastically.

However, I've been told by A support guy to add the BUFNO parameter to the files, however, I've been reading through the forum that the BUFNO parameter is not taken in consideration when the program processing the file is DFSORT.

Here is the Sort step and it's stats:

enrico-sorichetti · Posted: Wed Sep 09, 2009 1:54 pm

are You aware of the dangers of using disp=mod... you might be clobbering things

why not concatenate the tape and dasd datasets and create a new tape? but,
investigate how ICEGENER I/O behaves for concatenations of datasets on different device types

maybe a MERGE if the datasets are properly sorted might give better results ( from an I/O perspective).
( for MERGE the I/O technique used might be different and optimized for each input involved )

beware this is not a solution, but a hint on few issues to meditate about.

Frank or Kolusu are certainly the ones to speak the last word about it

ojdiaz · New User Joined: 19 Nov 2008 Posts: 98 Location: Spain

Hello Enrico

Thanks for ur support.

That's exactly what we did before, just with a Sort and it took quite some time to execute. We tried changing the creation of a new tape with ICEGENER and the times were still elevated, that's when we decided to use an disp=MOD

As for the dangers of it, What i know so far is that if the device runs out of space, the file could be damaged beyond repair. However, some STORAGE support guys told me that the "Space is almost infinite", not the best technical or reassuring answer I could get, but that's what they've told me

We have made some test with the disp=MOD and have been succesful so far. Around 120 executions in production. Here is one of the sysouts of a step with an ICEGENER And disp=MOD

enrico-sorichetti · Posted: Wed Sep 09, 2009 2:18 pm

that <things> refers to the output dataset ,

if something happens when appending the output dataset might be logically unusable

the dasd dataset only partially appended, and the cleanup might be tricky

ojdiaz · New User Joined: 19 Nov 2008 Posts: 98 Location: Spain

That's what i heard before. Well, we made this process so that the reocovery of a damaged file could be done from a DB2 table that is loaded before with the data in the file. We are appending the records to it as to have an historical file that won't be deleted. That historical file has accounting movements of the current month, so when a new month arrives, a new file is created. The table, as is loaded and has only the last 3 months data on it, can be downloaded and used to recreate the file if something goes wrong.

Thanks for ur support

Oliver

Frank Yaeger · Posted: Wed Sep 09, 2009 9:35 pm

If you don't need the records in order, then you should be doing a COPY, not a SORT. COPY is generally much faster then SORT.

You can do the COPY with the PGM=SORT job by using OPTION COPY. You can also do it with PGM=ICEGENER. BUFNO will have no effect if DFSORT is used (DFSORT does its own buffering).

I don't understand what the use of DISP=MOD vs DISP=NEW has to do with the runtime unless it's some kind of special processing done for tapes in your installation. AFAIK, using DISP=NEW for a tape does not normally add significant runtime for a job.

If you want, add the following to the job:

//SORTDIAG DD DUMMY

to get diagnostic messages. Then run the job with DISP=NEW and DISP=MOD and send me (yaeger@us.ibm.com) the JES messages for both runs and I'll take a look.

ojdiaz · New User Joined: 19 Nov 2008 Posts: 98 Location: Spain

Maybe I wasn't clear enough when i addressed the issue about the DISP=NEW Vs DISP=MOD, I'm sorry for the confussion. I'll explain what the job used to do in detail:

First, We have a tape which will receive records in an incrementally. First day of the month the tape is create and receives around 10 million records, and everyday, more records are added, from 6 million up to 12 million in a given day

The job does the following.

Takes the Tape and copies it to a file in disk:

Frank Yaeger · Posted: Thu Sep 10, 2009 8:59 pm

If you just want to keep adding the unsorted new records to the end of the tape, then this should work, although I won't comment on the dangers of DISP=MOD since I don't know much about that. My only comment about DISP=MOD had to do with performance vs DISP=NEW, but now that you've explained what you were doing before and what you're doing now, I see that DISP=MOD vs DISP=NEW had nothing to do with performance.

I'm sure you realize that what you're doing with the one ICEGENER step is NOT EQUIVALENT to what you were doing before. The job you used before resulted in all of the records (old and new) being sorted (that's why it took longer). The job you're using now results in appending new records without sorting them which requires much less processing.

ojdiaz · New User Joined: 19 Nov 2008 Posts: 98 Location: Spain