Sort step running long

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

I have a sort step which is running very long. Is there something we can do to tweak this one?

Elapsed time : 03.41.50.13
TCB time : 00.39.46.63

The input dataset is on DASD with record count 111442286. The sort step log information is posted below.

Nic Clouston · Posted: Wed Mar 21, 2012 1:08 pm

As you are using SYNCSORT and this is a DFSORT part of the forum you will not get far. SYNCSORT questions get posted in the JCL part of the forum. No doubt some kind moderator will move it.

Bill Woodger · Posted: Wed Mar 21, 2012 1:09 pm

If you contact Syncsort support, I'm sure they will be glad to help. If your search around in the JCL forum, not in the DFSORT forum, then you willl even find some appropriate contact e-mail addresses.

One thing I'd do, rather than wirting out two files that are pretty-much the same, but just a bit more data at the end of the record in one of them, is to just write out one file, and, if possible, change the programs reading the "new" file so they can read a bigger record but still do nothing with the extra bit of data.

EDIT: Sheesh, you are only copying. What do you do with the output files? Why is a COPY taking so much CPU? I suppose that is your question.

dbzTHEdinosauer · Posted: Wed Mar 21, 2012 1:51 pm

unless these are tape,
am wondering about block size

Bill Woodger · Posted: Wed Mar 21, 2012 2:00 pm

Yes, seems odd to save 63 bytes of DASD per record, only to waste 24,000+ per block (if on DASD). The bigger rip-off on then would be reading later.

You are doing 222 million "moves" of around 1700 bytes, just to shorten the records. Has to be a better way, even if only by putting a dummy "IFTHEN" with IFOUTLEN= and the desired record length.

OK then, answer the obvious questions.

dbzTHEdinosauer · Posted: Wed Mar 21, 2012 2:51 pm

IF DASD, the poor blocking would cause the excp count to be very high,
(1 block is a physical i/o)
So, i would hazard a guess that
(if DASD is the output
we have no idea, since pertinent info is missing:

DD statements from JCL
jesmsg info on i/o
WHAT ELSE IS GOING ON IN THE MACHINE WHILE THIS JOB RUNS

)
the job is waiting for i/o channel activity.

for someone with 133 posts, the TS has provided us with nothing except,
My job is running a long time
and posted in the wrong forum.

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

expat · Posted: Wed Mar 21, 2012 3:01 pm

Stand well away from the doors please

Topic being moved to the JCL forum

Hold tight

expat · Posted: Wed Mar 21, 2012 3:03 pm

Have you actually tried calculating the optimum DASD blocksize for each of these outputs and coding into the JCL and see if this helps.

dbzTHEdinosauer · Posted: Wed Mar 21, 2012 3:09 pm

don"t know about syncsort,
but DFSORT,
you are best not to declare a record size for output file if you are using BUILD, etc....

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Bill Woodger · Posted: Wed Mar 21, 2012 3:28 pm

Looking further lead to my previous question about what you are acutally doing with the files.

The little one it obviously makes sense to have seperate at some point. The bigger ones, less so (now that we know they aren't on tape, going to different locations, or similar).

How is the input being created? Does it come out of a SORT/TOOL step at any point? That could be a good time to create the small file.

The big output files are being processed at some point. How about, reading the big input file for your first application, doing logic to select and ignore the small sub-set to a new file, and just going with the big files for the rest of the processing, as mentioned above.

Then the whole kit-and-kaboodle just disappears.

Bill Woodger · Posted: Wed Mar 21, 2012 3:49 pm

I meant "big file" singular in the above. Just use the main input file for the rest of the processing. You are "saving" little bits of DASD (subject to your blocking problem) yet having three copies of 111 million records of approx. 1700 bytes!

If you just use the main input file in place of the two you are creating, you save all that time/processing/cost and a whole heap of DASD, backup time/media, JCL simplicity, design simplicity, etc.

The best way to "save" CPU/IO is not usually by "tuning" as such, but by working out how things which are being done don't need to be done.

Pity I don't have some way to charge you for all those savings... :-)

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

I checked the 3rd file with 1682 LRECL can be avoided. There is an easytrieve program which uses the 3rd file to get some data form it. So it can get the data from the bigger file also. Thanks a ton!

Bill Woodger · Posted: Wed Mar 21, 2012 9:43 pm

No problem. Now, If you can only use the big file instead of the 1708, and get the little file created somewhere else...

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Bill Woodger · Posted: Thu Mar 22, 2012 1:03 am

The following is extracted from what you posted earlier.