Sort Merge tune performance

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

We are processing a huge VSAM file (around 35 million records, 1282 record length) and add around 50,000 records from another VSAM to it each day. Its like a history file and we maintain both VSAM and flat file versions of it (can't change that its business) as output. It takes around 30 mins to process as of now. We are going to add another 35 million records because we acquired a new company. Any performance tuning help is greatly appreciated.

Skolusu · Posted: Fri Jul 20, 2012 2:00 am

divya_maddi,

You can move the INCLUDE condition upfront. Try these control cards

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

Just tried running the job with the new control card by having the INCLUDE upfront. Could not see an improvement. Please let me know if there is another way.

Skolusu · Posted: Fri Jul 20, 2012 5:17 am

divya_maddi,

Please add //SORTDIAG DD DUMMY to your JCL and re-run the job and send us the complete sysout to DFSORT hot line: dfsort@us.ibm.com

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

I am going to send the sysout to the email ID you mentioned above, but I am pasting it here as well just in case. Any performance tuning help is greatly appreciated.

Thanks!

Skolusu · Posted: Fri Jul 20, 2012 10:28 pm

divya_maddi,

Here are some recommendations from our resident performance expert.

Your input is a KSDS with a CI size of 1536 which would mean there are 390 CIs per CA. Therefore, I would recommend to update the SORTIN01 and SORTIN02 as follows:

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

Still could not see an improvement

30.13 minutes with your recommendation
30.78 minutes with our production code

dbzTHEdinosauer · Posted: Sat Jul 21, 2012 12:21 am

is that wall clock time?

what about actual cpu time and a few other run statistics?
start with the simple stuff,
like what is found in the first part of the jes display.

your insistence on wall clock time may not be valid........
on your machine, it may just take that long.

dick scherrer · Posted: Sat Jul 21, 2012 12:32 am

Hello,

How many other processes (online or batch) might be using these files at the same time this job is run?

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

Yes that is wall clock time. CPU time is pretty low - under 3 mins.
No other process uses the file when we use it for this job.
Job runs at midnight and the file is closed down in the online region as well.

Before this we used REPRO for the VSAM copy and then a SORT for flat file copy. Now I have everything in one MERGE. Brought down the clock time from 2 hrs to 30 mins. But just wondering if there is more we can do.

Thanks!

dick scherrer · Posted: Sat Jul 21, 2012 7:49 am

Hello,

If i understand your process, more than 100million 1.25k records are being read and written - most of the reads/writes are vsam.

30 minutes may not be so bad . . . Sounds like the process is running at or near the speed of the dasd.

Just thought.

Bill Woodger · Posted: Sat Jul 21, 2012 1:33 pm

If you can show the JCL for the steps from the close of the online to your merge, there may be something. Perhaps to replace the REPRO with SORT, depending on exactly what it is doing.

dick scherrer · Posted: Sun Jul 22, 2012 1:42 am

Hi Bill,

If i understand, the repro process took 2 hours.

One thought i had was to use sort to "unload" the vsam files, merge these sequential files, and thn copy the result file into a newly delete/defined vsam file.

Might be worth a timing test. . .

Bill Woodger · Posted: Sun Jul 22, 2012 4:39 am

Hi Dick,

I was thinking along those lines. Reviewing the topic, the REPRO has already gone :-)

divya_maddi,

In your first run one record was deleted, so moving the INCLUDE would not have helped, but in your second nearly 10% of the total. Will make some difference on a MERGE, but not huge.

The BUFND should be noticeable, even if not always in elapsed. You have one record per CI. Does the file get used much in batch? How (serial, random, mixture)?

How many levels of index are there?

50k vs 35m is a very small number of inserts. Because of the small CI it might be worth checking on a program to do keyed inserts.

A listcat after the online and after the sort would be interesting.

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

I am sorry for the delay in response.
Following is the definition of the VSAM we are using (for all of them).
We almost always use the VSAM only for inserts and never for reads.
Any help in tuning this would be appreciated.

Yes the move from REPRO to SORT Merge already happened.
We FTP the file with 50,000 records from unix and then sort it into VSAM.
We then sort merge this VSAM with 50,000 records to our history VSAM (and flat file) with 35 million records.

We are at liberty to change the definition of the VSAM if it is going to improve performance.

DEFINE CLUSTER( -
NAME(VSAM) -
FOR(0000) -
FREESPACE(0 0) -
BUFFERSPACE(24576) -
OWNER($IAM) -
INDEXED -
SPEED -
SHAREOPTIONS(2 3) -
KEYS(52 0)) -
DATA( -
NAME(VSAM.DATA) -
CYL(3000 3000) -
RECSZ(1282 1282)) -
INDEX( -
NAME(VSAM.INDEX) -
CYL(1 1))

enrico-sorichetti · Posted: Thu Jul 26, 2012 9:36 pm

Skolusu · Posted: Thu Jul 26, 2012 9:53 pm

dick scherrer · Posted: Thu Jul 26, 2012 10:07 pm

Hello,

I'd run a timing test to see how this worked doing the merge with qsam files and then reloading the vsam.

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

We hold history only for 120 days and it has to be in one file. It is a KSDS VSAM but out DASD Manager asked us to use IAM as the owner which helps in compression. I never worked with IAM files, so i do not know if one needs to call this an IAM file. It is compressed though - The sequential file occupies much more space than the VSAM even though both have the same data.

bufferspace is BUFFERSPACE(24576) for all our VSAM file definitions both input and output.

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

I already tried QSAM followed by VSAM loading - takes longer due to the multiple writes may be?

dick scherrer · Posted: Fri Jul 27, 2012 8:40 pm

Hello,

If there are 50,000 new adds per day and the file keeps data for 120 days, how are there 35million records? At 50,000 a day, this would be about 1.5 million per month which would be less than 7million for 120 days would it not?

Yes, IAM is not the same as VSAM - program code will work as though it is VSAM, but the underlying mechanics are quite different.

I still don't understand why this needs to be one huge file. As requested earlier, please post how/when this history data is used.

Also when each monthly is run, how are the "old" records removed from the file. Hopefully, they are not just "marked" as deleted and are actually removed.

enrico-sorichetti · Posted: Fri Jul 27, 2012 9:11 pm

Skolusu · Posted: Fri Jul 27, 2012 11:12 pm

divya_maddi · New User Joined: 03 Nov 2005 Posts: 33

50,000 a day is an average (more recent average) - it might go higher than that. But we currently have the total at 35 million for 120 days. Old records are removed each day with the sort I already gave in the beginning. Mentioning again for you reference:

INCLUDE=(23,10,CH,GE,DATE1(-)-120)

dick scherrer · Posted: Sat Jul 28, 2012 1:50 am

Hello,