Optimal usage of Joinkeys

Gururajan Balaji · New User Joined: 10 Jan 2011 Posts: 6 Location: Chennai

Hi All,
I Have to compare 2 files on (1,9 bytes) and get matching records from file2.

First file lrecl=10,recfm=fb -> Has no dups on key (May contain 1 million records)
second file lrecl=274,recfm=fb --> has dups on key (may contain 500 million records or more)
both the files are sorted on key.

I use the below sort card for comparison.

daveporcelan · Posted: Fri Jan 21, 2011 7:01 pm

If file two is already sorted on the key, then why not use

Frank Yaeger · Posted: Sat Jan 22, 2011 2:20 am

In addition to doing a COPY instead of a SORT (as suggested by Dave), I would also try reversing the files so the file with dups and more records is used as F1 to see if that performs better:

Gururajan Balaji · New User Joined: 10 Jan 2011 Posts: 6 Location: Chennai

Thanks Dave & Frank.
Will there be an Improvement in performance if I use PARM='CORE=MAX'?

Frank Yaeger · Posted: Tue Jan 25, 2011 12:13 am

CORE=MAX is equivalent to MAINSIZE=MAX which is the shipped default for DFSORT so you won't see any difference unless your site has changed the shipped default. Since I don't know what your site defaults are, I can't tell you if CORE=MAX will have any effect.

David Eisenberg · New User Joined: 15 Nov 2007 Posts: 39 Location: New York

>I would also try reversing the files so the file with dups and more records is used as F1 to see if that performs better<

Frank,

Can you tell us a little more about what prompted this suggestion? Are there generally-applicable performance considerations when choosing which file should be F1 and which should be F2 in a JOIN when one (or both) of the files is very large?

Thanks,

David

Frank Yaeger · Posted: Tue Jan 25, 2011 11:54 pm

The only thing I can say is that using F1 for the file that has "more" duplicates may result in more efficient processing. I can't really be more specific than that as it depends on a lot of factors. The only way to know for sure is to try it both ways.

David Eisenberg · New User Joined: 15 Nov 2007 Posts: 39 Location: New York

Frank,

That's helpful; thanks.

David