Search and find the best possible match from a file

chagoyal · New User Joined: 03 May 2007 Posts: 11 Location: India

Hello, I have a scenario where I have two files, FILE1 and FILE2. FILE1 has the following structure

murmohk1 · Posted: Thu May 03, 2007 3:08 pm

chagoyal,

I dont think you can acheive your requirement through DFSORT or JCL. You need some programming skills for this.

chagoyal · New User Joined: 03 May 2007 Posts: 11 Location: India

Yup.. I felt so ..
But still thought there could be some <if then else logic> that could be incorporated in a jcl through dfsort to achieve this ..

Frank Yaeger · Posted: Thu May 03, 2007 11:06 pm

I'm not convinced yet that this can't be done with DFSORT/ICETOOL, but I need more information to know for sure.

What is the RECFM and LRECL of the each input file?

Please show an example of the records in each input file (relevant values) and what you expect for output. Please show as many variations as you can. You can use something simple for each key value and col value such as a letter or number to keep it simple.

chagoyal · New User Joined: 03 May 2007 Posts: 11 Location: India

Hello Frank,

Thanks for your response. Actually FILE1 and FILE2 are table unloads and can be made any RECFM and although the record size for both the input tables vary, the files can still be made to have the same lrecls by adding a filler in one or both the input files.

Lets assume all the columns in both the files are char(5).

Now lets assume the following data in FILE1
KKKK1KKKK2KKKK3KKKK4KKKK5CCCC1CCCC2CCCC3CCCC4CCCC5
KKKK6KKKK7KKKK8KKKK8KKK10CCCC6CCCC7CCCC8CCCC9CCC10

Lets assume FILE2 has the following data

KKKK1DDDDDDDDDDDDDDDKKKK4NNNN1
KKKK1DDDDDKKKK3KKKK3KKKK4NNNN2
KKKK1DDDDDKKKK4KKKKKKKKK5NNNN3
KKKK1KKKK2DDDDDDDDDDDDDDDNNNN4
KKKK1KKKK2KKKK3DDDDDDDDDDNNNN5
KKKK1KKKK2KKKK3KKKK4KKKK5NNNN6
KKKK4KKKK7KKKK8KKKK9KKK10NNNN7

Corresponding to the first record of FILE1, 6th record of file2 is the best match, i.e NNNN6. For record 2 of FILE1 there is no match hence it will not go in the o/p file.

Lets assume file2 has the first two records only..i.e.

KKKK1DDDDDDDDDDDDDDDKKKK4NNNN1
KKKK1DDDDDKKKK3KKKK3KKKK4NNNN2

In this case both the records occupy same position priority wise but since first record is found first, the effective value for the output will be NNNN1

Again if we assume the FILE2 just has the following three records.

KKKK1DDDDDDDDDDDDDDDKKKK4NNNN1
KKKK1DDDDDKKKK3KKKK3KKKK4NNNN2
KKKK1DDDDDKKKK4KKKKKKKKK5NNNN3

Even now the value for COL6 will be NNNN1

Frank Yaeger · Posted: Fri May 04, 2007 11:19 pm

I'm still not sure I understand your priority scheme. I think that a match of key1 and key2 and key3 has a higher priority than a match of key1 and key2. But does a match of key1 and key3 and key5 have a higher or lower priority than a match of key1 and key2?

Are you only interested in these matches:

key1, key2, key3, key4, key5
key1, key2, key3, key4
key1, key2, key3
key1, key2
Key1

or are you also interested in other matches such as:

Key2
Key1, Key3
Key1, Key2, Key5

and if you are interested in those other matches, then what's the priority scheme for all of the other possible matches?

chagoyal · New User Joined: 03 May 2007 Posts: 11 Location: India

Answers-

1. A match of key1 and key3 and key5 has lower priority than a match of key1 and key2 coz that is equivalent to match on key1 only.
matches only.

2. yes I am interested in the following

key1, key2, key3, key4, key5
key1, key2, key3, key4
key1, key2, key3
key1, key2
Key1

3. Nope- Not interested in those matches..

Frank Yaeger · Posted: Sun May 06, 2007 9:56 pm

chagoyal,

Ok, I think I understand what you want and the DFSORT/ICETOOL job below should do it. To test it, I used the following for the input files:

Input file1

chagoyal · New User Joined: 03 May 2007 Posts: 11 Location: India

Hello Frank,

I tried the solution posted by you .. and it's working !!! Thanks a lot !
YIPEE !!