remove duplicates and merge two files

Gopalakrishnan V · Posted: Mon Aug 23, 2010 6:35 pm

Y000263.RMSFEED.FILE1 :

010110011025247 XXXXXX
010110012300940 051461
010110012310940 111111

Y000263.REJECT.FILE1:

010110011025247 YYYYYY
010110011026855 051466

EXPECTED RESULT:
010110011025247 XXXXX
010110011026855 051466
010110012300940 051461
010110012310940 111111

MY CODING:

Robert Sample · Posted: Mon Aug 23, 2010 6:49 pm

Why is this in the COBOL forum? If you use SYNCSORT, it should be in the JCL forum; if you use DFSORT, it should be in the SORT forum.

sqlcode1 · Active Member Joined: 08 Apr 2010 Posts: 577 Location: USA

Gopalakrishnan V,
What is the LRECL and RECFM of the input file(s)?

Also, why couldn't you feed just 1 input file (Y000263.RMSFEED.FILE1) to this step, remove duplicates and then later in the next step/program concatenate both the files?

Thanks,

Garry Carroll · Posted: Mon Aug 23, 2010 7:48 pm

sqlcode1 · Active Member Joined: 08 Apr 2010 Posts: 577 Location: USA

Garry Carroll,
I agree with you, and yes OP can't use the method I have described above.

From his sample input and expected output he is looking for unpaired records from File1 and File2 along with paired F1 records. If he has duplicates in File1 (010110012310940) then he wants all the dups in output. In other words, he wants records from file2 only if its not present in file1.

Gopalakrishnan V
Could you also give us your DFSort Function level?

Thanks,

dneufarth · Posted: Mon Aug 23, 2010 9:05 pm

Help me understand OP examples. It's been awhile since I've slept.

I see no dups in 1st 15 characters except 1st rec in each file. Shouldn't all records with no dups be in output?

EQUALS will get the 1st rec.

sqlcode1 · Active Member Joined: 08 Apr 2010 Posts: 577 Location: USA

dneufarth,

Frank Yaeger · Posted: Mon Aug 23, 2010 11:32 pm

Gopalakrishnan V · Posted: Tue Aug 24, 2010 9:56 am

Hi,
In a cobol program i get the input file as y000263.rmsfeed.file1, in that program if any invalid record then it written into y000263.reject.file1.

on next day that reject file get corrected by some support team, and merged with y000263.rmsfeed.file1.

The main thing is if any record is missing in y000263.rmsfeed.file1 then we have to take that record from y000263.reject.file1 if available.
Otherwise if any duplicates then we should remove y000263.reject.file1 record.

LRECL=60, RECFM=FB

Garry Carroll · Posted: Tue Aug 24, 2010 12:14 pm

dneufarth · Posted: Tue Aug 24, 2010 12:32 pm

seems like this should all be in a single program that processes 'rmsfeed' while resolving the corrected record processsing order and creating a 'reject' file for correction later. And then the cycle repeats the next scheduled run.

Corrected records seem to supercede a more current feed.

dneufarth · Posted: Tue Aug 24, 2010 12:49 pm

I'm no SORT guru, so in plain speak

as corrected rejects are concatenated last, perhaps

BUILD a rec with seq number at end of rec

SORT the key ascending and the seq number descending

With Option Equals, the reject should always eliminate the 'rmsfeed' 'reject' records dup issue in favor of the reject

BUILD rec without seq number

now take this file into pgm that processes whatever and yields the latest reject file

search the dfsort or JCL forums for build examples

Garry Carroll · Posted: Tue Aug 24, 2010 12:56 pm

dneufarth · Posted: Tue Aug 24, 2010 1:07 pm

Garry,

Correct - had it backwards in my mind. Darn fine solution to get those corrected records in there first though.

Have no clue what I was thinking to bother with all that diatribe that is much ado about nothing.

sleepless in Cincinnati - got hung up on corrected records for some reason