how to remove matching records from both the input files.

saravanakumar N · New User Joined: 24 Mar 2010 Posts: 5 Location: Chennai

Hi

I have sequential tape file with record length 3900.
I need to compare yesterday's file with today's file and process only the modifed records.
Every day the file is having more 1.5 million records and more than 80% of these records are unchaged. So I want to remove the unchanged records from both the file (yes'day and today file). By this my COBOL program will run with records that need comparison.

The record can have modification on any of its part. So we need to compare the entire 3900 bytes to find the unmodified records.

What will be the approach to achieve this. Please help me on this

Bill Woodger · Posted: Fri Apr 06, 2012 4:46 pm

Fixed or variable-length records?

If only your Cobol program needs this, and the files are in sequence, you could do it as a "two-file match" (sticky at the top of the forum).

Are the records in sequence?

The location of the data (dasd or tape) is irrelevant. Having said that, tape is generally "sequential" isn't it?

saravanakumar N · New User Joined: 24 Mar 2010 Posts: 5 Location: Chennai

Hi Bill

Its a Fixed length record.
these are sequential files. Records are not in sorted order. we can sort it based on a key of 17 bytes., which is starting from 1st position of the record.

Bill Woodger · Posted: Fri Apr 06, 2012 6:46 pm

There is a "sticky" in the Cobol forum of a program which performs the two-file match (assorted other names are possible).

Basically, it takes two input files and matches them on a key. What you code for what happens when a match or mismatch occurs is up to you.

However, your files are unsorted, so you need to sort anyway, so may as well do what you can in the sort with anything left being done in the program.

So, you need a JOINKEYS application. You'll need to match with the whole record, so the application will automatically sort on the whole record.

Can you knock up a "sample" of inputs and expected output. Some short (<80 bytes) records but which show fully the situation you want to identify.

enrico-sorichetti · Posted: Fri Apr 06, 2012 7:05 pm

dick scherrer · Posted: Fri Apr 06, 2012 8:30 pm

Hello,

Skolusu · Posted: Fri Apr 06, 2012 10:08 pm