EZT prog to print duplicate records in an output file

grayWolf · Posted: Wed Nov 02, 2011 12:38 pm

Hi All,
We got a requirement to write all the duplicate records from the input file into an output file using Eztrieve.

Input file layout:

101001ABCDXYZQWERTY
102002GHJJRWQWEWTUY
102002ASDSGFHRJKNXZ
102002AGHHASDHHHJDF
103004BNVBNGNDDFGSF

First 3 bytes -> Department number
Second 3 bytes -> Stock number

If there are duplicates with respect to the Dept number/Stock number combination, we are supposed to write those records into an output file.
In the example above, since 102/002 is repeated 3 times, we must write these 3 records into an output file.

The logic that we followed is as follows:

1) Read the file
2) Move the file variables into Working storage variables
3) read the file
4) If the present record is equal to the previous record, write it into the output file.

So in my logic, only 1 record is written into the output file if there are TWO records as duplicates.

Please let me know what should be the logic that has to followed in case I need to write all the duplicates into the output file.
For example if there are 100 records as duplicates, all the 100 records should be in the Output file.

Thanks!

Bill Woodger · Posted: Wed Nov 02, 2011 12:56 pm

This is not really an Easytrieve question, it is "language independent" logic.

From what you have shown, I don't know why you don't get them all.

dbzTHEdinosauer · Posted: Wed Nov 02, 2011 12:57 pm

answer not appropriate, deleted by DBZ

dbzTHEdinosauer · Posted: Wed Nov 02, 2011 2:23 pm

maybe this:

dupped = ""
save_area = ""
save_key = ""

LOOP:

read record

if EOF

if dupped = "Y"
- write save_area
- END-OF-JOB

if save_key = record_key

write save_area
dupped = "Y"

if save_key not = record_key

if dupped = "Y"
- write save_area
- dupped = ""

move record to save_area
move record_key to save_key

GOTO LOOP

Bill Woodger · Posted: Wed Nov 02, 2011 3:07 pm

There's me wondering why dbz has all that stuff in, so I go back and look, and yes you need the record it is duplicating against as well.

So, all "one behind", store the whole record, and remember to check for the last stored one to write at the end of processing, or when encountering the first duplicate, write out the one that it duplicates, then the duplicate, then again no need for extra at end of file.

grayWolf · Posted: Wed Nov 02, 2011 3:22 pm

dbzTHEdinosauer · Posted: Wed Nov 02, 2011 3:46 pm

enrico-sorichetti · Posted: Wed Nov 02, 2011 3:54 pm

Bill Woodger · Posted: Wed Nov 02, 2011 4:01 pm

PeterHolland · Posted: Wed Nov 02, 2011 7:10 pm

Chapter 12 of the EZT Reference Guide 6.2 describes : Single File Keyed Processing.

Using Synchronized File Processing on a single file enables you to compare the
contents of a key field or fields from one record to the next and use IF tests to
group records according to the key fields. The file name is coded on the JOB
INPUT statement as follows:
JOB INPUT (filename KEY (keyfield...))

Bill Woodger · Posted: Wed Nov 02, 2011 7:34 pm

Perfect, Peter.

dick scherrer · Posted: Wed Nov 02, 2011 7:58 pm

Hello,

What is the maximum number of duplicates possible for "a key"?

enrico-sorichetti · Posted: Wed Nov 02, 2011 8:26 pm