Need to sort the INPUT file based on the key

sindhuvava · New User Joined: 05 Dec 2006 Posts: 17 Location: Chennai

This is the copybook layout of my input file:
Code:

05 SSN PIC X(09).
05 HICN PIC X(12).
05 FIRST-NAME PIC X(30).
05 MID-INT PIC X(1).
05 LAST-NAME PIC X(40).
05 BIRTH-DT PIC 9(8).
05 GNDR-CD PIC X(1).
05 COV-EFF-DT PIC 9(8).
05 COV-CANC-DT PIC 9(8).
05 UBOI PIC X(20).
05 REL-CD PIC X(2).
05 TRANS-TYPE PIC X(3).
05 DET-IND PIC X(1).
05 REASON PIC X(2).
05 SBSDY-EFF-DT PIC 9(8).
05 SBSDY-CANC-DT PIC 9(8).
05 APPL-ID PIC X(10).
05 NASCO-GRP-ID PIC X(9).
05 NASCO-SUBGRP-ID PIC X(4).
05 NASCO-PKG-NBR PIC X(3).
05 NASCO-SUB-ID PIC 9(13).
05 NASCO-DEP-ID PIC 9(2).
05 SRC-SYS-KEY PIC 9(3).
05 FILLER PIC X(15).
05 NOTIF-REAS PIC X(4).
05 FILLER PIC X(35).
05 Timestamp PIC X(14).
05 FILLER PIC X(127).

So i need to sort the INPUT file based on the key(NASCO-SUB-ID,NASCO-DEP-ID,SRC-SYS-KEY) to get the latest record with respect to the Timestamp.

The key which i have mentioned above forms the unique key for that input file. so i need to retrieve the latest record containing the highest timestamp from the input file for each unique key.

In the Input file, some of the records contains 'blanks' for the columns NASCO-SUB-ID,NASCO-DEP-ID respectively.

when i tried to sort using JCL, the records conatining blanks in the Key field are getting eliminated.

All the records contain SRC-SYS-KEY column in the input file.

please suggest me how to proceed.

nuthan · Posted: Wed Oct 17, 2007 10:47 am

Can you give ur JCl sort card which u have used for this.

sindhuvava · New User Joined: 05 Dec 2006 Posts: 17 Location: Chennai

//STEP110 EXEC PGM=SORT
//SYSPRINT DD SYSOUT=*
//SYSOUT DD SYSOUT=*
//SYSUDUMP DD SYSOUT=*
//SORTIN DD DSN=TEST.RDSRSP.HISTORY.NASCO.FILE.OCT07,
// DISP=(OLD,KEEP)
//SORTOUT DD DSN=TEST.HISTORY.NASCO.FILE.OCT15.SORT,
// DISP=(NEW,CATLG,DELETE),
// UNIT=SYSDA,MGMTCLAS=WORK6M,
// LIKE=TEST.RDSRSP.HISTORY.NASCO.FILE.OCT07
//SYSIN DD *
SORT FIELDS=(188,13,CH,A,201,2,CH,A,203,3,CH,A,260,14,CH,D)
//*
//STEP120 EXEC PGM=SORT
//SYSPRINT DD SYSOUT=*
//SYSOUT DD SYSOUT=*
//SYSUDUMP DD SYSOUT=*
//SORTIN DD DSN=TEST.HISTORY.NASCO.FILE.OCT15.SORT,
// DISP=(OLD,KEEP)
//SORTOUT DD DSN=TEST.HISTORY.NASCO.FILE.OCT15.SORT.NEW,
// DISP=(NEW,CATLG,DELETE),
// UNIT=SYSDA,MGMTCLAS=WORK6M,
// LIKE=TEST.HISTORY.NASCO.FILE.OCT15.SORT
//SYSIN DD *
SORT FIELDS=(188,13,CH,A,201,2,CH,A,203,3,CH,A)
OPTION EQUALS
SUM FIELDS=NONE
//*

this was the jcl i used.

the lrecl is 400.

nuthan · Posted: Wed Oct 17, 2007 11:33 am

The key( the three fields which u specified) wont be unique if the records contains 'blanks' for the columns NASCO-SUB-ID, NASCO-DEP-ID. so these recods fall under duplicate cond and was eliminated due to SUM fields.
Correct me if i am wrong, there will be one record which has SRC-SYS-KEY but NASCO-SUB-ID and NASCO-DEP-ID are balnk respectively.

sindhuvava · New User Joined: 05 Dec 2006 Posts: 17 Location: Chennai

Yes!! u r absolutely right!!!

can you provide me an solution how to get the required output

it would be really very much greatful to me..

thanks in advance!!!

nuthan · Posted: Wed Oct 17, 2007 11:46 am

let me know how will you decide the uniqueness of records if the fields NASCO-SUB-ID, NASCO-DEP-ID are balnk. are there any other fields to decide this, if not then it is difficult to separate them out as they fall under one category.

murmohk1 · Posted: Wed Oct 17, 2007 12:07 pm

Sindhu,

Aaru · Posted: Wed Oct 17, 2007 12:08 pm

sindhuvava · New User Joined: 05 Dec 2006 Posts: 17 Location: Chennai

Aaru,

Yes u r absolutely right!!!

i dont want those to be considered as duplicates.

05 NASCO-SUB-ID PIC 9(13).
05 NASCO-DEP-ID PIC 9(2).
05 SRC-SYS-KEY PIC 9(3).

Actually these are the three fields which forms the unique key for the table to be laoded.

murmohk1 · Posted: Wed Oct 17, 2007 12:19 pm

Sindhu,

Aaru · Posted: Wed Oct 17, 2007 2:38 pm

Could you please try this.

1) Seperate the input file into two, one (File 1)without blanks in those two fields and the other (File 2) with blanks in those 2 fields. This can be done using OUTFIL\INCLUDE.

2) Sort the file 1 on those 3 fields and then eliminate the duplicates using SUM FIELDS=NONE

3) Sort the file on the 3rd key field and then eliminate the duplicates.

4) concatenate both the files to get the final output.

Techies, please advise if this can be done in a better way than is.

sindhuvava · New User Joined: 05 Dec 2006 Posts: 17 Location: Chennai

Aaru · Posted: Wed Oct 17, 2007 3:25 pm

Sindhu,

Frank Yaeger · Posted: Wed Oct 17, 2007 9:43 pm

Sindhu,

Here's a DFSORT/ICETOOL job that will do what you asked for: