JCL SORT to split the file and put record count in trailer

Yogesh Jaiswal · New User Joined: 31 May 2015 Posts: 12 Location: India

Hi
I am trying to write a sort in which I have to split the file from particular record identifier (IVR000000) and then count the records(i.e. IVR000000 not the number of rows) and put the count at last in trailer record (e.g. )

My File looks like below:

PTX000000~C5408461~N~MTF~FEL~49948558~Mackenzie Saxon
PTX000000~K6157615~N~MTF~NL~38766490~Mawer Balanced
IVR000000~2036~9567183~E~Mrs.~Syt Est~~C (RECORD STARTING)
IVRADV000~2036~Ptes~Mte~~(416) 555-
PLNSUM000~~CI Investments~46429602~330.28~0.00~0.06~-16.52
PLNSUM000~SDRSP~SLFISI~K5837557~124034.53~4400.0449.30~6014.
IVRSUM000~124718.99~4400.00~0.00~135095.83~5976.84
PTX000000~K5837557~N~MTF~DSC~7291202~CI Harbou
PTX000000~47790432~N~MTF~DSC~7291202~CI Har (RECORD END)
IVR000000~6248~9762202~E~~Jltes~~Cmtes (NEW RECORD STARTING)
IVRADV000~6248~Mctestm Ctestmc~Whtestwht~~(519
IVR999999~000000013 (TRAILER RECORD)

The record starts with the 'IVR000000' so when the split happens upto the RECORD END the rows should be in that file only and the new record must start from 'IVR000000' only.

I am not sure if it can be done from SORT or ICETOOL or some other utility?
Please help!!

Bill Woodger · Posted: Wed Feb 17, 2016 4:52 pm

Can you show your input again, in the code-tags, not with bolding or other highlighting?

Can you also show the output you expect for that data, not just say something woolly and general? In the code-tags again, please.

Yogesh Jaiswal · New User Joined: 31 May 2015 Posts: 12 Location: India

Thanks for your quick reply.

My sample input file is attached with 4 records (i.e. IVR000000)

If the above input file is splitted into 2 files (File1-3 records and File2- 1 record), the output files should be like the attached Output file1 and Output file2.

Hope this clears my query.. please let me know if more clarification required.

Thanks a ton

Terry Heinze · Posted: Wed Feb 17, 2016 8:00 pm

Yogesh Jaiswal,
Please use code tags, not attachments.

Nic Clouston · Posted: Wed Feb 17, 2016 8:33 pm

Still not clear - your first example shows a trailer saying 13 whereas there are only 2 IVR000000 records present. Your second example shows only one set being extracted whereas it is implied ALL IVR000000 sets are to be selected. If you want only one set then what is the criteria for distinguishing that set from the other sets?

Yogesh Jaiswal · New User Joined: 31 May 2015 Posts: 12 Location: India

Sorry for the confusion and using attachments.

Actually in first example, the trailer should be only 2 not 13 (mistake).
In second example, I have given only 4 IVR000000 record sets, if splitted in two files like 3 sets in File1 and 1 set in File2 then their record count must also be calculated and inserted as trailer in the file which is 3 and 1 respectively.

In real, I have more than 3-4 lacs IVR000000 record sets which I need to split into 4-5 files, the record sets should be inserted in files such that the file1 contains 1 lac record set, file2 contains 1 lacs,....., nth file contains the rest of the record sets.

There is no criteria to select IVR000000 records sets (No record should be excluded only split the big file into 4-5 files), the only condition is that every new client record starts with this IVR000000 entry and its details are in subsequent lines till before the next IVR000000 is encountered. This all is a one set of record and must not get mixed with other IVR000000 details.

Hope this time I am clear. Thank you all for your patience.
If still there is any confusion, please let me know.

Nic Clouston · Posted: Thu Feb 18, 2016 3:06 am

What are/is 'lacs'? It does not seem to be a computing term.

enrico-sorichetti · Posted: Thu Feb 18, 2016 3:11 am

What are/is 'lacs'? It does not seem to be a computing term.

some kind of indian unit ... 1 lac = 100000 IIRC

RahulG31 · Active User Joined: 20 Dec 2014 Posts: 446 Location: USA

You need to create groups with BEGIN=(1,6,CH,EQ,C'IVR000'). If you are not bothered about the order/sequencing of groups then, I think, you can do it by using WHEN=GROUP with ID=1.

ID value starts with 1 and incremented by 1 but since we have ID as 1 byte it can only have values 0 - 9.

Now you can send separate groups to separate output files depending on ID values.

The trailer with count can be added later with TRAILER1.

I haven't tried it but I believe it should be along the same lines.
.

kranthikumarb · Posted: Thu Feb 18, 2016 12:57 pm

Try this. This splits the file into 2 files. Tailor it according to your needs and and use include NE C'IVR999999' for your last file.

Bill Woodger · Posted: Thu Feb 18, 2016 4:29 pm

Is your file made up entirely of IVR...PTX... records as the groups, plus the trailer, where you want to know the number of groups?

The you want to put approximately "n" records in each of three files, with the rest in a fourth file? The reason for "approximately" is that you don't want to split a group across files? And each file needs a trailer generated?

Yogesh Jaiswal · New User Joined: 31 May 2015 Posts: 12 Location: India

Thank you so much all for the replies.

One more thing is that the file is VB file with LRECL=2052, so putting the indicator at the last bytes might not work.

Yes Bill, my entire file is made up of IVR,PTX,PLN,TRX,.... like prefixes with trailer record at the end. So, I want it to split in a way to keep all this in splitted files so that the split files can be feeded in the next job as this big file was feeded but now with less volume of records.

We plan to have 100,000 record sets in one file, and it could be the case that the last 2-3 files will not have any data if the records are less.

Bill Woodger · Posted: Thu Feb 18, 2016 11:35 pm

For variable-length records, you extend at the "front" of the record.

If you have only one type of group, the only thing you have to worry about is accidentally including the trailer in a group. So you can OMIT COND= the trailer.

Add a sequence number using INREC and IFTHEN=(WHEN=INIT. Use IFTHEN=(WHEN=GROUP for your group-starter value. PUSH the sequence number to the position of the sequence number. This will give all the records in the group the same sequence number.

Have four OUTFILs. First can be LT 100000 in the sequence number. Second GE 100000 and LT 200000, third GE 200000 and LT 300000, fourth with SAVE (to catch any remainder).

Yogesh Jaiswal · New User Joined: 31 May 2015 Posts: 12 Location: India

Thanks Bill for the suggestion but its not working, I also dig out some info and tried many things with the BEGIN, GROUP approach, but the file is not getting split edit: the word is 'split' not 'splitted' which does not exist

I am using the below code and also tried many other combinations:

Yogesh Jaiswal · New User Joined: 31 May 2015 Posts: 12 Location: India

Also tried this one and getting same result:

Yogesh Jaiswal · New User Joined: 31 May 2015 Posts: 12 Location: India

Thank you all for looking into this.

Now I am able to split the files and also put the trailer at the end of each by using 2 steps as below:

Bill Woodger · Posted: Mon Feb 22, 2016 9:10 pm

I'm not sure why you felt two steps would be needed.

Yogesh Jaiswal · New User Joined: 31 May 2015 Posts: 12 Location: India

Thanks a lot Bill. I have achieved what I wanted with your suggestions and help in a single sort step.
As per your suggestion I tried the single OMIT instead of checking in every OUTFIL and also using the Sequence number to form the group but the results were not proper. May be because I am performing multiple things in as single sort step.
My final sort is as below: