Use SORT to Split file and omit some records

abby.qiong.zhang · New User Joined: 07 Jun 2012 Posts: 26 Location: China

I would like to split my input file into several output use the following rules:
1. Split every group of lines that start with "B" and follow several "C", "D", "E" lines into a new file
2. If col "23-25" is not equal to "203" in line C, then omit the whole group "B", "C", "D", "E", "C", "D", "E" lines.

My input like this

Bill Woodger · Posted: Mon Apr 21, 2014 1:10 pm

You are going to need to be clearer.

"B" defines the start of a group, no other types of group?

Why isn't your entire input included in output file 1?

Does a "C" record immediately follow a "B" record?

What do you want to do when there are two (or more) "C" records withing a "B" group?

Probably more, but difficult to tell when so little is know.

abby.qiong.zhang · New User Joined: 07 Jun 2012 Posts: 26 Location: China

Bill Woodger · Posted: Mon Apr 21, 2014 2:16 pm

OK, that makes things clearer, and easier.

Use OPTION COPY, INREC IFTHEN=(WHEN=GROUP to identify the B-record and PUSH the entire record to a temporary extension of your data, PUSH=(513:1,512).

The B record is now tucked away safely for when you need it.

In OUTFIL, use OMIT= to get rid of all the original B records. Use IFTHEN=(WHEN=(logical expression to identity the C records, then use the / (Slash Operator) to on BUILD to create two records for output, the B record first, followed by the C. BUILD=(513,512,/,1,512). Use IFOUTLEN=512 to set the record-length (which will achieve the chopping off of the B records from position 513 on all the other records).

I would write something to check, 100%, the structure of your file, and consider how it remains valid, but you've probably already dealt with this...

abby.qiong.zhang · New User Joined: 07 Jun 2012 Posts: 26 Location: China

Bill Woodger · Posted: Mon Apr 21, 2014 3:13 pm

It is in the OUTFIL that the records need cutting down (using IFOUTLEN in this case).

abby.qiong.zhang · New User Joined: 07 Jun 2012 Posts: 26 Location: China

Bill Woodger · Posted: Mon Apr 21, 2014 4:18 pm

The original solution was removing all the B records, as they were being inserted when a C record was encountered.

Input

abby.qiong.zhang · New User Joined: 07 Jun 2012 Posts: 26 Location: China

Bill Woodger · Posted: Tue Apr 22, 2014 12:11 pm

I think you've got that, including the NE.

I missed/mis-interpreted the need to drop some stuff.

So back to dropping all the Bs, and generating all the Bs, since it is only the 203 groups you need.

Add a new WHEN=GROUP with PUSH for the position of the 203. Do this only when the seq added in the first PUSH is "two" and END when 'B'. Ensure that the SEQ is big enough for the maximum number of records in a group, and then add another digit.

Use the 203 (which is now on all relevant records if present).

Note that other than the first B, each B will have the 203-position-value of the previous group (as they will end the group), but this does not matter, as all the Bs are ignored.

Your INCLUDE=/OMIT= should:

Ignore all Bs in position 1.
Keep all other records which have 203 in your new PUSHed field.

In the OUTFIL, you need to generate a B with the Slash Operator for every C.

Although you no longer need to reference the sequence number in the OUTFIL, it is now needed earlier for the new WHEN=GROUP.

abby.qiong.zhang · New User Joined: 07 Jun 2012 Posts: 26 Location: China

abby.qiong.zhang · New User Joined: 07 Jun 2012 Posts: 26 Location: China

I try with this and it works, add one more condition in the OMIT statements to delete all 227 lines.

Bill Woodger · Posted: Tue Apr 22, 2014 3:46 pm

OK. Good going. I'd have used NE 203, but if there are only 227 and 203 the results would be the same, if a little less clear (as that knowledge of the data is required to understand the code).

If you have a new question, posit it as new question please.

abby.qiong.zhang · New User Joined: 07 Jun 2012 Posts: 26 Location: China

Sure Bill.

I posted a new topic regarding the count matter, looking forward to your suggestion on this.

ibmmainframes.com/viewtopic.php?p=322658#322658