Grouping of records in a file using sort/DFsort

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Hi All,

I need help for grouping the records in a file, not sure whether we can do this using sort/ICEtool.Here is my requirement.

I need to group the records in my input file based on few fileds in the file

Input file is 2000 bytes long.

Grouping based on

Group 1)

Only Unique records ,Key starts at 13 length 12 ( no duplicates )

Group 2)

All Records with same values in Position starting from 13 ,12 bytes long( char) and position starting from 62,20 bytes long( PD)

Group 3)

All Records with same values in Position starting from 13 ,12 bytes long( char) and position starting from 62,10 bytes long( PD) and different values in position starting from 72,10 byte long (PD).

Group 4)

All Records with same values in Position starting from 13 ,12 bytes long( char) and position starting from 72,10 bytes long( PD) and different values in position starting from 62,10 byte long (PD).

Group 5)

All Records with Position startin from 13 ,12 bytes long( char) and different values in position starting from 62,20 bytes long( PD).

Expected Output file :-

The group name needs to be appended to the end of the each record and needs to be in sorted order based on group name and 13,12 as key.

Frank Yaeger · Posted: Thu Aug 20, 2009 10:33 pm

Please show an example of the records in your input file (relevant fields only) and what you expect for output, for all of the different cases.

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Frank,

Thanks for your quick response.

I am attaching the input and expected output.

20 byte fields is a combination of 4 fields.

20 byte PD can be split into 10 and 10 and we can use.

Group name is just an identifier to identify to which group the record belongs( as i stated earlier the records needs to be grouped to 5 groups)

Frank Yaeger · Posted: Fri Aug 21, 2009 1:28 am

Your expected output attachment doesn't show any group numbers, so it doesn't help me figure out which records in your input example should go to which groups.

Could you please just show the example input records and output records inline here like this:

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Thanks Frank for the response.

The input file is as follows.

Frank Yaeger · Posted: Fri Aug 21, 2009 3:20 am

Are there only one or two records with each 13,12 key as shown in your example, or could there be more (for example, three D...D keys)?

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

There will be only one or two records in the file with the same key value.
There wont be more than 2 for each key ( 13,12).

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Something to add...

If there is only one record for the key (13,12) then it will fall in group 1 and if there are two records for the key (13,12) then it will fall in any of other groups depending on other key values.

Thanks for your help.

Frank Yaeger · Posted: Sat Aug 22, 2009 1:12 am

Here's a DFSORT/ICETOOL job that will do what you asked for:

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Thanks for the response.

I tried using this Job and I'm getting RC 16.

Seems like T2 is exceeding the max record length.

"END OF T2 FIELD BEYOND MAXIMUM RECORD LENGTH "

Please find the attachment for ToolMsg and DFSMSG.

Regards,
Kham

Frank Yaeger · Posted: Sat Aug 22, 2009 2:35 am

It appears that your site has changed DFSORT's shipped default of SOLRF=YES to SOLRF=NO. This change is NOT recommended and can cause unwanted results as in your case. To use SOLRF, you can add the following to your job:

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Frank,

You are the best !!! Its perfect !!!

I was trying to decode what you are doing ? Can you just let me know why are you doing overlay or what is CTL1 and why in the CTL2 you are splicing the records...

Thanks for your support !!!

Frank Yaeger · Posted: Tue Aug 25, 2009 1:48 am

I'm using OVERLAY and SPLICE to create one record for each pair of keys that has the data for the first record of the pair in positions 1-2000 and the data for the second record of the pair in positions 2001-4000. For example:

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Thanks Frank

The document was very useful.

In the CTL2 you are you building a 4005 byte record ?

Can you please help me in understanding what you are trying to build in CTL2 and what is the significance of "/" in the below statement?

BUILD=(1,2000,C'CASE5',/,2001,2000,C'CASE5'))

Frank Yaeger · Posted: Wed Aug 26, 2009 1:20 am

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Thank you very much Frnak !!! I got it..I was looking for the meaning of "/".

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Frank,

A small change in reqirement...

Apart from Key in postion 13, 12 long I am going to have one more key field in 82nd postion which is 5 byte long packed decimal, (date field ex:- "20040804").

So I need to add this field also in the sort cards, but i am getting error 1when i am adding this field in the RESTART of seq number.

Skolusu · Posted: Fri Aug 28, 2009 2:03 am

khamarutheen,

The restart parm needs the keys to be contiguous bytes. So you need to pad the keys together. Change your CTL1CNTL to the following and re-run your job.

khamarutheen · Active Member Joined: 23 Aug 2005 Posts: 677 Location: NJ

Kolusu,
It worked !!!
I changed IFTHEN=(WHEN=INIT from below code as it was giving syntax error

Skolusu · Posted: Fri Aug 28, 2009 9:43 pm

khamarutheen,

oops my control cards had a typo when i was extending the control cards to the next line. sorry about that , but l am glad that you fixed them.

The correct control cards are