Syncsort: Eliminating the first occurance of dup record.

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

Hi,

Here is my req..
I have a file which has four columns and i am sorting based on the first three columns.
c1 c2 c3 c4
------------------------
a1 a2 a3 v1
a1 a2 a3 v2
a1 a2 a3 v3

I have to sort based on only the first three columns and eliminate the duplicates.But i wanted the third row in my output sorted file..
a1 a2 a3 v3 (Required row)

Please help....

nelson.pandian · Posted: Thu Feb 12, 2009 4:16 pm

Hi Naresh Kareti,

The DFSORT/ICETOOL job will gives you desire output.

Arun Raj · Posted: Thu Feb 12, 2009 7:24 pm

Naresh Kareti,

What if your input has records without duplicates? Say something like this.

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

Arcvns,

Ur right..my file can contain few duplicates and few non-duplicates..My output should always contain the last occurance of duplicate..

In a file, if there are 'n' duplicates, i want the n'th record in the o/p file..

I have given the values in the first post as an example only..we cannot give the values directly to the INDD.

I have to give the input file to the sortin and want the last occurance of duplicate record in the sortout file..i have to sort based on few columns and note that all the rows will be at same level(refer for the example in the first post)

Please let me know if the req is not clear...

Arun Raj · Posted: Thu Feb 12, 2009 11:41 pm

Naresh,

Post the expected output for the above input data.

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

Arun,

From ur example..i wanted the the following in the sorted file

A1 A2 A3 V3---last duplicate from the top three rows as we r sorting based on first three columns
A1 A2 A4 V1
A1 A2 A5 V1

gcicchet · Posted: Fri Feb 13, 2009 8:27 am

Hi,

try

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

Let me tell wat i am doing currently

the pgm is as follows

Arun Raj · Posted: Fri Feb 13, 2009 1:04 pm

Naresh,

For your requirement, you can slightly modify Gerry's card as

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

if i want to sort the 1st, 3rd and fourth fields, then how shud i give the sort condition in the ON fileds..

Arun Raj · Posted: Fri Feb 13, 2009 8:50 pm

Naresh Kareti,

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

Thanks everyone...this is working absolutely fine...but cant we achieve the same result using 'SORT" techniques..

In my pjct i may not use ICETOOL..

Arun Raj · Posted: Mon Feb 16, 2009 10:32 am

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

Hi,

I am actually sorting based on the following fileds
SORT FIELDS=(1,7,CH,A,26,18,CH,A,472,18,CH,A).

So how to give these fileds in the SECTIONS()...I am using lrecl of 1000 for the o/p file.

Arun Raj · Posted: Mon Feb 16, 2009 1:04 pm

Naresh,

As per your latest post, you would need the below SyncSort job to achieve this.

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

Arun,

Thank you very much..it is working as i expected.
Could you please explain how did u arrive at these numbers in the TRAILER3 command.

Arun Raj · Posted: Mon Feb 16, 2009 2:55 pm

nareshkareti · New User Joined: 22 Jul 2008 Posts: 33 Location: Chennai

I have a small issue again...I had the same logic and working fine when the input file have some records.

But when the input file is empty, then this logic is resulting in one row with all spaces in the sorted output.

Can anyone please tell the reason for this.