DB2 load utility(how to avoid duplicate records)

Nagendran · New User Joined: 24 Jan 2007 Posts: 89 Location: USA

Hi,

I am using DB2 Load utility to load the records in to the DB2 DB.while loading i want to discard the duplicate records getting inserted into the DB.

Note:There is no primary key in the DB.

For eg: Say a record of length 50 bytes.i have to enforce constraints on these 50 bytes to check for duplicate.

anybody plz help me.........

Regards,
Nagu

expat · Posted: Sun Jul 29, 2007 6:15 pm

Why not sort the input file first, and use the capabilities of the sort product to remove duplicate records.

Nagendran · New User Joined: 24 Jan 2007 Posts: 89 Location: USA

thats not my prob....

The problem for me here is when i try to insert a record which is already in DB it should be discarded.

stodolas · Posted: Sun Jul 29, 2007 8:00 pm

Write a program to check for existence in the table first. Or define a primary key across all fields in the table. I don't believe the load utility can perform any logic.

SharathG · New User Joined: 23 Jan 2007 Posts: 12 Location: India

Hi,

The best option to load a table from a sequential dataset is to use a Load with Replace option. It will replace all the existing data in the table with data from the PS dataset. Here's the syntax: -
LOAD DATA LOG NO
REPLACE
NOCOPYPEND
DISCARDS 1
STATISTICS TABLE(ALL) INDEX(ALL)
INDDN SYSREC00
INTO TABLE <table_name>
(Columns)

You can add this in the Customized Load Control Card and use it while Loading.

ashwinreddy · Posted: Thu Aug 02, 2007 7:43 pm

Hi,

We use Replace option to replace all the data in a tale space.

But not to eliminate the duplicates.

As per my knowledge we can't aviod the duplicates in Load Utility.

As other members suggested write program to aviod duplicates or a jcl to aviod the dulicates.

My knowledge may be limited, lets see from other memebrs.

Cheers
Ashwin

dick scherrer · Posted: Thu Aug 02, 2007 8:16 pm

Hello,

From 2 of your posts:

stodolas · Posted: Thu Aug 02, 2007 8:21 pm

Table access without a primary key will be fine on small tables, but if the table is bigger than a few thousand records, you are going to start to see poor performance without a primary key also.

There are very few reasons for not defining a primary key.

Nagendran · New User Joined: 24 Jan 2007 Posts: 89 Location: USA

I got the solution from you....

Thanks for all.....

Suryanarayana.tadala · New User Joined: 03 Nov 2005 Posts: 43 Location: St.Louis

Nagendran..
Could you please elaborate us with the solution that you have !!

Nagendran · New User Joined: 24 Jan 2007 Posts: 89 Location: USA

Hi,

1)I wrote a query in QMF to identify the duplicate records,
after that i have moved that duplicate records to a file.

2)Then i wrote a program to delete all the duplicate records from the DB.

3)Next by using Load utility i have loaded the records in the file to
the DB.

thanks,
Nagu

Suryanarayana.tadala · New User Joined: 03 Nov 2005 Posts: 43 Location: St.Louis

Thanks...I was under the impression that you have some utlity which does all this !!