Removing selective rows using DELETE SQL

himanshupant · New User Joined: 21 Mar 2007 Posts: 46 Location: India

Hello

I have a query ( bad pun)

Suppose there is a table with following rows

Kjeld · Posted: Wed Feb 09, 2011 2:55 am

Is this question a student assigment?

Mass deletions are not DB2's favorite. Depending on data quantities, I would consider unloading the data, sort the unload in Branch #, Account #, Date, Type, Amount. The sort routine could possibly be configured only to output the first occurrence of the records of type 'C' for a given combination of the first 4 keys. Alternatively, I would code a small batch program to do it.

Then you load the result dataset to the table with replace option.

himanshupant · New User Joined: 21 Mar 2007 Posts: 46 Location: India

this is not a student assignment..One time data cleanup as described above needs to be done that is why SQL DELETE would be preferred

dick scherrer · Posted: Wed Feb 09, 2011 9:10 am

Hello,

The better solution would to be do as suggested and retain the "footprints" of what was done. A blind delete provides no audit trail. . .

Also, as this has happened, having a solid solution will help if something similar happens one day.

There is no reason to limit solutions to the smallest amount of sql code that can be written.

himanshupant · New User Joined: 21 Mar 2007 Posts: 46 Location: India

Sure Dick...

I concur with your point that DELETE does leave no audit trail.. I had the routine safety measures in mind like taking a pre DELETE IMAGE COPY , exhaustive checkouts once DELETE is done so ensure that no unwanted data is remaining and no genuine data is lost.

Development effort for batch program would have made sense had this been a regular cleanup ...

However I just thought of a altogether different approach to tweak the data retrieval SELECT queries so that the records which I want to be deleted are not FETCHED in the first place in any application program.

Something like the below

dick scherrer · Posted: Wed Feb 09, 2011 9:40 am

Hello,

himanshupant · New User Joined: 21 Mar 2007 Posts: 46 Location: India

Hi Dick

I am wanting to tweak the SELECT because very few rows are corrupted and the development effort required to cleanup them up would be difficult to justify in my establishment.

The above SELECT query which I have highlighted is helping me to get the required rows where TYPE is "C"

But the requirement is to get all the possible rows for each branch / account between two dates. The requirement for TYPE = C still stands . So for a branch / account if on a give day between the given date range , if there are multiple 'C' type records we only select single minimum amount from it.

GuyC · Posted: Thu Feb 10, 2011 1:18 pm

you shouldn't use HAVING for fields which are know before grouping.
there is no point in grouping everything and then discarding <> "C"

Kjeld · Posted: Thu Feb 10, 2011 1:26 pm

The requirement was to DELETE the rows. I am not sure if it is at all possible to delete specific rows from a select with aggregate functions imbedded in it.

GuyC · Posted: Thu Feb 10, 2011 1:30 pm

you can use aggregate functions in a subselect of a delete, but :
I don't think you can Delete 1 of 2 "completely the same rows".
maybe with RID(), but haven't tried that yet.

himanshupant · New User Joined: 21 Mar 2007 Posts: 46 Location: India

GuyC · Posted: Thu Feb 10, 2011 4:41 pm

DB2 is mostly limited by the knowledge of its users.

Kjeld · Posted: Thu Feb 10, 2011 5:07 pm

himanshupant · New User Joined: 21 Mar 2007 Posts: 46 Location: India

himanshupant · New User Joined: 21 Mar 2007 Posts: 46 Location: India

dick scherrer · Posted: Fri Feb 11, 2011 12:55 am

Hello,

I'm a bit late to the party, but this:

GuyC · Posted: Fri Feb 11, 2011 1:52 pm

DB2 version ?
I think RID() came in DB2 v9 NFM.

If two rows are completely identical(all columns), the only difference is the RID.

pre v9 you have a few possibilities ( some already explained) :
1) do an unload/sort/reload
2) write a small program with a cursor and do a DELETE WHERE CURRENT OF
3) create a new table and do an insert (select group by)
...

dbzTHEdinosauer · Posted: Fri Feb 11, 2011 3:07 pm

dick scherrer · Posted: Fri Feb 11, 2011 8:33 pm

Testing - we don' need no steenkin' testing. . .

d