Regarding Cursor Overhead

Asif Iqbal · Posted: Tue Jan 16, 2007 11:56 am

Hi all,

I have a program logic which will require OPEN-FETCH-CLOSE of a cursor for as many as 1 million times( = no. of records read from a file)

I am concerned and confused about the overhead this may cause on DB2.

Is it advisable to go-ahead with this logic?

Thanks,
Asif

priyesh.agrawal · Posted: Tue Jan 16, 2007 12:01 pm

It'll ofcourse cause... whats the need to open and close cursor so many times...
If not really required, put that statements (Open and Close) out of the processing loop...

Asif Iqbal · Posted: Tue Jan 16, 2007 12:12 pm

I understand ur point Priyesh, but my requirement is:

1) Read a rec from input file

2) For each rec read, query the table and get the result table created. The result table will have multiple rows. So I need to use cursors.

3) In essence, for every record read from file there has to be an OPENing of cursor.

So this is the case...I do have an alternate logic which will use unload FILE of table, but first I wanted to confirm from u all...

Please let me know ur views.

Thanks,
Asif

William Thompson · Posted: Tue Jan 16, 2007 3:46 pm

Are you using the row retrieved or just looking for their existence? you should be able to optimize the select for one row.
If your requirements need this, use it. Parallel timing against the unloaded table might be useful.
Possibly creating a summarized version of the larger table could be employed?

Asif Iqbal · Posted: Tue Jan 16, 2007 6:25 pm

Yes I have taken the last approach suggested by you:

Asif Iqbal · Posted: Tue Jan 16, 2007 6:26 pm

Yes I have taken the last approach suggested by you:

William Thompson · Posted: Tue Jan 16, 2007 6:29 pm

Asif Iqbal · Posted: Tue Jan 16, 2007 6:33 pm

I have to process all the rows in my program.

William Thompson · Posted: Tue Jan 16, 2007 6:39 pm

It will probably cause some performance issues....
Sounds like you might need to do some commits and program for restartablilty.... That would improve things.....

Asif Iqbal · Posted: Tue Jan 16, 2007 6:43 pm

Ok William, Thanks 4 ur response.

Let me see this and I will inform if any issues...

adarsha · Posted: Tue Jan 16, 2007 7:59 pm

The idea of downloading all the data into the flat file in the preious step is a better idea insted of using the SQL stamets simply for retrieving the data from relations, which usually affect the performance of the system (including precompile, compile,bind & execution.)

one more conlict is that sometimes the data retrieved into the flat file will contain some junk values which are added in place of spaces by the
LOAD utility...

u should be careful for that if your program is not sensitive to those issues!!!

DavidatK · Posted: Wed Jan 17, 2007 1:06 am

Asif,

What kind of processing are you going to be doing with the rows fetched from the DB2 tables? Are you doing any Table Updates? Give us some basic pseudo code of the processing (reading, writing updating, summarizing etc). I?m sure there are many good suggestions to be found here.

Asif Iqbal · Posted: Wed Jan 17, 2007 2:22 pm

Ok, I am providing the details below.

My cursor query is:

dick scherrer · Posted: Wed Jan 17, 2007 9:52 pm

Hello,

For my $.02, you'll be better off with the unload rather than the sql.

Asif Iqbal · Posted: Thu Jan 18, 2007 10:59 am

oh ok...

but there is also concept of TEMPORARY Tables which I think is used in cases where the result table is large.

Can u shed some light on this...

Thanks,
Asif

DavidatK · Posted: Thu Jan 18, 2007 8:05 pm

Global Temporary tables can be very useful, but they do have limitations on size dependant on the site setup.

When you say the result table will be large, what are you talking about? How many rows do you expect, and what will the structure be? i.e. how many total bytes will the table take?

I find that I can get more rows in a cursor than I can with a temporary table.

In your example that you posted, it looks like the only variable in the cursor is the ?range?. For each record you read from your input file, the range is going to change forcing a new cursor?

Please explain some more. ( I know that sometimes when I try to explain something, especially to my wife, I think it?s crystal clear because I understand what I?m saying, but she doesn?t understand. I have to explain in basic, complete terms)

dick scherrer · Posted: Thu Jan 18, 2007 8:11 pm

Hello,

Yes, temporary tables can be used. . . .

Keep in mind that if the data selected to populate the temporary table is massive the overhead to insert the selected data into the temporary will also be massive.

Then there will be the overhead to read it back from the temporary table.

If, on the other hand, you unload the data and process it in qsam, it will use less resources directly and indirectly will keep this large process from impacting the rest of the database environment.

While database/sql is our very good friend, it isn't a good answer for everything.

DavidatK · Posted: Thu Jan 18, 2007 8:40 pm

I agree with you Dick, using an unload of the table(s) can be much more efficient than going against the DB2 table if the percentage of rows used is high.

What I?m having trouble getting my arms around is why the cursor needs to be opened/closed for every input record being processed. If the cursor will result in a different result each time, I?m not sure how he would process a flat file and get the same result. Opening/Closing and processing the flat file for each input record also will incur a heavy resource use.

I?m thinking that if we knew the whole specification, and data structure we could probably come up with a reasonably efficient logic.

William Thompson · Posted: Thu Jan 18, 2007 8:48 pm

Asif Iqbal · Posted: Fri Jan 19, 2007 5:28 pm

dick scherrer · Posted: Fri Jan 19, 2007 8:49 pm

Hello,

I'd suggest the unload, sort and then matching your list of cust-id's with the unloaded/sorted data.

When you unload and sort the data, use a VERY large block size. While you'll need only open/close of the flat files, using a large block size will dramatically reduce your run time.

This will far outperform the sql alternatives.