Frequent -911 abends

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

In my project we are getting frequent contention problems and jobs failing with -911..The temporary solution I use is to wait for the contending job to complete and then restart the failed job.

But I am looking forward for permanent solution. Seniors pls tell me the ways in which I can resolve these types of issues. Ridiculously some of the select queries are also failing with -911.

dick scherrer · Posted: Sun Mar 13, 2011 11:51 am

Hello,

There is a design problem. . .

Without changing the design (which may or may not completely resolve this), there needs to be better scheduling. Do not allow these to be runnng at the same time.

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Hmm even I thought of it..but the contending job are running multiple times a day..how do we handle these cases in shceduling..Like we have 2 jobs A & B..Both are updating table C..But the issue is job A runs 6 times in a day and the other job B runs every 2 hrs in a day..I m scared if we put scheduling resolution it may messs up a little bit..

And the other area where I don't have much knowledge is the locking involved for the tablespaces..Why should the select statement fail.?but the select statement in the contending program was not having "WITH UR" in the query..

dick scherrer · Posted: Mon Mar 14, 2011 2:27 am

Hello,

Ronald Burr · Posted: Mon Mar 14, 2011 3:08 am

Much depends on how long either job runs when it does run successfully.
If both jobs only run for a minute or two, setting negative dependencies on both jobs in the scheduler should suffice ( a negative dependency is one in which both jobs are flagged in such a way that the job scheduler will not release one job if the other has already been released but has not yet ended ). Sure, one job may have to wait a minute or two, but it will avoid the -911's.
If either job runs for an extended period, it may be possible that ONE of the two could have its queries changed to specify FOR FETCH ONLY WITH UR - that would avoid contention, though doing so at the risk of the WITH UR job not having the most up-to-date table rows, especially those being simultaneously manipulated by the other job.

don.leahy · Posted: Mon Mar 14, 2011 6:56 am

Suggest you also look at the checkpoint or commit frequency of each process.

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Anuj Dhawan · Posted: Mon Mar 14, 2011 3:37 pm

dick scherrer · Posted: Mon Mar 14, 2011 8:25 pm

Hello,

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Gt it thanks!

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Scheduling is definately a better option but I would also like to explore other options like tuning the queries.

I saw many select queries are also abending with -911 tat is a concern for me..If we change the islolation level of the select queries to "WITH UR" are we running into a risk of fetching wrong data..I read that islolation level UR permitts uncomitted read..How do we make sure that it does not take the uncommited data?

Secondly I have cae arcoss situations where the update job failed because the tablespace is in rad status but ideally the table space should be in RW status..ny idea what may have altered the tablespace status..I Ma nt aware how the tablsespace gets altered..Might b the DBA has to do something with that!

dick scherrer · Posted: Tue Mar 15, 2011 2:05 am

Hello,

Rijit · Active User Joined: 15 Apr 2010 Posts: 168 Location: Pune

Is it a good idea to reduce the COMMIT frequency for minimizing the possibility of a deadlock?

dick scherrer · Posted: Wed Mar 16, 2011 2:04 am

Hello,

Not if there is no way to "get back to where you were". How many commits were issued won't matter if some of the data is lost or corrupted.

I believe the search for a "magic bullet" should cease and the application be corrected. . .

rocky_balboa · Posted: Thu Mar 17, 2011 12:18 pm

Find out with help of DBAs whether lock escalation is leading the entire table to be locked due to insufficient memory for new locks. Make sure You aren't issuing "lock table" statements in any of your programs. You may need to revisit your commit frequency.

Also issue frequent commits even after your ready only access to release all your share locks.