View previous topic :: View next topic
|
Author |
Message |
priyankassharma
New User
Joined: 29 May 2007 Posts: 16 Location: mumbai
|
|
|
|
Hi,
Could somebody please help me knowing the reason that considerable no of records would be missed out while writing to a dasd file .The missing records belongs to one block which is been dropped due to some reason.
The records were read and processed in the application, seems that its an environment issue.
Thanks! |
|
Back to top |
|
|
William Thompson
Global Moderator
Joined: 18 Nov 2006 Posts: 3156 Location: Tucson AZ
|
|
|
|
Take a look at the EREP information. |
|
Back to top |
|
|
priyankassharma
New User
Joined: 29 May 2007 Posts: 16 Location: mumbai
|
|
|
|
Could you please suggest where can i get that information i.e. EREP information.
Thanks! |
|
Back to top |
|
|
dbzTHEdinosauer
Global Moderator
Joined: 20 Oct 2006 Posts: 6966 Location: porcelain throne
|
|
|
|
Start with this link and then pick a reference guide and user manual, download them to your PC and do a little reading. It will take you a little time, the first time, but then you have added a valuable tool to your knowledge skills. |
|
Back to top |
|
|
dbzTHEdinosauer
Global Moderator
Joined: 20 Oct 2006 Posts: 6966 Location: porcelain throne
|
|
|
|
Quote: |
The records were read and processed in the application, seems that its an environment issue.
|
How do you know that they were read and processed? And what do you mean by 'environment issue' and why, since others were supposedly read and processed?
Is this 'missing block' (are you using IBM terminology?) first, last or in the middle?
You have provided little information about what transpired, but, unfortunately, you have conveyed alot about your lack of understanding and experience with computers. I would be careful about using terminology that does not fit the situation.
If indeed this is a hardware problem, the EREP will help. But, I suspect you have a bug in your code OR incomplete or faulty JCL.
You should start by looking in the JES output of the JOB. |
|
Back to top |
|
|
priyankassharma
New User
Joined: 29 May 2007 Posts: 16 Location: mumbai
|
|
|
|
Thanks a lot .. will go thru it...
I believe it's going to help!
Thanks! |
|
Back to top |
|
|
aru_bec
New User
Joined: 23 Dec 2005 Posts: 19 Location: Bangalore
|
|
Back to top |
|
|
priyankassharma
New User
Joined: 29 May 2007 Posts: 16 Location: mumbai
|
|
|
|
Thanks for the info ! But the case mentioned above does not fit the scenario that I have.
Have checked the JES output the first thing but didn't get any evitable info
rmation from there.
The block missing is from middle of the file.There are some 57 k records that have been missed out.
have tried generating EREp reports but its didnt run as the access was denied for SYS1.LOGREC dataset.Tried copying the SYS1.LOGREC dataset to test qualifier for running the job, this time the job ran but TOURIST has rendered information that the dataset does not have valid attributes.
Thanks! |
|
Back to top |
|
|
dick scherrer
Moderator Emeritus
Joined: 23 Nov 2006 Posts: 19243 Location: Inside the Matrix
|
|
|
|
Hello,
How many records does the input file contain?
As was asked before, how are you sure that the records were processed when the file was read? How do you know they were actually written to the output file?
How many records are in the output file?
What does the JES output from the problem job show? What (if any) diagnostic information was provided?
Can a sort or some other program successfully read the file that is missing the 57k records?
Has this been re-run since the problem run?
If you explain the process that "lost" the records as well as the jcl for the problem job, we may be able to offer suggestions. |
|
Back to top |
|
|
priyankassharma
New User
Joined: 29 May 2007 Posts: 16 Location: mumbai
|
|
|
|
Hi ,
Please find below the answers to the queries you raised :
Hello,
How many records does the input file contain?
Input file contains 1146557 records.
As was asked before, how are you sure that the records were processed when the file was read? How do you know they were actually written to the output file?
We have a counter inthe code which would increment the count by one on reading each records, moreover we cam to know that the data is missed out by rerunning the job again.
On comparing both the outputs we noticed that first 54 K records were written, then 57 K records were skipped and then 567 K records were written.
No pattern was found in those 57 K missed out records i.e the records were not skipped cause of any logic inside the code.
How many records are in the output file?
621,653 in the file with missed out file and 678,739 in the run which gave the correct output.Moreover in the sysout the no records for both the runs have always been the same but it was missed out while writing to o/p in the first run.
What does the JES output from the problem job show? What (if any) diagnostic information was provided?
When comparing JESYMSG in JHS outputs we see that the DA MSS counts for the outputs are different ex:3608 in first run and 9608 in second run.
We didn't notice anything other than this going wrong.
Can a sort or some other program successfully read the file that is missing the 57k records?
We had a successful run thereafter with the same data and no modifications.
Has this been re-run since the problem run?
Yes and we got expected output i.e the same statistics that SYSOUT rendered through the counts for the records read and written to o/p file.
If you explain the process that "lost" the records as well as the jcl for the problem job, we may be able to offer suggestions.
The file in question is an o/p file with the following definition:
OUTFILE1 DD DSN=&STAT.QW.QW456.OUTFIL1,
DISP=(NEW,CATLG,DELETE),
DCB=AMX.MODLDSCB,
RECFM=VB,
LRECL=10756,
BLKSIZE=0,
BUFNO=20,
UNIT=&SYSDA,
DATACLAS=&DCHUGE
The same O/p file is been used in the next SORT step as input.For the erroneous run this step didnt run as the job abended because of the missed out records under the audit procedure.
Thanks! |
|
Back to top |
|
|
dick scherrer
Moderator Emeritus
Joined: 23 Nov 2006 Posts: 19243 Location: Inside the Matrix
|
|
|
|
Hello,
Quite a mystery. . . .
There is no possibility that a different version of the code ran for the problem run?
Have the SMF records for that run been looked at? If there were temporary or permanent i/o errors, they shou;d be recorded. If there were permanent i/o errors, though, you should have seen them without looking at SMF.
You might check the "volume" where the "bad" output file was created to see if there have any errors reported against that part of the dasd.
It sounds like it failed only that one time and the failure cannot be repeated. Unless some "footprint" is dicsovered, i'm not sure what else you can do to diagnose.
My guess is that somewhere there was an issue with the code, but i don't know how to verify this.
We're here if more info is discovered. |
|
Back to top |
|
|
William Thompson
Global Moderator
Joined: 18 Nov 2006 Posts: 3156 Location: Tucson AZ
|
|
|
|
dick scherrer wrote: |
There is no possibility that a different version of the code ran for the problem run? |
I don't know about you guys, but, after the exhaustive testing and rerunning with no failure, I would have logged this one to "unknown" several days ago....
Did you ever have an application programmer come to you with a "system" problem, with a dome and a listing to prove it was a "system" problem only to finally discover the compile did not match the dump? |
|
Back to top |
|
|
priyankassharma
New User
Joined: 29 May 2007 Posts: 16 Location: mumbai
|
|
|
|
Hi,
The whole purpose of doing the analysis is to find a permanent fix such that one or the other thing could be done to avoid this problem in future which had a major impact this time itself.
Am still looking in for some clues somewhere and been thankful to all you guys for providing your valuable inputs.
Was just wondering if there is ome way that I as an application programmer can generate EREP report, as mentioned before have tried doing it but didnt get any success.
Do you have any suggestions over that part also.
All I have looked into is JES , do not know much about SMF .Will be doing analysis in that regard and get back to you if find anything.
Thanks Again! |
|
Back to top |
|
|
dick scherrer
Moderator Emeritus
Joined: 23 Nov 2006 Posts: 19243 Location: Inside the Matrix
|
|
|
|
You're welcome
Good Luck.
We're here when there is new info and/or questions. |
|
Back to top |
|
|
jasorn Warnings : 1 Active User
Joined: 12 Jul 2006 Posts: 191 Location: USA
|
|
|
|
I suppose it could be hardware but my money on a logic issue. Usually when I've encountered similar issues in the wild the cause has been either an issue with the code itself or maybe some table that is used to determine how the program runs or passed parms, program not compiled, etc. It doesn't look like the step of having sort or something try to read the 'bad' output file was performed.
And I've NEVER seen counters get incremented improperly
I'll count this one as, "someone made a silly mistake and would rather it not get out". |
|
Back to top |
|
|
dbzTHEdinosauer
Global Moderator
Joined: 20 Oct 2006 Posts: 6966 Location: porcelain throne
|
|
|
|
hardware problems do not usually pick-on one program. If the OP had a hardware problem, sure-as-shoot'n, someone else did also. |
|
Back to top |
|
|
|