Rec count for multiple flat files - PS

Skolusu · Posted: Fri May 06, 2011 10:28 pm

kitchu84,

The following DFSORT/ICETOOL JCL will give you the desired results. I assumed that the list file has the DSN name in the first 44 bytes. I also limited the job to 1000 Datasets even though the maximum number of DD statements per job step is 3273. This job takes care of both FB and VB files.

The JCL is generated in step0200. Look at the output from SORTOUT of that step. It should have generated the JCL need to count the records from each file. If the generated JCL looks good then change the following statement and resubmit the job.

Skolusu · Posted: Fri May 06, 2011 10:30 pm

Sqlcode,

May be you should read the requirements listed here.

www.ibmmainframes.com/viewtopic.php?p=267313#267313

Craq Giegerich · Posted: Fri May 06, 2011 10:38 pm

Write the generated jcl to a dataset and use the TSO Submit command to submit the job to execute!

kitchu84 · New User Joined: 02 Dec 2006 Posts: 33 Location: chennai

hi Skolusu,

Thanks for solution. I am currently trying on correcting the creation of DD names in the dynamic jcl as it was throwing error:

JCP0427E DD NAME 'CT000CNTL' MUST BE 8 CHARACTERS OR LESS
JCP0427E DD NAME 'CT001CNTL' MUST BE 8 CHARACTERS OR LESS
JCP0427E DD NAME 'CT002CNTL' MUST BE 8 CHARACTERS OR LESS
JCP0427E DD NAME 'CT003CNTL' MUST BE 8 CHARACTERS OR LESS
JCP0427E DD NAME 'CT004CNTL' MUST BE 8 CHARACTERS OR LESS

Please let me know if there is a way to handle more than 3273 files. Do I need to write another dynamic jcl for that ... Please suggest.

Thanks,

enrico-sorichetti · Posted: Thu May 12, 2011 11:04 am

first of all the error is from Your dumb jcl checker, not from jcl
second looks like You mixed the two solutions

Kolusu solution builds ddnames and friends as CxxxCNTL with a numbering sequence of three digits

Sqlcode1 solution builds ddnames and friends as CTxxCNTL with a numbering sequence of two digits

You show <things> of the format CTxxx
so it looks like You mixed the two solutions

I did not <test> sqlcode1 solution, but since I was curios I gave Kolusu solution a quick and dirty run

Kolusu solution as given is correct ( no patronizing intended Kolusu

)
just pointing out a possible TS misunderstanding

Bill Woodger · Posted: Thu May 12, 2011 12:20 pm

While we are back on this, remember some of the things we brough up before.

When you run, you'd like all your files to be from the same production run, but have no way to know this.

When the client gets his output, at least to start with, he's going to look at it. Then you're going to get queries. He's going to say "on this report from three weeks ago..." so make sure you can relate his copy of the report to one you can look at, and all the files.

What about periodic files?

What about starting with the "main" parts of the system so you are concentrating on the important first (if he finds problems).

How about you spend some time running with the production data before you give the client the first report, so you can check for anything "obvious" before he gets to see it.

It's not just producing the report, it is everything that goes with it.

Do you have a file archiver? When you re-run an old set of reports one time, it'll take hours to get everything back.

enrico-sorichetti · Posted: Thu May 12, 2011 12:28 pm

Bill Woodger · Posted: Thu May 12, 2011 12:51 pm

kitchu84 · New User Joined: 02 Dec 2006 Posts: 33 Location: chennai

Hi All,

We will run this job periodically every 30 mins with the last 30 mins SAR unload and we will populate the data in DB2 tables for a particular JOB ran with that particular JOB ID.

For example: if a job name = ABCXXXXX ran with job id : JOB27909,
I will take unloads from SAR for the last 30 mins (time will be controlled by parm parameter in a COBOL module) and then I will filter out the file names for all the jobs alongwith their Job ids.

This will tell me specifically which run of the job had those files and what was the count. All this information will be loaded in table alongwith a timestamp. The data from table will be pulled through JAVA code and shown on URLs so that we can run some reports to know specifically on which date for a particular job what was the file count.

@enrico - Sorry I am not clear on this part:

"remembering that each dataset to be counted implies two dd, that' s the reason for the STOPAFT=1000,
to sqeeze everything out of jcl You could have used STOPAFT=1500"

Any pointers to handle more than 3273 files would be helpful.

Thanks all for your suggestions.

Thank you,

enrico-sorichetti · Posted: Thu May 12, 2011 10:47 pm

kitchu84 · New User Joined: 02 Dec 2006 Posts: 33 Location: chennai

Yes enrico ... I understand we have limitations on number of DDs... will also try form myside and look for Skolusu's suggestions...

Skolusu · Posted: Fri May 13, 2011 3:55 am

Bill Woodger · Posted: Fri May 13, 2011 4:39 am

I'm not sure I have followed this correctly.

A)
1) You run the DFSORT mega-file-counter about every 30 minutes
2) You check on SAR for all jobs completed in that time
3) You extract Job info from SAR, a sub-set of dataset info from the mega-file-counter
4) You load everything into DB2 so that you know which file relates to which job that ran at a particular time

or

B)
1) You check on SAR for all jobs complete in last 30 minutes
2) Create extract file-of-files for the mega-file-counter from the complete jobs in SAR
3) Run the DFSORT mega-file-counter
4) You load everything into DB2 so that you know which file relates to which job that ran at a particular time

or

C)
Something else I've missed completely

If A) is the mega-file-counter going to run in less than 30 minutes? That is only 1800 seconds, and it seems you might have more than 6500 DD's to open/close and read. + more questions if A) is confirmed

If B) why do you feel you need so many files, there will not be more than 3000 files created in a 30-minute window, will there? + more questions if B) is confirmed

If C), well, I missed it.

enrico-sorichetti · Posted: Fri May 13, 2011 9:12 am

enrico-sorichetti · Posted: Fri May 13, 2011 9:22 am

kitchu84 · New User Joined: 02 Dec 2006 Posts: 33 Location: chennai

Hello Bill,

yes you are right ... its option B ...

Also, since we need to run the jobs automatically through CA7 so suppose we submitted 5 dynamic JCLS each of them creating a different file with counts. I have a challenge to merge and then use it in another Job.
Say there are 5000 file names . Hence the main job creates 5 dynamic JCLS and submits them - each of which in turn creates one output file of file counts .
Now since these dynamic JCLs are not defined in CA7, i need a way to put
dependancy of all these jobs/output files on another final job which merges the 5 output file and then uses the data... I cannot put the dependency of Main job on final job because the final job might abend
due to file not found if the dynamic jcls are still running.

Also, is there a possibility that it might take previous version of the file
into consideration?

I am sorry if this isnt the right place to ask this query.

Please guide ...

Skolusu · Posted: Sat May 14, 2011 2:41 am

Bill Woodger · Posted: Sat May 14, 2011 7:06 am

kitchu84 · New User Joined: 02 Dec 2006 Posts: 33 Location: chennai

Hi Skolusu,

With due respect to you - the next part in my post was intended for you. I apologise for not mentioning your name specifically.

I am getting syntax errors in this line:
INREC IFTHEN=(WHEN=GROUP,RECORDS=1000,
*

WER268A INREC STATEMENT : SYNTAX ERROR
WER211B SYNCSMF CALLED BY SYNCSORT; RC=0000
WER449I SYNCSORT GLOBAL DSM SUBSYSTEM ACTIVE

I searched in the forum and found a similar kind of issue: : ibmmainframes.com/viewtopic.php?t=51832&postdays=0&postorder=asc&start=0

This type of error suggests that its because WHEN=GROUP is not available in the current release we are using

... Could you please suggest an alternative.

Hi Bill : we will run the file counter every time after we run the SAR unloads. The SAR unloads run every 30 mins (for entire 24 hrs). Regarding the rerun, the timestamp are actual ... we are planning to load the data at that particular instant of time. So say a job runs at 12:30 pm, the SAR unload running at 1 pm will pick up that job details and we will run file counter for the job. If the job again runs at say 2:50 pm with different job id , the next SAR unload will pick up the details of the job and file counter will count the records at that instant of time which will be loaded with that particular timestamp.

Sorry if I am not clear. Please let me know ...

kitchu84 · New User Joined: 02 Dec 2006 Posts: 33 Location: chennai

Just to add, this release of sort is invoked when i submit the job :

SYNCSORT FOR Z/OS 1.3.0.2R

enrico-sorichetti · Posted: Sun May 15, 2011 12:10 pm

kitchu84 · New User Joined: 02 Dec 2006 Posts: 33 Location: chennai

:'( ... will miss Skolusu's solutions ...

Bill Woodger · Posted: Sun May 15, 2011 1:22 pm

enrico-sorichetti · Posted: Sun May 15, 2011 1:34 pm

Bill, looks like we are just wasting time here, as does the TS organization by counting things every half an hour

they made up their mind/bed let them sleep in it

Bill Woodger · Posted: Sun May 15, 2011 2:01 pm