Volume chain using DFSORT

MFsys · New User Joined: 16 Feb 2018 Posts: 3 Location: India

Hello,

I am trying to solve a problem using DFSORT and wondering if it is even possible. I am able to achieve this using a REXX program, however, due to the volume of data, wondering if DFSORT could do it faster.

There are two columns containing TAPE volser for a dataset, which can extend on multiple tape volumes, but not in any particular order. So I need to find the head, tail and basically the entire chain. All I know is that head of any chain can only exist in first column with the next volume in the adjacent column.

The problem is similar to the one described in the question below.
ibmmainframes.com/viewtopic.php?t=58800&highlight=chain

Below is a small snippet of how the data might look like:

1ST VOL NEXT VOL
A00003 B00002
B00002 A00001
E00003 F00005
F00005 A00007
D00002 D00001
A00007 E00004
Expected Output

A00003 B00002 A00001
E00003 F00005 A00007 E00003
D00002 D00001

Joerg.Findeisen · Posted: Tue Apr 11, 2023 11:57 pm

I don't think any SORT product is a good choice for your requirement.

sergeyken · Posted: Wed Apr 12, 2023 12:13 am

With SORT it can be done only for a reasonably limited length of the chain.

In case you do not have 100,000-100,000,000 of volume names, using SORT in your case would give no benefits except disadvantages.

MFsys · New User Joined: 16 Feb 2018 Posts: 3 Location: India

Thanks Joerg and sergeyken for the inputs. After having dig through the DFSORT manuals for possible solutions, I too felt the same (and learnt a lof of new things along the way), but obviously couldn't get to a solution (hence this post). Can you suggest any possible approaches you would take to solve it, if it MUST BE done using DFSORT.

And yes, the list of volumes in this case is quite high, roughly 400k on just one system.

Joerg.Findeisen · Posted: Wed Apr 12, 2023 1:20 pm

Somehow your input data seems to be strangely sorted (or non-sorted).

MFsys · New User Joined: 16 Feb 2018 Posts: 3 Location: India

Yes, the sample data I showed up was not sorted. I just made it up to bring forward the point that it is not necessary that A0001 may link to A0002 for example.

What you have presented above is correct.

I think you are right about the technical limitation of upto 255 volumes, I would have to look it up. But in this scenario, based on my analysis using REXX, the maximum depth of the tapes I found was about 10. I am not really sure if it would be a good idea to assume that as a limit (or maybe a little more, say 15), as long as the solution is dynamic and can be adapted for higher limit in future, I would want to learn about it.

sergeyken · Posted: Thu Apr 13, 2023 6:59 am

The major trick is: how to use the same JOIN group for any number of join steps? Otherwise the total number of SORT control statements would exceed the size of input data

Rohit Umarjikar · Posted: Thu Apr 13, 2023 10:36 am

Not sure if this helps to explore or try options to get formatted volume numbers .
www.ibm.com/docs/en/zos/2.2.0?topic=execs-tailoring-edgjrpt-sample-jcl

Joerg.Findeisen · Posted: Thu Apr 13, 2023 11:18 am

@Rohit: One does not know if the source is DFRMM or CA1 (maybe something else).

Joerg.Findeisen · Posted: Thu Apr 13, 2023 11:25 am

The ICETOOL solution looks really nice, but wouldn't it be quite a bit of a SORT excess? Well, the other solution would have the opposite of IFTHEN excesses for what I have tried. I don't know if it's worth to be shared yet.

sergeyken · Posted: Thu Apr 13, 2023 4:49 pm

sergeyken · Posted: Thu Apr 13, 2023 5:11 pm

With ICETOOL/SYNCTOOL some performance improvements can be done if needed. Like this one:

Joerg.Findeisen · Posted: Thu Apr 13, 2023 5:22 pm

The bottleneck here is that the input must be in correct order for what I have attempted. I would like to read some comments if I am on the right path or not. It's just what came into my mind solving the requirement.

sergeyken · Posted: Thu Apr 13, 2023 6:02 pm

Joerg.Findeisen · Posted: Thu Apr 13, 2023 6:28 pm

I have not yet looked deeper into pre-sorting stuff before our both approaches come into place. Maybe you have an idea that can be deployed? I am afraid to not have a good solution at the moment for that (possible) issue.

sergeyken · Posted: Thu Apr 13, 2023 10:32 pm

Joerg.Findeisen · Posted: Thu Apr 13, 2023 11:40 pm

Thank you for your reply. Let's assume the input is in the correct order.

In the wild, the situation is in general more complex with multivolume and multifile chains. Not really fun to work with from my experience.

sergeyken · Posted: Fri Apr 14, 2023 9:00 pm