Can this be done with DFSORT?

Claes Norreen · Posted: Thu Apr 16, 2015 1:00 pm

Hi,

I've got the following requirement, that I'd love DFSORT to do for me - although I realize I may need to use an application program..

Basically, I've got a list of streets (identified by an unique id) and housenumbers on that street. My requirement is to make a list of streets with housenumber intervals. Also, a record must appear for even houseno and another for odd. The trick being, that "holes" in the housenumbers may appear. Take these examples:

Input:

Bill Woodger · Posted: Thu Apr 16, 2015 2:30 pm

DFSORT can check the bits for odd/even :-)

How much data is there?

Is it safe to assume that there are no duplicate house-numbers (like for apartment blocks) or compound house-numbers like 3-5 (for a building which has consumed multiple adjacent numbers)? No sub-divided house-numbers, like 14, 14A, 14B for an old building sub-divided into apartments? No "house names" in place of the number (like Dun Roamin, Bill's Gaff)?

Another way to put it, all correct, unique, numbers?

Are they left-aligned? What is the maximum number that can exist?

There are several ways to do it (almost certainly). Do you want for performance, clarity, or something else?

Claes Norreen · Posted: Thu Apr 16, 2015 2:37 pm

Hi Bill,

Thanks for your fast reply! (thought it was night in the US now?)

Well, it's just Danish addresses, so there's not a whole lot of data - so clarity first, I guess

There CAN be duplicate numbers - but only because of a letter (separate field) - nothing else. So we can have 14A, 14B and so on. We could remove duplicates easily using SELECT FIRST.

Housenumbers in Denmark are numeric 3 - letters (separate field) A-Z.

Looking forward to see some more!

RahulG31 · Active User Joined: 20 Dec 2014 Posts: 446 Location: USA

I believe this could be done but not that we should.

per me, it would involve multiple sort steps (and a decent amount of logic as well).

This is what I thought of doing:

1. Separate odd/even numbers in separate files by checking the last bit (as stated by bill). Let's consider the odd numbers file has input like this:
01
03
07
11

2. Take this file and see if the difference in consecutive numbers is greater than 2. There will be a separate step for doing this using JOINKEYS to get records like below:
01
03
07 A
11 A

3. Put the file in reverse order and do as in step 2 to get:
11 B
07 B
03 B
01

4. If the records are merged again with JOINKEYS then we would get:
01 A
03 B
07 A B
11 A B

5. 'A' signifies start of group and 'B' signifies end of group. If you PUSH what is present in records with 'A' (i.e. 01 for first record and so on..), you should get something like this:
01 A 01
03 B 01
07 A B 07
11 A B 11

6. A simple BUILD on this will give you the required numbers.

So, it looks to me that writing an application program would be much simpler.

I am curious to know a simpler solution. Waiting for reply from Bill. :-)

.

Claes Norreen · Posted: Mon Apr 20, 2015 1:21 pm

RahulG31,

Thanks for your reply.

However, I don't understand what you are trying to do - can you put some code for it maybe?

RahulG31 · Active User Joined: 20 Dec 2014 Posts: 446 Location: USA

Claes,

I am trying to create a group for the data in vertical format, so that you can get your desired output in horizontal format.

I am marking 'A' for the start of the group and 'B' for the end of group.
Where 'A' and 'B' both are present, that means that is the only element in the group e.g. 7 and 11 in my sample data.

The points I mentioned, are part of the idea on how to do it.

1. To identify odd/even:

Rohit Umarjikar · Posted: Tue Apr 21, 2015 4:31 am

I think, this can be done with DFSORT with the experts help as seen. But the point is why to make things more complex than already it is unless there is no COBOL in your shop? SO, it is easy to write a COBOL program for such complex requirements and also easy to maintain. A new person would go crazy if he is not a DFSORT expert

Also , a SAS card could be easy to achieve this in ase you are running out of choice.

Bill Woodger · Posted: Tue Apr 21, 2015 5:16 am

Claes,

Here's some code.

Claes Norreen · Posted: Tue Apr 21, 2015 12:07 pm

Wauw, this is brilliant, Bill! Thanks a lot!

I will try to grasp what is going on now..

Also want to thank you, RahulG31. After your 2nd post, I understood what you were trying to do - which wasn't that far from what Bill came up with.

Claes Norreen · Posted: Tue Apr 21, 2015 12:23 pm

Oh, I tried this:

Bill Woodger · Posted: Tue Apr 21, 2015 12:33 pm

Thanks Claes,

I suspect it can be fixed :-)

I probably can't look for about an hour, but I'll update it here.

Claes Norreen · Posted: Tue Apr 21, 2015 2:59 pm

Thanks, Bill

I've detected the flaw to be somewhere here:

Bill Woodger · Posted: Tue Apr 21, 2015 5:12 pm

Yes, thanks. That is the place. The problem is with consecutive "breaks" in the range within a street. Causes the GROUP to have acted on the previous record and then the GROUP acts again before the data from the previous record can be used.

A sequence number on the GROUP then store in different locations depending on the sequence number being odd/even would do it. On it now :-)

RahulG31 · Active User Joined: 20 Dec 2014 Posts: 446 Location: USA

Thanks for your thoughts Bill. I really never thought that this was achievable in a single step. :-)

Bill Woodger · Posted: Tue Apr 21, 2015 6:34 pm

Claes,

A couple of changes, so easiest to provide the entire OUTREC:

Bill Woodger · Posted: Tue Apr 21, 2015 6:42 pm

RahulG31,

That's why I'm recommending you master it and add it to your store of SORT tools :-)

Bill Woodger · Posted: Tue Apr 21, 2015 6:45 pm

Forgot to mention this little trick. If your OUTFIL is not presenting all the data from the input to it (because it is redundant) as is common with this type of solution, include something like this:

Claes Norreen · Posted: Tue Apr 21, 2015 6:50 pm

Thanks, Bill. It looks very promising!

I'll look at it in depths tomorrow, as I've already been here much too long today...

chandan.inst · Posted: Wed Apr 22, 2015 8:55 am

Awesome Bill.. As per RahulG31 I also never thought its doable with single step.

But yes we have DFSORT masters like you

Claes Norreen · Posted: Wed Apr 22, 2015 12:06 pm

Claes Norreen · Posted: Wed Apr 22, 2015 4:59 pm

Hi Bill,

I've now done a real test against the Danish streets. To do this, I first made the Streetcode 8,ZD, and the Housenumber 3,ZD - using your brilliant symbol file, that was a walk in the park..

Now, there's an issue with my collegues street, Anyvej (we do call it Anyway

).. There are some strange holes on this street. Here's the input:

Bill Woodger · Posted: Wed Apr 22, 2015 6:13 pm

Hi Claes,

I made that street number 7, and used only two-digit house numbers and got this:

Claes Norreen · Posted: Wed Apr 22, 2015 7:20 pm

Hi Bill,

I've got the exact same symbol file - except that I've defined

Bill Woodger · Posted: Wed Apr 22, 2015 7:38 pm

That data was the same as I had. I ran it anyway, and it produced the same results as I showed previously:

Bill Woodger · Posted: Wed Apr 22, 2015 7:49 pm

Using IFOUTLEN=61 didn't affect the results.

I'm wondering if something went wrong when you re-indented?

Here's the cards I'm running after the symbol conversion: