Merging records, but not all the time

prino · Posted: Sat Feb 17, 2018 4:20 pm

Sample of input file, lrecl=121:

sergeyken · Posted: Sun Feb 18, 2018 8:10 am

Which of ASA control characters to recognize?

For instance, '-' in lines 3, 7. ... is one of control characters.
As well as '+' in line 3 from bottom is control character, too.

If only '1' is considered it seems to be simple. Otherwise more specification details are needed.

sergeyken · Posted: Sun Feb 18, 2018 8:58 am

FYI:

prino · Posted: Sun Feb 18, 2018 10:24 pm

Oops, mea culpa, mea maxima culpa!

The current output file, like all others, is FBA with an LRECL=121. However, the '-' that is, among others, present in line 3 of the first sample is not an ASA control character, but part of the to-be-merged line of data. The only characters that should be exempted from the merge process are the '1' Form Feed characters, the minus'es on lines 3, 7 & 18 are data. The '+' on line is the result of an erroneous Cut&Paste, it should have been preceded with a space!
In other words a '1' form feed al always followed by 2n lines of data, of which the "even" lines need to me merged with the preceding "odd" lines, where the first "odd" line is the first line after the line with the '1' ASA character.

expat · Posted: Mon Feb 19, 2018 12:48 pm

Ages since I've played with sort, but could you not just EXCLUDE records with 1 in col 1 during the process ?

Arun Raj · Posted: Tue Feb 20, 2018 9:55 am

I tried to rearrange the sample input manually to get to the expected output and ended up in this (looks like an extra output record).

prino · Posted: Tue Feb 20, 2018 4:13 pm

Arun Raj · Posted: Tue Feb 20, 2018 7:07 pm

Arun Raj · Posted: Tue Feb 20, 2018 10:00 pm

Can you try something like this? Did not give much thought into it, and probably can make this better if I find some time later.

prino · Posted: Wed Feb 21, 2018 1:47 am

Thanks, I'll have to open the Tricks manual and check out what's happening, and how to adapt it to what I need.

I'll get back if I'll get stuck.

prino · Posted: Wed Feb 21, 2018 2:44 am

Just a hint for others:

prino · Posted: Wed Feb 21, 2018 3:58 am

Just another omission in the original and updated problem statement, there are a number of "section separator lines", but the following:

Arun Raj · Posted: Wed Feb 21, 2018 8:02 am

You might want to add an IFOUTLEN parameter to the OUTFIL (as shown in my example) to limit the output record length. From a quick look I believe your OUTREC is extending the record length to 189 and then the FTOV adds the RDW to it to make it 193.

prino · Posted: Wed Feb 21, 2018 5:16 pm

Nic Clouston · Posted: Wed Feb 21, 2018 7:30 pm

1 byte for the ASA character - possibly?

Arun Raj · Posted: Wed Feb 21, 2018 10:28 pm

prino - If the output is written to a data set you should see the expected results. DFSORT messages in SYSOUT would show the same LRECL in both the cases.

I ran a small test and when the output is routed to SYSOUT, regardless of DFSORT/IDCAMS/IEBGENER, I 'see' the extra one byte in SYSOUT output.

sergeyken · Posted: Thu Feb 22, 2018 10:43 pm

prino · Posted: Fri Feb 23, 2018 3:22 am

At some stage I might add an IFOUTLEN= statement, but right now I've got another far more serious problem, which has nothing to do with SORT (and cannot be solved by it).

The output file, after a further bit of post-processing to convert UTF8 into RTF "\uNNNN" escapes, does not display correctly for CJK characters (not in M$ Word, not in LO Writer) due to the facts that

the "Courier New" font does not contain CJK characters,
the substitution fonts (different for Word & Writer) have wholly incompatible font metrics,
LO Writer has a font-reset bug, making the layout even worse