exploiting Z16 performance

jzhardy · Posted: Tue Aug 08, 2023 3:58 pm

I had a day off work recently and, being something of a nerd decided to run some performance tests on a Z16.

the test was to convert text 256 to upper case. The four tests were:

1. third party library function - generated in Trace.
2. above, but generated in non-trace
3. COBOL - using intrinsic function UPPER-CASE
4. HLASM module I wrote to exploit SIMD instruction set with vectors. Cheated slightly by bulk loading INSTR into V0-V15.

the test harness was written in COBOL and followed the form:

Allan Winston · Posted: Wed Aug 09, 2023 4:18 am

Regarding whether or not advanced processor features are exploited by the COBOL compiler for this specific code fragment, I would recommend using the LIST compiler option in conjunction with various settings of the ARCH compiler option. While an ARCH value of 14 is for the z16 (and all preceding processors), it may well be that a lower value for ARCH is all that is needed to generate the fastest code.

jzhardy · Posted: Wed Aug 09, 2023 5:36 am

interesting - when I checked the listing, the arch level in effect was 7, not 14 as specified in my PARM parameter.

I suspect this is to do with the fact that I was using CWPCMAIN (the Xpediter compiler) which seems to override ARCH. Possibly a local environment setting rather than a limitation in XPediter.

I'll go back to IGYCRCTL when I get some time ...