Assembler - Performance of Compare Instructions

Binop B · Posted: Mon Aug 01, 2011 12:31 pm

Hi All,

Its been sometime since I have been here... Hope everyone is doing good ...

Query
Have a doubt regarding the usage of Compare Instructions and its usage in Assembler Programming. Am not sure, but I guess somewhere in my college days it was taught that using COMPARE instructions would actually take more time to get executed than a normal MOVE instruction. Is this true ? I cant remember the reason behind this, but it was something related to the usage of NSI in PSW. Did some search in google for the last two days but am not able to find anything concrete ...

Scenario
Currently we have coded a program (subroutine) with a check for a particular field. This field could have values A1, A2, A3 or A4 and the processing is slightly different for each. For every call of this program, would it be a good idea to recode is such a way to use the second byte (1, 2, 3, 4 ) and put a logic to branch accordingly. Is it worth the effort or would it be better to simple leave it like that ... For a big transaction (functionality wise) this routine could be called thousands of times during a single task (online transaction).

Bill Woodger · Posted: Mon Aug 01, 2011 1:05 pm

Does the transaction appear "slow" when it is used? If it is getting a "normal" sort of response (hit the key causing transmission, and available for input almost straight away) then it's not going to be worth doing anything to improve it, no one will notice.

If the response is "slow", and the users want something done about it, and you've been given the task, then you have a different question, which is "why does it take so long" and you start looking at it from the highest level, and narrowing it down with what tools you have available.

It is unlikely, these days, that making the sort of change that you are talking about will make any difference unless you already know that there is some very heavily used code, which the stuff you are talking about is part of.

You are right, arranging branching on the 2nd byte would be "faster". With today's machines, it is much better to concentrate on understandability and maintainability than to look for performance which no-one is ever going to notice.

Do it how everyone else does it at your site (which probably means leaving the code alone). The thinking, coding, admin, testing, etc is not going to be worth it, then someone is going to come along on maintenance, not recognise it for what it is, and screw up, or waste time until they understand it.

Binop B · Posted: Mon Aug 01, 2011 1:37 pm

Thanks a lot Bill for your guidance ...

As of now, the users are having a little concern regarding the response time but they haven't raised any "official concern" or request as such .. As suggested, will leave the code as it is now but will come back to it in case any questions start to pop up ...

Also, thanks again for the confimation that - for the above scenario, the branching would be faster than the comparison logic.

Bill Woodger · Posted: Mon Aug 01, 2011 1:53 pm

Binop,

If you pick up the performance thing in the future, don't go straight for that bit of code. Unless in a huge iteration, it is more likely something else, most likely I/O related.

PeterHolland · Posted: Mon Aug 01, 2011 1:55 pm

Whatever you do, you have to do some comparing. By means of a CLC or
TM to trigger your branch.
The only way to do branching without comparing you need to set up a branch table, see : en.wikibooks.org/wiki/360_Assembly/Branch_Instructions

You need then 4 instructions to execute your branch, instead of 2 for CLC or TM.

Binop B · Posted: Mon Aug 01, 2011 1:58 pm

Sure Bill... will keep that in mind ..
there are a lot of I/O operations in the routine, but while coding itself we have tried to put it in the best possible way we could think of...

Binop B · Posted: Mon Aug 01, 2011 2:09 pm

Hi Peter.. Hope you doing good ...

dick scherrer · Posted: Mon Aug 01, 2011 8:13 pm

Hi Binop,

Yes, the compare should take a bit more cpu than the move.

Having said that, you will not be able to measure the difference unless you write loops that do a billion comparea and another that does a billion moves. Display the time at the beginning of the first loop, between the loops and after the 3rd loop to see what little difference.

If you have a situation that has performance problems, this would be one of the last places to spend time on (imho).

Bill O'Boyle · Posted: Mon Aug 01, 2011 8:54 pm

Binop,

You could verify the contents with three CLI's. The first one for an 'A' and the second and third for not less than C'1' and not greater than C'4'.

If none of the above are false, then store the 2nd-byte (STC) in a work-register and "AND" this register with a F'15' (result is F'1' through F'4') and you're ready for branching. This will remove the need for a PACK and CVB. An SLL,3 will then result in a F'4' through F'16' in the work-register, if so desired.

The above instructions are very cheap.

Also, for testing a label (from 2-256 bytes in length) for X'00's, consider the OC (into itself) with a BZ indicating X'00's.

An alternative for loading a 4-Byte label into a register to test for X'00's, without having to clear the register beforehand, take a look at the ICM instruction, with BZ indicating X'00's.

ICM's and STCM's are used by COBOL for COMP-5 internal manipulation.

HTH....

Bill

Bill O'Boyle · Posted: Tue Aug 02, 2011 8:58 am

Binop B · Posted: Tue Aug 02, 2011 3:24 pm

Binop B · Posted: Tue Aug 02, 2011 3:44 pm

Out of curiosity ... ...

A quote from the link the Peter has shared ...

Robert Sample · Posted: Tue Aug 02, 2011 4:09 pm

dbzTHEdinosauer · Posted: Tue Aug 02, 2011 5:06 pm

not mentioned is the fact
nowadays, it seems that batch functions/processes are being performed in CICS Regions.

CLUE: CALLed thousands of times....

Bill Woodger · Posted: Tue Aug 02, 2011 5:18 pm

Binop B · Posted: Tue Aug 02, 2011 5:19 pm

Thanks Robert for your inputs ...

Bill Woodger · Posted: Tue Aug 02, 2011 5:52 pm

dick scherrer · Posted: Tue Aug 02, 2011 7:50 pm

Hello,

If the code really does "run too slow" one of the big ways to waste resources is to do things that aren't needed (again).

Often code will re-read the exact same record(s) it read on the previous iteration. Why not use the ones already read?

Code often searches an array when the search value is the same as the previous time thru the search.

Look to the code for processes that provide nothing to the process really, but merely waste reqources.

The work to flip-flop between a compare and a move is nothing useful - might be entertaining and even educational, but will not help the performance.

Binop B · Posted: Tue Aug 02, 2011 10:04 pm