Calling GPL routines from assembly

The subroutine stack
The CALL / RTN mechanism
The DSWGR / RTGR mechanism
The >0010 and >0012 routines

The first thing to consider is that there are different types of routines in GPL:

  • Routines called with CALL and returning with RTN or RTNC
  • Routines called with DSWGR and returning with RTGR
  • Routines called by name with routine G&>0010 and returning with routine G&>0012
  • All three make use of the subprogram stack to save their return parameters, so let's first talk about this stack.


    The GPL subroutine stack

    This stack has to be located in the scratch-pad RAM, between addresses >8300 and >83FF. Upon normal GPL execution, the stack starts at >8380 and grow upward to >839F.

    A stack pointer is maintained in byte >8373. Since this pointer is only one byte, the most significant byte of the address is assumed to be >83, which is why the stack must be in the scratch-pad. The pointer points at the last entry saved on the stack. Entries are 2-byte in length and can be assumed to be word-aligned.

          +---------+
    >83FE : :
    : (free) :
    : :
    +---------+
    >8388 | |<-,
    +---------+ |
    | | |
    +---------+ |
    | | |
    +---------+ |
    | | |
    +---------+ |
    >8380 | | |
    +---------+ |
    |
    +---------+ |
    >8372 | xx | 88 |--'
    +---------+

    Calling instructions save parameters such as the return address on the stack, after incrementing the pointer. Return instructions fetch the proper return parameters from the stack, then decrement the pointer.

    There is no safety mechanism to detect stack overflow (i.e. overwriting the 'data' stack that begins at >83A0), nor underflow (i.e. fetching meaningless bytes from under >8380).


    The CALL and RTN mechanism

    The GPL opcode CALL (opcode >06) gets the current address from the GROMs, saves it on the stack, then branches to the subroutine specified in the code flow:

           CALL G@>8234       which is encoded as >06,>82,>34  

    Unfortunately, no provision is made to pass the address as a variable. To do so you would have to resort to complicated tricks like: 1) save the return address on the stack, 2) save the desired address on the stack, and 3) perform a dummy RTN.

    The called routine can return with either RTN (opcode >00) or RTNC (opcode >01). The difference being that RTN clears the Cnd bit (the GPL equivalent of the Eq bit) in the GPL status byte, whereas RTNC leaves it intact. This gives the caller a chance to test the result of a subroutine:

           CALL G@>62A5
    BS G@SET
    BR G@NOTSET

    NB If the called subroutine (>62A5) returned with RTN, the BR will always be taken.

    Note that no mechanism is provided to return with the Cnd bit always set. But you can easily do it with:

           CEQ  @>8300,@>8300        compare a byte to itself 
    RTNC

    Since a byte is always equal to itself, the Cnd bit will be set. We're just wasting memory for the 3 bytes required to encode the CEQ instruction...


    Calling from assembly: the easy way

    If you have a cartridge like Editor/Assembler or Mini-Memory, you have a very simple way of calling GPL routines: the GPLLNK subprogram. Its syntax is very simple:

           BLWP @GPLLNK
    DATA >6234 GROM address of the routine to call

    The subroutine returns after the DATA statement, the Cnd bit will unfortunately be lost in the process and cannot be recovered, even from >837C directly.

    Also, there is no provision made to pass parameters in the form of data statements following the call. For instance, in GPL you could write:

           CALL  G@>7123
    BYTE >F5,>A2,>83

    The called subroutine would then retrieve the three data bytes with FETCh instructions. FETC looks up the return address on the stack, fetches bytes from GROM and updates the return address accordingly. Obviously this is not going to work from assembly, since FETC always accesses GROM, not cpu memory.


    Calling from assembly: the hard way

    Assuming you don't have a cartridge with a GPLLNK routine, you will need to do the call yourself, which is easy, and to implement a return mechanism, which is not.

    The first thing to do to emulate a call is to pass a return address on the stack. This will be the address of a GPL routine that enters assembly at the address you want (e.g. your return address). We'll talk about that later, let's first see the calling mechanism:

           INCT @>8373                Increment the stack pointer
    MOVB @>8373,R1 Get stack pointer
    SRL R1,8 Make it LSB
    LI R0,>xxxx GROM address of our GPL return routine
    MOV R0,@>8300(R1) Save it on the stack

    Now that we have set a return address, we can branch to a GPL routine:

           MOV  @>83FA,R9            Get current GROM base 
    LI R0,>6234 GROM address of the GPL routine to call
    MOV R0,@>0402(R9) Set the address
    SWPB R0
    MOV R0,@>0402(R9)
    LWPI >83E0 GPL workspace
    B @>006A Enter GPL interpreter

    Returning to assembly: the easy way

    If you have a GRAM card and can write your own GPL routines, it is a simple matter to write one that will return to a predetermined address in your assembly program. You just use the XML instruction. This instructions gets the address where to enter assembly from 16 tables located at various places in memory. Each table can hold upto 16 vectors, i.e. addresses where to branch.

    The XML instruction (code >0F) is followed by a byte of data whose first nibble indicates the table, and the second nibble indicates the entry within the table. Each entry is one word in length, being a cpu address.

    The address of the 16 tables are:

    XML >0x  >0D1A through >0D39   In console ROM 
    XML >1x >12A0 through >12BF "
    XML >2x >2000 through >201F Low memory expansion
    XML >3x >3FC0 through >3FDF "
    XML >4x >3FE0 through >3FFF "
    XML >5x >4010 through >402F Peripheral cards DSR space
    XML >6x >4030 through >404F "
    XML >7x >6010 through >602F Cartridge ROM
    XML >8x >6030 through >604F "
    XML >9x >7000 through >701F "
    XML >Ax >8000 through >801F Decoded as >8300, i.e. scratch-pad
    XML >Bx >A000 through >A01F High memory expansion
    XML >Cx >B000 through >B01F "
    XML >Dx >C000 through >C01F "
    XML >Ex >D000 through >D01F "
    XML >Fx >8300 through >831F Scratch-pad RAM

    So for instance:

            XML >C3        Coded as >0F,>C3

    will get its vector from address >B006 and enter assembly language at whatever address is found in >B006.

    Now all you have to do is:

  • Place the desired assembly return address at >B006
  • Place the GROM address of your XML instruction on the subprogram stack (shown above as >xxxx )
  • Call the desired GPL subroutine (shown above as >6234).
  • When the GPL subroutine performs its RTN (or RTNC) it will return to the XML instruction at address >xxxx, wich will cause the GPL interpreter to relinquish control to your assembly program, at whatever address you placed at >B006. Remember that the workspace will still be >83E0, the GPL workspace, so the first thing to do will probably be to switch to your own workspace.

    Note that XML does not affect the status byte, so you will be able to test the Cnd bit, if the GPL subroutine returned with RTNC.

    GPLRET LWPI >20BA         My workspace 
    MOV @>837C,R1 Get GPL status byte
    SLA R1,3 Test Cnd bit (value >20)
    JOC SET
    JNC NOTSET


    Returning to assembly: the hard way

    If you don't have a GRAM card, things are getting though: how are you going to re-enter assembly? It should be possible since GPL is entered upon power-up and you are executing an assembly program: obviously there is a way in! However it may not be possible to resume assembly at an arbitrarily chosen location...

    One way around this problem is to search the console GROMs for sequences of bytes that resemble an XML instrustion, i.e. >0F >xx, where >xx is a suitable table location.

    One such sequence is found at GROM address >36B8 : it contains >0F >4B, which could be interpreted as an XML to a vector found at >3FF6. (It is actually part of the instruction MOVE >000F,@>834A,@>834B but we are reading it out of sequence.) The actual location of this instruction may vary according to the GROM version, so you'd better use your favorite memory editor to locate the >0F, >4B string.

    All we have to do then is to use >36B8 as a GPL return address, and push it on the stack as shown above (>xxxx). Then place our assembly return address at >3FF6 and enter the GPL interpreter, just as above.


    The DSWGR and RTGR mechanism

    As you know, GROMs are accessed via a set of four addresses in cpu memory:
    >9800 to read data (this is known as the GROM base)
    >9802 to read the current address
    >9C00 to write data (GRAM only)
    >9C02 to set the address.

    However, provision is made to use upto 16 sets of such addresses, >0004 bytes apart, which give access to as much as one megabyte of GROM/GRAM. This never became a reality at Texas Intruments (in fact the console GROMs answer to each and every base), but most GRAM cards make use of this trick to implement more than 64K of GRAM.

    Interestingly, TI provided GPL opcodes to call routines located in another base. Unfortunately, one of them is buggy!

    The DSWGR opcode has the following syntax:

           DSWGR base,address  

    Where base can be either a variable or a constant, whereas address has to be a variable. So, in practice you could type:

           DST   >604A,@>8300        Address where to branch
    DSWRG >9804,@>8300 Branch at >604A using GROM port >9804

    DSWGR pushes both the current base and the return address on the subroutine stack, then changes the base value at >83FA and takes the branch.

    Routines called in this way simply return with:

           RTGR   

    which recovers the base and the return address from the stack.

    Unfortunately, RTGR also performs a dummy write to GROM/GRAM memory, at the address corresponding to the current base. That is, it erases GRAM byte >9800 when returning to base >9800, GRAM byte >9804 when returning to >9804, etc. Why TI did that is beyond my understanding. Maybe they thought it didn't matter since GROMs cannot be written to? Or because GROMs were limited to >1800 bytes, GROM 8 would cover >8000-97FF so the >98xx addresses would not be a problem? Well, it may be a problem with GRAM cards, so be aware of it.

    Anyhow, there is no DSWGR-callable routine in the console GROMs, and I'm not aware of any cartridge that contains one. Which does not mean that you cannot write your own!


    Calling from assembly

    To call a DSWGR-type routine from assembly, you cannot use GPLLNK. You have to resort to the 'manual' technique explained above. Namely:

  • Push the return address on the subroutine stack
  • Push the current GROM base (that you can get from >83FA)
  • Call the desired GPL subroutine.

  • The >0010 and >0012 calling mechanism

    At address >0010 in the console GROMs, is a routine which is the GPL equivalent of DSRLNK. It is used to call by name DSRs and subprograms (e.g. CALLs) that can be located either in peripheral card ROMs, or in GROMs.

    The routine expects the name of the DSR or subprogram to be placed in VDP memory, at an address specified by word >8356 (which points at the size byte of the string). The >0010 routine also FETChes one data byte from GROM memory: if it's >08 it will scan the DSR lists, if it's >0A it will scan the subprogram lists.

    The routine first checks the name and returns with the Cnd bit set if its size is zero, or larger than 7. Then it calls XML >19 which scans the peripheral cards DSR space at >4000-5FFF. If the desired subprogram/DSR is found there, the XML returns with the Cnd bit reset.

    If the subprogram/DSR is not found in any card ROM, the routine goes on scanning GROM headers (i.e. >0000,>2000, >4000, etc upto >E000). To this end, it first saves the GROM base on the subprogram stack. Then it calls XML >1A with performs the scanning. Note that XML >1A will scan all sixteen GROM bases and change the current base in >83FA accordingly.

    GPL subprograms and DSRs called by this mechanism must return by branching to a subroutine found at address >0012 in the console GROMs. This routine simply retrieves the initial GROM base from the top of the stack, sets it, then returns directly to the caller of the >0010 routine.

    Note that for some reason, subroutine >0010 leaves two bytes of garbage on the stack before it saves the current GROM base. This is why routine >0012 must be used: it knows that it should retrieve the base, then decrement the pointer, then perform a RTN.


    Calling from assembly

    This is not as easy as it seems. We cannot call routine >0010 directly, because it expects a data byte from GROM, that we cannot pass from assembly. Fortunately, the FETC instruction is the very first one in the routine, so we could just skip it as follow:

  • Read the 2-byte instruction at GROM address >0010: it will be a branch the actual location of the routine.
  • Doctor it so as to increment the address by two (ANDI with >1FFF, then INCT it)
  • Place >08 or >0A in byte >836D.
  • Now enter GPL at our doctored address, by performing a 'manual' GPL call as described above.
  • Another way around is to do the scanning ourselves, but that's not easy either. We can obviously use the DSRLNK routine to search peripheral card ROMs. But assuming the subprogram/DSR is not found there, how to search GROM headers?

    We cannot call XML >1A directly because it does not return to its caller, instead it re-enters the GPL interpreter. Plus, routine >0010 contains a number of intricacies to repeatedly call XML >1A, that would make our life really difficult.

    So the best way is probably to write a routine that scans the GROM headers and calls the appropriate GPL subprogram/DSR. This would be cleaner than calling subroutine >0010 with the dirty trick described above.

    Initial version 3/6/02 posted on OLUG
    Revision 1. 1120/02 Turned into a webpage


    Back to the TI-99/4A Tech Pages