NeoResearch: NVM Learn

Introduction
Push Some Numbers
Arithmetic operations
Stack Operations
Double-Stack model
Arrays
Value and Reference types
Syscalls (“Hello World!”)
A NVM snake game
Learn more

Introduction

This tutorial intends to teach you the basics of Neo Blockchain Virtual Machine, usually called NeoVM (or NVM). NeoVM is a stack-based machine for processing Smart Contract applications (and also transaction verifications) on Neo Blockchain, inspired by many successful stack languages like Bitcoin Script, Microsoft CIL, Java Virtual Machine and, finally, the FORTH language.

FORTH is older of these languages, being proposed in 1970 by Chuck Moore, currently maintained by Forth Inc and many independent implementations, such as Gforth and also this nice website/javascript implementation by Nick Morgan called EasyForth. Stack languages have many practical applications, specially due to their simplicity, and easier verification for code correctness (specially on a deterministic environment such as a blockchain).

(*) If you are new on the Stack Programming world, we strongly recommend reading more on Stack Data Structure first. If you are curious on FORTH stack machine, we also recommend EasyForth tutorial.

(**) The NVM/FORTH implementation of this tutorial is taken from NeoResearch nvm-forth project.

NVM Stack Items

NeoVM supports seven different types of stack items.

Integers: which are in fact Big Integers with positive/negative values limited to 32-bytes (or 256 bits)
Byte Arrays: general byte arrays
Booleans: a true/false value
Arrays: an array can contain a collection of other stack items, including more Arrays (important: this is a reference type, not value type)
Structs: similar to an array, it can contain several stack items inside it (struct is a value type)
Maps: a map can contain a byte array mapping from a key to a value, that may be another stack item
Interop Interfaces: these stack items are only meant to used for interoperating with high-level implementations of NeoVM, such as NeoContract (the Application Engine for Neo Blockchain)

NVM Opcodes

A NeoVM program is called script (or NeoVM script), which is composed by several operations (called opcodes). Each opcode has a unique number and a unique name, in order to identify the operation. For example: opcode named PUSH1, number 81 (in hex, 81 = 0x51), generates the number 1; opcode named ADD, number 147 (in hex, 147 = 0x93) adds two numbers.

Push Some Numbers

All information needs to be on NeoVM stack in order to be processed. For example, if you want to add two numbers, first you need to put them on the stack. One difference from other languages is that you need to first put the operands, and then put the operation, e.g., operation 1 + 2 is done as 1 2 + on a stack machine (put one, put two, then sum the top two elements). Where is the result of the operation stored? Again, the result of the operation is also put back on the stack.

`push1-push16` (opcodes `0x51-0x60`)

Let’s try it on practice! Type (don’t copy-paste) the following into the interpreter, typing Enter after each line.

PUSH1
PUSH2
PUSH3

\ FORTH implementation of NeoVM 2.x (Neo Blockchain Virtual Machine)
\ fast tutorial on forth: https://learnxinyminutes.com/docs/forth
\ https://yosefk.com/blog/my-history-with-forth-stack-machines.html
\ https://www.complang.tuwien.ac.at/forth/gforth/Docs-html/Characters-and-Strings-Tutorial.html
\ -----------------------------------------------------------
\ usage: online application https://neoresearch.io/nvm-learn/
\ locally on linux you can install gforth (apt install gforth)
\ however, FORTH syntax is not standard for both interpreters
\ this is meant to work only on the web application above
\ -----------------------------------------------------------

\ ================
\ define constants
\ ================

\ create empty bytearray
\variable empty 0 cells allot 
\ pushes empty bytearray to stack
\: push0 empty @ ;
\ for now, pushing ZERO instead of empty array... TODO: improve this
: push0 0 ;                          \ 0x00
: pushf push0 ;

\ push value -1 on main stack
: pushm1 -1 ;                        \ 0x4f
\ unused
\                                    \ 0x50                                       
\ push value 1 on main stack
: push1 1 ;                          \ 0x51
\ push value 1 on main stack
: pusht push1 ;                      \ 0x51
\ push value 2 on main stack
: push2 2 ;                          \ 0x52
\ push value 3 on main stack
: push3 3 ;                          \ 0x53
\ push value 4 on main stack
: push4 4 ;                          \ 0x54
\ push value 5 on main stack
: push5 5 ;                          \ 0x53
\ push value 6 on main stack
: push6 6 ;                          \ 0x53
\ push value 7 on main stack
: push7 7 ;                          \ 0x53
\ push value 8 on main stack
: push8 8 ;                          \ 0x53
\ push value 9 on main stack
: push9 9 ;                          \ 0x59
\ push value 10 on main stack
: push10 10 ;                        \ 0x5a
\ push value 11 on main stack
: push11 11 ;                        \ 0x5b
\ push value 12 on main stack
: push12 12 ;                        \ 0x5c
\ push value 13 on main stack
: push13 13 ;                        \ 0x5d
\ push value 14 on main stack
: push14 14 ;                        \ 0x5e
\ push value 15 on main stack
: push15 15 ;                        \ 0x5f
\ push value 16 on main stack
: push16 16 ;                        \ 0x60

\ ===========
\ control ops
\ ===========
\ nop: no operation
: nop ;                              \ 0x61

\ skip jumps, skip calls

\ skip ret

\ skip appcals, syscalls, tail call

\ ==========
\ stack ops
\ ==========

\ note: using return stack (rstack) as alternative stack. perhaps better using another software stack

\ duplicate data from alternative stack (could be `fromaltstack dup toaltstack`)
: dupfromaltstack r@ ;               \ 0x6a

\ move data to alternative stack
: toaltstack >r ;                    \ 0x6b

\ move data from alternative stack
: fromaltstack r> ;                  \ 0x6c

\ The item n back in the main stack is removed.
: xdrop roll drop ;                  \ 0x6d

\  The item n back in the main stack in swapped with top stack item.
\        XSWAP = 0x72 (requires loop/if)

\ The item on top of the main stack is copied and inserted to the position n in the main stack.
\        XTUCK = 0x73, (requires loop/if)

\ Puts the number of stack items onto the stack.
\ depth native defined (opcode 0x74)

\ Removes the top stack item.
\ drop native defined (opcode 0x75)

\ Duplicates the top stack item.
\ dup native defined (opcode 0x76)

\ Removes the second-to-top stack item.
\ nip native defined (opcode 0x77)

\ Copies the second-to-top stack item to the top.
\ over native defined (opcode 0x78)

\ The item n back in the stack is copied to the top.
\ pick native defined (opcode 0x79)

\ The item n back in the stack is moved to the top.
\ roll native defined (opcode 0x7a)

\ The top three items on the stack are rotated to the left.
\ rot native defined (opcode 0x7b)

\ The top two items on the stack are swapped.
\ swap native defined (opcode 0x7c)

\ The item at the top of the stack is copied and inserted before the second-to-top item.
\ tuck native defined (opcode 0x7d)

\ ==========================
\ begin arithmetic operators

\ inc 0x8b (defined after add)
\ dec 0x8c (defined after sub)

\ sign 8d (IF)

\ negate 8f (IF)

\ abs 90 (IF)

\ not 91 (IF)

\ nz 92 (IF)

\ add values on main stack
: add + ;                        \ 0x93

\ subtract values on main stack
: sub - ;                        \ 0x94

\ adds 1 to the input (defined here because of add)
: inc 1 add ;                    \ 0x8b

\ subtracts 1 from the input (defined here because of sub)
: dec 1 sub ;                    \ 0x8c

\ multiply values on main stack
: mul * ;                        \ 0x95

\ a is divided by b
: div / ;                        \ 0x96

\ mod (native)                   \ 0x97

\ shl (c# bigint) 0x98

\ shr (c# bigint) 0x99

\ booland (IF) 0x9a

\ boolor (IF) 0x9b

\ Returns 1 if the numbers are equal, 0 otherwise (note that forth true is -1)
: numequal = -1 mul ;              \ 0x9c

\ 9d reserved ?

\ Returns 1 if the numbers are not equal, 0 otherwise. (note that forth true is -1)
: numnotequal = 1 add ;            \ 0x9e

\ Returns 1 if a is less than b, 0 otherwise. (note that forth true is -1)
: lt < -1 mul ;            \ 0x9f

\ Returns 1 if a is greater than b, 0 otherwise. (note that forth true is -1)
: gt > -1 mul ;            \ 0xa0

\ lte (IF ? OR? <= ?)   \ 0xa1

\ gte (IF ? OR? >= ?)   \ 0xa2

\ Returns the smaller of a and b. \ 0xa3
\ min (native)

\ Returns the larger of a and b. \ 0xa4
\ max (native)

\ Returns 1 if x is within the specified range (left-inclusive), 0 otherwise.
\ WITHIN = 0xA5,

\ ====================
variable nvmarraytest            \ single global for array tests (warming-up var system)

\ arraysize is position zero of array
: arraysize push0 add @ ;                                          \ 0xc0

\ pack (opcode 0xc1) - presented in a few lines after

\ unpack (opcode 0xc2) - not implemented, yet

\ An input index n (or key) and an array (or map) are taken from main stack. Element array[n] (or map[n]) is put on top of the main stack.
: pickitem cells push1 add add @ ;                                 \ 0xc3
\ 1600 0 pickitem -> 1600[1]
        \ PICKITEM = 0xC3,
\        /// <summary>
\        /// A value v, index n (or key) and an array (or map) are taken from main stack. Attribution array[n]=v (or map[n]=v) is performed.
\        /// </summary>
\ SETITEM = 0xC4,
: setitem swap push1 add rot add ! ;                               \ 0xc4
\ 1600 0 10 -> 1600[1] = 10

\ newarray  (alloc n spaces + 1 for count)
: newarray dup here swap push1 add cells allot dup rot swap ! ;    \ 0xc5

\ pack (using `do loop` in return stack, may not work on gforth)    \0xc1 (after newarray/setitem)
: pack dup newarray toaltstack push0 do fromaltstack dupfromaltstack swap dup toaltstack rot setitem loop fromaltstack ;
\ warning: `0 pack` may break the FORTH loop

\ bye

\ clean page (command implemented manually)
page

What happened? Every time you type a line followed by the Enter key, the NVM opcode is executed (state HALT means that no errors happened during NeoVM execution). You should also notice that as you execute each line, the area at the top fills up with numbers. That area is our visualization of the stack. It should look like this:

1 2 3 <- Top

Now, into the same interpreter, try the opcode ADD followed by the Enter key. The top two elements on the stack, 2 and 3, have been replaced by 5.

1 5 <- Top

At this point, your editor window should look like this:

push1 HALT push2 HALT push3 HALT add HALT

Type ADD again and press Enter, and the top two elements will be replaced by 6. If you try ADD one more time NVM would abort execution, because it will try to pop the top two elements off the stack, even though there’s only one element on the stack! This results in a FAULT state on NeoVM:

push1 HALT push2 HALT push3 HALT add HALT add HALT add FAULT (caused by Stack Underflow)

You can also write everything in a single line and press Enter:

push10 push3 add