2021-02-15 15:29:55 +08:00
|
|
|
# Nasal Script Language
|
2019-07-25 02:14:33 +08:00
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
## Introduction
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
[Nasal](http://wiki.flightgear.org/Nasal_scripting_language)
|
2021-10-14 13:42:07 +08:00
|
|
|
is an ECMAscript-like programming language that used in [FlightGear](https://www.flightgear.org/).
|
2019-07-25 02:14:33 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
The interpreter is totally rewritten by [ValKmjolnir](https://github.com/ValKmjolnir) using C++(`-std=c++11`)
|
|
|
|
without reusing the code in [Andy Ross's nasal interpreter](<https://github.com/andyross/nasal>).
|
2021-10-08 23:18:26 +08:00
|
|
|
But we really appreciate that Andy created this amazing programming language and his interpreter project.
|
2021-05-04 17:39:24 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
Now this project uses MIT license (2021/5/4).
|
|
|
|
Edit it if you want,
|
|
|
|
use this project to learn or create more interesting things
|
|
|
|
(But don't forget me XD).
|
2021-05-04 17:39:24 +08:00
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
## Why Writing Nasal Interpreter
|
2020-09-14 13:52:25 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Nasal is a script language first used in Flightgear,
|
2021-10-14 13:42:07 +08:00
|
|
|
created by [Andy Ross](<https://github.com/andyross>).
|
|
|
|
But in 2019 summer holiday,
|
|
|
|
members in [FGPRC](https://www.fgprc.org/) told me that it is hard to debug with nasal-console in Flightgear,
|
2021-10-08 23:18:26 +08:00
|
|
|
especially when checking syntax errors.
|
2021-10-14 13:42:07 +08:00
|
|
|
So i tried to write a new interpreter to help them checking syntax error and even, runtime error.
|
2020-09-14 13:52:25 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
I wrote the lexer,
|
2021-10-14 13:42:07 +08:00
|
|
|
parser and
|
|
|
|
bytecode virtual machine(there was an ast-interpreter,
|
2021-10-08 23:18:26 +08:00
|
|
|
but i deleted it after version4.0) to help checking errors.
|
2021-10-14 13:42:07 +08:00
|
|
|
We found it much easier to check syntax and runtime
|
2021-10-08 23:18:26 +08:00
|
|
|
errors before copying nasal-codes in nasal-console in Flightgear to test.
|
2021-05-04 01:13:53 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Also, you could use this language to write some
|
|
|
|
interesting programs and run them without the lib of Flightgear.
|
|
|
|
You could add your own built-in functions to change
|
2021-10-14 13:42:07 +08:00
|
|
|
this interpreter to a useful tool in your own projects (such as a script in your own game).
|
2020-11-20 00:18:17 +08:00
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
## How to Compile
|
2020-11-20 00:18:17 +08:00
|
|
|
|
2021-05-04 01:13:53 +08:00
|
|
|
Better choose the latest update of the interpreter.
|
2021-10-13 22:59:15 +08:00
|
|
|
Download the source code and build it!
|
|
|
|
It's quite easy to build this interpreter.
|
2021-04-19 19:12:41 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
MUST USE `-O2/-O3` if want to optimize the interpreter!
|
2021-04-19 19:12:41 +08:00
|
|
|
|
2021-07-19 17:04:45 +08:00
|
|
|
Also remember to use g++ or clang++.
|
2021-06-29 17:18:05 +08:00
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
> [cpp compiler] -std=c++11 -O3 main.cpp -o nasal.exe -fno-exceptions
|
2021-06-11 15:16:06 +08:00
|
|
|
|
|
|
|
Or use this in linux/macOS/Unix
|
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
> [cpp compiler] -std=c++11 -O3 main.cpp -o nasal -fno-exceptions
|
2021-06-11 15:16:06 +08:00
|
|
|
|
|
|
|
## How to Use?
|
|
|
|
|
|
|
|
Input this command to run scripts directly:
|
|
|
|
|
|
|
|
> ./nasal filename
|
|
|
|
|
2021-06-11 15:28:25 +08:00
|
|
|
Use these commands to get version of interpreter:
|
|
|
|
|
2021-07-16 17:18:13 +08:00
|
|
|
> ./nasal -v | --version
|
2021-06-11 15:16:06 +08:00
|
|
|
|
2021-08-04 00:03:49 +08:00
|
|
|
Use these commands to get help(see more debug commands in help):
|
2021-06-11 15:28:25 +08:00
|
|
|
|
2021-07-16 17:18:13 +08:00
|
|
|
> ./nasal -h | --help
|
2020-12-12 20:13:23 +08:00
|
|
|
|
2021-06-13 01:01:32 +08:00
|
|
|
If your system is Windows and you want to output unicode,please use this command before running nasal interpreter:
|
|
|
|
|
|
|
|
> chcp 65001
|
|
|
|
|
|
|
|
The interpreter's interactive mode will do this automatically,so you don't need to run this command if you use the interactive interpreter.
|
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
## Parser
|
2019-09-25 20:49:06 +08:00
|
|
|
|
2021-01-23 19:21:37 +08:00
|
|
|
LL(k) parser.
|
2020-01-22 19:07:33 +08:00
|
|
|
|
2021-01-23 19:21:37 +08:00
|
|
|
```javascript
|
2021-08-01 02:11:27 +08:00
|
|
|
(var a,b,c)=[{b:nil},[1,2],func return 0;];
|
2021-01-23 19:21:37 +08:00
|
|
|
(a.b,b[0],c)=(1,2,3);
|
|
|
|
```
|
2020-12-19 01:26:15 +08:00
|
|
|
|
2021-06-05 17:15:07 +08:00
|
|
|
These two expressions have the same first set,so LL(1) is useless for this language.
|
2021-01-23 19:21:37 +08:00
|
|
|
|
|
|
|
Maybe in the future i can refactor it to LL(1) with special checks.
|
2020-12-19 01:26:15 +08:00
|
|
|
|
2021-08-01 02:11:27 +08:00
|
|
|
Problems mentioned above have been solved for a long time, but recently i found a new problem here:
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
var f=func(x,y,z){return x+y+z}
|
|
|
|
(a,b,c)=(0,1,2);
|
|
|
|
```
|
|
|
|
|
|
|
|
This will be recognized as this:
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
var f=func(x,y,z){return x+y+z}(a,b,c)
|
|
|
|
=(0,1,2);
|
|
|
|
```
|
|
|
|
|
|
|
|
and causes fatal syntax error.
|
|
|
|
And i tried this program in flightgear nasal console.
|
|
|
|
It also found this is a syntax error.
|
|
|
|
I think this is a serious design fault.
|
|
|
|
To avoid this syntax error, change program like this, just add a semicolon:
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
var f=func(x,y,z){return x+y+z};
|
|
|
|
^ here
|
|
|
|
(a,b,c)=(0,1,2);
|
|
|
|
```
|
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
### version 1.0(last update 2019/10/14)
|
2021-06-05 17:15:07 +08:00
|
|
|
|
|
|
|
First fully functional version of nasal_parser.
|
|
|
|
|
|
|
|
Before version 1.0,i tried many times to create a correct parser.
|
|
|
|
|
|
|
|
Finally i learned LL(1) and LL(k) and wrote a parser for math formulas in version 0.16(last update 2019/9/14).
|
|
|
|
|
|
|
|
In version 0.17(2019/9/15) 0.18(2019/9/18) 0.19(2019/10/1)i was playing the parser happily and after that i wrote version 1.0.
|
|
|
|
|
|
|
|
This project began at 2019/8/31.
|
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
## Abstract Syntax Tree
|
2020-09-13 16:40:23 +08:00
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
### Version 1.2(last update 2019/10/31)
|
2020-09-13 16:40:23 +08:00
|
|
|
|
2020-12-12 20:13:23 +08:00
|
|
|
The ast has been completed in this version.
|
2020-09-13 16:40:23 +08:00
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
### Version 2.0(last update 2020/8/31)
|
2020-01-13 14:13:35 +08:00
|
|
|
|
2020-12-12 20:13:23 +08:00
|
|
|
A completed ast-interpreter with unfinished lib functions.
|
2019-10-13 11:01:31 +08:00
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
### Version 3.0(last update 2020/10/23)
|
2020-06-01 01:28:49 +08:00
|
|
|
|
2020-12-12 20:13:23 +08:00
|
|
|
The ast is refactored and is now easier to read and maintain.
|
2020-09-13 16:40:23 +08:00
|
|
|
|
2020-12-12 20:13:23 +08:00
|
|
|
Ast-interpreter uses new techniques so it can run codes more efficiently.
|
2020-06-01 01:28:49 +08:00
|
|
|
|
2020-12-12 20:13:23 +08:00
|
|
|
Now you can add your own functions as builtin-functions in this interpreter!
|
2020-06-01 01:28:49 +08:00
|
|
|
|
2020-12-12 20:13:23 +08:00
|
|
|
I decide to save the ast interpreter after releasing v4.0. Because it took me a long time to think and write...
|
2020-09-13 16:40:23 +08:00
|
|
|
|
2021-06-07 23:53:43 +08:00
|
|
|
### Version 5.0(last update 2021/3/7)
|
2020-12-19 01:26:15 +08:00
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
I change my mind.
|
|
|
|
AST interpreter leaves me too much things to do.
|
2020-12-19 01:26:15 +08:00
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
If i continue saving this interpreter,
|
|
|
|
it will be harder for me to make the bytecode vm become more efficient.
|
2020-12-19 01:26:15 +08:00
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
## Byte Code Virtual Machine
|
2020-12-12 20:13:23 +08:00
|
|
|
|
2021-06-29 17:18:05 +08:00
|
|
|
### Version 4.0 (last update 2020/12/17)
|
2020-12-12 20:13:23 +08:00
|
|
|
|
|
|
|
I have just finished the first version of byte-code-interpreter.
|
|
|
|
|
|
|
|
This interpreter is still in test.After this test,i will release version 4.0!
|
|
|
|
|
|
|
|
Now i am trying to search hidden bugs in this interpreter.Hope you could help me! :)
|
|
|
|
|
|
|
|
There's an example of byte code below:
|
|
|
|
|
|
|
|
```javascript
|
2021-01-23 19:21:37 +08:00
|
|
|
for(var i=0;i<4000000;i+=1);
|
2020-12-12 20:13:23 +08:00
|
|
|
```
|
2020-09-13 16:40:23 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
```x86asm
|
2021-01-23 19:21:37 +08:00
|
|
|
.number 0
|
|
|
|
.number 4e+006
|
2020-12-12 20:13:23 +08:00
|
|
|
.number 1
|
2021-01-23 19:21:37 +08:00
|
|
|
.symbol i
|
|
|
|
0x00000000: pzero 0x00000000
|
2021-04-03 23:47:14 +08:00
|
|
|
0x00000001: loadg 0x00000000 (i)
|
|
|
|
0x00000002: callg 0x00000000 (i)
|
2021-01-23 19:21:37 +08:00
|
|
|
0x00000003: pnum 0x00000001 (4e+006)
|
|
|
|
0x00000004: less 0x00000000
|
|
|
|
0x00000005: jf 0x0000000b
|
|
|
|
0x00000006: pone 0x00000000
|
2021-04-03 23:47:14 +08:00
|
|
|
0x00000007: mcallg 0x00000000 (i)
|
2021-01-23 19:21:37 +08:00
|
|
|
0x00000008: addeq 0x00000000
|
|
|
|
0x00000009: pop 0x00000000
|
|
|
|
0x0000000a: jmp 0x00000002
|
|
|
|
0x0000000b: nop 0x00000000
|
2020-12-19 01:26:15 +08:00
|
|
|
```
|
|
|
|
|
2021-06-29 17:18:05 +08:00
|
|
|
### Version 5.0 (last update 2021/3/7)
|
2020-12-19 01:26:15 +08:00
|
|
|
|
|
|
|
I decide to optimize bytecode vm in this version.
|
|
|
|
|
2021-01-23 19:21:37 +08:00
|
|
|
Because it takes more than 1.5s to count i from 0 to 4000000-1.This is not efficient at all!
|
|
|
|
|
|
|
|
2021/1/23 update: Now it can count from 0 to 4000000-1 in 1.5s.
|
|
|
|
|
2021-06-29 17:18:05 +08:00
|
|
|
### Version 6.0 (last update 2021/6/1)
|
2021-04-03 23:47:14 +08:00
|
|
|
|
|
|
|
Use loadg loadl callg calll mcallg mcalll to avoid branches.
|
|
|
|
|
|
|
|
Delete type vm_scop.
|
|
|
|
|
|
|
|
Use const vm_num to avoid frequently new & delete.
|
|
|
|
|
|
|
|
Change garbage collector from reference-counting to mark-sweep.
|
|
|
|
|
|
|
|
Vapp and newf operand use .num to reduce the size of exec_code.
|
|
|
|
|
|
|
|
2021/4/3 update: Now it can count from 0 to 4000000-1 in 0.8s.
|
|
|
|
|
2021-04-19 19:12:41 +08:00
|
|
|
2021/4/19 update: Now it can count from 0 to 4e6-1 in 0.4s.
|
|
|
|
|
|
|
|
In this update i changed global and local scope from unordered_map to vector.
|
|
|
|
|
|
|
|
So the bytecode generator changed a lot.
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
for(var i=0;i<4000000;i+=1);
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
```x86asm
|
2021-04-19 19:12:41 +08:00
|
|
|
.number 4e+006
|
|
|
|
0x00000000: intg 0x00000001
|
|
|
|
0x00000001: pzero 0x00000000
|
|
|
|
0x00000002: loadg 0x00000000
|
|
|
|
0x00000003: callg 0x00000000
|
|
|
|
0x00000004: pnum 0x00000000 (4e+006)
|
|
|
|
0x00000005: less 0x00000000
|
|
|
|
0x00000006: jf 0x0000000c
|
|
|
|
0x00000007: pone 0x00000000
|
|
|
|
0x00000008: mcallg 0x00000000
|
|
|
|
0x00000009: addeq 0x00000000
|
|
|
|
0x0000000a: pop 0x00000000
|
|
|
|
0x0000000b: jmp 0x00000003
|
|
|
|
0x0000000c: nop 0x00000000
|
|
|
|
```
|
|
|
|
|
2021-06-29 17:18:05 +08:00
|
|
|
### Version 6.5 (last update 2021/6/24)
|
2021-06-05 17:15:07 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
2021/5/31 update:
|
2021-06-05 17:15:07 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Now gc can collect garbage correctly without re-collecting,
|
|
|
|
which will cause fatal error.
|
2021-06-05 17:15:07 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Add builtin_alloc to avoid mark-sweep when running a built-in function,
|
|
|
|
which will mark useful items as useless garbage to collect.
|
2021-06-05 17:15:07 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Better use setsize and assignment to get a big array,
|
|
|
|
append is very slow in this situation.
|
2021-06-05 17:15:07 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
2021/6/3 update:
|
2021-06-05 17:15:07 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Fixed a bug that gc still re-collects garbage,
|
|
|
|
this time i use three mark states to make sure garbage is ready to be collected.
|
|
|
|
|
|
|
|
Change callf to callfv and callfh.
|
|
|
|
And callfv fetches arguments from val_stack directly instead of using vm_vec,
|
|
|
|
a not very efficient way.
|
|
|
|
|
|
|
|
Better use callfv instead of callfh,
|
|
|
|
callfh will fetch a vm_hash from stack and parse it,
|
|
|
|
making this process slow.
|
2021-06-05 17:15:07 +08:00
|
|
|
|
|
|
|
```javascript
|
|
|
|
var f=func(x,y){return x+y;}
|
|
|
|
f(1024,2048);
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
```x86asm
|
2021-06-05 17:15:07 +08:00
|
|
|
.number 1024
|
|
|
|
.number 2048
|
|
|
|
.symbol x
|
|
|
|
.symbol y
|
|
|
|
0x00000000: intg 0x00000001
|
|
|
|
0x00000001: newf 0x00000007
|
|
|
|
0x00000002: intl 0x00000003
|
|
|
|
0x00000003: offset 0x00000001
|
|
|
|
0x00000004: para 0x00000000 (x)
|
|
|
|
0x00000005: para 0x00000001 (y)
|
|
|
|
0x00000006: jmp 0x0000000b
|
|
|
|
0x00000007: calll 0x00000001
|
|
|
|
0x00000008: calll 0x00000002
|
|
|
|
0x00000009: add 0x00000000
|
|
|
|
0x0000000a: ret 0x00000000
|
|
|
|
0x0000000b: loadg 0x00000000
|
|
|
|
0x0000000c: callg 0x00000000
|
|
|
|
0x0000000d: pnum 0x00000000 (1024)
|
|
|
|
0x0000000e: pnum 0x00000001 (2048)
|
|
|
|
0x0000000f: callfv 0x00000002
|
|
|
|
0x00000010: pop 0x00000000
|
|
|
|
0x00000011: nop 0x00000000
|
|
|
|
```
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
2021/6/21 update: Now gc will not collect nullptr.
|
|
|
|
And the function of assignment is complete,
|
|
|
|
now these kinds of assignment is allowed:
|
2021-06-21 16:46:47 +08:00
|
|
|
|
|
|
|
```javascript
|
|
|
|
var f=func()
|
|
|
|
{
|
|
|
|
var _=[{_:0},{_:1}];
|
|
|
|
return func(x)
|
|
|
|
{
|
|
|
|
return _[x];
|
|
|
|
}
|
|
|
|
}
|
|
|
|
var m=f();
|
|
|
|
m(0)._=m(1)._=10;
|
|
|
|
|
|
|
|
[0,1,2][1:2][0]=0;
|
|
|
|
```
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
In the old version,
|
|
|
|
parser will check this left-value and tells that these kinds of left-value are not allowed(bad lvalue).
|
2021-06-21 16:46:47 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
But now it can work.
|
|
|
|
And you could see its use by reading the code above.
|
|
|
|
To make sure this assignment works correctly,
|
|
|
|
codegen will generate byte code by nasal_codegen::call_gen() instead of nasal_codegen::mcall_gen(),
|
|
|
|
and the last child of the ast will be generated by nasal_codegen::mcall_gen().
|
|
|
|
So the bytecode is totally different now:
|
2021-06-21 16:46:47 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
```x86asm
|
2021-06-21 16:46:47 +08:00
|
|
|
.number 10
|
|
|
|
.number 2
|
|
|
|
.symbol _
|
|
|
|
.symbol x
|
|
|
|
0x00000000: intg 0x00000002
|
|
|
|
0x00000001: newf 0x00000005
|
|
|
|
0x00000002: intl 0x00000002
|
|
|
|
0x00000003: offset 0x00000001
|
|
|
|
0x00000004: jmp 0x00000017
|
|
|
|
0x00000005: newh 0x00000000
|
|
|
|
0x00000006: pzero 0x00000000
|
|
|
|
0x00000007: happ 0x00000000 (_)
|
|
|
|
0x00000008: newh 0x00000000
|
|
|
|
0x00000009: pone 0x00000000
|
|
|
|
0x0000000a: happ 0x00000000 (_)
|
|
|
|
0x0000000b: newv 0x00000002
|
|
|
|
0x0000000c: loadl 0x00000001
|
|
|
|
0x0000000d: newf 0x00000012
|
|
|
|
0x0000000e: intl 0x00000003
|
|
|
|
0x0000000f: offset 0x00000002
|
|
|
|
0x00000010: para 0x00000001 (x)
|
|
|
|
0x00000011: jmp 0x00000016
|
|
|
|
0x00000012: calll 0x00000001
|
|
|
|
0x00000013: calll 0x00000002
|
|
|
|
0x00000014: callv 0x00000000
|
|
|
|
0x00000015: ret 0x00000000
|
|
|
|
0x00000016: ret 0x00000000
|
|
|
|
0x00000017: loadg 0x00000000
|
|
|
|
0x00000018: callg 0x00000000
|
|
|
|
0x00000019: callfv 0x00000000
|
|
|
|
0x0000001a: loadg 0x00000001
|
|
|
|
0x0000001b: pnum 0x00000000 (10.000000)
|
|
|
|
0x0000001c: callg 0x00000001
|
|
|
|
0x0000001d: pone 0x00000000
|
|
|
|
0x0000001e: callfv 0x00000001
|
|
|
|
0x0000001f: mcallh 0x00000000 (_)
|
|
|
|
0x00000020: meq 0x00000000
|
|
|
|
0x00000021: callg 0x00000001
|
|
|
|
0x00000022: pzero 0x00000000
|
|
|
|
0x00000023: callfv 0x00000001
|
|
|
|
0x00000024: mcallh 0x00000000 (_)
|
|
|
|
0x00000025: meq 0x00000000
|
|
|
|
0x00000026: pop 0x00000000
|
|
|
|
0x00000027: pzero 0x00000000
|
|
|
|
0x00000028: pzero 0x00000000
|
|
|
|
0x00000029: pone 0x00000000
|
|
|
|
0x0000002a: pnum 0x00000001 (2.000000)
|
|
|
|
0x0000002b: newv 0x00000003
|
|
|
|
0x0000002c: slcbeg 0x00000000
|
|
|
|
0x0000002d: pone 0x00000000
|
|
|
|
0x0000002e: pnum 0x00000001 (2.000000)
|
|
|
|
0x0000002f: slc2 0x00000000
|
|
|
|
0x00000030: slcend 0x00000000
|
|
|
|
0x00000031: pzero 0x00000000
|
|
|
|
0x00000032: mcallv 0x00000000
|
|
|
|
0x00000033: meq 0x00000000
|
|
|
|
0x00000034: pop 0x00000000
|
|
|
|
0x00000035: nop 0x00000000
|
|
|
|
```
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
As you could see from the bytecode above,
|
|
|
|
mcall/mcallv/mcallh operands' using frequency will reduce,
|
|
|
|
call/callv/callh/callfv/callfh at the opposite.
|
2021-06-21 16:46:47 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
And because of the new structure of mcall,
|
|
|
|
addr_stack, a stack used to store the memory address,
|
|
|
|
is deleted from nasal_vm,
|
|
|
|
and now nasal_vm use nasal_val** mem_addr to store the memory address.
|
|
|
|
This will not cause fatal errors because the memory address is used __immediately__ after getting it.
|
2021-06-21 16:46:47 +08:00
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
### version 7.0 (last update 2021/10/8)
|
2021-06-26 14:53:10 +08:00
|
|
|
|
|
|
|
2021/6/26 update:
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Instruction dispatch is changed from call-threading to computed-goto(with inline function).
|
|
|
|
After changing the way of instruction dispatch,
|
|
|
|
there is a great improvement in nasal_vm.
|
|
|
|
Now vm can run test/bigloop and test/pi in 0.2s!
|
|
|
|
And vm runs test/fib in 0.8s on linux.
|
|
|
|
You could see the time use data below,
|
|
|
|
in Test data section.
|
2021-06-26 14:53:10 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
This version uses g++ extension "labels as values",
|
|
|
|
which is also supported by clang++.
|
|
|
|
(But i don't know if MSVC supports this)
|
2021-06-26 14:53:10 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
There is also a change in nasal_gc:
|
|
|
|
std::vector global is deleted,
|
|
|
|
now the global values are all stored on stack(from val_stack+0 to val_stack+intg-1).
|
2021-06-26 14:53:10 +08:00
|
|
|
|
2021-06-29 17:18:05 +08:00
|
|
|
2021/6/29 update:
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Add some instructions that execute const values:
|
|
|
|
op_addc,op_subc,op_mulc,op_divc,op_lnkc,op_addeqc,op_subeqc,op_muleqc,op_diveqc,op_lnkeqc.
|
2021-06-29 17:18:05 +08:00
|
|
|
|
|
|
|
Now the bytecode of test/bigloop.nas seems like this:
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
```x86asm
|
2021-06-29 17:18:05 +08:00
|
|
|
.number 4e+006
|
|
|
|
.number 1
|
|
|
|
0x00000000: intg 0x00000001
|
|
|
|
0x00000001: pzero 0x00000000
|
|
|
|
0x00000002: loadg 0x00000000
|
|
|
|
0x00000003: callg 0x00000000
|
|
|
|
0x00000004: pnum 0x00000000 (4000000)
|
|
|
|
0x00000005: less 0x00000000
|
|
|
|
0x00000006: jf 0x0000000b
|
|
|
|
0x00000007: mcallg 0x00000000
|
|
|
|
0x00000008: addeqc 0x00000001 (1)
|
|
|
|
0x00000009: pop 0x00000000
|
|
|
|
0x0000000a: jmp 0x00000003
|
|
|
|
0x0000000b: nop 0x00000000
|
|
|
|
```
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
And this test file runs in 0.1s after this update.
|
|
|
|
Most of the calculations are accelerated.
|
2021-06-29 17:18:05 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Also, assignment bytecode has changed a lot.
|
|
|
|
Now the first identifier that called in assignment will use op_load to assign,
|
|
|
|
instead of op_meq,op_pop.
|
2021-06-29 17:18:05 +08:00
|
|
|
|
|
|
|
```javascript
|
|
|
|
var (a,b)=(1,2);
|
|
|
|
a=b=0;
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
```x86asm
|
2021-06-29 17:18:05 +08:00
|
|
|
.number 2
|
|
|
|
0x00000000: intg 0x00000002
|
|
|
|
0x00000001: pone 0x00000000
|
|
|
|
0x00000002: loadg 0x00000000
|
|
|
|
0x00000003: pnum 0x00000000 (2)
|
|
|
|
0x00000004: loadg 0x00000001
|
|
|
|
0x00000005: pzero 0x00000000
|
|
|
|
0x00000006: mcallg 0x00000001
|
|
|
|
0x00000007: meq 0x00000000 (b=2 use meq,pop->a)
|
|
|
|
0x00000008: loadg 0x00000000 (a=b use loadg)
|
|
|
|
0x00000009: nop 0x00000000
|
|
|
|
```
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
### version 8.0 (latest)
|
|
|
|
|
|
|
|
2021/10/8 update:
|
|
|
|
|
|
|
|
In this version vm_nil and vm_num now is not managed by nasal_gc,
|
|
|
|
this will decrease the usage of gc_alloc and increase the efficiency of execution.
|
|
|
|
|
|
|
|
New value type is added: vm_obj.
|
|
|
|
This type is reserved for user to define their own value types.
|
|
|
|
Related API will be added in the future.
|
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
Fully functional closure:
|
|
|
|
Add new operands that get and set upvalues.
|
|
|
|
Delete an old operand 'op_offset'.
|
|
|
|
|
|
|
|
2021/10/13 update:
|
|
|
|
|
|
|
|
The format of output information of bytecodes changes to this:
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
```x86asm
|
2021-10-13 22:59:15 +08:00
|
|
|
0x0000017c: jmp 0x181
|
|
|
|
0x0000017d: calll 0x1
|
|
|
|
0x0000017e: calll 0x1
|
|
|
|
0x0000017f: callfv 0x1
|
|
|
|
0x00000180: ret
|
|
|
|
0x00000181: newf 0x185
|
|
|
|
0x00000182: intl 0x2
|
|
|
|
0x00000183: para 0x29 ("f")
|
|
|
|
0x00000184: jmp 0x19d
|
|
|
|
0x00000185: newf 0x189
|
|
|
|
0x00000186: intl 0x2
|
|
|
|
0x00000187: para 0x1d ("x")
|
|
|
|
0x00000188: jmp 0x19c
|
|
|
|
0x00000189: calll 0x1
|
|
|
|
0x0000018a: lessc 0x12 (2.000000)
|
|
|
|
0x0000018b: jf 0x18e
|
|
|
|
0x0000018c: calll 0x1
|
|
|
|
0x0000018d: ret
|
|
|
|
0x0000018e: upval 0x0[0x1]
|
|
|
|
0x0000018f: upval 0x0[0x1]
|
|
|
|
0x00000190: callfv 0x1
|
|
|
|
0x00000191: calll 0x1
|
|
|
|
0x00000192: subc 0x13 (1.000000)
|
|
|
|
0x00000193: callfv 0x1
|
|
|
|
0x00000194: upval 0x0[0x1]
|
|
|
|
0x00000195: upval 0x0[0x1]
|
|
|
|
0x00000196: callfv 0x1
|
|
|
|
0x00000197: calll 0x1
|
|
|
|
0x00000198: subc 0x12 (2.000000)
|
|
|
|
0x00000199: callfv 0x1
|
|
|
|
0x0000019a: add
|
|
|
|
0x0000019b: ret
|
|
|
|
```
|
|
|
|
|
|
|
|
## Benchmark
|
2021-06-19 00:32:10 +08:00
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
### version 6.5 (i5-8250U windows10 2021/6/19)
|
2021-06-19 00:32:10 +08:00
|
|
|
|
|
|
|
running time and gc time:
|
|
|
|
|
2021-06-20 01:27:01 +08:00
|
|
|
|file|call gc|total time|gc time|
|
2021-06-19 00:32:10 +08:00
|
|
|
|:----|:----|:----|:----|
|
|
|
|
|pi.nas|12000049|0.593s|0.222s|
|
|
|
|
|fib.nas|10573747|2.838s|0.187s|
|
|
|
|
|bp.nas|4419829|1.99s|0.18s|
|
|
|
|
|bigloop.nas|4000000|0.419s|0.039s|
|
|
|
|
|mandelbrot.nas|1044630|0.433s|0.041s|
|
|
|
|
|life.nas|817112|8.557s|0.199s|
|
|
|
|
|ascii-art.nas|45612|0.48s|0.027s|
|
|
|
|
|calc.nas|8089|0.068s|0.006s|
|
|
|
|
|quick_sort.nas|2768|0.107s|0s|
|
|
|
|
|bfs.nas|2471|1.763s|0.003s|
|
|
|
|
|
|
|
|
operands calling frequency:
|
|
|
|
|
2021-06-20 01:27:01 +08:00
|
|
|
|file|1st|2nd|3rd|4th|5th|
|
2021-06-19 00:32:10 +08:00
|
|
|
|:----|:----|:----|:----|:----|:----|
|
|
|
|
|pi.nas|callg|pop|mcallg|pnum|pone|
|
|
|
|
|fib.nas|calll|pnum|callg|less|jf|
|
|
|
|
|bp.nas|calll|callg|pop|callv|addeq|
|
|
|
|
|bigloop.nas|pnum|less|jf|callg|pone|
|
|
|
|
|mandelbrot.nas|callg|mult|loadg|pnum|pop|
|
|
|
|
|life.nas|calll|callv|pnum|jf|callg|
|
|
|
|
|ascii-art.nas|calll|pop|mcalll|callg|callb|
|
|
|
|
|calc.nas|calll|pop|pstr|mcalll|jmp|
|
|
|
|
|quick_sort.nas|calll|pop|jt|jf|less|
|
|
|
|
|bfs.nas|calll|pop|callv|mcalll|jf|
|
|
|
|
|
|
|
|
operands calling total times:
|
|
|
|
|
2021-06-20 01:27:01 +08:00
|
|
|
|file|1st|2nd|3rd|4th|5th|
|
2021-06-19 00:32:10 +08:00
|
|
|
|:----|:----|:----|:----|:----|:----|
|
|
|
|
|pi.nas|6000004|6000003|6000000|4000005|4000002|
|
|
|
|
|fib.nas|17622792|10573704|7049218|7049155|7049155|
|
|
|
|
|bp.nas|7081480|4227268|2764676|2617112|2065441|
|
|
|
|
|bigloop.nas|4000001|4000001|4000001|4000001|4000000|
|
|
|
|
|mandelbrot.nas|1519632|563856|290641|286795|284844|
|
|
|
|
|life.nas|2114371|974244|536413|534794|489743|
|
|
|
|
|ascii-art.nas|37906|22736|22402|18315|18292|
|
|
|
|
|calc.nas|191|124|109|99|87|
|
|
|
|
|quick_sort.nas|16226|5561|4144|3524|2833|
|
|
|
|
|bfs.nas|24707|16297|14606|14269|8672|
|
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
### version 7.0 (i5-8250U ubuntu-WSL on windows10 2021/6/29)
|
2021-06-26 14:53:10 +08:00
|
|
|
|
|
|
|
running time:
|
|
|
|
|
|
|
|
|file|total time|info|
|
|
|
|
|:----|:----|:----|
|
2021-06-29 17:18:05 +08:00
|
|
|
|pi.nas|0.15625s|great improvement|
|
2021-06-26 14:53:10 +08:00
|
|
|
|fib.nas|0.75s|great improvement|
|
2021-06-29 17:18:05 +08:00
|
|
|
|bp.nas|0.4218s(7162 epoch)|good improvement|
|
|
|
|
|bigloop.nas|0.09375s|great improvement|
|
|
|
|
|mandelbrot.nas|0.0312s|great improvement|
|
|
|
|
|life.nas|8.80s(windows) 1.25(ubuntu WSL)|little improvement|
|
2021-06-26 14:53:10 +08:00
|
|
|
|ascii-art.nas|0.015s|little improvement|
|
2021-06-29 17:18:05 +08:00
|
|
|
|calc.nas|0.0468s|little improvement|
|
2021-06-26 14:53:10 +08:00
|
|
|
|quick_sort.nas|0s|great improvement|
|
|
|
|
|bfs.nas|0.0156s|great improvement|
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
## Simple Tutorial
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
Nasal is really easy to learn.
|
|
|
|
Reading this tutorial will not takes you over 15 minutes.
|
|
|
|
You could totally use it after reading this simple tutorial:
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Basic Value Type__
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
`vm_none` is error type.
|
|
|
|
This type is used to interrupt the execution of virtual machine and will not be created by user program.
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
`vm_nil` is a null type. It means nothing.
|
2021-01-23 19:21:37 +08:00
|
|
|
```javascript
|
|
|
|
var spc=nil;
|
2021-10-14 13:42:07 +08:00
|
|
|
```
|
|
|
|
`vm_num` has 3 formats. Dec, hex and oct. Using IEEE754 double to store.
|
|
|
|
```javascript
|
|
|
|
var n=1;
|
|
|
|
var n=2.71828;
|
|
|
|
var n=2.147e16;
|
|
|
|
var n=1e-10;
|
|
|
|
var n=0x7fffffff;
|
|
|
|
var n=0xAA55;
|
|
|
|
var n=0o170001;
|
|
|
|
```
|
|
|
|
`vm_str` has 3 formats. But the third one is often used to declare a character.
|
|
|
|
```javascript
|
|
|
|
var s='str';
|
|
|
|
var s="another string";
|
|
|
|
var s=`c`;
|
|
|
|
```
|
|
|
|
`vm_vec` has unlimited length and can store all types of values.
|
|
|
|
```javascript
|
|
|
|
var vec=[];
|
|
|
|
var vec=[
|
2021-07-19 17:04:45 +08:00
|
|
|
0,
|
|
|
|
nil,
|
|
|
|
{},
|
|
|
|
[],
|
|
|
|
func(){return 0;}
|
|
|
|
];
|
2021-10-14 13:42:07 +08:00
|
|
|
append(vec,0,1,2);
|
|
|
|
```
|
|
|
|
`vm_hash` is a hashmap that stores values with strings/identifiers as the key.
|
|
|
|
```javascript
|
|
|
|
var hash={
|
2021-01-23 19:21:37 +08:00
|
|
|
member1:nil,
|
|
|
|
member2:'str',
|
|
|
|
'member3':'member\'s name can also be a string constant',
|
2021-02-23 23:33:01 +08:00
|
|
|
"member4":"also this",
|
2021-10-13 22:59:15 +08:00
|
|
|
function:func(){
|
2021-02-23 23:33:01 +08:00
|
|
|
var a=me.member2~me.member3;
|
|
|
|
return a;
|
|
|
|
}
|
2021-01-23 19:21:37 +08:00
|
|
|
};
|
2021-10-14 13:42:07 +08:00
|
|
|
```
|
|
|
|
`vm_func` is a function type (in fact it is lambda).
|
|
|
|
```javascript
|
|
|
|
var f=func(x,y,z){
|
|
|
|
return nil;
|
|
|
|
}
|
|
|
|
var f=func{
|
|
|
|
return 1024;
|
2021-01-23 19:21:37 +08:00
|
|
|
}
|
2021-10-14 13:42:07 +08:00
|
|
|
var f=func(x,y,z,default1=1,default2=2){
|
|
|
|
return x+y+z+default1+default2;
|
|
|
|
}
|
|
|
|
var f=func(args...){
|
2021-01-23 19:21:37 +08:00
|
|
|
var sum=0;
|
2021-10-14 13:42:07 +08:00
|
|
|
foreach(var i;args)
|
2021-01-23 19:21:37 +08:00
|
|
|
sum+=i;
|
2021-10-14 13:42:07 +08:00
|
|
|
return sum;
|
2021-01-23 19:21:37 +08:00
|
|
|
}
|
|
|
|
```
|
2021-10-14 13:42:07 +08:00
|
|
|
`vm_obj` is a special type that stores user data.
|
|
|
|
This means you could use other complex C/C++ data types in nasal.
|
|
|
|
This type is used when you are trying to add a new data structure into nasal,
|
|
|
|
so this type is often created by native-function that programmed in C/C++ by library developers.
|
|
|
|
You could see how to write your own native-functions below.
|
|
|
|
```javascript
|
|
|
|
var my_new_obj=func(){
|
|
|
|
return __builtin_my_obj();
|
|
|
|
}
|
|
|
|
var obj=my_new_obj();
|
|
|
|
```
|
|
|
|
|
|
|
|
### __Operators__
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
Nasal has basic math operators `+` `-` `*` `/` and a special operator `~` that links two strings together.
|
2021-01-23 19:21:37 +08:00
|
|
|
|
|
|
|
```javascript
|
2021-06-11 15:28:25 +08:00
|
|
|
1+2-1*2/1;
|
2021-01-23 19:21:37 +08:00
|
|
|
'str1'~'str2';
|
2021-02-23 23:33:01 +08:00
|
|
|
(1+2)*(3+4)
|
2021-10-14 13:42:07 +08:00
|
|
|
```
|
|
|
|
For conditional expressions, operators `==` `!=` `<` `>` `<=` `>=` are used to compare two values.
|
|
|
|
`and` `or` have the same function as C/C++ `&&` `||`, link comparations together.
|
|
|
|
```javascript
|
2021-01-23 19:21:37 +08:00
|
|
|
1+1 and 0;
|
2021-06-11 15:28:25 +08:00
|
|
|
1<0 or 1>0;
|
|
|
|
1<=0 and 1>=0;
|
|
|
|
1==0 or 1!=0;
|
2021-10-14 13:42:07 +08:00
|
|
|
```
|
|
|
|
Unary operators `-` `!` have the same function as C/C++.
|
|
|
|
```javascript
|
2021-01-23 19:21:37 +08:00
|
|
|
-1;
|
|
|
|
!0;
|
2021-10-14 13:42:07 +08:00
|
|
|
```
|
|
|
|
Operators `=` `+=` `-=` `*=` `/=` `~=` are used in assignment expressions.
|
|
|
|
```javascript
|
2021-01-23 19:21:37 +08:00
|
|
|
a=b=c=d=1;
|
|
|
|
a+=1;
|
|
|
|
a-=1;
|
|
|
|
a*=1;
|
|
|
|
a/=1;
|
|
|
|
a~='string';
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Definition__
|
2021-01-23 19:21:37 +08:00
|
|
|
|
|
|
|
```javascript
|
|
|
|
var a=1;
|
|
|
|
var (a,b,c)=[0,1,2];
|
|
|
|
var (a,b,c)=(0,1,2);
|
|
|
|
(var a,b,c)=[0,1,2];
|
|
|
|
(var a,b,c)=(0,1,2);
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Multi-Assignment__
|
2021-01-23 19:21:37 +08:00
|
|
|
|
|
|
|
```javascript
|
|
|
|
(a,b[0],c.d)=[0,1,2];
|
|
|
|
(a,b[1],c.e)=(0,1,2);
|
2021-08-01 22:34:02 +08:00
|
|
|
(a,b)=(b,a);
|
2021-01-23 19:21:37 +08:00
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Conditional Expression__
|
|
|
|
|
|
|
|
In nasal there's a new key word `elsif`.
|
|
|
|
It has the same functions as `else if`.
|
2021-01-23 19:21:37 +08:00
|
|
|
|
|
|
|
```javascript
|
2021-08-01 22:34:02 +08:00
|
|
|
if(1){
|
2021-01-23 19:21:37 +08:00
|
|
|
;
|
2021-08-01 22:34:02 +08:00
|
|
|
}elsif(2){
|
2021-01-23 19:21:37 +08:00
|
|
|
;
|
2021-08-01 22:34:02 +08:00
|
|
|
}else if(3){
|
2021-01-23 19:21:37 +08:00
|
|
|
;
|
2021-08-01 22:34:02 +08:00
|
|
|
}else{
|
2021-01-23 19:21:37 +08:00
|
|
|
;
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Loop__
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
While loop and for loop is simalar to C/C++.
|
2021-01-23 19:21:37 +08:00
|
|
|
```javascript
|
|
|
|
while(condition)
|
|
|
|
continue;
|
|
|
|
|
|
|
|
for(var i=0;i<10;i+=1)
|
|
|
|
break;
|
2021-10-14 13:42:07 +08:00
|
|
|
```
|
|
|
|
Nasal has another two kinds of loops that iterates through a vector:
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
`forindex` will get the index of a vector.
|
|
|
|
```javascript
|
2021-01-23 19:21:37 +08:00
|
|
|
forindex(var i;elem)
|
|
|
|
print(elem[i]);
|
2021-10-14 13:42:07 +08:00
|
|
|
```
|
|
|
|
`foreach` will get the element of a vector.
|
|
|
|
```javascript
|
2021-01-23 19:21:37 +08:00
|
|
|
foreach(var i;elem)
|
|
|
|
print(i);
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Subvec__
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Use index to search one element in the string will get the ascii number of this character.
|
|
|
|
If you want to get the character,use built-in function chr().
|
2021-02-23 23:33:01 +08:00
|
|
|
|
2021-01-23 19:21:37 +08:00
|
|
|
```javascript
|
|
|
|
a[-1,1,0:2,0:,:3,:,nil:8,3:nil,nil:nil];
|
2021-02-23 23:33:01 +08:00
|
|
|
"hello world"[0];
|
2021-01-23 19:21:37 +08:00
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Special Function Call__
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
This is of great use but is not very efficient
|
|
|
|
(because hashmap use string as the key to compare).
|
2021-02-13 13:28:20 +08:00
|
|
|
|
2021-01-23 19:21:37 +08:00
|
|
|
```javascript
|
2021-10-14 13:42:07 +08:00
|
|
|
f(x:0,y:nil,z:[]);
|
2021-01-23 19:21:37 +08:00
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Lambda__
|
2021-06-05 17:15:07 +08:00
|
|
|
|
|
|
|
Also functions have this kind of use:
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
func(x,y){return x+y}(0,1);
|
|
|
|
func(x){return 1/(1+math.exp(-x));}(0.5);
|
|
|
|
```
|
|
|
|
|
2021-10-13 22:59:15 +08:00
|
|
|
There's an interesting test file 'y-combinator.nas',
|
|
|
|
try it for fun:
|
|
|
|
```javascript
|
|
|
|
var fib=func(f){
|
|
|
|
return f(f);
|
|
|
|
}(
|
|
|
|
func(f){
|
|
|
|
return func(x){
|
|
|
|
if(x<2) return x;
|
|
|
|
return f(f)(x-1)+f(f)(x-2);
|
|
|
|
}
|
|
|
|
}
|
|
|
|
);
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Closure__
|
|
|
|
Closure means you could get the variable that is not in the local scope of a function that you called.
|
|
|
|
Here is an example, result is `1`:
|
2021-02-23 23:33:01 +08:00
|
|
|
```javascript
|
2021-10-13 22:59:15 +08:00
|
|
|
var f=func(){
|
2021-02-23 23:33:01 +08:00
|
|
|
var a=1;
|
|
|
|
return func(){return a;};
|
|
|
|
}
|
|
|
|
print(f()());
|
2021-10-14 13:42:07 +08:00
|
|
|
```
|
|
|
|
Using closure makes it easier to OOP.
|
|
|
|
```javascript
|
|
|
|
var student=func(n,a){
|
|
|
|
var (name,age)=(n,a);
|
2021-04-04 23:35:13 +08:00
|
|
|
return {
|
2021-10-14 13:42:07 +08:00
|
|
|
print_info:func() {println(name,' ',age);},
|
|
|
|
set_age: func(a){age=a;},
|
|
|
|
get_age: func() {return age;},
|
|
|
|
set_name: func(n){name=n;},
|
|
|
|
get_name: func() {return name;}
|
2021-04-04 23:35:13 +08:00
|
|
|
};
|
|
|
|
}
|
2021-02-23 23:33:01 +08:00
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Trait__
|
|
|
|
|
|
|
|
Also there's another way to OOP,that is `trait`.
|
|
|
|
|
|
|
|
When a hash has a member named `parents` and the value type is vector,
|
|
|
|
then when you are trying to find a member that is not in this hash,
|
|
|
|
virtual machine will search the member is parents.
|
|
|
|
If there is a hash that has the member, you will get the member's value.
|
|
|
|
|
|
|
|
Using this mechanism, we could OOP like this, the result is `114514`:
|
|
|
|
```javascript
|
|
|
|
var trait={
|
|
|
|
get:func{return me.val;},
|
|
|
|
set:func(x){me.val=x;}
|
|
|
|
};
|
|
|
|
|
|
|
|
var class={
|
|
|
|
new:func(){
|
|
|
|
return {
|
|
|
|
val:nil,
|
|
|
|
parents:[trait]
|
|
|
|
};
|
|
|
|
}
|
|
|
|
};
|
|
|
|
```
|
|
|
|
First virtual machine cannot find member `set` in hash `a`, but in `a.parents` there's a hash `trait` has the member `set`, so we get the `set`.
|
|
|
|
variable `me` points to hash `a`, so we change the `a.val`.
|
|
|
|
And `get` has the same process.
|
|
|
|
```javascript
|
|
|
|
var a=class.new();
|
|
|
|
a.set(114514);
|
|
|
|
println(a.get());
|
|
|
|
```
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
### __Native Functions(This is for library developers)__
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
You could add builtin functions of your own
|
|
|
|
(written in C/C++) to help you calculate things more quickly.
|
|
|
|
(Advanced usage)
|
2021-01-23 19:21:37 +08:00
|
|
|
|
2021-02-23 23:33:01 +08:00
|
|
|
Check built-in functions in lib.nas!
|
2021-10-14 13:42:07 +08:00
|
|
|
You could use this file as the example to learn.
|
2021-02-23 23:33:01 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
If you want to add your own built-in functions,
|
|
|
|
define the function in nasal_builtin.h. (or any other place, but remember to compile it)
|
2021-02-23 23:33:01 +08:00
|
|
|
|
2021-03-01 15:54:58 +08:00
|
|
|
Definition:
|
2021-02-23 23:33:01 +08:00
|
|
|
|
|
|
|
```C++
|
2021-10-08 23:18:26 +08:00
|
|
|
nasal_ref builtin_chr(std::vector<nasal_ref>&,nasal_gc&);
|
2021-02-23 23:33:01 +08:00
|
|
|
```
|
|
|
|
Then complete this function using C++:
|
|
|
|
```C++
|
2021-10-12 18:26:10 +08:00
|
|
|
nasal_ref builtin_print(std::vector<nasal_ref>& local,nasal_gc& gc)
|
2021-02-23 23:33:01 +08:00
|
|
|
{
|
2021-04-19 19:12:41 +08:00
|
|
|
// find value with index begin from 1
|
|
|
|
// because local_scope[0] is reserved for value 'me'
|
2021-10-12 18:26:10 +08:00
|
|
|
nasal_ref vec=local[1];
|
2021-02-23 23:33:01 +08:00
|
|
|
// main process
|
2021-05-04 17:39:24 +08:00
|
|
|
// also check number of arguments and type here
|
2021-10-14 13:42:07 +08:00
|
|
|
// if get an error,use builtin_err
|
2021-10-12 18:26:10 +08:00
|
|
|
for(auto i:vec.vec()->elems)
|
2021-10-08 23:18:26 +08:00
|
|
|
switch(i.type)
|
2021-02-23 23:33:01 +08:00
|
|
|
{
|
2021-10-12 18:26:10 +08:00
|
|
|
case vm_none: std::cout<<"undefined"; break;
|
2021-05-04 17:39:24 +08:00
|
|
|
case vm_nil: std::cout<<"nil"; break;
|
2021-10-08 23:18:26 +08:00
|
|
|
case vm_num: std::cout<<i.num(); break;
|
|
|
|
case vm_str: std::cout<<*i.str(); break;
|
|
|
|
case vm_vec: i.vec()->print(); break;
|
|
|
|
case vm_hash: i.hash()->print(); break;
|
2021-05-04 17:39:24 +08:00
|
|
|
case vm_func: std::cout<<"func(...){...}"; break;
|
2021-10-13 22:59:15 +08:00
|
|
|
case vm_obj: std::cout<<"<object>"; break;
|
2021-02-23 23:33:01 +08:00
|
|
|
}
|
2021-07-19 17:04:45 +08:00
|
|
|
std::cout<<std::flush;
|
2021-02-23 23:33:01 +08:00
|
|
|
// generate return value,use gc::gc_alloc(type) to make a new value
|
2021-10-08 23:18:26 +08:00
|
|
|
// or use reserved reference gc.nil/gc.one/gc.zero
|
|
|
|
return gc.nil;
|
2021-02-23 23:33:01 +08:00
|
|
|
}
|
|
|
|
```
|
2021-10-14 13:42:07 +08:00
|
|
|
After that, register the built-in function's name(in nasal) and the function's pointer in this table:
|
2021-02-23 23:33:01 +08:00
|
|
|
```C++
|
2021-10-14 23:22:28 +08:00
|
|
|
struct func
|
2021-02-23 23:33:01 +08:00
|
|
|
{
|
2021-06-11 15:16:06 +08:00
|
|
|
const char* name;
|
2021-10-08 23:18:26 +08:00
|
|
|
nasal_ref (*func)(std::vector<nasal_ref>&,nasal_gc&);
|
2021-04-04 23:35:13 +08:00
|
|
|
} builtin_func[]=
|
2021-02-23 23:33:01 +08:00
|
|
|
{
|
2021-06-19 00:32:10 +08:00
|
|
|
{"__builtin_print",builtin_print},
|
|
|
|
{nullptr, nullptr }
|
|
|
|
};
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
At last,warp the `__builtin_print` in a nasal file:
|
2021-06-19 00:32:10 +08:00
|
|
|
|
|
|
|
```javascript
|
2021-10-13 22:59:15 +08:00
|
|
|
var print=func(elems...){
|
2021-06-19 00:32:10 +08:00
|
|
|
return __builtin_print(elems);
|
2021-02-23 23:33:01 +08:00
|
|
|
};
|
|
|
|
```
|
2021-10-14 13:42:07 +08:00
|
|
|
In fact the arguments that `__builtin_print` uses is not necessary.
|
|
|
|
So writting it like this is also right:
|
2021-02-23 23:33:01 +08:00
|
|
|
|
|
|
|
```javascript
|
2021-10-13 22:59:15 +08:00
|
|
|
var print=func(elems...){
|
2021-06-19 00:32:10 +08:00
|
|
|
return __builtin_print;
|
2021-02-23 23:33:01 +08:00
|
|
|
};
|
|
|
|
```
|
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
If you don't warp built-in function in a normal nasal function,
|
2021-10-08 23:18:26 +08:00
|
|
|
this built-in function may cause a fault when searching arguments,
|
2021-10-14 13:42:07 +08:00
|
|
|
which will segmentation error.
|
2021-02-23 23:33:01 +08:00
|
|
|
|
2021-10-14 13:42:07 +08:00
|
|
|
Use `import(".nas")` to get the nasal file including your built-in functions,
|
2021-10-08 23:18:26 +08:00
|
|
|
then you could use it.
|
2021-05-31 19:10:59 +08:00
|
|
|
|
|
|
|
version 6.5 update:
|
|
|
|
|
2021-06-26 14:53:10 +08:00
|
|
|
Use nasal_gc::builtin_alloc in builtin function if this function uses alloc more than one time.
|
2021-05-31 19:10:59 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
When running a builtin function,alloc will run more than one time,
|
|
|
|
this may cause mark-sweep in gc_alloc.
|
2021-05-31 19:10:59 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
The value got before will be collected,but stil in use in this builtin function,
|
|
|
|
this is a fatal error.
|
2021-05-31 19:10:59 +08:00
|
|
|
|
|
|
|
So use builtin_alloc in builtin functions like this:
|
|
|
|
|
|
|
|
```C++
|
2021-10-12 18:26:10 +08:00
|
|
|
nasal_ref builtin_keys(std::vector<nasal_ref>& local,nasal_gc& gc)
|
2021-05-31 19:10:59 +08:00
|
|
|
{
|
2021-10-14 13:42:07 +08:00
|
|
|
nasal_ref hash=local[1];
|
|
|
|
if(hash.type!=vm_hash)
|
2021-05-31 19:10:59 +08:00
|
|
|
{
|
|
|
|
builtin_err("keys","\"hash\" must be hash");
|
2021-10-08 23:18:26 +08:00
|
|
|
return nasal_ref(vm_none);
|
2021-05-31 19:10:59 +08:00
|
|
|
}
|
2021-10-08 23:18:26 +08:00
|
|
|
|
|
|
|
// push vector into local scope to avoid being sweeped
|
2021-10-12 18:26:10 +08:00
|
|
|
local.push_back(gc.gc_alloc(vm_vec));
|
|
|
|
std::vector<nasal_ref>& vec=local.back().vec()->elems;
|
2021-10-14 13:42:07 +08:00
|
|
|
for(auto& iter:hash.hash()->elems)
|
2021-05-31 19:10:59 +08:00
|
|
|
{
|
2021-10-14 13:42:07 +08:00
|
|
|
nasal_ref str=gc.builtin_alloc(vm_str);
|
|
|
|
*str.str()=iter.first;
|
|
|
|
vec.push_back(str);
|
2021-05-31 19:10:59 +08:00
|
|
|
}
|
2021-10-12 18:26:10 +08:00
|
|
|
return local.back();
|
2021-05-31 19:10:59 +08:00
|
|
|
}
|
2021-06-05 17:15:07 +08:00
|
|
|
```
|
2021-08-01 01:54:14 +08:00
|
|
|
|
|
|
|
## Difference Between Andy's Nasal Interpreter and This Interpreter
|
|
|
|
|
|
|
|
This interpreter uses more strict syntax to make sure it is easier for you to program and debug.
|
|
|
|
|
|
|
|
In Andy's interpreter:
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
foreach(i;[0,1,2,3])
|
|
|
|
print(i)
|
|
|
|
```
|
|
|
|
|
|
|
|
This program can run normally with output 0 1 2 3.
|
2021-10-08 23:18:26 +08:00
|
|
|
But take a look at the iterator 'i',
|
|
|
|
this symbol is defined in foreach without using keyword 'var'.
|
2021-08-01 01:54:14 +08:00
|
|
|
I think this design will make programmers filling confused.
|
|
|
|
This is ambiguous that programmers maybe difficult to find the 'i' is defined here.
|
|
|
|
Without 'var',programmers may think this 'i' is defined anywhere else.
|
|
|
|
|
|
|
|
So in this new interpreter i use a more strict syntax to force users to use 'var' to define iterator of forindex and foreach.
|
2021-10-08 23:18:26 +08:00
|
|
|
If you forget to add the keyword 'var',
|
|
|
|
and you haven't defined this symbol before,
|
|
|
|
you will get this:
|
2021-08-01 01:54:14 +08:00
|
|
|
|
|
|
|
```javascript
|
2021-08-10 17:55:49 +08:00
|
|
|
[code] <test.nas> line 1: undefined symbol "i".
|
|
|
|
[codegen] in <test.nas>: error(s) occurred,stop.
|
2021-08-01 01:54:14 +08:00
|
|
|
```
|
|
|
|
|
|
|
|
Also there's another difference.
|
|
|
|
In Andy's interpreter:
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
var a=func {print(b);}
|
|
|
|
var b=1;
|
|
|
|
a();
|
|
|
|
```
|
|
|
|
|
|
|
|
This program runs normally with output 1.
|
|
|
|
But in this new interpreter, it will get:
|
|
|
|
|
|
|
|
```javascript
|
2021-08-10 17:55:49 +08:00
|
|
|
[code] <test.nas> line 1: undefined symbol "b".
|
|
|
|
[codegen] in <test.nas>: error(s) occurred,stop.
|
2021-08-01 01:54:14 +08:00
|
|
|
```
|
|
|
|
|
2021-08-11 14:54:17 +08:00
|
|
|
(outdated)This difference is caused by different kinds of ways of lexical analysis.
|
2021-10-08 23:18:26 +08:00
|
|
|
In most script language interpreters,
|
|
|
|
they use dynamic analysis to check if this symbol is defined yet.
|
2021-08-01 01:54:14 +08:00
|
|
|
However, this kind of analysis is at the cost of lower efficiency.
|
2021-10-08 23:18:26 +08:00
|
|
|
To make sure the interpreter runs at higher efficiency,
|
|
|
|
i choose static analysis to manage the memory space of each symbol.
|
2021-08-01 01:54:14 +08:00
|
|
|
By this way, runtime will never need to check if a symbol exists or not.
|
|
|
|
But this causes a difference.
|
2021-10-08 23:18:26 +08:00
|
|
|
You will get an error of 'undefined symbol',
|
|
|
|
instead of nothing happening in most script language interpreters.
|
2021-08-01 01:54:14 +08:00
|
|
|
|
|
|
|
This change is __controversial__ among FGPRC's members.
|
|
|
|
So maybe in the future i will use dynamic analysis again to cater to the habits of senior programmers.
|
2021-08-01 22:34:02 +08:00
|
|
|
|
2021-08-03 18:55:11 +08:00
|
|
|
(2021/8/3 update) __Now i use scanning ast twice to reload symbols.
|
|
|
|
So this difference does not exist from this update.__
|
2021-10-08 23:18:26 +08:00
|
|
|
But a new difference is that if you call a variable before defining it,
|
|
|
|
you'll get nil instead of 'undefined error'.
|
2021-08-03 18:55:11 +08:00
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
In this new interpreter,
|
|
|
|
function doesn't put dynamic arguments into vector 'arg' automatically.
|
|
|
|
So if you use 'arg' without definition,
|
|
|
|
you'll get an error of 'undefined symbol'.
|
2021-08-10 17:55:49 +08:00
|
|
|
|
|
|
|
## Trace Back Info
|
|
|
|
|
2021-10-08 23:18:26 +08:00
|
|
|
Now when the interpreter crashes,
|
|
|
|
it will print trace back information:
|
2021-08-10 17:55:49 +08:00
|
|
|
|
|
|
|
```javascript
|
|
|
|
func()
|
|
|
|
{
|
|
|
|
println("hello");
|
|
|
|
die("error occurred this line");
|
|
|
|
return;
|
|
|
|
}();
|
|
|
|
```
|
|
|
|
|
|
|
|
Function 'die' is used to throw error and crash.
|
|
|
|
|
|
|
|
```javascript
|
|
|
|
hello
|
|
|
|
[vm] error: error occurred this line
|
2021-10-14 13:42:07 +08:00
|
|
|
[vm] error at 0x0000009b: native function error.
|
2021-08-10 17:55:49 +08:00
|
|
|
trace back:
|
2021-10-14 13:42:07 +08:00
|
|
|
0x0000009b: callb 0x22 <__builtin_die> (lib.nas line 85)
|
|
|
|
0x00000182: callfv 0x1 (a.nas line 6)
|
|
|
|
0x00000186: callfv 0x0 (a.nas line 8)
|
2021-08-10 17:55:49 +08:00
|
|
|
vm stack(limit 10):
|
2021-10-14 13:42:07 +08:00
|
|
|
null |
|
|
|
|
func | <0x8b0f50> func{entry=0x9b}
|
|
|
|
func | <0x8b1db0> func{entry=0x17c}
|
|
|
|
num | 57.295780
|
|
|
|
num | 1852.000000
|
|
|
|
num | 1.943800
|
|
|
|
num | 0.000540
|
|
|
|
num | 39.370100
|
|
|
|
num | 3.280800
|
|
|
|
num | 0.453600
|
2021-08-10 17:55:49 +08:00
|
|
|
```
|
|
|
|
|
|
|
|
Here is an example of stack overflow:
|
|
|
|
```javascript
|
|
|
|
func(f){
|
|
|
|
return f(f);
|
|
|
|
}(
|
|
|
|
func(f){
|
|
|
|
f(f);
|
|
|
|
}
|
|
|
|
)();
|
|
|
|
```
|
|
|
|
|
|
|
|
And the trace back info:
|
|
|
|
```javascript
|
|
|
|
[vm] stack overflow
|
|
|
|
trace back:
|
2021-10-14 13:42:07 +08:00
|
|
|
0x0000000f: callfv 0x1 (a.nas line 5)
|
|
|
|
0x0000000f: 4090 same call(s) ...
|
|
|
|
0x00000007: callfv 0x1 (a.nas line 2)
|
|
|
|
0x00000013: callfv 0x1 (a.nas line 3)
|
2021-08-10 17:55:49 +08:00
|
|
|
vm stack(limit 10):
|
2021-10-14 13:42:07 +08:00
|
|
|
func | <0xc511e0> func{entry=0xd}
|
|
|
|
... | 9 same value(s)
|
2021-10-08 23:18:26 +08:00
|
|
|
```
|
2021-10-14 13:42:07 +08:00
|
|
|
|
|
|
|
Error will be thrown if there's a fatal error when executing:
|
|
|
|
```javascript
|
|
|
|
func(){
|
|
|
|
return 0;
|
|
|
|
}()[1];
|
|
|
|
```
|
|
|
|
|
|
|
|
And the trace back info:
|
|
|
|
```javascript
|
|
|
|
[vm] error at 0x00000008: callv: must call a vector/hash/string
|
|
|
|
trace back:
|
|
|
|
0x00000008: callv 0x0 (a.nas line 3)
|
|
|
|
vm stack(limit 10):
|
|
|
|
num | 0.000000
|
|
|
|
```
|