waforth/README.md

131 lines
4.9 KiB
Markdown
Raw Normal View History

2018-05-23 19:47:23 +02:00
# [WAForth](https://el-tramo.be/waforth): Forth Interpreter+Compiler for WebAssembly
2018-05-24 22:43:33 +02:00
WAForth is a bootstrapping Forth interpreter and dynamic compiler for
2018-05-23 19:47:23 +02:00
[WebAssembly](https://webassembly.org). You can see it in a demo
[here](https://el-tramo.be/waforth/).
2020-01-04 19:57:07 +01:00
It is [entirely written in (raw)
WebAssembly](https://github.com/remko/waforth/blob/master/src/waforth.wat), and
the compiler generates WebAssembly code on the fly. The only parts for which it
relies on external (JavaScript) code is to dynamically load modules (since
WebAssembly [doesn't support JIT
yet](https://webassembly.org/docs/future-features/#platform-independent-just-in-time-jit-compilation)),
2019-11-08 22:47:01 +01:00
and the I/O primitives to read and write a character to a screen.
2018-05-23 19:47:23 +02:00
2019-03-12 22:12:29 +01:00
Parts of the implementation were influenced by
2019-11-08 16:19:11 +01:00
[jonesforth](http://git.annexia.org/?p=jonesforth.git;a=summary).
2018-05-23 19:47:23 +02:00
2019-03-15 13:09:02 +01:00
WAForth is still in an experimental stage. It implements most of the [ANS Core
Words](http://lars.nocrew.org/dpans/dpans6.htm#6.1), and passes most of the
[Forth 200x Test Suite](http://www.forth200x.org/documents/html/testsuite.html)
core word tests.
2018-05-23 19:47:23 +02:00
## Install Dependencies
2019-11-08 22:47:01 +01:00
The build uses the [WebAssembly Binary
Toolkit](https://github.com/WebAssembly/wabt) for converting raw WebAssembly
text format into the binary format, and [Yarn](https://yarnpkg.com) for
managing the dependencies of the shell.
2018-05-23 19:47:23 +02:00
2019-11-08 22:47:01 +01:00
brew install wabt yarn
2018-05-23 19:47:23 +02:00
yarn
## Building & Running
To build everything:
make
To run the development server:
2022-04-13 17:02:46 +02:00
make dev
2018-05-23 19:47:23 +02:00
## Testing
2019-03-10 10:37:01 +01:00
The tests are served from `/tests` by the development server.
You can also run the tests in Node.JS by running
make check
2018-05-23 19:47:23 +02:00
## Design
2019-03-09 22:02:46 +01:00
The WAForth core is written as [a single
module](https://github.com/remko/waforth/blob/master/src/waforth.wat) in
WebAssembly's [text
2019-11-11 11:48:29 +01:00
format](https://webassembly.github.io/spec/core/text/index.html).
2018-05-23 19:47:23 +02:00
### The Interpreter
The interpreter runs a loop that processes commands, and switches to and from
compiler mode.
2019-03-09 22:02:46 +01:00
Contrary to some other Forth systems, this system doesn't use direct threading
for executing code. WebAssembly doesn't allow unstructured jumps, let alone
dynamic jumps. Instead, WAForth uses subroutine threading, where each word
is implemented as a single WebAssembly function, and the system uses calls
and indirect calls (see below) to execute words.
2018-05-23 19:47:23 +02:00
### The Compiler
2019-03-09 22:02:46 +01:00
While in compile mode for a word, the compiler generates WebAssembly
instructions in binary format (since there is no assembler infrastructure in
the browser). Since WebAssembly [doesn't support JIT compilation
yet](https://webassembly.org/docs/future-features/#platform-independent-just-in-time-jit-compilation),
a finished word is bundled into a separate binary WebAssembly module, and sent
to the loader, which dynamically loads it and registers it with a shared
[function
table](https://webassembly.github.io/spec/core/valid/modules.html#tables) at
the next offset, which in turn is recorded in the word dictionary.
Because words reside in different modules, all calls to and from the words need
to happen as indirect `call_indirect` calls through the shared function table.
2019-11-08 22:47:01 +01:00
This of course introduces some overhead, although it appears limited.
2019-03-09 22:02:46 +01:00
As WebAssembly doesn't support unstructured jumps, control flow words
(`IF/ELSE/THEN`, `LOOP`, `REPEAT`, ...) can't be implemented in terms of more
basic words, unlike in jonesforth. However, since Forth only requires
structured jumps, the compiler can easily be implemented using the loop and
branch instructions available in WebAssembly.
2018-05-23 19:47:23 +02:00
2018-05-31 21:11:04 +02:00
Finally, the compiler adds minimal debug information about the compiled word in
2019-03-09 22:02:46 +01:00
the [name
section](https://github.com/WebAssembly/design/blob/master/BinaryEncoding.md#name-section),
making it easier for doing some debugging in the browser.
2018-05-31 21:11:04 +02:00
2019-03-09 22:02:46 +01:00
![Debugger view of a compiled
2022-04-17 18:22:36 +02:00
word](doc/debugger.png "Debugger view of a
2019-03-09 22:02:46 +01:00
compiled word")
2018-05-31 21:11:04 +02:00
2018-05-23 19:47:23 +02:00
### The Loader
2019-03-09 22:02:46 +01:00
The loader is a small bit of JavaScript that uses the [WebAssembly JavaScript
API](https://webassembly.github.io/spec/js-api/index.html) to dynamically load
a compiled word (in the form of a WebAssembly module), and ensuring that the
shared function table is large enough for the module to register itself.
2018-05-23 19:47:23 +02:00
### The Shell
2019-03-09 22:02:46 +01:00
The shell is [a JavaScript
class](https://github.com/remko/waforth/blob/master/src/shell/WAForth.js) that
wraps the WebAssembly module, and loads it in the browser. It provides the I/O
primitives to the WebAssembly module to read and write characters to a
terminal, and externally provides a `run()` function to execute a fragment of
Forth code.
2018-05-23 19:47:23 +02:00
2019-03-09 22:02:46 +01:00
To tie everything together into an interactive system, there's a small
console-based interface around this shell to type Forth code, which you can see
in action [here](https://el-tramo.be/waforth/).
2018-05-23 19:47:23 +02:00
![WAForth Console](https://el-tramo.be/waforth/console.gif "WAForth Console")
2019-03-09 22:02:46 +01:00
### Misc notes
- The exposed return stack isn't used. Control flow is kept implicitly in the
code (e.g. through branches, indirect calls, ...). This also means that
control flow can't be influenced by code.