fermi/README.md

# Fermi: A terminal DSL for Fermi estimation with distributions

This project is a minimalist, calculator-style DSL for fermi estimation. It can multiply, divide, add and subtract scalars, lognormals and beta distributions, and supports variables and mixtures. 

## Usage

```
$ fermi
5M 12M           # number of people living in Chicago
beta 1 200       # fraction of people that have a piano
30 180           # minutes it takes to tune a piano, including travel time
/ 48 52          # weeks a year that piano tuners work for
/ 5 6            # days a week in which piano tuners work
/ 6 8            # hours a day in which piano tuners work
/ 60             # minutes to an hour
=: piano_tuners
```

Here are some real-life examples: [Chance for a Russian male of fighting age of being drafted](https://x.com/NunoSempere/status/1829525844169248912), [did the startup Friend burn too much cash](https://x.com/NunoSempere/status/1818810770932568308), [how much did Nikita Bier make mentoring?](https://x.com/NunoSempere/status/1815169781907042504), [what fraction of North Korea's caloric intake is Russia supporting?](https://x.com/NunoSempere/status/1855666428835140078). In general, as a terminal guy, I've found that having zero startup cost makes creating small fermi models much cheaper, and thus happen more often.

## Build instructions 

Install the [go toolchain](https://go.dev/dl/), then:

```
git clone https://git.nunosempere.com/NunoSempere/fermi
cd fermi
make build
./fermi
# sudo make install
# fermi 
```


## Tips & tricks

- It's conceptually clearer to have all the multiplications first and then all the divisions
- For distributions between 0 and 1, consider using a beta distribution
- The default operation is multiplication

If you type "help" (or run fermi -h), you can see a small grammar and some optional command flags:

```
$ fermi -h

1. Grammar:
  Operation | Variable assignment | Special
    Operation:                             operator operand
          operator:                        (empty) | * | / | + | -
          operand:                         scalar | lognormal | beta | variable
            lognormal:                     low high
            beta:                          beta alpha beta
    Variable assignment:                   =: variable_name
    Variable assignment and clear stack:   =. variable_name
    Special commands:
         Comment:                          # this is a comment
         Summary stats:                    stats
         Clear stack:                      clear | c | .
         Print debug info:                 debug | d
         Print help message:               help  | h
         Start additional stack:           operator (
         Return from additional stack      )
         Exit:                             exit  | e
  Examples:
    + 2
    / 2.5
    * 1 10 (interpreted as lognormal)
    + 1 10
    * beta 1 10
    1 10 (multiplication taken as default operation)
    =: x
    .
    1 100
    + x
    # this is a comment
    * 1 12 # this is an operation followed by a comment
    * (
    1 10
    + beta 1 100
    )
    / 1% 
    =. y
    mx x 1 y 2.33
    + mx x 30% y 70%
    exit

2. Command flags:
  -echo
        Specifies whether inputs should be echoed back. Useful if reading from a file
.  -f string
        Specifies a file with a model to run
  -n int
        Specifies the number of samples to draw when using samples (default 100000)
  -h    Shows help message

```

### Integrations with Linux utilities

Because the model reads from standard input, you can pipe a model to it:

```
$ cat more/piano-tuners.fermi | fermi
```

In that case, you will probably want to use the echo flag as well

```
$ cat more/piano-tuners-commented.fermi | fermi -echo
```

You can make a model an executable file by running `$ chmod -x model.fermi` and then adding the following at the top, XD.

```
#!/bin/usr/fermi -f
```

You can save a session to a logfile with tee:

```
fermi | tee -a fermi.log
```

## Different levels of complexity

The mainline code has a bunch of complexity: variables, parenthesis, samples, beta distributions, number of samples, mixtures etc. In the simple/ folder:

- f_simple.go (370 lines) strips variables and parenthesis, but keeps beta distributions, samples, and addition and subtraction
- f_minimal.go (140 lines) strips everything that isn't lognormal and scalar multiplication and addition, plus a few debug options.

## Roadmap 

Done: 

- [x] Write README
- [x] Add division?
- [x] Read from file?
- [x] Save to file?
- [x] Allow comments?
  - [x] Use a sed filter? 
  - [x] Add proper comment processing
- [x] Add show more info version
- [x] Scalar multiplication and division
- [x] Think how to integrate with squiggle.c to draw samples
  - [x] Copy the time to botec go code
  - [x] Define samplers
  - [x] Call those samplers when operating on distributions that can't be operated on algebraically
- [x] Display output more nicely, with K/M/B/T
- [x] Consider the following: make this into a stack-based DSL, with:
  - [x] Variables that can be saved to and then displayed
  - [x] Other types of distributions, particularly beta distributions? => But then this requires moving to bags of samples. It could still be ~instantaneous though.
  - [x] Added bags of samples to support addition and multiplication of betas and lognormals
- [x] Figure out go syntax for
  - Maps
  - Joint types
  - Enums
- [x] Fix correlation problem, by spinning up a new randomness thing every time some serial computation is done.
- [x] Clean up error code. Right now only needed for division
- [x] Maintain *both* a more complex thing that's more featureful *and* the more simple multiplication of lognormals thing.
- [x] Allow input with K/M/T
- [x] Document parenthesis syntax
- [x] Specify number of samples as a command line option
- [x] Figure out how to make models executable, by adding a #!/bin/bash-style command at the top?
- [x] Make -n flag work
- [x] Add flag to repeat input lines (useful when reading from files)
- [x] Add percentages
- [x] Consider adding an understanding of percentages
- [x] Improve and rationalize error messages a bit
- [x] Add, then document mixture distributions

To (possibly) do:

- [ ] Consider implications of sampling strategy for operating variables.
- [ ] Fix lognormal multiplication and division by 0 or < 0
- [ ] With the -f command line option, the program doesn't read from stdin after finishing reading the file
- [ ] Add functions. Now easier to do with an explicit representation of the stack
- [ ] Think about how to draw a histogram from samples
- [ ] Dump samples to file
- [ ] Represent samples/statistics in some other way
- [ ] Perhaps use qsort rather than full sorting
- [ ] Program into a small device, like a calculator?
- [ ] Units?

Discarded: 

- [ ] ~~Think of some way of calling bc~~
change title 2024-12-30 22:27:28 +00:00			`# Fermi: A terminal DSL for Fermi estimation with distributions`
save readme and another example, add makefile 2024-05-10 19:35:06 +00:00
more README tweaks 2024-12-30 22:25:03 +00:00			`This project is a minimalist, calculator-style DSL for fermi estimation. It can multiply, divide, add and subtract scalars, lognormals and beta distributions, and supports variables and mixtures.`
save readme and another example, add makefile 2024-05-10 19:35:06 +00:00
more README tweaks 2024-12-30 22:25:03 +00:00			`## Usage`

			```
			`$ fermi`
			`5M 12M # number of people living in Chicago`
			`beta 1 200 # fraction of people that have a piano`
			`30 180 # minutes it takes to tune a piano, including travel time`
			`/ 48 52 # weeks a year that piano tuners work for`
			`/ 5 6 # days a week in which piano tuners work`
			`/ 6 8 # hours a day in which piano tuners work`
			`/ 60 # minutes to an hour`
			`=: piano_tuners`
			```
save readme and another example, add makefile 2024-05-10 19:35:06 +00:00
more README tweaks 2024-12-30 22:25:03 +00:00			Here are some real-life examples: [Chance for a Russian male of fighting age of being drafted](https://x.com/NunoSempere/status/1829525844169248912), [did the startup Friend burn too much cash](https://x.com/NunoSempere/status/1818810770932568308), [how much did Nikita Bier make mentoring?](https://x.com/NunoSempere/status/1815169781907042504), [what fraction of North Korea's caloric intake is Russia supporting?](https://x.com/NunoSempere/status/1855666428835140078). In general, as a terminal guy, I've found that having zero startup cost makes creating small fermi models much cheaper, and thus happen more often.
save readme and another example, add makefile 2024-05-10 19:35:06 +00:00
fengshui: improve README 2024-12-30 22:17:39 +00:00			`## Build instructions`

			`Install the [go toolchain](https://go.dev/dl/), then:`

document stuff better 2024-07-07 14:30:35 +00:00			```
fengshui: improve README 2024-12-30 22:17:39 +00:00			`git clone https://git.nunosempere.com/NunoSempere/fermi`
fengshui: README tweaks 2024-12-30 22:19:21 +00:00			`cd fermi`
document stuff better 2024-07-07 14:30:35 +00:00			`make build`
fengshui: improve README 2024-12-30 22:17:39 +00:00			`./fermi`
fengshui: README tweaks 2024-12-30 22:19:21 +00:00			`# sudo make install`
			`# fermi`
document stuff better 2024-07-07 14:30:35 +00:00			```
save readme and another example, add makefile 2024-05-10 19:35:06 +00:00

more README tweaks 2024-12-30 22:25:03 +00:00			`## Tips & tricks`
save readme and another example, add makefile 2024-05-10 19:35:06 +00:00
more README tweaks 2024-12-30 22:25:03 +00:00			`- It's conceptually clearer to have all the multiplications first and then all the divisions`
			`- For distributions between 0 and 1, consider using a beta distribution`
			`- The default operation is multiplication`
fengshui: improve README 2024-12-30 22:17:39 +00:00
change help msg in READMe 2024-10-01 07:59:45 +00:00			`If you type "help" (or run fermi -h), you can see a small grammar and some optional command flags:`
more feng shui 2024-06-19 14:41:47 +00:00
			```
fengshui: improve README 2024-12-30 22:17:39 +00:00			`$ fermi -h`
change help msg in READMe 2024-10-01 07:59:45 +00:00
			`1. Grammar:`
document stuff better 2024-07-07 14:30:35 +00:00			`Operation \| Variable assignment \| Special`
			`Operation: operator operand`
			`operator: (empty) \| * \| / \| + \| -`
			`operand: scalar \| lognormal \| beta \| variable`
			`lognormal: low high`
			`beta: beta alpha beta`
			`Variable assignment: =: variable_name`
			`Variable assignment and clear stack: =. variable_name`
change help msg in READMe 2024-10-01 07:59:45 +00:00			`Special commands:`
add num samples as command line flag 2024-07-12 16:11:25 +00:00			`Comment: # this is a comment`
change help msg in READMe 2024-10-01 07:59:45 +00:00			`Summary stats: stats`
document stuff better 2024-07-07 14:30:35 +00:00			`Clear stack: clear \| c \| .`
			`Print debug info: debug \| d`
add num samples as command line flag 2024-07-12 16:11:25 +00:00			`Print help message: help \| h`
			`Start additional stack: operator (`
			`Return from additional stack )`
document stuff better 2024-07-07 14:30:35 +00:00			`Exit: exit \| e`
			`Examples:`
			`+ 2`
change help msg in READMe 2024-10-01 07:59:45 +00:00			`/ 2.5`
			`* 1 10 (interpreted as lognormal)`
document stuff better 2024-07-07 14:30:35 +00:00			`+ 1 10`
			`* beta 1 10`
change help msg in READMe 2024-10-01 07:59:45 +00:00			`1 10 (multiplication taken as default operation)`
spell check 2024-12-28 23:53:42 +00:00			`=: x`
change help msg in READMe 2024-10-01 07:59:45 +00:00			`.`
document stuff better 2024-07-07 14:30:35 +00:00			`1 100`
			`+ x`
change help msg in READMe 2024-10-01 07:59:45 +00:00			`# this is a comment`
			`* 1 12 # this is an operation followed by a comment`
add num samples as command line flag 2024-07-12 16:11:25 +00:00			`* (`
			`1 10`
			`+ beta 1 100`
			`)`
fengshui: improve README 2024-12-30 22:17:39 +00:00			`/ 1%`
spell check 2024-12-28 23:53:42 +00:00			`=. y`
			`mx x 1 y 2.33`
			`+ mx x 30% y 70%`
document stuff better 2024-07-07 14:30:35 +00:00			`exit`
change help msg in READMe 2024-10-01 07:59:45 +00:00
spell check 2024-12-28 23:53:42 +00:00			`2. Command flags:`
change help msg in READMe 2024-10-01 07:59:45 +00:00			`-echo`
			`Specifies whether inputs should be echoed back. Useful if reading from a file`
			`. -f string`
spell check 2024-12-28 23:53:42 +00:00			`Specifies a file with a model to run`
change help msg in READMe 2024-10-01 07:59:45 +00:00			`-n int`
			`Specifies the number of samples to draw when using samples (default 100000)`
			`-h Shows help message`
spell check 2024-12-28 23:53:42 +00:00
update examples &c 2024-05-10 21:38:20 +00:00			```
add K/M/B/T to output 2024-06-03 06:45:33 +00:00
spell check 2024-12-28 23:53:42 +00:00			`### Integrations with Linux utilities`
allow models to also be executables 2024-07-12 22:29:55 +00:00
spell check 2024-12-28 23:53:42 +00:00			`Because the model reads from standard input, you can pipe a model to it:`
allow models to also be executables 2024-07-12 22:29:55 +00:00
			```
document cool command line options and linux integrations 2024-07-12 22:32:24 +00:00			`$ cat more/piano-tuners.fermi \| fermi`
allow models to also be executables 2024-07-12 22:29:55 +00:00			```

add pointer for echo flag 2024-10-01 08:05:47 +00:00			`In that case, you will probably want to use the echo flag as well`

			```
			`$ cat more/piano-tuners-commented.fermi \| fermi -echo`
			```

more README tweaks 2024-12-30 22:25:03 +00:00			You can make a model an executable file by running `$ chmod -x model.fermi` and then adding the following at the top, XD.
allow models to also be executables 2024-07-12 22:29:55 +00:00
			```
			`#!/bin/usr/fermi -f`
			```

			`You can save a session to a logfile with tee:`

			```
			`fermi \| tee -a fermi.log`
			```
savepoint 2024-05-10 21:47:34 +00:00
more feng shui 2024-06-19 14:41:47 +00:00			`## Different levels of complexity`

spell check 2024-12-28 23:53:42 +00:00			`The mainline code has a bunch of complexity: variables, parenthesis, samples, beta distributions, number of samples, mixtures etc. In the simple/ folder:`
more feng shui 2024-06-19 14:41:47 +00:00
spell check 2024-12-28 23:53:42 +00:00			`- f_simple.go (370 lines) strips variables and parenthesis, but keeps beta distributions, samples, and addition and subtraction`
more feng shui 2024-06-19 14:41:47 +00:00			`- f_minimal.go (140 lines) strips everything that isn't lognormal and scalar multiplication and addition, plus a few debug options.`

update examples &c 2024-05-10 21:38:20 +00:00			`## Roadmap`
play around with bash/unix utilities 2024-05-10 19:09:35 +00:00
more feng shui 2024-06-19 14:41:47 +00:00			`Done:`

update examples &c 2024-05-10 21:38:20 +00:00			`- [x] Write README`
			`- [x] Add division?`
			`- [x] Read from file?`
			`- [x] Save to file?`
			`- [x] Allow comments?`
			`- [x] Use a sed filter?`
document stuff better 2024-07-07 14:30:35 +00:00			`- [x] Add proper comment processing`
update examples &c 2024-05-10 21:38:20 +00:00			`- [x] Add show more info version`
fix distributional division 2024-05-12 17:10:25 +00:00			`- [x] Scalar multiplication and division`
add variables! 2024-06-09 12:48:53 +00:00			`- [x] Think how to integrate with squiggle.c to draw samples`
			`- [x] Copy the time to botec go code`
fix correlation problem by using global variable 2024-06-10 01:08:10 +00:00			`- [x] Define samplers`
spell check 2024-12-28 23:53:42 +00:00			`- [x] Call those samplers when operating on distributions that can't be operated on algebraically`
add spec for more expressive DSL 2024-06-03 07:28:16 +00:00			`- [x] Display output more nicely, with K/M/B/T`
fix correlation problem by using global variable 2024-06-10 01:08:10 +00:00			`- [x] Consider the following: make this into a stack-based DSL, with:`
			`- [x] Variables that can be saved to and then displayed`
			`- [x] Other types of distributions, particularly beta distributions? => But then this requires moving to bags of samples. It could still be ~instantaneous though.`
document stuff better 2024-07-07 14:30:35 +00:00			`- [x] Added bags of samples to support addition and multiplication of betas and lognormals`
fix correlation problem by using global variable 2024-06-10 01:08:10 +00:00			`- [x] Figure out go syntax for`
add variables! 2024-06-09 12:48:53 +00:00			`- Maps`
			`- Joint types`
			`- Enums`
add full DSl example to readme 2024-06-10 15:06:31 +00:00			`- [x] Fix correlation problem, by spinning up a new randomness thing every time some serial computation is done.`
more feng shui 2024-06-19 14:41:47 +00:00			`- [x] Clean up error code. Right now only needed for division`
			`- [x] Maintain both a more complex thing that's more featureful and the more simple multiplication of lognormals thing.`
document stuff better 2024-07-07 14:30:35 +00:00			`- [x] Allow input with K/M/T`
add num samples as command line flag 2024-07-12 16:11:25 +00:00			`- [x] Document parenthesis syntax`
rename f.go=>fermi.go after fixing nvim problem 2024-07-12 22:13:24 +00:00			`- [x] Specify number of samples as a command line option`
allow models to also be executables 2024-07-12 22:29:55 +00:00			`- [x] Figure out how to make models executable, by adding a #!/bin/bash-style command at the top?`
add echo flag for use when consuming files 2024-08-09 15:39:38 +00:00			`- [x] Make -n flag work`
			`- [x] Add flag to repeat input lines (useful when reading from files)`
document suffixes 2024-11-10 17:16:49 +00:00			`- [x] Add percentages`
simplify fermi.go again 2024-12-24 16:05:53 +00:00			`- [x] Consider adding an understanding of percentages`
spell check 2024-12-28 23:53:42 +00:00			`- [x] Improve and rationalize error messages a bit`
fengshui 2024-12-29 00:04:22 +00:00			`- [x] Add, then document mixture distributions`
more feng shui 2024-06-19 14:41:47 +00:00
			`To (possibly) do:`

fengshui 2024-12-29 00:04:22 +00:00			`- [ ] Consider implications of sampling strategy for operating variables.`
add todo bug found when talking with Jorge 2024-11-19 19:43:45 +00:00			`- [ ] Fix lognormal multiplication and division by 0 or < 0`
sets echo on by default with read from file flag 2024-11-10 17:07:19 +00:00			`- [ ] With the -f command line option, the program doesn't read from stdin after finishing reading the file`
spell check 2024-12-28 23:53:42 +00:00			`- [ ] Add functions. Now easier to do with an explicit representation of the stack`
more feng shui 2024-06-19 14:41:47 +00:00			`- [ ] Think about how to draw a histogram from samples`
add full DSl example to readme 2024-06-10 15:06:31 +00:00			`- [ ] Dump samples to file`
			`- [ ] Represent samples/statistics in some other way`
			`- [ ] Perhaps use qsort rather than full sorting`
more feng shui 2024-06-19 14:41:47 +00:00			`- [ ] Program into a small device, like a calculator?`
document suffixes 2024-11-10 17:16:49 +00:00			`- [ ] Units?`
more feng shui 2024-06-19 14:41:47 +00:00
			`Discarded:`
add spec for more expressive DSL 2024-06-03 07:28:16 +00:00
more feng shui 2024-06-19 14:41:47 +00:00			`- [ ] ~~Think of some way of calling bc~~`