README.adoc: "normand" -> "Normand"
[normand.git] / README.adoc
CommitLineData
d2d06893
MJ
1// SPDX-FileCopyrightText: 2023 Philippe Proulx <eeppeliteloop@gmail.com>
2// SPDX-License-Identifier: CC-BY-SA-4.0
3
bb2f9e9c
PP
4// Show ToC at a specific location for a GitHub rendering
5ifdef::env-github[]
6:toc: macro
7endif::env-github[]
8
9ifndef::env-github[]
71aaa3f7 10:toc: left
bb2f9e9c
PP
11endif::env-github[]
12
13// This is to mimic what GitHub does so that anchors work in an offline
14// rendering too.
15:idprefix:
16:idseparator: -
71aaa3f7 17
bb2f9e9c 18// Other attributes
71aaa3f7
PP
19:py3: Python{nbsp}3
20
bb2f9e9c
PP
21= Normand
22Philippe Proulx
23
df0f8552
PP
24image::normand-logo.png[]
25
71aaa3f7
PP
26[.normal]
27image:https://img.shields.io/pypi/v/normand.svg?label=Latest%20version[link="https://pypi.python.org/pypi/normand"]
28
29[.lead]
30_**Normand**_ is a text-to-binary processor with its own language.
31
32This package offers both a portable {py3} module and a command-line
33tool.
34
2acde24a 35WARNING: This version of Normand is 0.23, meaning both the Normand
71aaa3f7
PP
36language and the module/CLI interface aren't stable.
37
bb2f9e9c
PP
38ifdef::env-github[]
39// ToC location for a GitHub rendering
40toc::[]
41endif::env-github[]
42
71aaa3f7
PP
43== Introduction
44
45The purpose of Normand is to consume human-readable text representing
46bytes and to produce the corresponding binary data.
47
48.Simple bytes input.
49====
50Consider the following Normand input:
51
52----
534f 55 32 bb $167 fe %10100111 a9 $-32
54----
55
56The generated nine bytes are:
57
58----
594f 55 32 bb a7 fe a7 a9 e0
60----
61====
62
63As you can see in the last example, the fundamental unit of the Normand
64language is the _byte_. The order in which you list bytes will be the
65order of the generated data.
66
67The Normand language is more than simple lists of bytes, though. Its
68main features are:
69
70Comments, including a bunch of insignificant symbols which may improve readability::
71+
72Input:
73+
74----
75ff bb %1101:0010 # This is a comment
7678 29 af $192 # This too # 99 $-80
77fe80::6257:18ff:fea3:4229
7860:57:18:a3:42:29
7910839636-5d65-4a68-8e6a-21608ddf7258
80----
81+
82Output:
83+
84----
85ff bb d2 78 29 af c0 99 b0 fe 80 62 57 18 ff fe
86a3 42 29 60 57 18 a3 42 29 10 83 96 36 5d 65 4a
8768 8e 6a 21 60 8d df 72 58
88----
89
90Hexadecimal, decimal, and binary byte constants::
91+
92Input:
93+
94----
95aa bb $247 $-89 %0011_0010 %11.01= 10/10
96----
97+
98Output:
99+
100----
101aa bb f7 a7 32 da
102----
103
7a7b31e8 104Strings::
71aaa3f7
PP
105+
106Input:
107+
108----
109"hello world!" 00
110u16le"stress\nverdict 🤣"
7a7b31e8 111s:latin3{hex(ICITTE)}
71aaa3f7
PP
112----
113+
114Output:
115+
116----
11768 65 6c 6c 6f 20 77 6f 72 6c 64 21 00 73 00 74 ┆ hello world!•s•t
11800 72 00 65 00 73 00 73 00 0a 00 76 00 65 00 72 ┆ •r•e•s•s•••v•e•r
7a7b31e8
PP
11900 64 00 69 00 63 00 74 00 20 00 3e d8 23 dd 30 ┆ •d•i•c•t• •>•#•0
12078 32 66 ┆ x2f
71aaa3f7
PP
121----
122
123Labels: special variables holding the offset where they're defined::
124+
125----
126<beg> b2 52 e3 bc 91 05
127$100 $50 <chair> 33 9f fe
12825 e9 89 8a <end>
129----
130
131Variables::
132+
133----
1345e 65 {tower = 47} c6 7f f2 c4
13544 {hurl = tower - 14} b5 {tower = hurl} 26 2d
136----
137+
138The value of a variable assignment is the evaluation of a valid {py3}
139expression which may include label and variable names.
140
269f6eb3 141Fixed-length number with a given length (8{nbsp}bits to 64{nbsp}bits) and byte order::
71aaa3f7
PP
142+
143Input:
144+
145----
146{strength = 4}
ee724c95
PP
147!be 67 <lbl> 44 $178 [(end - lbl) * 8 + strength : 16] $99 <end>
148!le [-1993 : 32]
56996d34 149[-3.141593 : 64be]
71aaa3f7
PP
150----
151+
152Output:
153+
154----
56996d34
PP
15567 44 b2 00 2c 63 37 f8 ff ff c0 09 21 fb 82 c2
156bd 7f
71aaa3f7
PP
157----
158+
269f6eb3 159The encoded number is the evaluation of a valid {py3} expression which
05f81895
PP
160may include label and variable names.
161
162https://en.wikipedia.org/wiki/LEB128[LEB128] integer::
163+
164Input:
165+
166----
ee724c95
PP
167aa bb cc [-1993 : sleb128] <meow> dd ee ff
168[meow * 199 : uleb128]
05f81895
PP
169----
170+
171Output:
172+
173----
174aa bb cc b7 70 dd ee ff e3 07
175----
176+
177The encoded integer is the evaluation of a valid {py3} expression which
71aaa3f7
PP
178may include label and variable names.
179
27d52a19
PP
180Conditional::
181+
182Input:
183+
184----
185aa bb cc
186
187(
188 "foo"
189
190 !if {ICITTE > 10}
191 "bar"
12b5dbc0
PP
192 !else
193 "fight"
27d52a19
PP
194 !end
195) * 4
196----
197+
198Output:
199+
200----
12b5dbc0
PP
201aa bb cc 66 6f 6f 66 69 67 68 74 66 6f 6f 66 69 ┆ •••foofightfoofi
20267 68 74 66 6f 6f 62 61 72 66 6f 6f 62 61 72 ┆ ghtfoobarfoobar
27d52a19
PP
203----
204
71aaa3f7
PP
205Repetition::
206+
207Input:
208+
209----
2adf4336 210aa bb * 5 cc <zoom> "yeah\0" * {zoom * 3}
e57a18e1
PP
211
212!repeat 3
213 ff ee "juice"
214!end
71aaa3f7
PP
215----
216+
217Output:
218+
219----
2adf4336
PP
220aa bb bb bb bb bb cc 79 65 61 68 00 79 65 61 68 ┆ •••••••yeah•yeah
22100 79 65 61 68 00 79 65 61 68 00 79 65 61 68 00 ┆ •yeah•yeah•yeah•
22279 65 61 68 00 79 65 61 68 00 79 65 61 68 00 79 ┆ yeah•yeah•yeah•y
22365 61 68 00 79 65 61 68 00 79 65 61 68 00 79 65 ┆ eah•yeah•yeah•ye
22461 68 00 79 65 61 68 00 79 65 61 68 00 79 65 61 ┆ ah•yeah•yeah•yea
22568 00 79 65 61 68 00 79 65 61 68 00 79 65 61 68 ┆ h•yeah•yeah•yeah
71aaa3f7 22600 79 65 61 68 00 79 65 61 68 00 79 65 61 68 00 ┆ •yeah•yeah•yeah•
e57a18e1
PP
227ff ee 6a 75 69 63 65 ff ee 6a 75 69 63 65 ff ee ┆ ••juice••juice••
2286a 75 69 63 65 ┆ juice
71aaa3f7
PP
229----
230
676f6189
PP
231Alignment::
232+
233Input:
234+
235----
ee724c95 236!be
676f6189 237
ee724c95
PP
238 [199:32]
239@64 [43:64]
240@16 [-123:16]
241@32~255 [5584:32]
676f6189
PP
242----
243+
244Output:
245+
246----
24700 00 00 c7 00 00 00 00 00 00 00 00 00 00 00 2b
248ff 85 ff ff 00 00 15 d0
249----
71aaa3f7 250
25ca454b
PP
251Filling::
252+
253Input:
254+
255----
ee724c95
PP
256!le
257[0xdeadbeef:32]
258[-1993:16]
259[9:16]
25ca454b 260+0x40
ee724c95 261[ICITTE:8]
25ca454b 262"meow mix"
fc21bb27 263+200~FFh
ee724c95 264[ICITTE:8]
25ca454b
PP
265----
266+
267Output:
268+
269----
270ef be ad de 37 f8 09 00 00 00 00 00 00 00 00 00 ┆ ••••7•••••••••••
27100 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
27200 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
27300 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
27440 6d 65 6f 77 20 6d 69 78 ff ff ff ff ff ff ff ┆ @meow mix•••••••
275ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
276ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
277ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
278ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
279ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
280ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
281ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
282ff ff ff ff ff ff ff ff c8 ┆ •••••••••
283----
284
cd33dfe6
PP
285Transformation::
286+
287Input:
288+
289----
ee724c95 290"end of file @ " [end:8]
cd33dfe6
PP
291
292!transform gzip
293 "this part will be gzipped"
294!end
295
296<end>
297----
298+
299Output:
300+
301----
30265 6e 64 20 6f 66 20 66 69 6c 65 20 40 20 3c 1f ┆ end of file @ <•
3038b 08 00 7b 7b 26 65 02 ff 2b c9 c8 2c 56 28 48 ┆ •••{{&e••+••,V(H
3042c 2a 51 28 cf cc c9 51 48 4a 55 48 af ca 2c 28 ┆ ,*Q(•••QHJUH••,(
30548 4d 01 00 d4 cc 5b 8a 19 00 00 00 ┆ HM••••[•••••
306----
307
71aaa3f7
PP
308Multilevel grouping::
309+
310Input:
311+
312----
313ff ((aa bb "zoom" cc) * 5) * 3 $-34 * 4
314----
315+
316Output:
317+
318----
319ff aa bb 7a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc aa ┆ •••zoom•••zoom••
320bb 7a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc aa bb 7a ┆ •zoom•••zoom•••z
3216f 6f 6d cc aa bb 7a 6f 6f 6d cc aa bb 7a 6f 6f ┆ oom•••zoom•••zoo
3226d cc aa bb 7a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc ┆ m•••zoom•••zoom•
323aa bb 7a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc aa bb ┆ ••zoom•••zoom•••
3247a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc aa bb 7a 6f ┆ zoom•••zoom•••zo
3256f 6d cc aa bb 7a 6f 6f 6d cc de de de de ┆ om•••zoom•••••
326----
327
320644e2
PP
328Macros::
329+
330Input:
331+
332----
333!macro hello(world)
334 "hello"
335 !if world " world" !end
336!end
337
338!repeat 17
339 ff ff ff ff
340 m:hello({ICITTE > 15 and ICITTE < 60})
341!end
342----
343+
344Output:
345+
346----
347ff ff ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 6c ┆ ••••hello••••hel
3486c 6f ff ff ff ff 68 65 6c 6c 6f 20 77 6f 72 6c ┆ lo••••hello worl
34964 ff ff ff ff 68 65 6c 6c 6f 20 77 6f 72 6c 64 ┆ d••••hello world
350ff ff ff ff 68 65 6c 6c 6f 20 77 6f 72 6c 64 ff ┆ ••••hello world•
351ff ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 6c 6c ┆ •••hello••••hell
3526f ff ff ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 ┆ o••••hello••••he
3536c 6c 6f ff ff ff ff 68 65 6c 6c 6f ff ff ff ff ┆ llo••••hello••••
35468 65 6c 6c 6f ff ff ff ff 68 65 6c 6c 6f ff ff ┆ hello••••hello••
355ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 6c 6c 6f ┆ ••hello••••hello
356ff ff ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 6c ┆ ••••hello••••hel
3576c 6f ff ff ff ff 68 65 6c 6c 6f ┆ lo••••hello
358----
359
71aaa3f7
PP
360Precise error reporting::
361+
362----
363/tmp/meow.normand:10:24 - Expecting a bit (`0` or `1`).
364----
365+
366----
367/tmp/meow.normand:32:6 - Unexpected character `k`.
368----
369+
370----
320644e2 371/tmp/meow.normand:24:19 - Illegal (unknown or unreachable) variable/label name `meow` in expression `(meow - 45) // 8`; the legal names are {`ICITTE`, `mix`, `zoom`}.
71aaa3f7
PP
372----
373+
374----
f5dcb24c
PP
375/tmp/meow.normand:32:19 - While expanding the macro `meow`:
376/tmp/meow.normand:35:5 - While expanding the macro `zzz`:
320644e2 377/tmp/meow.normand:18:9 - Value 315 is outside the 8-bit range when evaluating expression `end - ICITTE`.
71aaa3f7
PP
378----
379
380You can use Normand to track data source files in your favorite VCS
381instead of raw binary files. The binary files that Normand generates can
382be used to test file format decoding, including malformatted data, for
383example, as well as for education.
384
385See <<learn-normand>> to explore all the Normand features.
386
387== Install Normand
388
389Normand requires Python ≥ 3.4.
390
391To install Normand:
392
393----
394$ python3 -m pip install --user normand
395----
396
397See
398https://packaging.python.org/en/latest/tutorials/installing-packages/#installing-to-the-user-site[Installing to the User Site]
399to learn more about a user site installation.
400
401[NOTE]
402====
403Normand has a single module file, `normand.py`, which you can copy as is
af3cf417 404to your project to use it (both the <<python3-api,`normand.parse()`>>
71aaa3f7
PP
405function and the <<command-line-tool,command-line tool>>).
406
407`normand.py` has _no external dependencies_, but if you're using
91df9414
PP
408Python{nbsp}3.4 or Python{nbsp}3.5, you'll need a local copy of the
409standard `typing` module.
71aaa3f7
PP
410====
411
43937a34
PP
412== Design goals
413
414The design goals of Normand are:
415
416Portability::
417 We're making sure `normand.py` works with Python{nbsp}≥{nbsp}3.4 and
418 doesn't have any external dependencies so that you may just copy the
419 module as is to your own project.
420
421Ease of use::
422 The most basic Normand input is a sequence of hexadecimal constants
423 (for example, `4e6f726d616e64`) which produce exactly what you'd
424 expect.
425+
426Most Normand features map to programming language concepts you already
427know and understand: constant integers, literal strings, variables,
428conditionals, repetitions/loops, and the rest.
429
430Concise and readable input::
431 We could have chosen XML or YAML as the input format, but having a
432 DSL here makes a Normand input compact and easy to read, two
433 important traits when using Normand to write tests, for example.
434+
435Compare the following Normand input and some hypothetical XML
436equivalent, for example:
437+
be95456e 438.Actual Normand input.
43937a34
PP
439----
440ff dd 01 ab $192 $-128 %1101:0011
441
ee724c95 442[end:8]
43937a34
PP
443
444{iter = 1}
445
446!if {not something}
447 # five times because xyz
448 !repeat 5
ee724c95 449 "hello world " [iter:8]
43937a34
PP
450 {iter = iter + 1}
451 !end
452!end
453
454<end>
455----
456+
457.Hypothetical Normand XML input.
458[source,xml]
459----
460<?xml version="1.0" encoding="utf-8" ?>
461<group>
462 <byte base="x" val="ff" />
463 <byte base="x" val="dd" />
464 <byte base="x" val="1" />
465 <byte base="x" val="ab" />
466 <byte base="d" val="192" />
467 <byte base="d" val="-128" />
468 <byte base="b" val="11010011" />
469 <fixed-len-num expr="end" len="8" />
470 <var-assign name="iter" expr="1" />
471 <cond expr="not something">
472 <!-- five times because xyz -->
473 <repeat expr="5">
474 <str>hello world </str>
475 <fixed-len-num expr="iter" len="8" />
476 <var-assign name="iter" expr="iter + 1" />
477 </repeat>
478 </cond>
479 <label name="end" />
480</group>
481----
482
71aaa3f7
PP
483== Learn Normand
484
485A Normand text input is a sequence of items which represent a sequence
486of raw bytes.
487
488[[state]] During the processing of items to data, Normand relies on a
489current state:
490
491[%header%autowidth]
492|===
af3cf417 493|State variable |Description |Initial value: <<python3-api,{py3} API>> |Initial value: <<command-line-tool,CLI>>
71aaa3f7
PP
494
495|[[cur-offset]] Current offset
496|
05f81895 497The current offset has an effect on the value of <<label,labels>> and of
269f6eb3 498the special `ICITTE` name in <<fixed-length-number,fixed-length
95e254bd 499number>>, <<leb128-integer,LEB128 integer>>, <<string,string>>,
f63f4a5d 500<<filling,filling>>, <<variable-assignment,variable assignment>>,
27d52a19 501<<conditional-block,conditional block>>, <<repetition-block,repetition
320644e2
PP
502block>>, <<macro-expansion,macro expansion>>, and
503<<post-item-repetition,post-item repetition>> expression evaluation.
71aaa3f7
PP
504
505Each generated byte increments the current offset.
506
507A <<current-offset-setting,current offset setting>> may change the
676f6189
PP
508current offset without generating data.
509
510An <<current-offset-alignment,current offset alignment>> generates
511padding bytes to make the current offset satisfy a given alignment.
71aaa3f7
PP
512|`init_offset` parameter of the `parse()` function.
513|`--offset` option.
514
515|[[cur-bo]] Current byte order
516|
56996d34 517The current byte order can have an effect on the encoding of
269f6eb3 518<<fixed-length-number,fixed-length numbers>>.
71aaa3f7
PP
519
520A <<current-byte-order-setting,current byte order setting>> may change
521the current byte order.
522|`init_byte_order` parameter of the `parse()` function.
523|`--byte-order` option.
524
525|<<label,Labels>>
526|Mapping of label names to integral values.
527|`init_labels` parameter of the `parse()` function.
528|One or more `--label` options.
529
530|<<variable-assignment,Variables>>
27d52a19 531|Mapping of variable names to integral or floating point number values.
71aaa3f7 532|`init_variables` parameter of the `parse()` function.
7a7b31e8 533|One or more `--var` or `--var-str` options.
71aaa3f7
PP
534|===
535
536The available items are:
537
6dd69a2a
PP
538* A <<byte-constant,constant integer>> representing one or more
539 constant bytes.
71aaa3f7 540
7a7b31e8
PP
541* A <<literal-string,literal string>> representing a constant sequence
542 of bytes encoding UTF-8, UTF-16, UTF-32, or Latin-1 to Latin-10 data.
71aaa3f7
PP
543
544* A <<current-byte-order-setting,current byte order setting>> (big or
545 little endian).
546
269f6eb3 547* A <<fixed-length-number,fixed-length number>> (integer or
56996d34
PP
548 floating point), possibly using the <<cur-bo,current byte order>>, and
549 of which the value is the result of a {py3} expression.
05f81895
PP
550
551* An <<leb128-integer,LEB128 integer>> of which the value is the result
552 of a {py3} expression.
71aaa3f7 553
7a7b31e8
PP
554* A <<string,string>> representing a sequence of bytes encoding UTF-8,
555 UTF-16, UTF-32, or Latin-1 to Latin-10 data, and of which the value is
556 the result of a {py3} expression.
557
71aaa3f7
PP
558* A <<current-offset-setting,current offset setting>>.
559
676f6189
PP
560* A <<current-offset-alignment,current offset alignment>>.
561
25ca454b
PP
562* A <<filling,filling>>.
563
71aaa3f7
PP
564* A <<label,label>>, that is, a named constant holding the current
565 offset.
566+
567This is similar to an assembly label.
568
569* A <<variable-assignment,variable assignment>> associating a name to
570 the integral result of an evaluated {py3} expression.
571
572* A <<group,group>>, that is, a scoped sequence of items.
573
27d52a19
PP
574* A <<conditional-block,conditional block>>.
575
e57a18e1
PP
576* A <<repetition-block,repetition block>>.
577
cd33dfe6
PP
578* A <<transformation-block,transformation block>>.
579
320644e2
PP
580* A <<macro-definition-block,macro definition block>>.
581
582* A <<macro-expansion,macro expansion>>.
583
e57a18e1
PP
584Moreover, you can repeat many items above a constant or variable number
585of times with the ``pass:[*]`` operator _after_ the item to repeat. This
586is called a <<post-item-repetition,post-item repetition>>.
71aaa3f7 587
ba11fb1d 588A Normand comment may exist pretty much anywhere between tokens.
71aaa3f7
PP
589
590A comment is anything between two ``pass:[#]`` characters on the same
ba11fb1d
PP
591line, or from ``pass:[#]`` until the end of the line. Whitespaces are
592also considered comments. The following symbols are also considered
593comments around and between items, as well as between hexadecimal
594nibbles and binary bits of <<byte-constant,byte constants>>:
71aaa3f7
PP
595
596----
ee724c95 597& , - . / : ; = ? \ _ |
71aaa3f7
PP
598----
599
600The latter serve to improve readability so that you may write, for
601example, a MAC address or a UUID as is.
602
fc21bb27
PP
603[[const-int]] Many items require a _constant integer_, possibly
604negative, in which case it may start with `-` for a negative integer. A
605positive constant integer is any of:
606
607Decimal::
608 One or mode digits (`0` to `9`).
609
610Hexadecimal::
611 One of:
612+
613* The `0x` or `0X` prefix followed with one or more hexadecimal digits
614 (`0` to `9`, `a` to `f`, or `A` to `F`).
615* One or more hexadecimal digits followed with the `h` or `H` suffix.
616
617Octal::
618 One of:
619+
620* The `0o` or `0O` prefix followed with one or more octal digits
621 (`0` to `7`).
622* One or more octal digits followed with the `o`, `O`, `q`, or `Q`
623 suffix.
624
625Binary::
626 One of:
627+
628* The `0b` or `0B` prefix followed with one or more bits (`0` or `1`).
629* One or more bits followed with the `b` or `B` suffix.
630
ee724c95
PP
631In general, anything between `pass:[{]` and `}` is a {py3} expression.
632
71aaa3f7
PP
633You can test the examples of this section with the `normand`
634<<command-line-tool,command-line tool>> as such:
635
636----
637$ normand file | hexdump -C
638----
639
640where `file` is the name of a file containing the Normand input.
641
642=== Byte constant
643
6dd69a2a 644A _byte constant_ represents one or more constant bytes.
71aaa3f7
PP
645
646A byte constant is:
647
648Hexadecimal form::
6dd69a2a 649 Two consecutive hexadecimal digits representing a single byte.
71aaa3f7
PP
650
651Decimal form::
6dd69a2a 652 One or more digits after the `$` prefix representing a single byte.
71aaa3f7 653
6dd69a2a
PP
654Binary form:: {empty}
655+
656--
657. __**N**__ `%` prefixes (at least one).
658+
659The number of `%` characters is the number of subsequent expected bytes.
660
661. __**N**__{nbsp}×{nbsp}8 bits (`0` or `1`).
662--
71aaa3f7
PP
663
664====
665Input:
666
667----
ee724c95 668ab cd (3d 8F) CC
71aaa3f7
PP
669----
670
671Output:
672
673----
674ab cd 3d 8f cc
675----
676====
677
678====
679Input:
680
681----
682$192 %1100/0011 $ -77
683----
684
685Output:
686
687----
688c0 c3 b3
689----
690====
691
692====
693Input:
694
695----
69658f64689-6316-4d55-8a1a-04cada366172
697fe80::6257:18ff:fea3:4229
698----
699
700Output:
701
702----
70358 f6 46 89 63 16 4d 55 8a 1a 04 ca da 36 61 72 ┆ X•F•c•MU•••••6ar
704fe 80 62 57 18 ff fe a3 42 29 ┆ ••bW••••B)
705----
706====
707
708====
709Input:
710
711----
712%01110011 %01100001 %01101100 %01110101 %01110100
6dd69a2a 713%%%1101:0010 11111111 #A#11 #B#00 #C#011 #D#1
71aaa3f7
PP
714----
715
716Output:
717
718----
6dd69a2a 71973 61 6c 75 74 d2 ff c7 ┆ salut•••
71aaa3f7
PP
720----
721====
722
723=== Literal string
724
7a7b31e8
PP
725A _literal string_ represents the encoded bytes of a literal string
726using the UTF-8, UTF-16, UTF-32, or Latin-1 to Latin-10 encoding.
71aaa3f7
PP
727
728The string to encode isn't implicitly null-terminated: use `\0` at the
729end of the string to add a null character.
730
731A literal string is:
732
7a7b31e8
PP
733. **Optional**: one of the following encodings instead of the default
734 UTF-8:
71aaa3f7
PP
735+
736--
737[horizontal]
7a7b31e8
PP
738`s:u8`::
739`u8`::
740 UTF-8.
741
742`s:u16be`::
743`u16be`::
744 UTF-16BE.
745
746`s:u16le`::
747`u16le`::
748 UTF-16LE.
749
750`s:u32be`::
751`u32be`::
752 UTF-32BE.
753
754`s:u32le`::
755`u32le`::
756 UTF-32LE.
757
758`s:latin1`::
759 ISO/IEC 8859-1.
760
761`s:latin2`::
762 ISO/IEC 8859-2.
763
764`s:latin3`::
765 ISO/IEC 8859-3.
766
767`s:latin4`::
768 ISO/IEC 8859-4.
769
770`s:latin5`::
771 ISO/IEC 8859-9.
772
773`s:latin6`::
774 ISO/IEC 8859-10.
775
776`s:latin7`::
777 ISO/IEC 8859-13.
778
779`s:latin8`::
780 ISO/IEC 8859-14.
781
782`s:latin9`::
783 ISO/IEC 8859-15.
784
785`s:latin10`::
786 ISO/IEC 8859-16.
71aaa3f7
PP
787--
788
789. The ``pass:["]`` prefix.
790
791. A sequence of zero or more characters, possibly containing escape
792 sequences.
793+
794An escape sequence is the ``\`` character followed by one of:
795+
796--
797[horizontal]
798`0`:: Null (U+0000)
799`a`:: Alert (U+0007)
800`b`:: Backspace (U+0008)
801`e`:: Escape (U+001B)
802`f`:: Form feed (U+000C)
803`n`:: End of line (U+000A)
804`r`:: Carriage return (U+000D)
805`t`:: Character tabulation (U+0009)
806`v`:: Line tabulation (U+000B)
807``\``:: Reverse solidus (U+005C)
808``pass:["]``:: Quotation mark (U+0022)
809--
810
811. The ``pass:["]`` suffix.
812
813====
814Input:
815
816----
817"coucou tout le monde!"
818----
819
820Output:
821
822----
82363 6f 75 63 6f 75 20 74 6f 75 74 20 6c 65 20 6d ┆ coucou tout le m
8246f 6e 64 65 21 ┆ onde!
825----
826====
827
828====
829Input:
830
831----
832u16le"I am not young enough to know everything."
833----
834
835Output:
836
837----
83849 00 20 00 61 00 6d 00 20 00 6e 00 6f 00 74 00 ┆ I• •a•m• •n•o•t•
83920 00 79 00 6f 00 75 00 6e 00 67 00 20 00 65 00 ┆ •y•o•u•n•g• •e•
8406e 00 6f 00 75 00 67 00 68 00 20 00 74 00 6f 00 ┆ n•o•u•g•h• •t•o•
84120 00 6b 00 6e 00 6f 00 77 00 20 00 65 00 76 00 ┆ •k•n•o•w• •e•v•
84265 00 72 00 79 00 74 00 68 00 69 00 6e 00 67 00 ┆ e•r•y•t•h•i•n•g•
8432e 00 ┆ .•
844----
845====
846
847====
848Input:
849
850----
7a7b31e8 851s:u32be "\"illusion is the first\nof all pleasures\" 🦉"
71aaa3f7
PP
852----
853
854Output:
855
856----
85700 00 00 22 00 00 00 69 00 00 00 6c 00 00 00 6c ┆ •••"•••i•••l•••l
85800 00 00 75 00 00 00 73 00 00 00 69 00 00 00 6f ┆ •••u•••s•••i•••o
85900 00 00 6e 00 00 00 20 00 00 00 69 00 00 00 73 ┆ •••n••• •••i•••s
86000 00 00 20 00 00 00 74 00 00 00 68 00 00 00 65 ┆ ••• •••t•••h•••e
86100 00 00 20 00 00 00 66 00 00 00 69 00 00 00 72 ┆ ••• •••f•••i•••r
86200 00 00 73 00 00 00 74 00 00 00 0a 00 00 00 6f ┆ •••s•••t•••••••o
86300 00 00 66 00 00 00 20 00 00 00 61 00 00 00 6c ┆ •••f••• •••a•••l
86400 00 00 6c 00 00 00 20 00 00 00 70 00 00 00 6c ┆ •••l••• •••p•••l
86500 00 00 65 00 00 00 61 00 00 00 73 00 00 00 75 ┆ •••e•••a•••s•••u
86600 00 00 72 00 00 00 65 00 00 00 73 00 00 00 22 ┆ •••r•••e•••s•••"
86700 00 00 20 00 01 f9 89 ┆ ••• ••••
868----
869====
870
7a7b31e8
PP
871====
872Input:
873
874----
875s:latin1 "Paul Piché"
876----
877
878Output:
879
880----
88150 61 75 6c 20 50 69 63 68 e9 ┆ Paul Pich•
882----
883====
884
71aaa3f7
PP
885=== Current byte order setting
886
887This special item sets the <<cur-bo,_current byte order_>>.
888
889The two accepted forms are:
890
891[horizontal]
ee724c95
PP
892`!be`:: Set the current byte order to big endian.
893`!le`:: Set the current byte order to little endian.
71aaa3f7 894
269f6eb3 895=== Fixed-length number
71aaa3f7 896
269f6eb3
PP
897A _fixed-length number_ represents a fixed number of bytes encoding
898either:
899
900* An unsigned or signed integer (two's complement).
901+
902The available lengths are 8, 16, 24, 32, 40, 48, 56, and 64.
903
904* A floating point number
b87a3aa2 905 (https://standards.ieee.org/standard/754-2008.html[IEEE{nbsp}754-2008]).
269f6eb3 906+
9403947b 907The available lengths are 32 (_binary32_) and 64 (_binary64_).
71aaa3f7 908
56996d34
PP
909The value is the result of evaluating a {py3} expression.
910
911The byte order to use to encode the value is either directly specified
912or is the <<cur-bo,current byte order>>.
269f6eb3
PP
913
914A fixed-length number is:
71aaa3f7 915
ee724c95 916. The `[` prefix.
71aaa3f7
PP
917
918. A valid {py3} expression.
05f81895 919+
269f6eb3 920For a fixed-length number at some source location{nbsp}__**L**__, this
05f81895
PP
921expression may contain the name of any accessible <<label,label>> (not
922within a nested group), including the name of a label defined
6740649d
PP
923after{nbsp}__**L**__ (except within a
924<<transformation-block,transformation block>>), as well as the name of
925any <<variable-assignment,variable>> known at{nbsp}__**L**__.
05f81895 926+
269f6eb3
PP
927The value of the special name `ICITTE` (`int` type) in this expression
928is the <<cur-offset,current offset>> (before encoding the number).
71aaa3f7
PP
929
930. The `:` character.
931
269f6eb3
PP
932. An encoding length in bits amongst:
933+
934--
27d52a19 935The expression evaluates to an `int` or `bool` value::
269f6eb3 936 `8`, `16`, `24`, `32`, `40`, `48`, `56`, and `64`.
27d52a19
PP
937+
938NOTE: Normand automatically converts a `bool` value to `int`.
269f6eb3
PP
939
940The expression evaluates to a `float` value::
941 `32` and `64`.
942--
71aaa3f7 943
56996d34
PP
944. **Optional**: a suffix of the previous encoding length, without
945 any whitespace, amongst:
946+
947--
948[horizontal]
949`be`:: Encode in big endian.
950`le`:: Encode in little endian.
951--
952+
953Without this suffix, the encoding byte order is the <<cur-bo,current
954byte order>> which must be defined if the encoding length is greater
955than eight.
956
ee724c95 957. The `]` suffix.
71aaa3f7
PP
958
959====
960Input:
961
962----
56996d34
PP
963[345:16le]
964[-0xabcd:32be]
71aaa3f7
PP
965----
966
967Output:
968
969----
97059 01 ff ff 54 33
971----
972====
973
974====
975Input:
976
977----
ee724c95 978!be
71aaa3f7
PP
979
980# String length in bits
ee724c95 981[8 * (str_end - str_beg) : 16]
71aaa3f7
PP
982
983# String
984<str_beg>
985 "hello world!"
986<str_end>
987----
988
989Output:
990
991----
99200 60 68 65 6c 6c 6f 20 77 6f 72 6c 64 21 ┆ •`hello world!
993----
994====
995
996====
997Input:
998
999----
ee724c95 1000[20 - ICITTE : 8] * 10
71aaa3f7
PP
1001----
1002
1003Output:
1004
1005----
100614 13 12 11 10 0f 0e 0d 0c 0b
1007----
1008====
1009
269f6eb3
PP
1010====
1011Input:
1012
1013----
56996d34 1014[2 * 0.0529 : 32le]
269f6eb3
PP
1015----
1016
1017Output:
1018
1019----
1020ac ad d8 3d
1021----
1022====
1023
05f81895
PP
1024=== LEB128 integer
1025
1026An _LEB128 integer_ represents a variable number of bytes encoding an
1027unsigned or signed integer which is the result of evaluating a {py3}
1028expression following the https://en.wikipedia.org/wiki/LEB128[LEB128]
1029format.
1030
1031An LEB128 integer is:
1032
ee724c95 1033. The `[` prefix.
05f81895 1034
27d52a19
PP
1035. A valid {py3} expression of which the evaluation result type
1036 is `int` or `bool` (automatically converted to `int`).
05f81895
PP
1037+
1038For an LEB128 integer at some source location{nbsp}__**L**__, this
1039expression may contain:
1040+
1041--
fc21bb27
PP
1042* The name of any <<label,label>> defined before{nbsp}__**L**__
1043 which isn't within a nested group.
320644e2
PP
1044* The name of any <<variable-assignment,variable>> known
1045 at{nbsp}__**L**__.
05f81895
PP
1046--
1047+
269f6eb3
PP
1048The value of the special name `ICITTE` (`int` type) in this expression
1049is the <<cur-offset,current offset>> (before encoding the integer).
05f81895
PP
1050
1051. The `:` character.
1052
1053. One of:
1054+
1055--
1056[horizontal]
1057`uleb128`:: Use the unsigned LEB128 format.
1058`sleb128`:: Use the signed LEB128 format.
1059--
1060
ee724c95 1061. The `]` suffix.
05f81895
PP
1062
1063====
1064Input:
1065
1066----
ee724c95 1067[624485 : uleb128]
05f81895
PP
1068----
1069
1070Output:
1071
1072----
1073e5 8e 26
1074----
1075====
1076
1077====
1078Input:
1079
1080----
1081aa bb cc dd
1082<meow>
1083ee ff
ee724c95 1084[-981238311 + (meow * -23) : sleb128]
05f81895
PP
1085"hello"
1086----
1087
c2b79cf6
PP
1088Output:
1089
05f81895
PP
1090----
1091aa bb cc dd ee ff fd fa 8d ac 7c 68 65 6c 6c 6f ┆ ••••••••••|hello
1092----
1093====
1094
7a7b31e8
PP
1095=== String
1096
1097A _string_ represents a variable number of bytes encoding a string which
1098is the result of evaluating a {py3} expression using the UTF-8, UTF-16,
1099UTF-32, or Latin-1 to Latin-10 encoding.
1100
1101A string has two possible forms:
1102
1103Encoding prefix form:: {empty}
1104+
1105. An encoding amongst:
1106+
1107--
1108[horizontal]
1109`s:u8`::
1110`u8`::
1111 UTF-8.
1112
1113`s:u16be`::
1114`u16be`::
1115 UTF-16BE.
1116
1117`s:u16le`::
1118`u16le`::
1119 UTF-16LE.
1120
1121`s:u32be`::
1122`u32be`::
1123 UTF-32BE.
1124
1125`s:u32le`::
1126`u32le`::
1127 UTF-32LE.
1128
1129`s:latin1`::
1130 ISO/IEC 8859-1.
1131
1132`s:latin2`::
1133 ISO/IEC 8859-2.
1134
1135`s:latin3`::
1136 ISO/IEC 8859-3.
1137
1138`s:latin4`::
1139 ISO/IEC 8859-4.
1140
1141`s:latin5`::
1142 ISO/IEC 8859-9.
1143
1144`s:latin6`::
1145 ISO/IEC 8859-10.
1146
1147`s:latin7`::
1148 ISO/IEC 8859-13.
1149
1150`s:latin8`::
1151 ISO/IEC 8859-14.
1152
1153`s:latin9`::
1154 ISO/IEC 8859-15.
1155
1156`s:latin10`::
1157 ISO/IEC 8859-16.
1158--
1159
1160. The ``pass:[{]`` prefix.
1161
1162. A valid {py3} expression of which the evaluation result type
1163 is `bool`, `int`, `float`, or `str` (the first three automatically
1164 converted to `str`).
1165+
1166For a string at some source location{nbsp}__**L**__, this expression may
1167contain:
1168+
1169--
1170* The name of any <<label,label>> defined before{nbsp}__**L**__
1171 which isn't within a nested group.
1172* The name of any <<variable-assignment,variable>> known
1173 at{nbsp}__**L**__.
1174--
1175+
1176The value of the special name `ICITTE` (`int` type) in this expression
1177is the <<cur-offset,current offset>> (before encoding the string).
1178
1179. The `}` suffix.
1180
1181Encoding suffix form:: {empty}
1182+
ee724c95 1183. The `[` prefix.
7a7b31e8
PP
1184
1185. A valid {py3} expression of which the evaluation result type
1186 is `bool`, `int`, `float`, or `str` (the first three automatically
1187 converted to `str`).
1188+
1189For a string at some source location{nbsp}__**L**__, this expression may
1190contain:
1191+
1192--
1193* The name of any <<label,label>> defined before{nbsp}__**L**__
1194 which isn't within a nested group.
1195* The name of any <<variable-assignment,variable>> known
1196 at{nbsp}__**L**__.
1197--
1198+
1199The value of the special name `ICITTE` (`int` type) in this expression
1200is the <<cur-offset,current offset>> (before encoding the string).
1201
1202. The `:` character.
1203
1204. A string encoding amongst:
1205+
1206--
1207[horizontal]
1208`s:u8`::
1209 UTF-8.
1210
1211`s:u16be`::
1212 UTF-16BE.
1213
1214`s:u16le`::
1215 UTF-16LE.
1216
1217`s:u32be`::
1218 UTF-32BE.
1219
1220`s:u32le`::
1221 UTF-32LE.
1222
1223`s:latin1`::
1224 ISO/IEC 8859-1.
1225
1226`s:latin2`::
1227 ISO/IEC 8859-2.
1228
1229`s:latin3`::
1230 ISO/IEC 8859-3.
1231
1232`s:latin4`::
1233 ISO/IEC 8859-4.
1234
1235`s:latin5`::
1236 ISO/IEC 8859-9.
1237
1238`s:latin6`::
1239 ISO/IEC 8859-10.
1240
1241`s:latin7`::
1242 ISO/IEC 8859-13.
1243
1244`s:latin8`::
1245 ISO/IEC 8859-14.
1246
1247`s:latin9`::
1248 ISO/IEC 8859-15.
1249
1250`s:latin10`::
1251 ISO/IEC 8859-16.
1252--
1253
ee724c95 1254. The `]` suffix.
7a7b31e8
PP
1255
1256====
1257Input:
1258
1259----
1260{iter = 1}
1261
1262!repeat 10
ee724c95 1263 u8{iter} " "
7a7b31e8
PP
1264 {iter = iter + 1}
1265!end
1266----
1267
1268Output:
1269
1270----
127131 20 32 20 33 20 34 20 35 20 36 20 37 20 38 20 ┆ 1 2 3 4 5 6 7 8
127239 20 31 30 20 ┆ 9 10
1273----
1274====
1275
1276====
1277Input:
1278
1279----
1280{meow = 'salut jérémie'}
ee724c95 1281[meow.upper() : s:latin1]
7a7b31e8
PP
1282----
1283
1284Output:
1285
1286----
128753 41 4c 55 54 20 4a c9 52 c9 4d 49 45 ┆ SALUT J•R•MIE
1288----
1289====
1290
71aaa3f7
PP
1291=== Current offset setting
1292
1293This special item sets the <<cur-offset,_current offset_>>.
1294
1295A current offset setting is:
1296
1297. The `<` prefix.
1298
fc21bb27
PP
1299. A <<const-int,positive constant integer>> which is the new current
1300 offset.
71aaa3f7
PP
1301
1302. The `>` suffix.
1303
1304====
1305Input:
1306
1307----
ee724c95
PP
1308 [ICITTE : 8] * 8
1309<0x61> [ICITTE : 8] * 8
71aaa3f7
PP
1310----
1311
1312Output:
1313
1314----
131500 01 02 03 04 05 06 07 61 62 63 64 65 66 67 68 ┆ ••••••••abcdefgh
1316----
1317====
1318
1319====
1320Input:
1321
1322----
1323aa bb cc dd <meow> ee ff
1324<12> 11 22 33 <mix> 44 55
ee724c95 1325[meow : 8] [mix : 8]
71aaa3f7
PP
1326----
1327
1328Output:
1329
1330----
1331aa bb cc dd ee ff 11 22 33 44 55 04 0f ┆ •••••••"3DU••
1332----
1333====
1334
676f6189
PP
1335=== Current offset alignment
1336
00deb9fa 1337A _current offset alignment_ represents zero or more padding bytes to
676f6189
PP
1338make the <<cur-offset,current offset>> meet a given
1339https://en.wikipedia.org/wiki/Data_structure_alignment[alignment] value.
1340
1341More specifically, for an alignment value of{nbsp}__**N**__{nbsp}bits,
1342a current offset alignment represents the required padding bytes until
1343the current offset is a multiple of __**N**__{nbsp}/{nbsp}8.
1344
1345A current offset alignment is:
1346
1347. The `@` prefix.
1348
fc21bb27
PP
1349. A <<const-int,positive constant integer>> which is the alignment value
1350 in _bits_.
676f6189
PP
1351+
1352This value must be greater than zero and a multiple of{nbsp}8.
1353
1354. **Optional**:
1355+
1356--
1357. The ``pass:[~]`` prefix.
fc21bb27
PP
1358. A <<const-int,positive constant integer>> which is the value of the
1359 byte to use as padding to align the <<cur-offset,current offset>>.
676f6189
PP
1360--
1361+
1362Without this section, the padding byte value is zero.
1363
1364====
1365Input:
1366
1367----
136811 22 (@32 aa bb cc) * 3
1369----
1370
1371Output:
1372
1373----
137411 22 00 00 aa bb cc 00 aa bb cc 00 aa bb cc
1375----
1376====
1377
1378====
1379Input:
1380
1381----
ee724c95 1382!le
676f6189 138377 88
ee724c95 1384@32~0xcc [-893.5:32]
676f6189
PP
1385@128~0x55 "meow"
1386----
1387
1388Output:
1389
1390----
139177 88 cc cc 00 60 5f c4 55 55 55 55 55 55 55 55 ┆ w••••`_•UUUUUUUU
13926d 65 6f 77 ┆ meow
1393----
1394====
1395
1396====
1397Input:
1398
1399----
1400aa bb cc <29> @64~255 "zoom"
1401----
1402
1403Output:
1404
1405----
1406aa bb cc ff ff ff 7a 6f 6f 6d ┆ ••••••zoom
1407----
1408====
1409
25ca454b
PP
1410=== Filling
1411
1412A _filling_ represents zero or more padding bytes to make the
1413<<cur-offset,current offset>> reach a given value.
1414
1415A filling is:
1416
1417. The ``pass:[+]`` prefix.
1418
1419. One of:
1420
fc21bb27
PP
1421** A <<const-int,positive constant integer>> which is the current offset
1422 target.
25ca454b
PP
1423
1424** The ``pass:[{]`` prefix, a valid {py3} expression of which the
1425 evaluation result type is `int` or `bool` (automatically converted to
ee724c95 1426 `int`), and the `}` suffix.
25ca454b
PP
1427+
1428For a filling at some source location{nbsp}__**L**__, this expression
1429may contain:
1430+
1431--
1432* The name of any <<label,label>> defined before{nbsp}__**L**__
1433 which isn't within a nested group.
1434* The name of any <<variable-assignment,variable>> known
1435 at{nbsp}__**L**__.
1436--
1437+
1438The value of the special name `ICITTE` (`int` type) in this expression
1439is the <<cur-offset,current offset>> (before handling the items to
1440repeat).
1441
1442** A valid {py3} name.
1443+
1444For the name `__NAME__`, this is equivalent to the
ee724c95 1445`pass:[{]__NAME__}` form above.
25ca454b
PP
1446
1447+
1448This value must be greater than or equal to the current offset where
1449it's used.
1450
1451. **Optional**:
1452+
1453--
1454. The ``pass:[~]`` prefix.
fc21bb27
PP
1455. A <<const-int,positive constant integer>> which is the value of the
1456 byte to use as padding to reach the current offset target.
25ca454b
PP
1457--
1458+
1459Without this section, the padding byte value is zero.
1460
1461====
1462Input:
1463
1464----
1465aa bb cc dd
1466+0x40
1467"hello world"
1468----
1469
1470Output:
1471
1472----
1473aa bb cc dd 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
147400 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
147500 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
147600 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
147768 65 6c 6c 6f 20 77 6f 72 6c 64 ┆ hello world
1478----
1479====
1480
1481====
1482Input:
1483
1484----
1485!macro part(iter, fill)
ee724c95 1486 <0> "particular security " [ord('0') + iter : 8] +fill~0x80
25ca454b
PP
1487!end
1488
1489{iter = 1}
1490
1491!repeat 5
1492 m:part(iter, {32 + 4 * iter})
1493 {iter = iter + 1}
1494!end
1495----
1496
1497Output:
1498
1499----
150070 61 72 74 69 63 75 6c 61 72 20 73 65 63 75 72 ┆ particular secur
150169 74 79 20 31 80 80 80 80 80 80 80 80 80 80 80 ┆ ity 1•••••••••••
150280 80 80 80 70 61 72 74 69 63 75 6c 61 72 20 73 ┆ ••••particular s
150365 63 75 72 69 74 79 20 32 80 80 80 80 80 80 80 ┆ ecurity 2•••••••
150480 80 80 80 80 80 80 80 80 80 80 80 70 61 72 74 ┆ ••••••••••••part
150569 63 75 6c 61 72 20 73 65 63 75 72 69 74 79 20 ┆ icular security
150633 80 80 80 80 80 80 80 80 80 80 80 80 80 80 80 ┆ 3•••••••••••••••
150780 80 80 80 80 80 80 80 70 61 72 74 69 63 75 6c ┆ ••••••••particul
150861 72 20 73 65 63 75 72 69 74 79 20 34 80 80 80 ┆ ar security 4•••
150980 80 80 80 80 80 80 80 80 80 80 80 80 80 80 80 ┆ ••••••••••••••••
151080 80 80 80 80 80 80 80 70 61 72 74 69 63 75 6c ┆ ••••••••particul
151161 72 20 73 65 63 75 72 69 74 79 20 35 80 80 80 ┆ ar security 5•••
151280 80 80 80 80 80 80 80 80 80 80 80 80 80 80 80 ┆ ••••••••••••••••
151380 80 80 80 80 80 80 80 80 80 80 80 ┆ ••••••••••••
1514----
1515====
1516
71aaa3f7
PP
1517=== Label
1518
1519A _label_ associates a name to the <<cur-offset,current offset>>.
1520
1521All the labels of a whole Normand input must have unique names.
1522
05f81895 1523A label must not share the name of a <<variable-assignment,variable>>
71aaa3f7
PP
1524name.
1525
71aaa3f7
PP
1526A label is:
1527
1528. The `<` prefix.
1529
27d52a19 1530. A valid {py3} name which is not `ICITTE`.
71aaa3f7
PP
1531
1532. The `>` suffix.
1533
1534=== Variable assignment
1535
1536A _variable assignment_ associates a name to the integral result of an
1537evaluated {py3} expression.
1538
05f81895 1539A variable assignment is:
71aaa3f7
PP
1540
1541. The ``pass:[{]`` prefix.
1542
27d52a19 1543. A valid {py3} name which is not `ICITTE`.
71aaa3f7
PP
1544
1545. The `=` character.
1546
7a7b31e8
PP
1547. A valid {py3} expression of which the evaluation result type is `int`,
1548 `float`, or `bool` (automatically converted to `int`), or `str`.
05f81895
PP
1549+
1550For a variable assignment at some source location{nbsp}__**L**__, this
320644e2
PP
1551expression may contain:
1552+
1553--
1554* The name of any <<label,label>> defined before{nbsp}__**L**__
1555 which isn't within a nested group.
1556* The name of any <<variable-assignment,variable>> known
1557 at{nbsp}__**L**__.
1558--
05f81895 1559+
269f6eb3
PP
1560The value of the special name `ICITTE` (`int` type) in this expression
1561is the <<cur-offset,current offset>>.
71aaa3f7
PP
1562
1563. The `}` suffix.
1564
1565====
1566Input:
1567
1568----
ee724c95
PP
1569{mix = 101} !le
1570{meow = 42} 11 22 [meow:8] 33 {meow = ICITTE + 17}
1571"yooo" [meow + mix : 16]
71aaa3f7
PP
1572----
1573
1574Output:
1575
1576----
157711 22 2a 33 79 6f 6f 6f 7a 00 ┆ •"*3yoooz•
1578----
1579====
1580
1581=== Group
1582
1583A _group_ is a scoped sequence of items.
1584
1585The <<label,labels>> within a group aren't visible outside of it.
1586
e57a18e1
PP
1587The main purpose of a group is to <<post-item-repetition,repeat>> more
1588than a single item and to isolate labels.
71aaa3f7
PP
1589
1590A group is:
1591
261c5ecf 1592. The `(`, `!group`, or `!g` opening.
71aaa3f7 1593
cd33dfe6 1594. Zero or more items except, recursively, a macro definition block.
71aaa3f7 1595
261c5ecf
PP
1596. Depending on the group opening:
1597+
1598--
1599`(`::
1600 The `)` closing.
1601
1602`!group`::
1603`!g`::
1604 The `!end` closing.
1605--
71aaa3f7
PP
1606
1607====
1608Input:
1609
1610----
1611((aa bb cc) dd () ee) "leclerc"
1612----
1613
1614Output:
1615
1616----
1617aa bb cc dd ee 6c 65 63 6c 65 72 63 ┆ •••••leclerc
1618----
1619====
1620
1621====
1622Input:
1623
1624----
261c5ecf
PP
1625!group
1626 (aa bb cc) * 3 dd ee
1627!end * 5
71aaa3f7
PP
1628----
1629
1630Output:
1631
1632----
1633aa bb cc aa bb cc aa bb cc dd ee aa bb cc aa bb
1634cc aa bb cc dd ee aa bb cc aa bb cc aa bb cc dd
1635ee aa bb cc aa bb cc aa bb cc dd ee aa bb cc aa
1636bb cc aa bb cc dd ee
1637----
1638====
1639
1640====
1641Input:
1642
1643----
ee724c95 1644!be
71aaa3f7
PP
1645(
1646 <str_beg> u16le"sébastien diaz" <str_end>
ee724c95
PP
1647 [ICITTE - str_beg : 8]
1648 [(end - str_beg) * 5 : 24]
71aaa3f7
PP
1649) * 3
1650<end>
1651----
1652
1653Output:
1654
1655----
165673 00 e9 00 62 00 61 00 73 00 74 00 69 00 65 00 ┆ s•••b•a•s•t•i•e•
16576e 00 20 00 64 00 69 00 61 00 7a 00 1c 00 01 e0 ┆ n• •d•i•a•z•••••
165873 00 e9 00 62 00 61 00 73 00 74 00 69 00 65 00 ┆ s•••b•a•s•t•i•e•
16596e 00 20 00 64 00 69 00 61 00 7a 00 1c 00 01 40 ┆ n• •d•i•a•z••••@
166073 00 e9 00 62 00 61 00 73 00 74 00 69 00 65 00 ┆ s•••b•a•s•t•i•e•
16616e 00 20 00 64 00 69 00 61 00 7a 00 1c 00 00 a0 ┆ n• •d•i•a•z•••••
1662----
1663====
1664
27d52a19
PP
1665=== Conditional block
1666
12b5dbc0
PP
1667A _conditional block_ represents either the bytes of zero or more items
1668if some expression is true, or the bytes of zero or more other items if
1669it's false.
27d52a19
PP
1670
1671A conditional block is:
1672
261c5ecf 1673. The `!if` opening.
27d52a19
PP
1674
1675. One of:
1676
1677** The ``pass:[{]`` prefix, a valid {py3} expression of which the
1678 evaluation result type is `int` or `bool` (automatically converted to
ee724c95 1679 `int`), and the `}` suffix.
27d52a19 1680+
320644e2
PP
1681For a conditional block at some source location{nbsp}__**L**__, this
1682expression may contain:
27d52a19
PP
1683+
1684--
1685* The name of any <<label,label>> defined before{nbsp}__**L**__
1686 which isn't within a nested group.
1687* The name of any <<variable-assignment,variable>> known
320644e2 1688 at{nbsp}__**L**__.
27d52a19
PP
1689--
1690+
1691The value of the special name `ICITTE` (`int` type) in this expression
1692is the <<cur-offset,current offset>> (before handling the contained
1693items).
1694
1695** A valid {py3} name.
1696+
1697For the name `__NAME__`, this is equivalent to the
ee724c95 1698`pass:[{]__NAME__}` form above.
27d52a19 1699
cd33dfe6
PP
1700. Zero or more items to be handled when the condition is true
1701 except, recursively, a macro definition block.
12b5dbc0
PP
1702
1703. **Optional**:
1704
1705.. The `!else` opening.
cd33dfe6
PP
1706.. Zero or more items to be handled when the condition is false
1707 except, recursively, a macro definition block
27d52a19 1708
261c5ecf 1709. The `!end` closing.
27d52a19
PP
1710
1711====
1712Input:
1713
1714----
1715{at = 1}
1716{rep_count = 9}
1717
1718!repeat rep_count
1719 "meow "
1720
1721 !if {ICITTE > 25}
1722 "mix"
12b5dbc0
PP
1723 !else
1724 "zoom"
27d52a19
PP
1725 !end
1726
12b5dbc0
PP
1727 !if {at < rep_count} 20 !end
1728
27d52a19
PP
1729 {at = at + 1}
1730!end
1731----
1732
1733Output:
1734
1735----
12b5dbc0
PP
17366d 65 6f 77 20 7a 6f 6f 6d 20 6d 65 6f 77 20 7a ┆ meow zoom meow z
17376f 6f 6d 20 6d 65 6f 77 20 7a 6f 6f 6d 20 6d 65 ┆ oom meow zoom me
17386f 77 20 6d 69 78 20 6d 65 6f 77 20 6d 69 78 20 ┆ ow mix meow mix
17396d 65 6f 77 20 6d 69 78 20 6d 65 6f 77 20 6d 69 ┆ meow mix meow mi
27d52a19 174078 20 6d 65 6f 77 20 6d 69 78 20 6d 65 6f 77 20 ┆ x meow mix meow
12b5dbc0 17416d 69 78 ┆ mix
27d52a19
PP
1742----
1743====
1744
1745====
1746Input:
1747
1748----
1749<str_beg>
1750u16le"meow mix!"
1751<str_end>
1752
1753!if {str_end - str_beg > 10}
1754 " BIG"
1755!end
1756----
1757
1758Output:
1759
1760----
17616d 00 65 00 6f 00 77 00 20 00 6d 00 69 00 78 00 ┆ m•e•o•w• •m•i•x•
176221 00 20 42 49 47 ┆ !• BIG
1763----
1764====
1765
e57a18e1 1766=== Repetition block
71aaa3f7 1767
e57a18e1
PP
1768A _repetition block_ represents the bytes of one or more items repeated
1769a given number of times.
676f6189 1770
e57a18e1 1771A repetition block is:
71aaa3f7 1772
261c5ecf 1773. The `!repeat` or `!r` opening.
71aaa3f7 1774
2adf4336
PP
1775. One of:
1776
fc21bb27
PP
1777** A <<const-int,positive constant integer>> which is the number of
1778 times to repeat the previous item.
2adf4336 1779
27d52a19
PP
1780** The ``pass:[{]`` prefix, a valid {py3} expression of which the
1781 evaluation result type is `int` or `bool` (automatically converted to
ee724c95 1782 `int`), and the `}` suffix.
05f81895 1783+
320644e2
PP
1784For a repetition block at some source location{nbsp}__**L**__, this
1785expression may contain:
05f81895
PP
1786+
1787--
27d52a19
PP
1788* The name of any <<label,label>> defined before{nbsp}__**L**__
1789 which isn't within a nested group.
05f81895 1790* The name of any <<variable-assignment,variable>> known
320644e2 1791 at{nbsp}__**L**__.
05f81895
PP
1792--
1793+
e57a18e1
PP
1794The value of the special name `ICITTE` (`int` type) in this expression
1795is the <<cur-offset,current offset>> (before handling the items to
1796repeat).
1797
1798** A valid {py3} name.
1799+
1800For the name `__NAME__`, this is equivalent to the
ee724c95 1801`pass:[{]__NAME__}` form above.
e57a18e1 1802
cd33dfe6 1803. Zero or more items except, recursively, a macro definition block.
e57a18e1 1804
261c5ecf 1805. The `!end` closing.
e57a18e1
PP
1806
1807You may also use a <<post-item-repetition,post-item repetition>> after
1808some items. The form ``!repeat{nbsp}__X__{nbsp}__ITEMS__{nbsp}!end``
1809is equivalent to ``(__ITEMS__){nbsp}pass:[*]{nbsp}__X__``.
71aaa3f7
PP
1810
1811====
1812Input:
1813
1814----
fc21bb27 1815!repeat 0o400
ee724c95 1816 [end - ICITTE - 1 : 8]
e57a18e1
PP
1817!end
1818
1819<end>
71aaa3f7
PP
1820----
1821
1822Output:
1823
1824----
1825ff fe fd fc fb fa f9 f8 f7 f6 f5 f4 f3 f2 f1 f0 ┆ ••••••••••••••••
1826ef ee ed ec eb ea e9 e8 e7 e6 e5 e4 e3 e2 e1 e0 ┆ ••••••••••••••••
1827df de dd dc db da d9 d8 d7 d6 d5 d4 d3 d2 d1 d0 ┆ ••••••••••••••••
1828cf ce cd cc cb ca c9 c8 c7 c6 c5 c4 c3 c2 c1 c0 ┆ ••••••••••••••••
1829bf be bd bc bb ba b9 b8 b7 b6 b5 b4 b3 b2 b1 b0 ┆ ••••••••••••••••
1830af ae ad ac ab aa a9 a8 a7 a6 a5 a4 a3 a2 a1 a0 ┆ ••••••••••••••••
18319f 9e 9d 9c 9b 9a 99 98 97 96 95 94 93 92 91 90 ┆ ••••••••••••••••
18328f 8e 8d 8c 8b 8a 89 88 87 86 85 84 83 82 81 80 ┆ ••••••••••••••••
18337f 7e 7d 7c 7b 7a 79 78 77 76 75 74 73 72 71 70 ┆ •~}|{zyxwvutsrqp
18346f 6e 6d 6c 6b 6a 69 68 67 66 65 64 63 62 61 60 ┆ onmlkjihgfedcba`
18355f 5e 5d 5c 5b 5a 59 58 57 56 55 54 53 52 51 50 ┆ _^]\[ZYXWVUTSRQP
18364f 4e 4d 4c 4b 4a 49 48 47 46 45 44 43 42 41 40 ┆ ONMLKJIHGFEDCBA@
18373f 3e 3d 3c 3b 3a 39 38 37 36 35 34 33 32 31 30 ┆ ?>=<;:9876543210
18382f 2e 2d 2c 2b 2a 29 28 27 26 25 24 23 22 21 20 ┆ /.-,+*)('&%$#"!
18391f 1e 1d 1c 1b 1a 19 18 17 16 15 14 13 12 11 10 ┆ ••••••••••••••••
18400f 0e 0d 0c 0b 0a 09 08 07 06 05 04 03 02 01 00 ┆ ••••••••••••••••
1841----
1842====
1843
2adf4336
PP
1844====
1845Input:
1846
1847----
1848{times = 1}
e57a18e1 1849
2adf4336 1850aa bb cc dd
e57a18e1
PP
1851
1852!repeat 3
2adf4336 1853 <here>
e57a18e1
PP
1854
1855 !repeat {here + 1}
1856 ee ff
1857 !end
1858
1859 11 22 !repeat times 33 !end
1860
2adf4336 1861 {times = times + 1}
e57a18e1
PP
1862!end
1863
2adf4336
PP
1864"coucou!"
1865----
1866
1867Output:
1868
1869----
1870aa bb cc dd ee ff ee ff ee ff ee ff ee ff 11 22 ┆ •••••••••••••••"
187133 ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ 3•••••••••••••••
1872ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1873ff ee ff ee ff 11 22 33 33 ee ff ee ff ee ff ee ┆ ••••••"33•••••••
1874ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1875ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1876ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1877ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1878ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1879ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1880ff ee ff ee ff ee ff ee ff ee ff ee ff 11 22 33 ┆ ••••••••••••••"3
188133 33 63 6f 75 63 6f 75 21 ┆ 33coucou!
1882----
1883====
1884
cd33dfe6
PP
1885=== Transformation block
1886
1887A _transformation block_ represents the bytes of one or more items
1888transformed into other bytes by a function.
1889
1890As of this version, Normand only offers a predetermined set of
1891transformation functions.
1892
1893An encoded block is:
1894
1895. The `!transform` or `!t` opening.
1896
1897. A transformation function name amongst:
1898+
1899--
1900[horizontal]
1901`base64`::
1902`b64`::
1903 Standard https://datatracker.ietf.org/doc/html/rfc4648.html#section-4[Base64].
1904
1905`base64u`::
1906`b64u`::
1907 URL-safe Base64, using `-` instead of `pass:[+]` and `_` instead of
1908 `/`.
1909
1910`base32`::
1911`b32`::
1912 Standard https://datatracker.ietf.org/doc/html/rfc4648.html#section-6[Base32].
1913
1914`base16`::
1915`b16`::
1916 Standard https://datatracker.ietf.org/doc/html/rfc4648.html#section-8[Base16].
1917
1918`ascii85`::
1919`a85`::
1920 https://en.wikipedia.org/wiki/Ascii85[Ascii85] without padding.
1921
1922`ascii85p`::
1923`a85p`::
1924 Ascii85 with padding.
1925
1926`base85`::
1927`b85`::
1928 https://en.wikipedia.org/wiki/Ascii85[Base85] (like Git-style binary
1929 diffs) without padding.
1930
1931`base85p`::
1932`b85p`::
1933 Base85 with padding.
1934
1935`quopri`::
1936`qp`::
1937 MIME
1938 https://datatracker.ietf.org/doc/html/rfc2045#section-6.7[quoted-printable]
1939 without quoted whitespaces.
1940
1941`quoprit`::
1942`qpt`::
1943 MIME quoted-printable with quoted whitespaces.
1944
1945`gzip`::
1946`gz`::
1947 https://en.wikipedia.org/wiki/Gzip[gzip].
1948
1949`bzip2`::
1950`bz2`::
1951 https://en.wikipedia.org/wiki/Bzip2[bzip2].
1952--
1953
1954. Zero or more items except, recursively, a macro definition block.
1955+
1956Any {py3} expression within any of those items may not refer to a future
1957<<label,label>>.
1958+
1959The value of the special name `ICITTE` in any {py3} expression within
1960any of those items is the <<cur-offset,current offset>> _before_ Normand
1961applies the transformation function. Therefore, labels defined within
1962those items also have the current offset value _before_ Normand applies
1963the transformation function.
1964
1965. The `!end` closing.
1966
1967The <<cur-offset,current offset>> after having handled the last item of
1968a transformation block is the value of the current offset before
1969handling the first item plus the size of the generated (transformed)
1970bytes. In other words, <<current-offset-setting,current offset
1971settings>> within the items of the block have no impact outside said
1972block.
1973
1974====
1975Input:
1976
1977----
1978aa bb cc dd
1979
ee724c95 1980"size of compressed section: " [end - start : 8]
cd33dfe6
PP
1981
1982<start>
1983
1984!transform bzip2
1985 "this will be compressed!"
1986 89*100 00*5000
1987!end
1988
1989<end>
1990
1991"yes!"
1992----
1993
1994Output:
1995
1996----
1997aa bb cc dd 73 69 7a 65 20 6f 66 20 63 6f 6d 70 ┆ ••••size of comp
199872 65 73 73 65 64 20 73 65 63 74 69 6f 6e 3a 20 ┆ ressed section:
199952 42 5a 68 39 31 41 59 26 53 59 68 e1 8c fc 00 ┆ RBZh91AY&SYh••••
200000 33 d1 e0 c0 00 60 00 5e 66 dc 80 00 20 00 80 ┆ •3••••`•^f••• ••
200100 08 20 00 31 40 d3 43 23 26 20 ca 87 a9 a1 e8 ┆ •• •1@•C#& •••••
200218 29 44 80 9c 80 49 bf cc b3 e8 45 ed e2 76 ad ┆ •)D•••I••••E••v•
20030f 12 8b 8a d6 cd 40 04 7e 2e e4 8a 70 a1 20 d1 ┆ ••••••@•~.••p• •
2004c3 19 f8 79 65 73 21 ┆ •••yes!
2005----
2006====
2007
2008====
2009Input:
2010
2011----
201288*16
2013
2014!t a85
2015 "I am determined to be cheerful and happy in whatever situation "
2016 "I may find myself. For I have learned that the greater part of "
2017 "our misery or unhappiness is determined not by our circumstance "
2018 "but by our disposition."
2019!end
2020
2021@128~99h
2022
ee724c95 2023!t qp <beg> [ICITTE - beg : 8] * 50 !end
cd33dfe6
PP
2024----
2025
2026Output:
2027
2028----
202988 88 88 88 88 88 88 88 88 88 88 88 88 88 88 88 ┆ ••••••••••••••••
203038 4b 5f 47 59 2b 43 6f 26 2a 41 54 44 58 25 44 ┆ 8K_GY+Co&*ATDX%D
203149 6d 3f 24 46 44 69 3a 32 41 4b 59 4a 72 41 53 ┆ Im?$FDi:2AKYJrAS
203223 6d 6f 46 5f 69 31 2f 44 49 61 6c 27 40 3b 70 ┆ #moF_i1/DIal'@;p
203331 32 2b 44 47 5e 39 47 41 28 45 2c 41 54 68 58 ┆ 12+DG^9GA(E,AThX
20342a 2b 45 4d 37 3d 46 5e 5d 42 2b 44 66 2d 5b 68 ┆ *+EM7=F^]B+Df-[h
20352b 44 6b 50 34 2b 44 2c 3e 2a 41 30 3e 60 37 46 ┆ +DkP4+D,>*A0>`7F
203628 4b 30 22 2f 67 2a 57 25 45 5a 64 70 72 42 4f ┆ (K0"/g*W%EZdprBO
203751 27 71 2b 44 62 55 74 45 63 2c 48 21 2b 45 56 ┆ Q'q+DbUtEc,H!+EV
20383a 2a 46 3c 47 5b 3d 41 4b 59 57 2b 41 52 54 5b ┆ :*F<G[=AKYW+ART[
20396c 45 5a 66 3d 30 45 63 60 46 42 41 66 75 23 37 ┆ lEZf=0Ec`FBAfu#7
204045 5a 66 34 35 46 28 4b 42 3b 2b 45 29 39 43 46 ┆ EZf45F(KB;+E)9CF
204160 28 6c 24 45 2c 5d 4e 2f 41 54 4d 6f 38 42 6c ┆ `(l$E,]N/ATMo8Bl
204262 44 2d 41 54 56 4c 28 44 2f 21 6d 21 41 30 3e ┆ bD-ATVL(D/!m!A0>
204363 2e 46 3c 47 25 3c 2b 45 29 43 43 2b 43 66 2c ┆ c.F<G%<+E)CC+Cf,
20442b 40 73 29 58 30 46 43 42 26 73 41 4b 59 48 29 ┆ +@s)X0FCB&sAKYH)
204546 3c 47 25 3c 2b 45 29 43 43 2b 43 6f 32 2d 45 ┆ F<G%<+E)CC+Co2-E
20462c 54 66 33 46 44 35 5a 32 2f 63 99 99 99 99 99 ┆ ,Tf3FD5Z2/c•••••
20473d 30 30 3d 30 31 3d 30 32 3d 30 33 3d 30 34 3d ┆ =00=01=02=03=04=
204830 35 3d 30 36 3d 30 37 3d 30 38 3d 30 39 0a 3d ┆ 05=06=07=08=09•=
204930 42 3d 30 43 0d 3d 30 45 3d 30 46 3d 31 30 3d ┆ 0B=0C•=0E=0F=10=
205031 31 3d 31 32 3d 31 33 3d 31 34 3d 31 35 3d 31 ┆ 11=12=13=14=15=1
205136 3d 31 37 3d 31 38 3d 31 39 3d 31 41 3d 31 42 ┆ 6=17=18=19=1A=1B
20523d 31 43 3d 31 44 3d 31 45 3d 31 46 20 21 22 23 ┆ =1C=1D=1E=1F !"#
205324 25 26 27 28 29 2a 2b 2c 2d 3d 0a 2e 2f 30 31 ┆ $%&'()*+,-=•./01
2054----
2055====
2056
320644e2
PP
2057=== Macro definition block
2058
2059A _macro definition block_ associates a name and parameter names to
2060a group of items.
2061
2062A macro definition block doesn't lead to generated bytes itself: a
2063<<macro-expansion,macro expansion>> does so.
2064
2065A macro definition may only exist at the root level, that is, not within
2066a <<group,group>>, a <<repetition-block,repetition block>>, a
2067<<conditional-block,conditional block>>, or another
2068<<macro-definition-block,macro definition block>>.
2069
2070All macro definitions must have unique names.
2071
2072A macro definition is:
2073
2074. The `!macro` or `!m` opening.
2075
2076. A valid {py3} name (the macro name).
2077
2078. The `(` parameter name list prefix.
2079
2080. A comma-separated list of zero or more unique parameter names,
2081 each one being a valid {py3} name.
2082
2083. The `)` parameter name list suffix.
2084
2085. Zero or more items except, recursively, a macro definition block.
2086
2087. The `!end` closing.
2088
2089====
2090----
2091!macro bake()
ee724c95 2092 !le [ICITTE * 8 : 16]
320644e2
PP
2093 u16le"predict explode"
2094!end
2095----
2096====
2097
2098====
2099----
2100!macro nail(rep, with_extra, val)
2101 {iter = 1}
2102
2103 !repeat rep
ee724c95
PP
2104 [val + iter : uleb128]
2105 [0xdeadbeef : 32]
320644e2
PP
2106 {iter = iter + 1}
2107 !end
2108
2109 !if with_extra
2110 "meow mix\0"
2111 !end
2112!end
2113----
2114====
2115
2116=== Macro expansion
2117
2118A _macro expansion_ expands the items of a defined
2119<<macro-definition-block,macro>>.
2120
2121The macro to expand must be defined _before_ the expansion.
2122
2123The <<state,state>> before handling the first item of the chosen macro
2124is:
2125
2126<<cur-offset,Current offset>>::
2127 Unchanged.
2128
2129<<cur-bo,Current byte order>>::
2130 Unchanged.
2131
2132Variables::
2133 The only available variables initially are the macro parameters.
2134
2135Labels::
2136 None.
2137
2138The state after having handled the last item of the chosen macro is:
2139
2140Current offset::
2141 The one before handling the first item of the macro plus the size
2142 of the generated data of the macro expansion.
2143+
2144IMPORTANT: This means <<current-offset-setting,current offset setting>>
2145items within the expanded macro don't impact the final current offset.
2146
2147Current byte order::
2148 The one before handling the first item of the macro.
2149
2150Variables::
2151 The ones before handling the first item of the macro.
2152
2153Labels::
2154 The ones before handling the first item of the macro.
2155
2156A macro expansion is:
2157
2158. The `m:` prefix.
2159
2160. A valid {py3} name (the name of the macro to expand).
2161
2162. The `(` parameter value list prefix.
2163
2164. A comma-separated list of zero or more unique parameter values.
2165+
2166The number of parameter values must match the number of parameter
2167names of the definition of the chosen macro.
2168+
2169A parameter value is one of:
2170+
2171--
fc21bb27 2172* A <<const-int,constant integer>>, possibly negative.
320644e2 2173
dbd84e74
PP
2174* A constant floating point number.
2175
320644e2
PP
2176* The ``pass:[{]`` prefix, a valid {py3} expression of which the
2177 evaluation result type is `int` or `bool` (automatically converted to
ee724c95 2178 `int`), and the `}` suffix.
320644e2
PP
2179+
2180For a macro expansion at some source location{nbsp}__**L**__, this
2181expression may contain:
2182
2183** The name of any <<label,label>> defined before{nbsp}__**L**__
2184 which isn't within a nested group.
2185** The name of any <<variable-assignment,variable>> known
2186 at{nbsp}__**L**__.
2187
2188+
2189The value of the special name `ICITTE` (`int` type) in this expression
2190is the <<cur-offset,current offset>> (before handling the items of the
2191chosen macro).
2192
2193* A valid {py3} name.
2194+
2195For the name `__NAME__`, this is equivalent to the
2196`pass:[{]__NAME__pass:[}]` form above.
2197--
2198
2199. The `)` parameter value list suffix.
2200
2201====
2202Input:
2203
2204----
2205!macro bake()
ee724c95 2206 !le [ICITTE * 8 : 16]
320644e2
PP
2207 u16le"predict explode"
2208!end
2209
2210"hello [" m:bake() "] world"
2211
2212m:bake() * 5
2213----
2214
2215Output:
2216
2217----
221868 65 6c 6c 6f 20 5b 38 00 70 00 72 00 65 00 64 ┆ hello [8•p•r•e•d
221900 69 00 63 00 74 00 20 00 65 00 78 00 70 00 6c ┆ •i•c•t• •e•x•p•l
222000 6f 00 64 00 65 00 5d 20 77 6f 72 6c 64 70 01 ┆ •o•d•e•] worldp•
222170 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
222265 00 78 00 70 00 6c 00 6f 00 64 00 65 00 70 02 ┆ e•x•p•l•o•d•e•p•
222370 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
222465 00 78 00 70 00 6c 00 6f 00 64 00 65 00 70 03 ┆ e•x•p•l•o•d•e•p•
222570 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
222665 00 78 00 70 00 6c 00 6f 00 64 00 65 00 70 04 ┆ e•x•p•l•o•d•e•p•
222770 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
222865 00 78 00 70 00 6c 00 6f 00 64 00 65 00 70 05 ┆ e•x•p•l•o•d•e•p•
222970 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
223065 00 78 00 70 00 6c 00 6f 00 64 00 65 00 ┆ e•x•p•l•o•d•e•
2231----
2232====
2233
2234====
2235Input:
2236
2237----
2238!macro A(val, is_be)
ee724c95 2239 !le
320644e2
PP
2240
2241 !if is_be
ee724c95 2242 !be
320644e2
PP
2243 !end
2244
ee724c95 2245 [val : 16]
320644e2
PP
2246!end
2247
2248!macro B(rep, is_be)
2249 {iter = 1}
2250
2251 !repeat rep
2252 m:A({iter * 3}, is_be)
2253 {iter = iter + 1}
2254 !end
2255!end
2256
2257m:B(5, 1)
2258m:B(3, 0)
2259----
2260
2261Output:
2262
2263----
226400 03 00 06 00 09 00 0c 00 0f 03 00 06 00 09 00
2265----
2266====
2267
dbd84e74
PP
2268====
2269Input:
2270
2271----
ee724c95 2272!macro flt32be(val) !be [val : 32] !end
dbd84e74
PP
2273
2274"CHEETOS"
2275m:flt32be(-42.17)
2276m:flt32be(56.23e-4)
2277----
2278
2279Output:
2280
2281----
228243 48 45 45 54 4f 53 c2 28 ae 14 3b b8 41 25 ┆ CHEETOS•(••;•A%
2283----
2284====
2285
e57a18e1
PP
2286=== Post-item repetition
2287
2288A _post-item repetition_ represents the bytes of an item repeated a
2289given number of times.
2290
2291A post-item repetition is:
2292
27d52a19 2293. One of those items:
e57a18e1 2294
27d52a19
PP
2295** A <<byte-constant,byte constant>>.
2296** A <<literal-string,literal string>>.
2297** A <<fixed-length-number,fixed-length number>>.
2298** An <<leb128-integer,LEB128 integer>>.
7a7b31e8 2299** A <<string,string>>.
320644e2 2300** A <<macro-expansion,macro-expansion>>.
cd33dfe6 2301** A <<transformation-block,transformation block>>.
27d52a19 2302** A <<group,group>>.
e57a18e1
PP
2303
2304. The ``pass:[*]`` character.
2305
2306. One of:
2307
2308** A positive integer (hexadecimal starting with `0x` or `0X` accepted)
2309 which is the number of times to repeat the previous item.
2310
27d52a19
PP
2311** The ``pass:[{]`` prefix, a valid {py3} expression of which the
2312 evaluation result type is `int` or `bool` (automatically converted to
ee724c95 2313 `int`), and the `}` suffix.
e57a18e1 2314+
320644e2
PP
2315For a post-item repetition at some source location{nbsp}__**L**__, this
2316expression may contain:
e57a18e1
PP
2317+
2318--
27d52a19
PP
2319* The name of any <<label,label>> defined before{nbsp}__**L**__
2320 which isn't within a nested group and
2321 which isn't part of the repeated item.
e57a18e1
PP
2322* The name of any <<variable-assignment,variable>> known
2323 at{nbsp}__**L**__, which isn't part of its repeated item, and which
320644e2 2324 doesn't.
e57a18e1
PP
2325--
2326+
2327The value of the special name `ICITTE` (`int` type) in this expression
2328is the <<cur-offset,current offset>> (before handling the items to
2329repeat).
2330
2331** A valid {py3} name.
2332+
2333For the name `__NAME__`, this is equivalent to the
2334`pass:[{]__NAME__pass:[}]` form above.
2335
2336You may also use a <<repetition-block,repetition block>>. The form
2337``__ITEM__{nbsp}pass:[*]{nbsp}__X__`` is equivalent to
2338``!repeat{nbsp}__X__{nbsp}__ITEM__{nbsp}!end``.
2339
2340====
2341Input:
2342
2343----
ee724c95 2344[end - ICITTE - 1 : 8] * 0x100 <end>
e57a18e1
PP
2345----
2346
2347Output:
2348
2349----
2350ff fe fd fc fb fa f9 f8 f7 f6 f5 f4 f3 f2 f1 f0 ┆ ••••••••••••••••
2351ef ee ed ec eb ea e9 e8 e7 e6 e5 e4 e3 e2 e1 e0 ┆ ••••••••••••••••
2352df de dd dc db da d9 d8 d7 d6 d5 d4 d3 d2 d1 d0 ┆ ••••••••••••••••
2353cf ce cd cc cb ca c9 c8 c7 c6 c5 c4 c3 c2 c1 c0 ┆ ••••••••••••••••
2354bf be bd bc bb ba b9 b8 b7 b6 b5 b4 b3 b2 b1 b0 ┆ ••••••••••••••••
2355af ae ad ac ab aa a9 a8 a7 a6 a5 a4 a3 a2 a1 a0 ┆ ••••••••••••••••
23569f 9e 9d 9c 9b 9a 99 98 97 96 95 94 93 92 91 90 ┆ ••••••••••••••••
23578f 8e 8d 8c 8b 8a 89 88 87 86 85 84 83 82 81 80 ┆ ••••••••••••••••
23587f 7e 7d 7c 7b 7a 79 78 77 76 75 74 73 72 71 70 ┆ •~}|{zyxwvutsrqp
23596f 6e 6d 6c 6b 6a 69 68 67 66 65 64 63 62 61 60 ┆ onmlkjihgfedcba`
23605f 5e 5d 5c 5b 5a 59 58 57 56 55 54 53 52 51 50 ┆ _^]\[ZYXWVUTSRQP
23614f 4e 4d 4c 4b 4a 49 48 47 46 45 44 43 42 41 40 ┆ ONMLKJIHGFEDCBA@
23623f 3e 3d 3c 3b 3a 39 38 37 36 35 34 33 32 31 30 ┆ ?>=<;:9876543210
23632f 2e 2d 2c 2b 2a 29 28 27 26 25 24 23 22 21 20 ┆ /.-,+*)('&%$#"!
23641f 1e 1d 1c 1b 1a 19 18 17 16 15 14 13 12 11 10 ┆ ••••••••••••••••
23650f 0e 0d 0c 0b 0a 09 08 07 06 05 04 03 02 01 00 ┆ ••••••••••••••••
2366----
2367====
2368
2369====
2370Input:
2371
2372----
2373{times = 1}
2374aa bb cc dd
2375(
2376 <here>
2377 (ee ff) * {here + 1}
2378 11 22 33 * {times}
2379 {times = times + 1}
2380) * 3
2381"coucou!"
2382----
2383
2384Output:
2385
2386----
2387aa bb cc dd ee ff ee ff ee ff ee ff ee ff 11 22 ┆ •••••••••••••••"
238833 ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ 3•••••••••••••••
2389ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2390ff ee ff ee ff 11 22 33 33 ee ff ee ff ee ff ee ┆ ••••••"33•••••••
2391ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2392ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2393ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2394ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2395ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2396ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2397ff ee ff ee ff ee ff ee ff ee ff ee ff 11 22 33 ┆ ••••••••••••••"3
239833 33 63 6f 75 63 6f 75 21 ┆ 33coucou!
2399----
2400====
2401
71aaa3f7
PP
2402== Command-line tool
2403
2404If you <<install-normand,installed>> the `normand` package, then you
2405can use the `normand` command-line tool:
2406
2407----
2408$ normand <<< '"ma gang de malades"' | hexdump -C
2409----
2410
2411----
241200000000 6d 61 20 67 61 6e 67 20 64 65 20 6d 61 6c 61 64 |ma gang de malad|
241300000010 65 73 |es|
2414----
2415
2416If you copy the `normand.py` module to your own project, then you can
2417run the module itself:
2418
2419----
2420$ python3 -m normand <<< '"ma gang de malades"' | hexdump -C
2421----
2422
2423----
242400000000 6d 61 20 67 61 6e 67 20 64 65 20 6d 61 6c 61 64 |ma gang de malad|
242500000010 65 73 |es|
2426----
2427
2428Without a path argument, the `normand` tool reads from the standard
2429input.
2430
2431The `normand` tool prints the generated binary data to the standard
2432output.
2433
2434Various options control the initial <<state,state>> of the processor:
2435use the `--help` option to learn more.
2436
2437== {py3} API
2438
e57a18e1 2439The whole `normand` package/module public API is:
71aaa3f7
PP
2440
2441[source,python]
2442----
e57a18e1 2443# Byte order.
71aaa3f7
PP
2444class ByteOrder(enum.Enum):
2445 # Big endian.
2446 BE = ...
2447
2448 # Little endian.
2449 LE = ...
2450
2451
e57a18e1
PP
2452# Text location.
2453class TextLocation:
71aaa3f7
PP
2454 # Line number.
2455 @property
2456 def line_no(self) -> int:
2457 ...
2458
2459 # Column number.
2460 @property
2461 def col_no(self) -> int:
2462 ...
2463
2464
f5dcb24c
PP
2465# Parsing error message.
2466class ParseErrorMessage:
2467 # Message text.
2468 @property
2469 def text(self):
2470 ...
2471
2472 # Source text location.
2473 @property
2474 def text_location(self):
2475 ...
2476
2477
e57a18e1 2478# Parsing error.
71aaa3f7 2479class ParseError(RuntimeError):
f5dcb24c
PP
2480 # Parsing error messages.
2481 #
2482 # The first message is the most _specific_ one.
71aaa3f7 2483 @property
f5dcb24c 2484 def messages(self):
71aaa3f7
PP
2485 ...
2486
2487
e57a18e1
PP
2488# Variables dictionary type (for type hints).
2489VariablesT = typing.Dict[str, typing.Union[int, float]]
2490
2491
2492# Labels dictionary type (for type hints).
2493LabelsT = typing.Dict[str, int]
1b8aa84a
PP
2494
2495
e57a18e1 2496# Parsing result.
71aaa3f7
PP
2497class ParseResult:
2498 # Generated data.
2499 @property
2500 def data(self) -> bytearray:
2501 ...
2502
2503 # Updated variable values.
2504 @property
1b8aa84a 2505 def variables(self) -> SymbolsT:
71aaa3f7
PP
2506 ...
2507
2508 # Updated main group label values.
2509 @property
1b8aa84a 2510 def labels(self) -> SymbolsT:
71aaa3f7
PP
2511 ...
2512
2513 # Final offset.
2514 @property
2515 def offset(self) -> int:
2516 ...
2517
2518 # Final byte order.
2519 @property
1b8aa84a 2520 def byte_order(self) -> typing.Optional[ByteOrder]:
71aaa3f7
PP
2521 ...
2522
1b8aa84a 2523
e57a18e1
PP
2524# Parses the `normand` input using the initial state defined by
2525# `init_variables`, `init_labels`, `init_offset`, and `init_byte_order`,
2526# and returns the corresponding parsing result.
71aaa3f7 2527def parse(normand: str,
1b8aa84a
PP
2528 init_variables: typing.Optional[SymbolsT] = None,
2529 init_labels: typing.Optional[SymbolsT] = None,
71aaa3f7
PP
2530 init_offset: int = 0,
2531 init_byte_order: typing.Optional[ByteOrder] = None) -> ParseResult:
2532 ...
2533----
2534
2535The `normand` parameter is the actual <<learn-normand,Normand input>>
2536while the other parameters control the initial <<state,state>>.
2537
2538The `parse()` function raises a `ParseError` instance should it fail to
2539parse the `normand` string for any reason.
bf8f3b38
PP
2540
2541== Development
2542
2543Normand is a https://python-poetry.org/[Poetry] project.
2544
2545To develop it, install it through Poetry and enter the virtual
2546environment:
2547
2548----
2549$ poetry install
2550$ poetry shell
2551$ normand <<< '"lol" * 10 0a'
2552----
2553
2554`normand.py` is processed by:
2555
2556* https://microsoft.github.io/pyright/[Pyright]
2557* https://github.com/psf/black[Black]
2558* https://pycqa.github.io/isort/[isort]
2559
d2d06893
MJ
2560Licensing and copyright follows the
2561https://reuse.software/tutorial/[REUSE] specification and is checked
2562with the https://github.com/fsfe/reuse-tool[reuse tool].
2563
bf8f3b38
PP
2564=== Testing
2565
2566Use https://docs.pytest.org/[pytest] to test Normand once the package is
2567part of your virtual environment, for example:
2568
2569----
2570$ poetry install
2571$ poetry run pip3 install pytest
2572$ poetry run pytest
2573----
2574
2575The `pytest` project is currently not a development dependency in
2576`pyproject.toml` due to backward compatibiliy issues with
2577Python{nbsp}3.4.
2578
2579In the `tests` directory, each `*.nt` file is a test. The file name
2580prefix indicates what it's meant to test:
2581
2582`pass-`::
2583 Everything above the `---` line is the valid Normand input
2584 to test.
2585+
2586Everything below the `---` line is the expected data
2587(whitespace-separated hexadecimal bytes).
2588
2589`fail-`::
2590 Everything above the `---` line is the invalid Normand input
2591 to test.
2592+
2593Everything below the `---` line is the expected error message having
2594this form:
2595+
2596----
2597LINE:COL - MESSAGE
2598----
2599
2600=== Contributing
2601
2602Normand uses https://review.lttng.org/admin/repos/normand,general[Gerrit]
2603for code review.
2604
2605To report a bug, https://github.com/efficios/normand/issues/new[create a
2606GitHub issue].
This page took 0.127001 seconds and 4 git commands to generate.