Add transformation block support
[normand.git] / README.adoc
CommitLineData
bb2f9e9c
PP
1// Show ToC at a specific location for a GitHub rendering
2ifdef::env-github[]
3:toc: macro
4endif::env-github[]
5
6ifndef::env-github[]
71aaa3f7 7:toc: left
bb2f9e9c
PP
8endif::env-github[]
9
10// This is to mimic what GitHub does so that anchors work in an offline
11// rendering too.
12:idprefix:
13:idseparator: -
71aaa3f7 14
bb2f9e9c 15// Other attributes
71aaa3f7
PP
16:py3: Python{nbsp}3
17
bb2f9e9c
PP
18= Normand
19Philippe Proulx
20
df0f8552
PP
21image::normand-logo.png[]
22
71aaa3f7
PP
23[.normal]
24image:https://img.shields.io/pypi/v/normand.svg?label=Latest%20version[link="https://pypi.python.org/pypi/normand"]
25
26[.lead]
27_**Normand**_ is a text-to-binary processor with its own language.
28
29This package offers both a portable {py3} module and a command-line
30tool.
31
cd33dfe6 32WARNING: This version of Normand is 0.21, meaning both the Normand
71aaa3f7
PP
33language and the module/CLI interface aren't stable.
34
bb2f9e9c
PP
35ifdef::env-github[]
36// ToC location for a GitHub rendering
37toc::[]
38endif::env-github[]
39
71aaa3f7
PP
40== Introduction
41
42The purpose of Normand is to consume human-readable text representing
43bytes and to produce the corresponding binary data.
44
45.Simple bytes input.
46====
47Consider the following Normand input:
48
49----
504f 55 32 bb $167 fe %10100111 a9 $-32
51----
52
53The generated nine bytes are:
54
55----
564f 55 32 bb a7 fe a7 a9 e0
57----
58====
59
60As you can see in the last example, the fundamental unit of the Normand
61language is the _byte_. The order in which you list bytes will be the
62order of the generated data.
63
64The Normand language is more than simple lists of bytes, though. Its
65main features are:
66
67Comments, including a bunch of insignificant symbols which may improve readability::
68+
69Input:
70+
71----
72ff bb %1101:0010 # This is a comment
7378 29 af $192 # This too # 99 $-80
74fe80::6257:18ff:fea3:4229
7560:57:18:a3:42:29
7610839636-5d65-4a68-8e6a-21608ddf7258
77----
78+
79Output:
80+
81----
82ff bb d2 78 29 af c0 99 b0 fe 80 62 57 18 ff fe
83a3 42 29 60 57 18 a3 42 29 10 83 96 36 5d 65 4a
8468 8e 6a 21 60 8d df 72 58
85----
86
87Hexadecimal, decimal, and binary byte constants::
88+
89Input:
90+
91----
92aa bb $247 $-89 %0011_0010 %11.01= 10/10
93----
94+
95Output:
96+
97----
98aa bb f7 a7 32 da
99----
100
7a7b31e8 101Strings::
71aaa3f7
PP
102+
103Input:
104+
105----
106"hello world!" 00
107u16le"stress\nverdict 🤣"
7a7b31e8 108s:latin3{hex(ICITTE)}
71aaa3f7
PP
109----
110+
111Output:
112+
113----
11468 65 6c 6c 6f 20 77 6f 72 6c 64 21 00 73 00 74 ┆ hello world!•s•t
11500 72 00 65 00 73 00 73 00 0a 00 76 00 65 00 72 ┆ •r•e•s•s•••v•e•r
7a7b31e8
PP
11600 64 00 69 00 63 00 74 00 20 00 3e d8 23 dd 30 ┆ •d•i•c•t• •>•#•0
11778 32 66 ┆ x2f
71aaa3f7
PP
118----
119
120Labels: special variables holding the offset where they're defined::
121+
122----
123<beg> b2 52 e3 bc 91 05
124$100 $50 <chair> 33 9f fe
12525 e9 89 8a <end>
126----
127
128Variables::
129+
130----
1315e 65 {tower = 47} c6 7f f2 c4
13244 {hurl = tower - 14} b5 {tower = hurl} 26 2d
133----
134+
135The value of a variable assignment is the evaluation of a valid {py3}
136expression which may include label and variable names.
137
269f6eb3 138Fixed-length number with a given length (8{nbsp}bits to 64{nbsp}bits) and byte order::
71aaa3f7
PP
139+
140Input:
141+
142----
143{strength = 4}
144{be} 67 <lbl> 44 $178 {(end - lbl) * 8 + strength : 16} $99 <end>
145{le} {-1993 : 32}
269f6eb3 146{-3.141593 : 64}
71aaa3f7
PP
147----
148+
149Output:
150+
151----
269f6eb3
PP
15267 44 b2 00 2c 63 37 f8 ff ff 7f bd c2 82 fb 21
15309 c0
71aaa3f7
PP
154----
155+
269f6eb3 156The encoded number is the evaluation of a valid {py3} expression which
05f81895
PP
157may include label and variable names.
158
159https://en.wikipedia.org/wiki/LEB128[LEB128] integer::
160+
161Input:
162+
163----
164aa bb cc {-1993 : sleb128} <meow> dd ee ff
165{meow * 199 : uleb128}
166----
167+
168Output:
169+
170----
171aa bb cc b7 70 dd ee ff e3 07
172----
173+
174The encoded integer is the evaluation of a valid {py3} expression which
71aaa3f7
PP
175may include label and variable names.
176
27d52a19
PP
177Conditional::
178+
179Input:
180+
181----
182aa bb cc
183
184(
185 "foo"
186
187 !if {ICITTE > 10}
188 "bar"
12b5dbc0
PP
189 !else
190 "fight"
27d52a19
PP
191 !end
192) * 4
193----
194+
195Output:
196+
197----
12b5dbc0
PP
198aa bb cc 66 6f 6f 66 69 67 68 74 66 6f 6f 66 69 ┆ •••foofightfoofi
19967 68 74 66 6f 6f 62 61 72 66 6f 6f 62 61 72 ┆ ghtfoobarfoobar
27d52a19
PP
200----
201
71aaa3f7
PP
202Repetition::
203+
204Input:
205+
206----
2adf4336 207aa bb * 5 cc <zoom> "yeah\0" * {zoom * 3}
e57a18e1
PP
208
209!repeat 3
210 ff ee "juice"
211!end
71aaa3f7
PP
212----
213+
214Output:
215+
216----
2adf4336
PP
217aa bb bb bb bb bb cc 79 65 61 68 00 79 65 61 68 ┆ •••••••yeah•yeah
21800 79 65 61 68 00 79 65 61 68 00 79 65 61 68 00 ┆ •yeah•yeah•yeah•
21979 65 61 68 00 79 65 61 68 00 79 65 61 68 00 79 ┆ yeah•yeah•yeah•y
22065 61 68 00 79 65 61 68 00 79 65 61 68 00 79 65 ┆ eah•yeah•yeah•ye
22161 68 00 79 65 61 68 00 79 65 61 68 00 79 65 61 ┆ ah•yeah•yeah•yea
22268 00 79 65 61 68 00 79 65 61 68 00 79 65 61 68 ┆ h•yeah•yeah•yeah
71aaa3f7 22300 79 65 61 68 00 79 65 61 68 00 79 65 61 68 00 ┆ •yeah•yeah•yeah•
e57a18e1
PP
224ff ee 6a 75 69 63 65 ff ee 6a 75 69 63 65 ff ee ┆ ••juice••juice••
2256a 75 69 63 65 ┆ juice
71aaa3f7
PP
226----
227
676f6189
PP
228Alignment::
229+
230Input:
231+
232----
233{be}
234
235 {199:32}
236@64 {43:64}
237@16 {-123:16}
238@32~255 {5584:32}
239----
240+
241Output:
242+
243----
24400 00 00 c7 00 00 00 00 00 00 00 00 00 00 00 2b
245ff 85 ff ff 00 00 15 d0
246----
71aaa3f7 247
25ca454b
PP
248Filling::
249+
250Input:
251+
252----
253{le}
254{0xdeadbeef:32}
255{-1993:16}
256{9:16}
257+0x40
258{ICITTE:8}
259"meow mix"
fc21bb27 260+200~FFh
25ca454b
PP
261{ICITTE:8}
262----
263+
264Output:
265+
266----
267ef be ad de 37 f8 09 00 00 00 00 00 00 00 00 00 ┆ ••••7•••••••••••
26800 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
26900 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
27000 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
27140 6d 65 6f 77 20 6d 69 78 ff ff ff ff ff ff ff ┆ @meow mix•••••••
272ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
273ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
274ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
275ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
276ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
277ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
278ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ┆ ••••••••••••••••
279ff ff ff ff ff ff ff ff c8 ┆ •••••••••
280----
281
cd33dfe6
PP
282Transformation::
283+
284Input:
285+
286----
287"end of file @ " {end:8}
288
289!transform gzip
290 "this part will be gzipped"
291!end
292
293<end>
294----
295+
296Output:
297+
298----
29965 6e 64 20 6f 66 20 66 69 6c 65 20 40 20 3c 1f ┆ end of file @ <•
3008b 08 00 7b 7b 26 65 02 ff 2b c9 c8 2c 56 28 48 ┆ •••{{&e••+••,V(H
3012c 2a 51 28 cf cc c9 51 48 4a 55 48 af ca 2c 28 ┆ ,*Q(•••QHJUH••,(
30248 4d 01 00 d4 cc 5b 8a 19 00 00 00 ┆ HM••••[•••••
303----
304
71aaa3f7
PP
305Multilevel grouping::
306+
307Input:
308+
309----
310ff ((aa bb "zoom" cc) * 5) * 3 $-34 * 4
311----
312+
313Output:
314+
315----
316ff aa bb 7a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc aa ┆ •••zoom•••zoom••
317bb 7a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc aa bb 7a ┆ •zoom•••zoom•••z
3186f 6f 6d cc aa bb 7a 6f 6f 6d cc aa bb 7a 6f 6f ┆ oom•••zoom•••zoo
3196d cc aa bb 7a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc ┆ m•••zoom•••zoom•
320aa bb 7a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc aa bb ┆ ••zoom•••zoom•••
3217a 6f 6f 6d cc aa bb 7a 6f 6f 6d cc aa bb 7a 6f ┆ zoom•••zoom•••zo
3226f 6d cc aa bb 7a 6f 6f 6d cc de de de de ┆ om•••zoom•••••
323----
324
320644e2
PP
325Macros::
326+
327Input:
328+
329----
330!macro hello(world)
331 "hello"
332 !if world " world" !end
333!end
334
335!repeat 17
336 ff ff ff ff
337 m:hello({ICITTE > 15 and ICITTE < 60})
338!end
339----
340+
341Output:
342+
343----
344ff ff ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 6c ┆ ••••hello••••hel
3456c 6f ff ff ff ff 68 65 6c 6c 6f 20 77 6f 72 6c ┆ lo••••hello worl
34664 ff ff ff ff 68 65 6c 6c 6f 20 77 6f 72 6c 64 ┆ d••••hello world
347ff ff ff ff 68 65 6c 6c 6f 20 77 6f 72 6c 64 ff ┆ ••••hello world•
348ff ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 6c 6c ┆ •••hello••••hell
3496f ff ff ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 ┆ o••••hello••••he
3506c 6c 6f ff ff ff ff 68 65 6c 6c 6f ff ff ff ff ┆ llo••••hello••••
35168 65 6c 6c 6f ff ff ff ff 68 65 6c 6c 6f ff ff ┆ hello••••hello••
352ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 6c 6c 6f ┆ ••hello••••hello
353ff ff ff ff 68 65 6c 6c 6f ff ff ff ff 68 65 6c ┆ ••••hello••••hel
3546c 6f ff ff ff ff 68 65 6c 6c 6f ┆ lo••••hello
355----
356
71aaa3f7
PP
357Precise error reporting::
358+
359----
360/tmp/meow.normand:10:24 - Expecting a bit (`0` or `1`).
361----
362+
363----
364/tmp/meow.normand:32:6 - Unexpected character `k`.
365----
366+
367----
320644e2 368/tmp/meow.normand:24:19 - Illegal (unknown or unreachable) variable/label name `meow` in expression `(meow - 45) // 8`; the legal names are {`ICITTE`, `mix`, `zoom`}.
71aaa3f7
PP
369----
370+
371----
f5dcb24c
PP
372/tmp/meow.normand:32:19 - While expanding the macro `meow`:
373/tmp/meow.normand:35:5 - While expanding the macro `zzz`:
320644e2 374/tmp/meow.normand:18:9 - Value 315 is outside the 8-bit range when evaluating expression `end - ICITTE`.
71aaa3f7
PP
375----
376
377You can use Normand to track data source files in your favorite VCS
378instead of raw binary files. The binary files that Normand generates can
379be used to test file format decoding, including malformatted data, for
380example, as well as for education.
381
382See <<learn-normand>> to explore all the Normand features.
383
384== Install Normand
385
386Normand requires Python ≥ 3.4.
387
388To install Normand:
389
390----
391$ python3 -m pip install --user normand
392----
393
394See
395https://packaging.python.org/en/latest/tutorials/installing-packages/#installing-to-the-user-site[Installing to the User Site]
396to learn more about a user site installation.
397
398[NOTE]
399====
400Normand has a single module file, `normand.py`, which you can copy as is
af3cf417 401to your project to use it (both the <<python3-api,`normand.parse()`>>
71aaa3f7
PP
402function and the <<command-line-tool,command-line tool>>).
403
404`normand.py` has _no external dependencies_, but if you're using
405Python{nbsp}3.4, you'll need a local copy of the standard `typing`
406module.
407====
408
43937a34
PP
409== Design goals
410
411The design goals of Normand are:
412
413Portability::
414 We're making sure `normand.py` works with Python{nbsp}≥{nbsp}3.4 and
415 doesn't have any external dependencies so that you may just copy the
416 module as is to your own project.
417
418Ease of use::
419 The most basic Normand input is a sequence of hexadecimal constants
420 (for example, `4e6f726d616e64`) which produce exactly what you'd
421 expect.
422+
423Most Normand features map to programming language concepts you already
424know and understand: constant integers, literal strings, variables,
425conditionals, repetitions/loops, and the rest.
426
427Concise and readable input::
428 We could have chosen XML or YAML as the input format, but having a
429 DSL here makes a Normand input compact and easy to read, two
430 important traits when using Normand to write tests, for example.
431+
432Compare the following Normand input and some hypothetical XML
433equivalent, for example:
434+
435.Actual normand input.
436----
437ff dd 01 ab $192 $-128 %1101:0011
438
439{end:8}
440
441{iter = 1}
442
443!if {not something}
444 # five times because xyz
445 !repeat 5
446 "hello world " {iter:8}
447 {iter = iter + 1}
448 !end
449!end
450
451<end>
452----
453+
454.Hypothetical Normand XML input.
455[source,xml]
456----
457<?xml version="1.0" encoding="utf-8" ?>
458<group>
459 <byte base="x" val="ff" />
460 <byte base="x" val="dd" />
461 <byte base="x" val="1" />
462 <byte base="x" val="ab" />
463 <byte base="d" val="192" />
464 <byte base="d" val="-128" />
465 <byte base="b" val="11010011" />
466 <fixed-len-num expr="end" len="8" />
467 <var-assign name="iter" expr="1" />
468 <cond expr="not something">
469 <!-- five times because xyz -->
470 <repeat expr="5">
471 <str>hello world </str>
472 <fixed-len-num expr="iter" len="8" />
473 <var-assign name="iter" expr="iter + 1" />
474 </repeat>
475 </cond>
476 <label name="end" />
477</group>
478----
479
71aaa3f7
PP
480== Learn Normand
481
482A Normand text input is a sequence of items which represent a sequence
483of raw bytes.
484
485[[state]] During the processing of items to data, Normand relies on a
486current state:
487
488[%header%autowidth]
489|===
af3cf417 490|State variable |Description |Initial value: <<python3-api,{py3} API>> |Initial value: <<command-line-tool,CLI>>
71aaa3f7
PP
491
492|[[cur-offset]] Current offset
493|
05f81895 494The current offset has an effect on the value of <<label,labels>> and of
269f6eb3 495the special `ICITTE` name in <<fixed-length-number,fixed-length
7a7b31e8 496number>>, <<leb-128-integer,LEB128 integer>>, <<string,string>>,
f63f4a5d 497<<filling,filling>>, <<variable-assignment,variable assignment>>,
27d52a19 498<<conditional-block,conditional block>>, <<repetition-block,repetition
320644e2
PP
499block>>, <<macro-expansion,macro expansion>>, and
500<<post-item-repetition,post-item repetition>> expression evaluation.
71aaa3f7
PP
501
502Each generated byte increments the current offset.
503
504A <<current-offset-setting,current offset setting>> may change the
676f6189
PP
505current offset without generating data.
506
507An <<current-offset-alignment,current offset alignment>> generates
508padding bytes to make the current offset satisfy a given alignment.
71aaa3f7
PP
509|`init_offset` parameter of the `parse()` function.
510|`--offset` option.
511
512|[[cur-bo]] Current byte order
513|
05f81895 514The current byte order has an effect on the encoding of
269f6eb3 515<<fixed-length-number,fixed-length numbers>>.
71aaa3f7
PP
516
517A <<current-byte-order-setting,current byte order setting>> may change
518the current byte order.
519|`init_byte_order` parameter of the `parse()` function.
520|`--byte-order` option.
521
522|<<label,Labels>>
523|Mapping of label names to integral values.
524|`init_labels` parameter of the `parse()` function.
525|One or more `--label` options.
526
527|<<variable-assignment,Variables>>
27d52a19 528|Mapping of variable names to integral or floating point number values.
71aaa3f7 529|`init_variables` parameter of the `parse()` function.
7a7b31e8 530|One or more `--var` or `--var-str` options.
71aaa3f7
PP
531|===
532
533The available items are:
534
6dd69a2a
PP
535* A <<byte-constant,constant integer>> representing one or more
536 constant bytes.
71aaa3f7 537
7a7b31e8
PP
538* A <<literal-string,literal string>> representing a constant sequence
539 of bytes encoding UTF-8, UTF-16, UTF-32, or Latin-1 to Latin-10 data.
71aaa3f7
PP
540
541* A <<current-byte-order-setting,current byte order setting>> (big or
542 little endian).
543
269f6eb3
PP
544* A <<fixed-length-number,fixed-length number>> (integer or
545 floating point) using the <<cur-bo,current byte order>> and of which
546 the value is the result of a {py3} expression.
05f81895
PP
547
548* An <<leb128-integer,LEB128 integer>> of which the value is the result
549 of a {py3} expression.
71aaa3f7 550
7a7b31e8
PP
551* A <<string,string>> representing a sequence of bytes encoding UTF-8,
552 UTF-16, UTF-32, or Latin-1 to Latin-10 data, and of which the value is
553 the result of a {py3} expression.
554
71aaa3f7
PP
555* A <<current-offset-setting,current offset setting>>.
556
676f6189
PP
557* A <<current-offset-alignment,current offset alignment>>.
558
25ca454b
PP
559* A <<filling,filling>>.
560
71aaa3f7
PP
561* A <<label,label>>, that is, a named constant holding the current
562 offset.
563+
564This is similar to an assembly label.
565
566* A <<variable-assignment,variable assignment>> associating a name to
567 the integral result of an evaluated {py3} expression.
568
569* A <<group,group>>, that is, a scoped sequence of items.
570
27d52a19
PP
571* A <<conditional-block,conditional block>>.
572
e57a18e1
PP
573* A <<repetition-block,repetition block>>.
574
cd33dfe6
PP
575* A <<transformation-block,transformation block>>.
576
320644e2
PP
577* A <<macro-definition-block,macro definition block>>.
578
579* A <<macro-expansion,macro expansion>>.
580
e57a18e1
PP
581Moreover, you can repeat many items above a constant or variable number
582of times with the ``pass:[*]`` operator _after_ the item to repeat. This
583is called a <<post-item-repetition,post-item repetition>>.
71aaa3f7 584
ba11fb1d 585A Normand comment may exist pretty much anywhere between tokens.
71aaa3f7
PP
586
587A comment is anything between two ``pass:[#]`` characters on the same
ba11fb1d
PP
588line, or from ``pass:[#]`` until the end of the line. Whitespaces are
589also considered comments. The following symbols are also considered
590comments around and between items, as well as between hexadecimal
591nibbles and binary bits of <<byte-constant,byte constants>>:
71aaa3f7
PP
592
593----
25ca454b 594/ \ ? & : ; . , [ ] _ = | -
71aaa3f7
PP
595----
596
597The latter serve to improve readability so that you may write, for
598example, a MAC address or a UUID as is.
599
fc21bb27
PP
600[[const-int]] Many items require a _constant integer_, possibly
601negative, in which case it may start with `-` for a negative integer. A
602positive constant integer is any of:
603
604Decimal::
605 One or mode digits (`0` to `9`).
606
607Hexadecimal::
608 One of:
609+
610* The `0x` or `0X` prefix followed with one or more hexadecimal digits
611 (`0` to `9`, `a` to `f`, or `A` to `F`).
612* One or more hexadecimal digits followed with the `h` or `H` suffix.
613
614Octal::
615 One of:
616+
617* The `0o` or `0O` prefix followed with one or more octal digits
618 (`0` to `7`).
619* One or more octal digits followed with the `o`, `O`, `q`, or `Q`
620 suffix.
621
622Binary::
623 One of:
624+
625* The `0b` or `0B` prefix followed with one or more bits (`0` or `1`).
626* One or more bits followed with the `b` or `B` suffix.
627
71aaa3f7
PP
628You can test the examples of this section with the `normand`
629<<command-line-tool,command-line tool>> as such:
630
631----
632$ normand file | hexdump -C
633----
634
635where `file` is the name of a file containing the Normand input.
636
637=== Byte constant
638
6dd69a2a 639A _byte constant_ represents one or more constant bytes.
71aaa3f7
PP
640
641A byte constant is:
642
643Hexadecimal form::
6dd69a2a 644 Two consecutive hexadecimal digits representing a single byte.
71aaa3f7
PP
645
646Decimal form::
6dd69a2a 647 One or more digits after the `$` prefix representing a single byte.
71aaa3f7 648
6dd69a2a
PP
649Binary form:: {empty}
650+
651--
652. __**N**__ `%` prefixes (at least one).
653+
654The number of `%` characters is the number of subsequent expected bytes.
655
656. __**N**__{nbsp}×{nbsp}8 bits (`0` or `1`).
657--
71aaa3f7
PP
658
659====
660Input:
661
662----
663ab cd [3d 8F] CC
664----
665
666Output:
667
668----
669ab cd 3d 8f cc
670----
671====
672
673====
674Input:
675
676----
677$192 %1100/0011 $ -77
678----
679
680Output:
681
682----
683c0 c3 b3
684----
685====
686
687====
688Input:
689
690----
69158f64689-6316-4d55-8a1a-04cada366172
692fe80::6257:18ff:fea3:4229
693----
694
695Output:
696
697----
69858 f6 46 89 63 16 4d 55 8a 1a 04 ca da 36 61 72 ┆ X•F•c•MU•••••6ar
699fe 80 62 57 18 ff fe a3 42 29 ┆ ••bW••••B)
700----
701====
702
703====
704Input:
705
706----
707%01110011 %01100001 %01101100 %01110101 %01110100
6dd69a2a 708%%%1101:0010 11111111 #A#11 #B#00 #C#011 #D#1
71aaa3f7
PP
709----
710
711Output:
712
713----
6dd69a2a 71473 61 6c 75 74 d2 ff c7 ┆ salut•••
71aaa3f7
PP
715----
716====
717
718=== Literal string
719
7a7b31e8
PP
720A _literal string_ represents the encoded bytes of a literal string
721using the UTF-8, UTF-16, UTF-32, or Latin-1 to Latin-10 encoding.
71aaa3f7
PP
722
723The string to encode isn't implicitly null-terminated: use `\0` at the
724end of the string to add a null character.
725
726A literal string is:
727
7a7b31e8
PP
728. **Optional**: one of the following encodings instead of the default
729 UTF-8:
71aaa3f7
PP
730+
731--
732[horizontal]
7a7b31e8
PP
733`s:u8`::
734`u8`::
735 UTF-8.
736
737`s:u16be`::
738`u16be`::
739 UTF-16BE.
740
741`s:u16le`::
742`u16le`::
743 UTF-16LE.
744
745`s:u32be`::
746`u32be`::
747 UTF-32BE.
748
749`s:u32le`::
750`u32le`::
751 UTF-32LE.
752
753`s:latin1`::
754 ISO/IEC 8859-1.
755
756`s:latin2`::
757 ISO/IEC 8859-2.
758
759`s:latin3`::
760 ISO/IEC 8859-3.
761
762`s:latin4`::
763 ISO/IEC 8859-4.
764
765`s:latin5`::
766 ISO/IEC 8859-9.
767
768`s:latin6`::
769 ISO/IEC 8859-10.
770
771`s:latin7`::
772 ISO/IEC 8859-13.
773
774`s:latin8`::
775 ISO/IEC 8859-14.
776
777`s:latin9`::
778 ISO/IEC 8859-15.
779
780`s:latin10`::
781 ISO/IEC 8859-16.
71aaa3f7
PP
782--
783
784. The ``pass:["]`` prefix.
785
786. A sequence of zero or more characters, possibly containing escape
787 sequences.
788+
789An escape sequence is the ``\`` character followed by one of:
790+
791--
792[horizontal]
793`0`:: Null (U+0000)
794`a`:: Alert (U+0007)
795`b`:: Backspace (U+0008)
796`e`:: Escape (U+001B)
797`f`:: Form feed (U+000C)
798`n`:: End of line (U+000A)
799`r`:: Carriage return (U+000D)
800`t`:: Character tabulation (U+0009)
801`v`:: Line tabulation (U+000B)
802``\``:: Reverse solidus (U+005C)
803``pass:["]``:: Quotation mark (U+0022)
804--
805
806. The ``pass:["]`` suffix.
807
808====
809Input:
810
811----
812"coucou tout le monde!"
813----
814
815Output:
816
817----
81863 6f 75 63 6f 75 20 74 6f 75 74 20 6c 65 20 6d ┆ coucou tout le m
8196f 6e 64 65 21 ┆ onde!
820----
821====
822
823====
824Input:
825
826----
827u16le"I am not young enough to know everything."
828----
829
830Output:
831
832----
83349 00 20 00 61 00 6d 00 20 00 6e 00 6f 00 74 00 ┆ I• •a•m• •n•o•t•
83420 00 79 00 6f 00 75 00 6e 00 67 00 20 00 65 00 ┆ •y•o•u•n•g• •e•
8356e 00 6f 00 75 00 67 00 68 00 20 00 74 00 6f 00 ┆ n•o•u•g•h• •t•o•
83620 00 6b 00 6e 00 6f 00 77 00 20 00 65 00 76 00 ┆ •k•n•o•w• •e•v•
83765 00 72 00 79 00 74 00 68 00 69 00 6e 00 67 00 ┆ e•r•y•t•h•i•n•g•
8382e 00 ┆ .•
839----
840====
841
842====
843Input:
844
845----
7a7b31e8 846s:u32be "\"illusion is the first\nof all pleasures\" 🦉"
71aaa3f7
PP
847----
848
849Output:
850
851----
85200 00 00 22 00 00 00 69 00 00 00 6c 00 00 00 6c ┆ •••"•••i•••l•••l
85300 00 00 75 00 00 00 73 00 00 00 69 00 00 00 6f ┆ •••u•••s•••i•••o
85400 00 00 6e 00 00 00 20 00 00 00 69 00 00 00 73 ┆ •••n••• •••i•••s
85500 00 00 20 00 00 00 74 00 00 00 68 00 00 00 65 ┆ ••• •••t•••h•••e
85600 00 00 20 00 00 00 66 00 00 00 69 00 00 00 72 ┆ ••• •••f•••i•••r
85700 00 00 73 00 00 00 74 00 00 00 0a 00 00 00 6f ┆ •••s•••t•••••••o
85800 00 00 66 00 00 00 20 00 00 00 61 00 00 00 6c ┆ •••f••• •••a•••l
85900 00 00 6c 00 00 00 20 00 00 00 70 00 00 00 6c ┆ •••l••• •••p•••l
86000 00 00 65 00 00 00 61 00 00 00 73 00 00 00 75 ┆ •••e•••a•••s•••u
86100 00 00 72 00 00 00 65 00 00 00 73 00 00 00 22 ┆ •••r•••e•••s•••"
86200 00 00 20 00 01 f9 89 ┆ ••• ••••
863----
864====
865
7a7b31e8
PP
866====
867Input:
868
869----
870s:latin1 "Paul Piché"
871----
872
873Output:
874
875----
87650 61 75 6c 20 50 69 63 68 e9 ┆ Paul Pich•
877----
878====
879
71aaa3f7
PP
880=== Current byte order setting
881
882This special item sets the <<cur-bo,_current byte order_>>.
883
884The two accepted forms are:
885
886[horizontal]
887``pass:[{be}]``:: Set the current byte order to big endian.
888``pass:[{le}]``:: Set the current byte order to little endian.
889
269f6eb3 890=== Fixed-length number
71aaa3f7 891
269f6eb3
PP
892A _fixed-length number_ represents a fixed number of bytes encoding
893either:
894
895* An unsigned or signed integer (two's complement).
896+
897The available lengths are 8, 16, 24, 32, 40, 48, 56, and 64.
898
899* A floating point number
b87a3aa2 900 (https://standards.ieee.org/standard/754-2008.html[IEEE{nbsp}754-2008]).
269f6eb3
PP
901+
902The available length are 32 (_binary32_) and 64 (_binary64_).
71aaa3f7 903
269f6eb3
PP
904The value is the result of evaluating a {py3} expression using the
905<<cur-bo,current byte order>>.
906
907A fixed-length number is:
71aaa3f7
PP
908
909. The ``pass:[{]`` prefix.
910
911. A valid {py3} expression.
05f81895 912+
269f6eb3 913For a fixed-length number at some source location{nbsp}__**L**__, this
05f81895
PP
914expression may contain the name of any accessible <<label,label>> (not
915within a nested group), including the name of a label defined
cd33dfe6
PP
916after{nbsp}__**L**__ (except within an <<encoded-block,encoded block>>),
917as well as the name of any <<variable-assignment,variable>> known
918at{nbsp}__**L**__.
05f81895 919+
269f6eb3
PP
920The value of the special name `ICITTE` (`int` type) in this expression
921is the <<cur-offset,current offset>> (before encoding the number).
71aaa3f7
PP
922
923. The `:` character.
924
269f6eb3
PP
925. An encoding length in bits amongst:
926+
927--
27d52a19 928The expression evaluates to an `int` or `bool` value::
269f6eb3 929 `8`, `16`, `24`, `32`, `40`, `48`, `56`, and `64`.
27d52a19
PP
930+
931NOTE: Normand automatically converts a `bool` value to `int`.
269f6eb3
PP
932
933The expression evaluates to a `float` value::
934 `32` and `64`.
935--
71aaa3f7
PP
936
937. The `}` suffix.
938
939====
940Input:
941
942----
943{le} {345:16}
944{be} {-0xabcd:32}
945----
946
947Output:
948
949----
95059 01 ff ff 54 33
951----
952====
953
954====
955Input:
956
957----
958{be}
959
960# String length in bits
961{8 * (str_end - str_beg) : 16}
962
963# String
964<str_beg>
965 "hello world!"
966<str_end>
967----
968
969Output:
970
971----
97200 60 68 65 6c 6c 6f 20 77 6f 72 6c 64 21 ┆ •`hello world!
973----
974====
975
976====
977Input:
978
979----
980{20 - ICITTE : 8} * 10
981----
982
983Output:
984
985----
98614 13 12 11 10 0f 0e 0d 0c 0b
987----
988====
989
269f6eb3
PP
990====
991Input:
992
993----
994{le}
995{2 * 0.0529 : 32}
996----
997
998Output:
999
1000----
1001ac ad d8 3d
1002----
1003====
1004
05f81895
PP
1005=== LEB128 integer
1006
1007An _LEB128 integer_ represents a variable number of bytes encoding an
1008unsigned or signed integer which is the result of evaluating a {py3}
1009expression following the https://en.wikipedia.org/wiki/LEB128[LEB128]
1010format.
1011
1012An LEB128 integer is:
1013
1014. The ``pass:[{]`` prefix.
1015
27d52a19
PP
1016. A valid {py3} expression of which the evaluation result type
1017 is `int` or `bool` (automatically converted to `int`).
05f81895
PP
1018+
1019For an LEB128 integer at some source location{nbsp}__**L**__, this
1020expression may contain:
1021+
1022--
fc21bb27
PP
1023* The name of any <<label,label>> defined before{nbsp}__**L**__
1024 which isn't within a nested group.
320644e2
PP
1025* The name of any <<variable-assignment,variable>> known
1026 at{nbsp}__**L**__.
05f81895
PP
1027--
1028+
269f6eb3
PP
1029The value of the special name `ICITTE` (`int` type) in this expression
1030is the <<cur-offset,current offset>> (before encoding the integer).
05f81895
PP
1031
1032. The `:` character.
1033
1034. One of:
1035+
1036--
1037[horizontal]
1038`uleb128`:: Use the unsigned LEB128 format.
1039`sleb128`:: Use the signed LEB128 format.
1040--
1041
1042. The `}` suffix.
1043
1044====
1045Input:
1046
1047----
1048{624485 : uleb128}
1049----
1050
1051Output:
1052
1053----
1054e5 8e 26
1055----
1056====
1057
1058====
1059Input:
1060
1061----
1062aa bb cc dd
1063<meow>
1064ee ff
1065{-981238311 + (meow * -23) : sleb128}
1066"hello"
1067----
1068
c2b79cf6
PP
1069Output:
1070
05f81895
PP
1071----
1072aa bb cc dd ee ff fd fa 8d ac 7c 68 65 6c 6c 6f ┆ ••••••••••|hello
1073----
1074====
1075
7a7b31e8
PP
1076=== String
1077
1078A _string_ represents a variable number of bytes encoding a string which
1079is the result of evaluating a {py3} expression using the UTF-8, UTF-16,
1080UTF-32, or Latin-1 to Latin-10 encoding.
1081
1082A string has two possible forms:
1083
1084Encoding prefix form:: {empty}
1085+
1086. An encoding amongst:
1087+
1088--
1089[horizontal]
1090`s:u8`::
1091`u8`::
1092 UTF-8.
1093
1094`s:u16be`::
1095`u16be`::
1096 UTF-16BE.
1097
1098`s:u16le`::
1099`u16le`::
1100 UTF-16LE.
1101
1102`s:u32be`::
1103`u32be`::
1104 UTF-32BE.
1105
1106`s:u32le`::
1107`u32le`::
1108 UTF-32LE.
1109
1110`s:latin1`::
1111 ISO/IEC 8859-1.
1112
1113`s:latin2`::
1114 ISO/IEC 8859-2.
1115
1116`s:latin3`::
1117 ISO/IEC 8859-3.
1118
1119`s:latin4`::
1120 ISO/IEC 8859-4.
1121
1122`s:latin5`::
1123 ISO/IEC 8859-9.
1124
1125`s:latin6`::
1126 ISO/IEC 8859-10.
1127
1128`s:latin7`::
1129 ISO/IEC 8859-13.
1130
1131`s:latin8`::
1132 ISO/IEC 8859-14.
1133
1134`s:latin9`::
1135 ISO/IEC 8859-15.
1136
1137`s:latin10`::
1138 ISO/IEC 8859-16.
1139--
1140
1141. The ``pass:[{]`` prefix.
1142
1143. A valid {py3} expression of which the evaluation result type
1144 is `bool`, `int`, `float`, or `str` (the first three automatically
1145 converted to `str`).
1146+
1147For a string at some source location{nbsp}__**L**__, this expression may
1148contain:
1149+
1150--
1151* The name of any <<label,label>> defined before{nbsp}__**L**__
1152 which isn't within a nested group.
1153* The name of any <<variable-assignment,variable>> known
1154 at{nbsp}__**L**__.
1155--
1156+
1157The value of the special name `ICITTE` (`int` type) in this expression
1158is the <<cur-offset,current offset>> (before encoding the string).
1159
1160. The `}` suffix.
1161
1162Encoding suffix form:: {empty}
1163+
1164. The ``pass:[{]`` prefix.
1165
1166. A valid {py3} expression of which the evaluation result type
1167 is `bool`, `int`, `float`, or `str` (the first three automatically
1168 converted to `str`).
1169+
1170For a string at some source location{nbsp}__**L**__, this expression may
1171contain:
1172+
1173--
1174* The name of any <<label,label>> defined before{nbsp}__**L**__
1175 which isn't within a nested group.
1176* The name of any <<variable-assignment,variable>> known
1177 at{nbsp}__**L**__.
1178--
1179+
1180The value of the special name `ICITTE` (`int` type) in this expression
1181is the <<cur-offset,current offset>> (before encoding the string).
1182
1183. The `:` character.
1184
1185. A string encoding amongst:
1186+
1187--
1188[horizontal]
1189`s:u8`::
1190 UTF-8.
1191
1192`s:u16be`::
1193 UTF-16BE.
1194
1195`s:u16le`::
1196 UTF-16LE.
1197
1198`s:u32be`::
1199 UTF-32BE.
1200
1201`s:u32le`::
1202 UTF-32LE.
1203
1204`s:latin1`::
1205 ISO/IEC 8859-1.
1206
1207`s:latin2`::
1208 ISO/IEC 8859-2.
1209
1210`s:latin3`::
1211 ISO/IEC 8859-3.
1212
1213`s:latin4`::
1214 ISO/IEC 8859-4.
1215
1216`s:latin5`::
1217 ISO/IEC 8859-9.
1218
1219`s:latin6`::
1220 ISO/IEC 8859-10.
1221
1222`s:latin7`::
1223 ISO/IEC 8859-13.
1224
1225`s:latin8`::
1226 ISO/IEC 8859-14.
1227
1228`s:latin9`::
1229 ISO/IEC 8859-15.
1230
1231`s:latin10`::
1232 ISO/IEC 8859-16.
1233--
1234
1235. The `}` suffix.
1236
1237====
1238Input:
1239
1240----
1241{iter = 1}
1242
1243!repeat 10
1244 {iter : s:u8} " "
1245 {iter = iter + 1}
1246!end
1247----
1248
1249Output:
1250
1251----
125231 20 32 20 33 20 34 20 35 20 36 20 37 20 38 20 ┆ 1 2 3 4 5 6 7 8
125339 20 31 30 20 ┆ 9 10
1254----
1255====
1256
1257====
1258Input:
1259
1260----
1261{meow = 'salut jérémie'}
1262{meow.upper() : s:latin1}
1263----
1264
1265Output:
1266
1267----
126853 41 4c 55 54 20 4a c9 52 c9 4d 49 45 ┆ SALUT J•R•MIE
1269----
1270====
1271
71aaa3f7
PP
1272=== Current offset setting
1273
1274This special item sets the <<cur-offset,_current offset_>>.
1275
1276A current offset setting is:
1277
1278. The `<` prefix.
1279
fc21bb27
PP
1280. A <<const-int,positive constant integer>> which is the new current
1281 offset.
71aaa3f7
PP
1282
1283. The `>` suffix.
1284
1285====
1286Input:
1287
1288----
1289 {ICITTE : 8} * 8
1290<0x61> {ICITTE : 8} * 8
1291----
1292
1293Output:
1294
1295----
129600 01 02 03 04 05 06 07 61 62 63 64 65 66 67 68 ┆ ••••••••abcdefgh
1297----
1298====
1299
1300====
1301Input:
1302
1303----
1304aa bb cc dd <meow> ee ff
1305<12> 11 22 33 <mix> 44 55
1306{meow : 8} {mix : 8}
1307----
1308
1309Output:
1310
1311----
1312aa bb cc dd ee ff 11 22 33 44 55 04 0f ┆ •••••••"3DU••
1313----
1314====
1315
676f6189
PP
1316=== Current offset alignment
1317
00deb9fa 1318A _current offset alignment_ represents zero or more padding bytes to
676f6189
PP
1319make the <<cur-offset,current offset>> meet a given
1320https://en.wikipedia.org/wiki/Data_structure_alignment[alignment] value.
1321
1322More specifically, for an alignment value of{nbsp}__**N**__{nbsp}bits,
1323a current offset alignment represents the required padding bytes until
1324the current offset is a multiple of __**N**__{nbsp}/{nbsp}8.
1325
1326A current offset alignment is:
1327
1328. The `@` prefix.
1329
fc21bb27
PP
1330. A <<const-int,positive constant integer>> which is the alignment value
1331 in _bits_.
676f6189
PP
1332+
1333This value must be greater than zero and a multiple of{nbsp}8.
1334
1335. **Optional**:
1336+
1337--
1338. The ``pass:[~]`` prefix.
fc21bb27
PP
1339. A <<const-int,positive constant integer>> which is the value of the
1340 byte to use as padding to align the <<cur-offset,current offset>>.
676f6189
PP
1341--
1342+
1343Without this section, the padding byte value is zero.
1344
1345====
1346Input:
1347
1348----
134911 22 (@32 aa bb cc) * 3
1350----
1351
1352Output:
1353
1354----
135511 22 00 00 aa bb cc 00 aa bb cc 00 aa bb cc
1356----
1357====
1358
1359====
1360Input:
1361
1362----
1363{le}
136477 88
1365@32~0xcc {-893.5:32}
1366@128~0x55 "meow"
1367----
1368
1369Output:
1370
1371----
137277 88 cc cc 00 60 5f c4 55 55 55 55 55 55 55 55 ┆ w••••`_•UUUUUUUU
13736d 65 6f 77 ┆ meow
1374----
1375====
1376
1377====
1378Input:
1379
1380----
1381aa bb cc <29> @64~255 "zoom"
1382----
1383
1384Output:
1385
1386----
1387aa bb cc ff ff ff 7a 6f 6f 6d ┆ ••••••zoom
1388----
1389====
1390
25ca454b
PP
1391=== Filling
1392
1393A _filling_ represents zero or more padding bytes to make the
1394<<cur-offset,current offset>> reach a given value.
1395
1396A filling is:
1397
1398. The ``pass:[+]`` prefix.
1399
1400. One of:
1401
fc21bb27
PP
1402** A <<const-int,positive constant integer>> which is the current offset
1403 target.
25ca454b
PP
1404
1405** The ``pass:[{]`` prefix, a valid {py3} expression of which the
1406 evaluation result type is `int` or `bool` (automatically converted to
1407 `int`), and the ``pass:[}]`` suffix.
1408+
1409For a filling at some source location{nbsp}__**L**__, this expression
1410may contain:
1411+
1412--
1413* The name of any <<label,label>> defined before{nbsp}__**L**__
1414 which isn't within a nested group.
1415* The name of any <<variable-assignment,variable>> known
1416 at{nbsp}__**L**__.
1417--
1418+
1419The value of the special name `ICITTE` (`int` type) in this expression
1420is the <<cur-offset,current offset>> (before handling the items to
1421repeat).
1422
1423** A valid {py3} name.
1424+
1425For the name `__NAME__`, this is equivalent to the
1426`pass:[{]__NAME__pass:[}]` form above.
1427
1428+
1429This value must be greater than or equal to the current offset where
1430it's used.
1431
1432. **Optional**:
1433+
1434--
1435. The ``pass:[~]`` prefix.
fc21bb27
PP
1436. A <<const-int,positive constant integer>> which is the value of the
1437 byte to use as padding to reach the current offset target.
25ca454b
PP
1438--
1439+
1440Without this section, the padding byte value is zero.
1441
1442====
1443Input:
1444
1445----
1446aa bb cc dd
1447+0x40
1448"hello world"
1449----
1450
1451Output:
1452
1453----
1454aa bb cc dd 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
145500 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
145600 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
145700 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ┆ ••••••••••••••••
145868 65 6c 6c 6f 20 77 6f 72 6c 64 ┆ hello world
1459----
1460====
1461
1462====
1463Input:
1464
1465----
1466!macro part(iter, fill)
1467 <0> "particular security " {ord('0') + iter : 8} +fill~0x80
1468!end
1469
1470{iter = 1}
1471
1472!repeat 5
1473 m:part(iter, {32 + 4 * iter})
1474 {iter = iter + 1}
1475!end
1476----
1477
1478Output:
1479
1480----
148170 61 72 74 69 63 75 6c 61 72 20 73 65 63 75 72 ┆ particular secur
148269 74 79 20 31 80 80 80 80 80 80 80 80 80 80 80 ┆ ity 1•••••••••••
148380 80 80 80 70 61 72 74 69 63 75 6c 61 72 20 73 ┆ ••••particular s
148465 63 75 72 69 74 79 20 32 80 80 80 80 80 80 80 ┆ ecurity 2•••••••
148580 80 80 80 80 80 80 80 80 80 80 80 70 61 72 74 ┆ ••••••••••••part
148669 63 75 6c 61 72 20 73 65 63 75 72 69 74 79 20 ┆ icular security
148733 80 80 80 80 80 80 80 80 80 80 80 80 80 80 80 ┆ 3•••••••••••••••
148880 80 80 80 80 80 80 80 70 61 72 74 69 63 75 6c ┆ ••••••••particul
148961 72 20 73 65 63 75 72 69 74 79 20 34 80 80 80 ┆ ar security 4•••
149080 80 80 80 80 80 80 80 80 80 80 80 80 80 80 80 ┆ ••••••••••••••••
149180 80 80 80 80 80 80 80 70 61 72 74 69 63 75 6c ┆ ••••••••particul
149261 72 20 73 65 63 75 72 69 74 79 20 35 80 80 80 ┆ ar security 5•••
149380 80 80 80 80 80 80 80 80 80 80 80 80 80 80 80 ┆ ••••••••••••••••
149480 80 80 80 80 80 80 80 80 80 80 80 ┆ ••••••••••••
1495----
1496====
1497
71aaa3f7
PP
1498=== Label
1499
1500A _label_ associates a name to the <<cur-offset,current offset>>.
1501
1502All the labels of a whole Normand input must have unique names.
1503
05f81895 1504A label must not share the name of a <<variable-assignment,variable>>
71aaa3f7
PP
1505name.
1506
71aaa3f7
PP
1507A label is:
1508
1509. The `<` prefix.
1510
27d52a19 1511. A valid {py3} name which is not `ICITTE`.
71aaa3f7
PP
1512
1513. The `>` suffix.
1514
1515=== Variable assignment
1516
1517A _variable assignment_ associates a name to the integral result of an
1518evaluated {py3} expression.
1519
05f81895 1520A variable assignment is:
71aaa3f7
PP
1521
1522. The ``pass:[{]`` prefix.
1523
27d52a19 1524. A valid {py3} name which is not `ICITTE`.
71aaa3f7
PP
1525
1526. The `=` character.
1527
7a7b31e8
PP
1528. A valid {py3} expression of which the evaluation result type is `int`,
1529 `float`, or `bool` (automatically converted to `int`), or `str`.
05f81895
PP
1530+
1531For a variable assignment at some source location{nbsp}__**L**__, this
320644e2
PP
1532expression may contain:
1533+
1534--
1535* The name of any <<label,label>> defined before{nbsp}__**L**__
1536 which isn't within a nested group.
1537* The name of any <<variable-assignment,variable>> known
1538 at{nbsp}__**L**__.
1539--
05f81895 1540+
269f6eb3
PP
1541The value of the special name `ICITTE` (`int` type) in this expression
1542is the <<cur-offset,current offset>>.
71aaa3f7
PP
1543
1544. The `}` suffix.
1545
1546====
1547Input:
1548
1549----
1550{mix = 101} {le}
1551{meow = 42} 11 22 {meow:8} 33 {meow = ICITTE + 17}
1552"yooo" {meow + mix : 16}
1553----
1554
1555Output:
1556
1557----
155811 22 2a 33 79 6f 6f 6f 7a 00 ┆ •"*3yoooz•
1559----
1560====
1561
1562=== Group
1563
1564A _group_ is a scoped sequence of items.
1565
1566The <<label,labels>> within a group aren't visible outside of it.
1567
e57a18e1
PP
1568The main purpose of a group is to <<post-item-repetition,repeat>> more
1569than a single item and to isolate labels.
71aaa3f7
PP
1570
1571A group is:
1572
261c5ecf 1573. The `(`, `!group`, or `!g` opening.
71aaa3f7 1574
cd33dfe6 1575. Zero or more items except, recursively, a macro definition block.
71aaa3f7 1576
261c5ecf
PP
1577. Depending on the group opening:
1578+
1579--
1580`(`::
1581 The `)` closing.
1582
1583`!group`::
1584`!g`::
1585 The `!end` closing.
1586--
71aaa3f7
PP
1587
1588====
1589Input:
1590
1591----
1592((aa bb cc) dd () ee) "leclerc"
1593----
1594
1595Output:
1596
1597----
1598aa bb cc dd ee 6c 65 63 6c 65 72 63 ┆ •••••leclerc
1599----
1600====
1601
1602====
1603Input:
1604
1605----
261c5ecf
PP
1606!group
1607 (aa bb cc) * 3 dd ee
1608!end * 5
71aaa3f7
PP
1609----
1610
1611Output:
1612
1613----
1614aa bb cc aa bb cc aa bb cc dd ee aa bb cc aa bb
1615cc aa bb cc dd ee aa bb cc aa bb cc aa bb cc dd
1616ee aa bb cc aa bb cc aa bb cc dd ee aa bb cc aa
1617bb cc aa bb cc dd ee
1618----
1619====
1620
1621====
1622Input:
1623
1624----
1625{be}
1626(
1627 <str_beg> u16le"sébastien diaz" <str_end>
1628 {ICITTE - str_beg : 8}
1629 {(end - str_beg) * 5 : 24}
1630) * 3
1631<end>
1632----
1633
1634Output:
1635
1636----
163773 00 e9 00 62 00 61 00 73 00 74 00 69 00 65 00 ┆ s•••b•a•s•t•i•e•
16386e 00 20 00 64 00 69 00 61 00 7a 00 1c 00 01 e0 ┆ n• •d•i•a•z•••••
163973 00 e9 00 62 00 61 00 73 00 74 00 69 00 65 00 ┆ s•••b•a•s•t•i•e•
16406e 00 20 00 64 00 69 00 61 00 7a 00 1c 00 01 40 ┆ n• •d•i•a•z••••@
164173 00 e9 00 62 00 61 00 73 00 74 00 69 00 65 00 ┆ s•••b•a•s•t•i•e•
16426e 00 20 00 64 00 69 00 61 00 7a 00 1c 00 00 a0 ┆ n• •d•i•a•z•••••
1643----
1644====
1645
27d52a19
PP
1646=== Conditional block
1647
12b5dbc0
PP
1648A _conditional block_ represents either the bytes of zero or more items
1649if some expression is true, or the bytes of zero or more other items if
1650it's false.
27d52a19
PP
1651
1652A conditional block is:
1653
261c5ecf 1654. The `!if` opening.
27d52a19
PP
1655
1656. One of:
1657
1658** The ``pass:[{]`` prefix, a valid {py3} expression of which the
1659 evaluation result type is `int` or `bool` (automatically converted to
1660 `int`), and the ``pass:[}]`` suffix.
1661+
320644e2
PP
1662For a conditional block at some source location{nbsp}__**L**__, this
1663expression may contain:
27d52a19
PP
1664+
1665--
1666* The name of any <<label,label>> defined before{nbsp}__**L**__
1667 which isn't within a nested group.
1668* The name of any <<variable-assignment,variable>> known
320644e2 1669 at{nbsp}__**L**__.
27d52a19
PP
1670--
1671+
1672The value of the special name `ICITTE` (`int` type) in this expression
1673is the <<cur-offset,current offset>> (before handling the contained
1674items).
1675
1676** A valid {py3} name.
1677+
1678For the name `__NAME__`, this is equivalent to the
1679`pass:[{]__NAME__pass:[}]` form above.
1680
cd33dfe6
PP
1681. Zero or more items to be handled when the condition is true
1682 except, recursively, a macro definition block.
12b5dbc0
PP
1683
1684. **Optional**:
1685
1686.. The `!else` opening.
cd33dfe6
PP
1687.. Zero or more items to be handled when the condition is false
1688 except, recursively, a macro definition block
27d52a19 1689
261c5ecf 1690. The `!end` closing.
27d52a19
PP
1691
1692====
1693Input:
1694
1695----
1696{at = 1}
1697{rep_count = 9}
1698
1699!repeat rep_count
1700 "meow "
1701
1702 !if {ICITTE > 25}
1703 "mix"
12b5dbc0
PP
1704 !else
1705 "zoom"
27d52a19
PP
1706 !end
1707
12b5dbc0
PP
1708 !if {at < rep_count} 20 !end
1709
27d52a19
PP
1710 {at = at + 1}
1711!end
1712----
1713
1714Output:
1715
1716----
12b5dbc0
PP
17176d 65 6f 77 20 7a 6f 6f 6d 20 6d 65 6f 77 20 7a ┆ meow zoom meow z
17186f 6f 6d 20 6d 65 6f 77 20 7a 6f 6f 6d 20 6d 65 ┆ oom meow zoom me
17196f 77 20 6d 69 78 20 6d 65 6f 77 20 6d 69 78 20 ┆ ow mix meow mix
17206d 65 6f 77 20 6d 69 78 20 6d 65 6f 77 20 6d 69 ┆ meow mix meow mi
27d52a19 172178 20 6d 65 6f 77 20 6d 69 78 20 6d 65 6f 77 20 ┆ x meow mix meow
12b5dbc0 17226d 69 78 ┆ mix
27d52a19
PP
1723----
1724====
1725
1726====
1727Input:
1728
1729----
1730<str_beg>
1731u16le"meow mix!"
1732<str_end>
1733
1734!if {str_end - str_beg > 10}
1735 " BIG"
1736!end
1737----
1738
1739Output:
1740
1741----
17426d 00 65 00 6f 00 77 00 20 00 6d 00 69 00 78 00 ┆ m•e•o•w• •m•i•x•
174321 00 20 42 49 47 ┆ !• BIG
1744----
1745====
1746
e57a18e1 1747=== Repetition block
71aaa3f7 1748
e57a18e1
PP
1749A _repetition block_ represents the bytes of one or more items repeated
1750a given number of times.
676f6189 1751
e57a18e1 1752A repetition block is:
71aaa3f7 1753
261c5ecf 1754. The `!repeat` or `!r` opening.
71aaa3f7 1755
2adf4336
PP
1756. One of:
1757
fc21bb27
PP
1758** A <<const-int,positive constant integer>> which is the number of
1759 times to repeat the previous item.
2adf4336 1760
27d52a19
PP
1761** The ``pass:[{]`` prefix, a valid {py3} expression of which the
1762 evaluation result type is `int` or `bool` (automatically converted to
1763 `int`), and the ``pass:[}]`` suffix.
05f81895 1764+
320644e2
PP
1765For a repetition block at some source location{nbsp}__**L**__, this
1766expression may contain:
05f81895
PP
1767+
1768--
27d52a19
PP
1769* The name of any <<label,label>> defined before{nbsp}__**L**__
1770 which isn't within a nested group.
05f81895 1771* The name of any <<variable-assignment,variable>> known
320644e2 1772 at{nbsp}__**L**__.
05f81895
PP
1773--
1774+
e57a18e1
PP
1775The value of the special name `ICITTE` (`int` type) in this expression
1776is the <<cur-offset,current offset>> (before handling the items to
1777repeat).
1778
1779** A valid {py3} name.
1780+
1781For the name `__NAME__`, this is equivalent to the
1782`pass:[{]__NAME__pass:[}]` form above.
1783
cd33dfe6 1784. Zero or more items except, recursively, a macro definition block.
e57a18e1 1785
261c5ecf 1786. The `!end` closing.
e57a18e1
PP
1787
1788You may also use a <<post-item-repetition,post-item repetition>> after
1789some items. The form ``!repeat{nbsp}__X__{nbsp}__ITEMS__{nbsp}!end``
1790is equivalent to ``(__ITEMS__){nbsp}pass:[*]{nbsp}__X__``.
71aaa3f7
PP
1791
1792====
1793Input:
1794
1795----
fc21bb27 1796!repeat 0o400
e57a18e1
PP
1797 {end - ICITTE - 1 : 8}
1798!end
1799
1800<end>
71aaa3f7
PP
1801----
1802
1803Output:
1804
1805----
1806ff fe fd fc fb fa f9 f8 f7 f6 f5 f4 f3 f2 f1 f0 ┆ ••••••••••••••••
1807ef ee ed ec eb ea e9 e8 e7 e6 e5 e4 e3 e2 e1 e0 ┆ ••••••••••••••••
1808df de dd dc db da d9 d8 d7 d6 d5 d4 d3 d2 d1 d0 ┆ ••••••••••••••••
1809cf ce cd cc cb ca c9 c8 c7 c6 c5 c4 c3 c2 c1 c0 ┆ ••••••••••••••••
1810bf be bd bc bb ba b9 b8 b7 b6 b5 b4 b3 b2 b1 b0 ┆ ••••••••••••••••
1811af ae ad ac ab aa a9 a8 a7 a6 a5 a4 a3 a2 a1 a0 ┆ ••••••••••••••••
18129f 9e 9d 9c 9b 9a 99 98 97 96 95 94 93 92 91 90 ┆ ••••••••••••••••
18138f 8e 8d 8c 8b 8a 89 88 87 86 85 84 83 82 81 80 ┆ ••••••••••••••••
18147f 7e 7d 7c 7b 7a 79 78 77 76 75 74 73 72 71 70 ┆ •~}|{zyxwvutsrqp
18156f 6e 6d 6c 6b 6a 69 68 67 66 65 64 63 62 61 60 ┆ onmlkjihgfedcba`
18165f 5e 5d 5c 5b 5a 59 58 57 56 55 54 53 52 51 50 ┆ _^]\[ZYXWVUTSRQP
18174f 4e 4d 4c 4b 4a 49 48 47 46 45 44 43 42 41 40 ┆ ONMLKJIHGFEDCBA@
18183f 3e 3d 3c 3b 3a 39 38 37 36 35 34 33 32 31 30 ┆ ?>=<;:9876543210
18192f 2e 2d 2c 2b 2a 29 28 27 26 25 24 23 22 21 20 ┆ /.-,+*)('&%$#"!
18201f 1e 1d 1c 1b 1a 19 18 17 16 15 14 13 12 11 10 ┆ ••••••••••••••••
18210f 0e 0d 0c 0b 0a 09 08 07 06 05 04 03 02 01 00 ┆ ••••••••••••••••
1822----
1823====
1824
2adf4336
PP
1825====
1826Input:
1827
1828----
1829{times = 1}
e57a18e1 1830
2adf4336 1831aa bb cc dd
e57a18e1
PP
1832
1833!repeat 3
2adf4336 1834 <here>
e57a18e1
PP
1835
1836 !repeat {here + 1}
1837 ee ff
1838 !end
1839
1840 11 22 !repeat times 33 !end
1841
2adf4336 1842 {times = times + 1}
e57a18e1
PP
1843!end
1844
2adf4336
PP
1845"coucou!"
1846----
1847
1848Output:
1849
1850----
1851aa bb cc dd ee ff ee ff ee ff ee ff ee ff 11 22 ┆ •••••••••••••••"
185233 ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ 3•••••••••••••••
1853ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1854ff ee ff ee ff 11 22 33 33 ee ff ee ff ee ff ee ┆ ••••••"33•••••••
1855ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1856ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1857ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1858ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1859ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1860ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
1861ff ee ff ee ff ee ff ee ff ee ff ee ff 11 22 33 ┆ ••••••••••••••"3
186233 33 63 6f 75 63 6f 75 21 ┆ 33coucou!
1863----
1864====
1865
cd33dfe6
PP
1866=== Transformation block
1867
1868A _transformation block_ represents the bytes of one or more items
1869transformed into other bytes by a function.
1870
1871As of this version, Normand only offers a predetermined set of
1872transformation functions.
1873
1874An encoded block is:
1875
1876. The `!transform` or `!t` opening.
1877
1878. A transformation function name amongst:
1879+
1880--
1881[horizontal]
1882`base64`::
1883`b64`::
1884 Standard https://datatracker.ietf.org/doc/html/rfc4648.html#section-4[Base64].
1885
1886`base64u`::
1887`b64u`::
1888 URL-safe Base64, using `-` instead of `pass:[+]` and `_` instead of
1889 `/`.
1890
1891`base32`::
1892`b32`::
1893 Standard https://datatracker.ietf.org/doc/html/rfc4648.html#section-6[Base32].
1894
1895`base16`::
1896`b16`::
1897 Standard https://datatracker.ietf.org/doc/html/rfc4648.html#section-8[Base16].
1898
1899`ascii85`::
1900`a85`::
1901 https://en.wikipedia.org/wiki/Ascii85[Ascii85] without padding.
1902
1903`ascii85p`::
1904`a85p`::
1905 Ascii85 with padding.
1906
1907`base85`::
1908`b85`::
1909 https://en.wikipedia.org/wiki/Ascii85[Base85] (like Git-style binary
1910 diffs) without padding.
1911
1912`base85p`::
1913`b85p`::
1914 Base85 with padding.
1915
1916`quopri`::
1917`qp`::
1918 MIME
1919 https://datatracker.ietf.org/doc/html/rfc2045#section-6.7[quoted-printable]
1920 without quoted whitespaces.
1921
1922`quoprit`::
1923`qpt`::
1924 MIME quoted-printable with quoted whitespaces.
1925
1926`gzip`::
1927`gz`::
1928 https://en.wikipedia.org/wiki/Gzip[gzip].
1929
1930`bzip2`::
1931`bz2`::
1932 https://en.wikipedia.org/wiki/Bzip2[bzip2].
1933--
1934
1935. Zero or more items except, recursively, a macro definition block.
1936+
1937Any {py3} expression within any of those items may not refer to a future
1938<<label,label>>.
1939+
1940The value of the special name `ICITTE` in any {py3} expression within
1941any of those items is the <<cur-offset,current offset>> _before_ Normand
1942applies the transformation function. Therefore, labels defined within
1943those items also have the current offset value _before_ Normand applies
1944the transformation function.
1945
1946. The `!end` closing.
1947
1948The <<cur-offset,current offset>> after having handled the last item of
1949a transformation block is the value of the current offset before
1950handling the first item plus the size of the generated (transformed)
1951bytes. In other words, <<current-offset-setting,current offset
1952settings>> within the items of the block have no impact outside said
1953block.
1954
1955====
1956Input:
1957
1958----
1959aa bb cc dd
1960
1961"size of compressed section: " {end - start : 8}
1962
1963<start>
1964
1965!transform bzip2
1966 "this will be compressed!"
1967 89*100 00*5000
1968!end
1969
1970<end>
1971
1972"yes!"
1973----
1974
1975Output:
1976
1977----
1978aa bb cc dd 73 69 7a 65 20 6f 66 20 63 6f 6d 70 ┆ ••••size of comp
197972 65 73 73 65 64 20 73 65 63 74 69 6f 6e 3a 20 ┆ ressed section:
198052 42 5a 68 39 31 41 59 26 53 59 68 e1 8c fc 00 ┆ RBZh91AY&SYh••••
198100 33 d1 e0 c0 00 60 00 5e 66 dc 80 00 20 00 80 ┆ •3••••`•^f••• ••
198200 08 20 00 31 40 d3 43 23 26 20 ca 87 a9 a1 e8 ┆ •• •1@•C#& •••••
198318 29 44 80 9c 80 49 bf cc b3 e8 45 ed e2 76 ad ┆ •)D•••I••••E••v•
19840f 12 8b 8a d6 cd 40 04 7e 2e e4 8a 70 a1 20 d1 ┆ ••••••@•~.••p• •
1985c3 19 f8 79 65 73 21 ┆ •••yes!
1986----
1987====
1988
1989====
1990Input:
1991
1992----
199388*16
1994
1995!t a85
1996 "I am determined to be cheerful and happy in whatever situation "
1997 "I may find myself. For I have learned that the greater part of "
1998 "our misery or unhappiness is determined not by our circumstance "
1999 "but by our disposition."
2000!end
2001
2002@128~99h
2003
2004!t qp <beg> {ICITTE - beg : 8} * 50 !end
2005----
2006
2007Output:
2008
2009----
201088 88 88 88 88 88 88 88 88 88 88 88 88 88 88 88 ┆ ••••••••••••••••
201138 4b 5f 47 59 2b 43 6f 26 2a 41 54 44 58 25 44 ┆ 8K_GY+Co&*ATDX%D
201249 6d 3f 24 46 44 69 3a 32 41 4b 59 4a 72 41 53 ┆ Im?$FDi:2AKYJrAS
201323 6d 6f 46 5f 69 31 2f 44 49 61 6c 27 40 3b 70 ┆ #moF_i1/DIal'@;p
201431 32 2b 44 47 5e 39 47 41 28 45 2c 41 54 68 58 ┆ 12+DG^9GA(E,AThX
20152a 2b 45 4d 37 3d 46 5e 5d 42 2b 44 66 2d 5b 68 ┆ *+EM7=F^]B+Df-[h
20162b 44 6b 50 34 2b 44 2c 3e 2a 41 30 3e 60 37 46 ┆ +DkP4+D,>*A0>`7F
201728 4b 30 22 2f 67 2a 57 25 45 5a 64 70 72 42 4f ┆ (K0"/g*W%EZdprBO
201851 27 71 2b 44 62 55 74 45 63 2c 48 21 2b 45 56 ┆ Q'q+DbUtEc,H!+EV
20193a 2a 46 3c 47 5b 3d 41 4b 59 57 2b 41 52 54 5b ┆ :*F<G[=AKYW+ART[
20206c 45 5a 66 3d 30 45 63 60 46 42 41 66 75 23 37 ┆ lEZf=0Ec`FBAfu#7
202145 5a 66 34 35 46 28 4b 42 3b 2b 45 29 39 43 46 ┆ EZf45F(KB;+E)9CF
202260 28 6c 24 45 2c 5d 4e 2f 41 54 4d 6f 38 42 6c ┆ `(l$E,]N/ATMo8Bl
202362 44 2d 41 54 56 4c 28 44 2f 21 6d 21 41 30 3e ┆ bD-ATVL(D/!m!A0>
202463 2e 46 3c 47 25 3c 2b 45 29 43 43 2b 43 66 2c ┆ c.F<G%<+E)CC+Cf,
20252b 40 73 29 58 30 46 43 42 26 73 41 4b 59 48 29 ┆ +@s)X0FCB&sAKYH)
202646 3c 47 25 3c 2b 45 29 43 43 2b 43 6f 32 2d 45 ┆ F<G%<+E)CC+Co2-E
20272c 54 66 33 46 44 35 5a 32 2f 63 99 99 99 99 99 ┆ ,Tf3FD5Z2/c•••••
20283d 30 30 3d 30 31 3d 30 32 3d 30 33 3d 30 34 3d ┆ =00=01=02=03=04=
202930 35 3d 30 36 3d 30 37 3d 30 38 3d 30 39 0a 3d ┆ 05=06=07=08=09•=
203030 42 3d 30 43 0d 3d 30 45 3d 30 46 3d 31 30 3d ┆ 0B=0C•=0E=0F=10=
203131 31 3d 31 32 3d 31 33 3d 31 34 3d 31 35 3d 31 ┆ 11=12=13=14=15=1
203236 3d 31 37 3d 31 38 3d 31 39 3d 31 41 3d 31 42 ┆ 6=17=18=19=1A=1B
20333d 31 43 3d 31 44 3d 31 45 3d 31 46 20 21 22 23 ┆ =1C=1D=1E=1F !"#
203424 25 26 27 28 29 2a 2b 2c 2d 3d 0a 2e 2f 30 31 ┆ $%&'()*+,-=•./01
2035----
2036====
2037
320644e2
PP
2038=== Macro definition block
2039
2040A _macro definition block_ associates a name and parameter names to
2041a group of items.
2042
2043A macro definition block doesn't lead to generated bytes itself: a
2044<<macro-expansion,macro expansion>> does so.
2045
2046A macro definition may only exist at the root level, that is, not within
2047a <<group,group>>, a <<repetition-block,repetition block>>, a
2048<<conditional-block,conditional block>>, or another
2049<<macro-definition-block,macro definition block>>.
2050
2051All macro definitions must have unique names.
2052
2053A macro definition is:
2054
2055. The `!macro` or `!m` opening.
2056
2057. A valid {py3} name (the macro name).
2058
2059. The `(` parameter name list prefix.
2060
2061. A comma-separated list of zero or more unique parameter names,
2062 each one being a valid {py3} name.
2063
2064. The `)` parameter name list suffix.
2065
2066. Zero or more items except, recursively, a macro definition block.
2067
2068. The `!end` closing.
2069
2070====
2071----
2072!macro bake()
2073 {le} {ICITTE * 8 : 16}
2074 u16le"predict explode"
2075!end
2076----
2077====
2078
2079====
2080----
2081!macro nail(rep, with_extra, val)
2082 {iter = 1}
2083
2084 !repeat rep
2085 {val + iter : uleb128}
2086 {0xdeadbeef : 32}
2087 {iter = iter + 1}
2088 !end
2089
2090 !if with_extra
2091 "meow mix\0"
2092 !end
2093!end
2094----
2095====
2096
2097=== Macro expansion
2098
2099A _macro expansion_ expands the items of a defined
2100<<macro-definition-block,macro>>.
2101
2102The macro to expand must be defined _before_ the expansion.
2103
2104The <<state,state>> before handling the first item of the chosen macro
2105is:
2106
2107<<cur-offset,Current offset>>::
2108 Unchanged.
2109
2110<<cur-bo,Current byte order>>::
2111 Unchanged.
2112
2113Variables::
2114 The only available variables initially are the macro parameters.
2115
2116Labels::
2117 None.
2118
2119The state after having handled the last item of the chosen macro is:
2120
2121Current offset::
2122 The one before handling the first item of the macro plus the size
2123 of the generated data of the macro expansion.
2124+
2125IMPORTANT: This means <<current-offset-setting,current offset setting>>
2126items within the expanded macro don't impact the final current offset.
2127
2128Current byte order::
2129 The one before handling the first item of the macro.
2130
2131Variables::
2132 The ones before handling the first item of the macro.
2133
2134Labels::
2135 The ones before handling the first item of the macro.
2136
2137A macro expansion is:
2138
2139. The `m:` prefix.
2140
2141. A valid {py3} name (the name of the macro to expand).
2142
2143. The `(` parameter value list prefix.
2144
2145. A comma-separated list of zero or more unique parameter values.
2146+
2147The number of parameter values must match the number of parameter
2148names of the definition of the chosen macro.
2149+
2150A parameter value is one of:
2151+
2152--
fc21bb27 2153* A <<const-int,constant integer>>, possibly negative.
320644e2 2154
dbd84e74
PP
2155* A constant floating point number.
2156
320644e2
PP
2157* The ``pass:[{]`` prefix, a valid {py3} expression of which the
2158 evaluation result type is `int` or `bool` (automatically converted to
2159 `int`), and the ``pass:[}]`` suffix.
2160+
2161For a macro expansion at some source location{nbsp}__**L**__, this
2162expression may contain:
2163
2164** The name of any <<label,label>> defined before{nbsp}__**L**__
2165 which isn't within a nested group.
2166** The name of any <<variable-assignment,variable>> known
2167 at{nbsp}__**L**__.
2168
2169+
2170The value of the special name `ICITTE` (`int` type) in this expression
2171is the <<cur-offset,current offset>> (before handling the items of the
2172chosen macro).
2173
2174* A valid {py3} name.
2175+
2176For the name `__NAME__`, this is equivalent to the
2177`pass:[{]__NAME__pass:[}]` form above.
2178--
2179
2180. The `)` parameter value list suffix.
2181
2182====
2183Input:
2184
2185----
2186!macro bake()
2187 {le} {ICITTE * 8 : 16}
2188 u16le"predict explode"
2189!end
2190
2191"hello [" m:bake() "] world"
2192
2193m:bake() * 5
2194----
2195
2196Output:
2197
2198----
219968 65 6c 6c 6f 20 5b 38 00 70 00 72 00 65 00 64 ┆ hello [8•p•r•e•d
220000 69 00 63 00 74 00 20 00 65 00 78 00 70 00 6c ┆ •i•c•t• •e•x•p•l
220100 6f 00 64 00 65 00 5d 20 77 6f 72 6c 64 70 01 ┆ •o•d•e•] worldp•
220270 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
220365 00 78 00 70 00 6c 00 6f 00 64 00 65 00 70 02 ┆ e•x•p•l•o•d•e•p•
220470 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
220565 00 78 00 70 00 6c 00 6f 00 64 00 65 00 70 03 ┆ e•x•p•l•o•d•e•p•
220670 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
220765 00 78 00 70 00 6c 00 6f 00 64 00 65 00 70 04 ┆ e•x•p•l•o•d•e•p•
220870 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
220965 00 78 00 70 00 6c 00 6f 00 64 00 65 00 70 05 ┆ e•x•p•l•o•d•e•p•
221070 00 72 00 65 00 64 00 69 00 63 00 74 00 20 00 ┆ p•r•e•d•i•c•t• •
221165 00 78 00 70 00 6c 00 6f 00 64 00 65 00 ┆ e•x•p•l•o•d•e•
2212----
2213====
2214
2215====
2216Input:
2217
2218----
2219!macro A(val, is_be)
2220 {le}
2221
2222 !if is_be
2223 {be}
2224 !end
2225
2226 {val : 16}
2227!end
2228
2229!macro B(rep, is_be)
2230 {iter = 1}
2231
2232 !repeat rep
2233 m:A({iter * 3}, is_be)
2234 {iter = iter + 1}
2235 !end
2236!end
2237
2238m:B(5, 1)
2239m:B(3, 0)
2240----
2241
2242Output:
2243
2244----
224500 03 00 06 00 09 00 0c 00 0f 03 00 06 00 09 00
2246----
2247====
2248
dbd84e74
PP
2249====
2250Input:
2251
2252----
2253!macro flt32be(val) {be} {val : 32} !end
2254
2255"CHEETOS"
2256m:flt32be(-42.17)
2257m:flt32be(56.23e-4)
2258----
2259
2260Output:
2261
2262----
226343 48 45 45 54 4f 53 c2 28 ae 14 3b b8 41 25 ┆ CHEETOS•(••;•A%
2264----
2265====
2266
e57a18e1
PP
2267=== Post-item repetition
2268
2269A _post-item repetition_ represents the bytes of an item repeated a
2270given number of times.
2271
2272A post-item repetition is:
2273
27d52a19 2274. One of those items:
e57a18e1 2275
27d52a19
PP
2276** A <<byte-constant,byte constant>>.
2277** A <<literal-string,literal string>>.
2278** A <<fixed-length-number,fixed-length number>>.
2279** An <<leb128-integer,LEB128 integer>>.
7a7b31e8 2280** A <<string,string>>.
320644e2 2281** A <<macro-expansion,macro-expansion>>.
cd33dfe6 2282** A <<transformation-block,transformation block>>.
27d52a19 2283** A <<group,group>>.
e57a18e1
PP
2284
2285. The ``pass:[*]`` character.
2286
2287. One of:
2288
2289** A positive integer (hexadecimal starting with `0x` or `0X` accepted)
2290 which is the number of times to repeat the previous item.
2291
27d52a19
PP
2292** The ``pass:[{]`` prefix, a valid {py3} expression of which the
2293 evaluation result type is `int` or `bool` (automatically converted to
2294 `int`), and the ``pass:[}]`` suffix.
e57a18e1 2295+
320644e2
PP
2296For a post-item repetition at some source location{nbsp}__**L**__, this
2297expression may contain:
e57a18e1
PP
2298+
2299--
27d52a19
PP
2300* The name of any <<label,label>> defined before{nbsp}__**L**__
2301 which isn't within a nested group and
2302 which isn't part of the repeated item.
e57a18e1
PP
2303* The name of any <<variable-assignment,variable>> known
2304 at{nbsp}__**L**__, which isn't part of its repeated item, and which
320644e2 2305 doesn't.
e57a18e1
PP
2306--
2307+
2308The value of the special name `ICITTE` (`int` type) in this expression
2309is the <<cur-offset,current offset>> (before handling the items to
2310repeat).
2311
2312** A valid {py3} name.
2313+
2314For the name `__NAME__`, this is equivalent to the
2315`pass:[{]__NAME__pass:[}]` form above.
2316
2317You may also use a <<repetition-block,repetition block>>. The form
2318``__ITEM__{nbsp}pass:[*]{nbsp}__X__`` is equivalent to
2319``!repeat{nbsp}__X__{nbsp}__ITEM__{nbsp}!end``.
2320
2321====
2322Input:
2323
2324----
2325{end - ICITTE - 1 : 8} * 0x100 <end>
2326----
2327
2328Output:
2329
2330----
2331ff fe fd fc fb fa f9 f8 f7 f6 f5 f4 f3 f2 f1 f0 ┆ ••••••••••••••••
2332ef ee ed ec eb ea e9 e8 e7 e6 e5 e4 e3 e2 e1 e0 ┆ ••••••••••••••••
2333df de dd dc db da d9 d8 d7 d6 d5 d4 d3 d2 d1 d0 ┆ ••••••••••••••••
2334cf ce cd cc cb ca c9 c8 c7 c6 c5 c4 c3 c2 c1 c0 ┆ ••••••••••••••••
2335bf be bd bc bb ba b9 b8 b7 b6 b5 b4 b3 b2 b1 b0 ┆ ••••••••••••••••
2336af ae ad ac ab aa a9 a8 a7 a6 a5 a4 a3 a2 a1 a0 ┆ ••••••••••••••••
23379f 9e 9d 9c 9b 9a 99 98 97 96 95 94 93 92 91 90 ┆ ••••••••••••••••
23388f 8e 8d 8c 8b 8a 89 88 87 86 85 84 83 82 81 80 ┆ ••••••••••••••••
23397f 7e 7d 7c 7b 7a 79 78 77 76 75 74 73 72 71 70 ┆ •~}|{zyxwvutsrqp
23406f 6e 6d 6c 6b 6a 69 68 67 66 65 64 63 62 61 60 ┆ onmlkjihgfedcba`
23415f 5e 5d 5c 5b 5a 59 58 57 56 55 54 53 52 51 50 ┆ _^]\[ZYXWVUTSRQP
23424f 4e 4d 4c 4b 4a 49 48 47 46 45 44 43 42 41 40 ┆ ONMLKJIHGFEDCBA@
23433f 3e 3d 3c 3b 3a 39 38 37 36 35 34 33 32 31 30 ┆ ?>=<;:9876543210
23442f 2e 2d 2c 2b 2a 29 28 27 26 25 24 23 22 21 20 ┆ /.-,+*)('&%$#"!
23451f 1e 1d 1c 1b 1a 19 18 17 16 15 14 13 12 11 10 ┆ ••••••••••••••••
23460f 0e 0d 0c 0b 0a 09 08 07 06 05 04 03 02 01 00 ┆ ••••••••••••••••
2347----
2348====
2349
2350====
2351Input:
2352
2353----
2354{times = 1}
2355aa bb cc dd
2356(
2357 <here>
2358 (ee ff) * {here + 1}
2359 11 22 33 * {times}
2360 {times = times + 1}
2361) * 3
2362"coucou!"
2363----
2364
2365Output:
2366
2367----
2368aa bb cc dd ee ff ee ff ee ff ee ff ee ff 11 22 ┆ •••••••••••••••"
236933 ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ 3•••••••••••••••
2370ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2371ff ee ff ee ff 11 22 33 33 ee ff ee ff ee ff ee ┆ ••••••"33•••••••
2372ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2373ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2374ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2375ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2376ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2377ff ee ff ee ff ee ff ee ff ee ff ee ff ee ff ee ┆ ••••••••••••••••
2378ff ee ff ee ff ee ff ee ff ee ff ee ff 11 22 33 ┆ ••••••••••••••"3
237933 33 63 6f 75 63 6f 75 21 ┆ 33coucou!
2380----
2381====
2382
71aaa3f7
PP
2383== Command-line tool
2384
2385If you <<install-normand,installed>> the `normand` package, then you
2386can use the `normand` command-line tool:
2387
2388----
2389$ normand <<< '"ma gang de malades"' | hexdump -C
2390----
2391
2392----
239300000000 6d 61 20 67 61 6e 67 20 64 65 20 6d 61 6c 61 64 |ma gang de malad|
239400000010 65 73 |es|
2395----
2396
2397If you copy the `normand.py` module to your own project, then you can
2398run the module itself:
2399
2400----
2401$ python3 -m normand <<< '"ma gang de malades"' | hexdump -C
2402----
2403
2404----
240500000000 6d 61 20 67 61 6e 67 20 64 65 20 6d 61 6c 61 64 |ma gang de malad|
240600000010 65 73 |es|
2407----
2408
2409Without a path argument, the `normand` tool reads from the standard
2410input.
2411
2412The `normand` tool prints the generated binary data to the standard
2413output.
2414
2415Various options control the initial <<state,state>> of the processor:
2416use the `--help` option to learn more.
2417
2418== {py3} API
2419
e57a18e1 2420The whole `normand` package/module public API is:
71aaa3f7
PP
2421
2422[source,python]
2423----
e57a18e1 2424# Byte order.
71aaa3f7
PP
2425class ByteOrder(enum.Enum):
2426 # Big endian.
2427 BE = ...
2428
2429 # Little endian.
2430 LE = ...
2431
2432
e57a18e1
PP
2433# Text location.
2434class TextLocation:
71aaa3f7
PP
2435 # Line number.
2436 @property
2437 def line_no(self) -> int:
2438 ...
2439
2440 # Column number.
2441 @property
2442 def col_no(self) -> int:
2443 ...
2444
2445
f5dcb24c
PP
2446# Parsing error message.
2447class ParseErrorMessage:
2448 # Message text.
2449 @property
2450 def text(self):
2451 ...
2452
2453 # Source text location.
2454 @property
2455 def text_location(self):
2456 ...
2457
2458
e57a18e1 2459# Parsing error.
71aaa3f7 2460class ParseError(RuntimeError):
f5dcb24c
PP
2461 # Parsing error messages.
2462 #
2463 # The first message is the most _specific_ one.
71aaa3f7 2464 @property
f5dcb24c 2465 def messages(self):
71aaa3f7
PP
2466 ...
2467
2468
e57a18e1
PP
2469# Variables dictionary type (for type hints).
2470VariablesT = typing.Dict[str, typing.Union[int, float]]
2471
2472
2473# Labels dictionary type (for type hints).
2474LabelsT = typing.Dict[str, int]
1b8aa84a
PP
2475
2476
e57a18e1 2477# Parsing result.
71aaa3f7
PP
2478class ParseResult:
2479 # Generated data.
2480 @property
2481 def data(self) -> bytearray:
2482 ...
2483
2484 # Updated variable values.
2485 @property
1b8aa84a 2486 def variables(self) -> SymbolsT:
71aaa3f7
PP
2487 ...
2488
2489 # Updated main group label values.
2490 @property
1b8aa84a 2491 def labels(self) -> SymbolsT:
71aaa3f7
PP
2492 ...
2493
2494 # Final offset.
2495 @property
2496 def offset(self) -> int:
2497 ...
2498
2499 # Final byte order.
2500 @property
1b8aa84a 2501 def byte_order(self) -> typing.Optional[ByteOrder]:
71aaa3f7
PP
2502 ...
2503
1b8aa84a 2504
e57a18e1
PP
2505# Parses the `normand` input using the initial state defined by
2506# `init_variables`, `init_labels`, `init_offset`, and `init_byte_order`,
2507# and returns the corresponding parsing result.
71aaa3f7 2508def parse(normand: str,
1b8aa84a
PP
2509 init_variables: typing.Optional[SymbolsT] = None,
2510 init_labels: typing.Optional[SymbolsT] = None,
71aaa3f7
PP
2511 init_offset: int = 0,
2512 init_byte_order: typing.Optional[ByteOrder] = None) -> ParseResult:
2513 ...
2514----
2515
2516The `normand` parameter is the actual <<learn-normand,Normand input>>
2517while the other parameters control the initial <<state,state>>.
2518
2519The `parse()` function raises a `ParseError` instance should it fail to
2520parse the `normand` string for any reason.
bf8f3b38
PP
2521
2522== Development
2523
2524Normand is a https://python-poetry.org/[Poetry] project.
2525
2526To develop it, install it through Poetry and enter the virtual
2527environment:
2528
2529----
2530$ poetry install
2531$ poetry shell
2532$ normand <<< '"lol" * 10 0a'
2533----
2534
2535`normand.py` is processed by:
2536
2537* https://microsoft.github.io/pyright/[Pyright]
2538* https://github.com/psf/black[Black]
2539* https://pycqa.github.io/isort/[isort]
2540
2541=== Testing
2542
2543Use https://docs.pytest.org/[pytest] to test Normand once the package is
2544part of your virtual environment, for example:
2545
2546----
2547$ poetry install
2548$ poetry run pip3 install pytest
2549$ poetry run pytest
2550----
2551
2552The `pytest` project is currently not a development dependency in
2553`pyproject.toml` due to backward compatibiliy issues with
2554Python{nbsp}3.4.
2555
2556In the `tests` directory, each `*.nt` file is a test. The file name
2557prefix indicates what it's meant to test:
2558
2559`pass-`::
2560 Everything above the `---` line is the valid Normand input
2561 to test.
2562+
2563Everything below the `---` line is the expected data
2564(whitespace-separated hexadecimal bytes).
2565
2566`fail-`::
2567 Everything above the `---` line is the invalid Normand input
2568 to test.
2569+
2570Everything below the `---` line is the expected error message having
2571this form:
2572+
2573----
2574LINE:COL - MESSAGE
2575----
2576
2577=== Contributing
2578
2579Normand uses https://review.lttng.org/admin/repos/normand,general[Gerrit]
2580for code review.
2581
2582To report a bug, https://github.com/efficios/normand/issues/new[create a
2583GitHub issue].
This page took 0.130599 seconds and 4 git commands to generate.