Document -Map option.
[deliverable/binutils-gdb.git] / ld / ld.texinfo
1 \input texinfo
2 @setfilename ld.info
3 @syncodeindex ky cp
4 @c @include configdoc.texi
5
6 @c @smallbook
7 @c @cropmarks
8
9 @ifinfo
10 @format
11 START-INFO-DIR-ENTRY
12 * Ld:: The GNU linker.
13 END-INFO-DIR-ENTRY
14 @end format
15 @end ifinfo
16
17 @ifinfo
18 This file documents the GNU linker LD.
19
20 Copyright (C) 1991, 1992, 1993 Free Software Foundation, Inc.
21
22 Permission is granted to make and distribute verbatim copies of
23 this manual provided the copyright notice and this permission notice
24 are preserved on all copies.
25
26 Permission is granted to copy and distribute modified versions of this
27 manual under the conditions for verbatim copying, provided also that
28 the entire resulting derived work is distributed under the terms of a
29 permission notice identical to this one.
30
31 Permission is granted to copy and distribute translations of this manual
32 into another language, under the above conditions for modified versions.
33
34 @ignore
35 Permission is granted to process this file through Tex and print the
36 results, provided the printed document carries copying permission
37 notice identical to this one except for the removal of this paragraph
38 (this paragraph not being relevant to the printed manual).
39
40 @end ignore
41 @end ifinfo
42 @iftex
43 @finalout
44 @setchapternewpage odd
45 @settitle Using LD, the GNU linker
46 @titlepage
47 @title Using ld
48 @subtitle The GNU linker
49 @sp 1
50 @subtitle @code{ld} version 2
51 @subtitle March 1993
52 @author Steve Chamberlain and Roland Pesch
53 @author Cygnus Support
54 @page
55
56 @tex
57 {\parskip=0pt
58 \hfill Cygnus Support\par
59 \hfill steve\@cygnus.com, pesch\@cygnus.com\par
60 \hfill {\it Using LD, the GNU linker}\par
61 \hfill Edited by Jeffrey Osier (jeffrey\@cygnus.com), March 1993.\par
62 }
63 \global\parindent=0pt % Steve likes it this way.
64 @end tex
65
66 @vskip 0pt plus 1filll
67 Copyright @copyright{} 1991, 1992, 1993 Free Software Foundation, Inc.
68
69 Permission is granted to make and distribute verbatim copies of
70 this manual provided the copyright notice and this permission notice
71 are preserved on all copies.
72
73 Permission is granted to copy and distribute modified versions of this
74 manual under the conditions for verbatim copying, provided also that
75 the entire resulting derived work is distributed under the terms of a
76 permission notice identical to this one.
77
78 Permission is granted to copy and distribute translations of this manual
79 into another language, under the above conditions for modified versions.
80 @end titlepage
81 @end iftex
82 @c FIXME: Talk about importance of *order* of args, cmds to linker!
83
84 @ifinfo
85 @node Top
86 @top Using ld
87 This file documents the GNU linker ld.
88
89 @menu
90 * Overview:: Overview
91 * Invocation:: Invocation
92 * Commands:: Command Language
93 @ifset GENERIC
94 * Machine Dependent:: Machine Dependent Features
95 @end ifset
96 @ifclear GENERIC
97 @ifset H8300
98 * H8/300:: ld and the H8/300
99 @end ifset
100 @ifset I960
101 * i960:: ld and the Intel 960 family
102 @end ifset
103 @end ifclear
104 @ifclear SingleFormat
105 * BFD:: BFD
106 @end ifclear
107 @c Following blank line required for remaining bug in makeinfo conds/menus
108
109 * MRI:: MRI Compatible Script Files
110 * Index:: Index
111 @end menu
112 @end ifinfo
113
114 @node Overview
115 @chapter Overview
116
117 @cindex GNU linker
118 @cindex what is this?
119 @code{ld} combines a number of object and archive files, relocates
120 their data and ties up symbol references. Usually the last step in
121 compiling a program is to run @code{ld}.
122
123 @code{ld} accepts Linker Command Language files written in
124 a superset of AT&T's Link Editor Command Language syntax,
125 to provide explicit and total control over the linking process.
126
127 @ifclear SingleFormat
128 This version of @code{ld} uses the general purpose BFD libraries
129 to operate on object files. This allows @code{ld} to read, combine, and
130 write object files in many different formats---for example, COFF or
131 @code{a.out}. Different formats may be linked together to produce any
132 available kind of object file. @xref{BFD} for a list of formats
133 supported on various architectures.
134 @end ifclear
135
136 Aside from its flexibility, the GNU linker is more helpful than other
137 linkers in providing diagnostic information. Many linkers abandon
138 execution immediately upon encountering an error; whenever possible,
139 @code{ld} continues executing, allowing you to identify other errors
140 (or, in some cases, to get an output file in spite of the error).
141
142 @node Invocation
143 @chapter Invocation
144
145 The GNU linker @code{ld} is meant to cover a broad range of situations,
146 and to be as compatible as possible with other linkers. As a result,
147 you have many choices to control its behavior.
148
149 @ifset UsesEnvVars
150 @menu
151 * Options:: Command Line Options
152 * Environment:: Environment Variables
153 @end menu
154
155 @node Options
156 @section Command Line Options
157 @end ifset
158
159 @cindex command line
160 @cindex options
161 Here is a summary of the options you can use on the @code{ld} command
162 line:
163
164 @c FIXME! -relax only avail h8/300, i960. Conditionals screwed in examples.
165 @smallexample
166 ld [-o @var{output} ] @var{objfile}@dots{}
167 [ -A@var{architecture} ] [ -b @var{input-format} ] [ -Bstatic ]
168 [ -c @var{MRI-commandfile} ] [ -d | -dc | -dp ]
169 [ -defsym @var{symbol}=@var{expression} ]
170 [ -e @var{entry} ] [ -F ] [ -F @var{format} ]
171 [ -format @var{input-format} ] [ -g ] [ -G @var{size} ] [ -i ]
172 [ -l@var{ar} ] [ -L@var{searchdir} ] [ -M ] [ -Map @var{mapfile} ]
173 [ -m @var{emulation} ] [ -N | -n ] [ -noinhibit-exec ]
174 [ -R @var{filename} ] [ -relax ] [ -r | -Ur ] [ -S ] [ -s ]
175 [ -T @var{commandfile} ] [ -Ttext @var{textorg} ] [ -Tdata @var{dataorg} ]
176 [ -Tbss @var{bssorg} ] [ -t ] [ -u @var{sym}] [-V] [-v] [ -X ] [-x ]
177 [ -y@var{symbol} ] [ @{ @var{script} @} ]
178 @end smallexample
179
180 This plethora of command-line options may seem intimidating, but in
181 actual practice few of them are used in any particular context.
182 @cindex standard Unix system
183 For instance, a frequent use of @code{ld} is to link standard Unix
184 object files on a standard, supported Unix system. On such a system, to
185 link a file @code{hello.o}:
186
187 @example
188 ld -o @var{output} /lib/crt0.o hello.o -lc
189 @end example
190
191 This tells @code{ld} to produce a file called @var{output} as the
192 result of linking the file @code{/lib/crt0.o} with @code{hello.o} and
193 the library @code{libc.a}, which will come from the standard search
194 directories. (See the discussion of the @samp{-l} option below.)
195
196 The command-line options to @code{ld} may be specified in any order, and
197 may be repeated at will. Repeating most options with a
198 different argument will either have no further effect, or override prior
199 occurrences (those further to the left on the command line) of that
200 option.
201
202 @ifclear SingleFormat
203 The exceptions---which may meaningfully be used more than once---are
204 @samp{-A}, @samp{-b} (or its synonym @samp{-format}), @samp{-defsym},
205 @samp{-L}, @samp{-l}, @samp{-R}, and @samp{-u}.
206 @end ifclear
207 @ifset SingleFormat
208 The exceptions---which may meaningfully be used more than once---are
209 @samp{-A}, @samp{-defsym}, @samp{-L}, @samp{-l}, @samp{-R}, and @samp{-u}.
210 @end ifset
211
212 @cindex object files
213 The list of object files to be linked together, shown as @var{objfile}@dots{},
214 may follow, precede, or be mixed in with command-line options, except that
215 an @var{objfile} argument may not be placed between an option and
216 its argument.
217
218 Usually the linker is invoked with at least one object file, but other
219 forms of binary input files can also be specified with @samp{-l},
220 @samp{-R}, and the script command language. If @emph{no} binary input
221 files at all are specified, the linker does not produce any output, and
222 issues the message @samp{No input files}.
223
224 Option arguments must either follow the option letter without intervening
225 whitespace, or be given as separate arguments immediately following the
226 option that requires them.
227
228 @table @code
229 @item @var{objfile}@dots{}
230 The object files to be linked.
231
232 @ifset I960
233 @cindex architectures
234 @kindex -A@var{arch}
235 @item -A@var{architecture}
236 In the current release of @code{ld}, this option is useful only for the
237 Intel 960 family of architectures. In that @code{ld} configuration, the
238 @var{architecture} argument identifies the particular architecture in
239 the 960 family, enabling some safeguards and modifying the
240 archive-library search path. @xref{i960,,@code{ld} and the Intel 960
241 family}, for details.
242
243 Future releases of @code{ld} may support similar functionality for
244 other architecture families.
245 @end ifset
246
247 @ifclear SingleFormat
248 @cindex binary input format
249 @kindex -b @var{format}
250 @cindex input format
251 @item -b @var{input-format}
252 @cindex input format
253 Specify the binary format for input object files that follow this option
254 on the command line. You don't usually need to specify this, as
255 @code{ld} is configured to expect as a default input format the most
256 usual format on each machine. @var{input-format} is a text string, the
257 name of a particular format supported by the BFD libraries.
258 @w{@samp{-format @var{input-format}}} has the same effect. @xref{BFD}.
259
260 You may want to use this option if you are linking files with an unusual
261 binary format. You can also use @samp{-b} to switch formats explicitly (when
262 linking object files of different formats), by including
263 @samp{-b @var{input-format}} before each group of object files in a
264 particular format.
265
266 The default format is taken from the environment variable
267 @code{GNUTARGET}.
268 @ifset UsesEnvVars
269 @xref{Environment}.
270 @end ifset
271 You can also define the input
272 format from a script, using the command @code{TARGET}; see @ref{Other
273 Commands}.
274 @end ifclear
275
276 @kindex -Bstatic
277 @item -Bstatic
278 Ignored. This option is accepted for command-line compatibility with
279 the SunOS linker.
280
281 @kindex -c @var{MRI-cmdfile}
282 @cindex compatibility, MRI
283 @item -c @var{MRI-commandfile}
284 For compatibility with linkers produced by MRI, @code{ld} accepts script
285 files written in an alternate, restricted command language, described in
286 @ref{MRI,,MRI Compatible Script Files}. Introduce MRI script files with
287 the option @samp{-c}; use the @samp{-T} option to run linker
288 scripts written in the general-purpose @code{ld} scripting language.
289 If @var{MRI-cmdfile} does not exist, @code{ld} looks for it in the directories
290 specified by any @samp{-L} options.
291
292 @cindex common allocation
293 @kindex -d
294 @item -d
295 @kindex -dc
296 @itemx -dc
297 @kindex -dp
298 @itemx -dp
299 These three options are equivalent; multiple forms are supported for
300 compatibility with other linkers. They
301 assign space to common symbols even if a relocatable output file is
302 specified (with @samp{-r}). The script command
303 @code{FORCE_COMMON_ALLOCATION} has the same effect. @xref{Other
304 Commands}.
305
306 @cindex symbols, from command line
307 @kindex -defsym @var{symbol}=@var{exp}
308 @item -defsym @var{symbol}=@var{expression}
309 Create a global symbol in the output file, containing the absolute
310 address given by @var{expression}. You may use this option as many
311 times as necessary to define multiple symbols in the command line. A
312 limited form of arithmetic is supported for the @var{expression} in this
313 context: you may give a hexadecimal constant or the name of an existing
314 symbol, or use @code{+} and @code{-} to add or subtract hexadecimal
315 constants or symbols. If you need more elaborate expressions, consider
316 using the linker command language from a script (@pxref{Assignment, ,
317 Assignment: Symbol Definitions}). @emph{Note:} there should be no
318 white space between @var{symbol}, the equals sign (``@key{=}''), and
319 @var{expression}.
320
321 @cindex entry point, from command line
322 @kindex -e @var{entry}
323 @item -e @var{entry}
324 Use @var{entry} as the explicit symbol for beginning execution of your
325 program, rather than the default entry point. @xref{Entry Point}, for a
326 discussion of defaults and other ways of specifying the
327 entry point.
328
329 @ifclear SingleFormat
330 @kindex -F
331 @item -F
332 @itemx -F@var{format}
333 Ignored. Some older linkers used this option throughout a compilation
334 toolchain for specifying object-file format for both input and output
335 object files. The mechanisms @code{ld} uses for this purpose (the
336 @samp{-b} or @samp{-format} options for input files, the @code{TARGET}
337 command in linker scripts for output files, the @code{GNUTARGET}
338 environment variable) are more flexible, but @code{ld} accepts the
339 @samp{-F} option for compatibility with scripts written to call the old
340 linker.
341
342 @kindex -format
343 @item -format @var{input-format}
344 Synonym for @samp{-b @var{input-format}}.
345 @end ifclear
346
347 @kindex -g
348 @item -g
349 Ignored. Provided for compatibility with other tools.
350
351 @kindex -G
352 @cindex object size
353 @item -G@var{value}
354 @itemx -G @var{value}
355 Set the maximum size of objects to be optimized using the GP register to
356 @var{size} under MIPS ECOFF. Ignored for other object file formats.
357
358 @kindex -i
359 @cindex incremental link
360 @item -i
361 Perform an incremental link (same as option @samp{-r}).
362
363 @cindex archive files, from cmd line
364 @kindex -l@var{ar}
365 @item -l@var{ar}
366 Add archive file @var{ar} to the list of files to link. This
367 option may be used any number of times. @code{ld} will search its
368 path-list for occurrences of @code{lib@var{ar}.a} for every @var{ar}
369 specified.
370
371 @cindex search directory, from cmd line
372 @kindex -L@var{dir}
373 @item -L@var{searchdir}
374 Add path @var{searchdir} to the list of paths that @code{ld} will search
375 for archive libraries and @code{ld} control scripts. You may use this
376 option any number of times.
377
378 @ifset UsesEnvVars
379 The default set of paths searched (without being specified with
380 @samp{-L}) depends on which emulation mode @code{ld} is using, and in
381 some cases also on how it was configured. @xref{Environment}.
382 @end ifset
383
384 The paths can also be specified in a link script with the
385 @code{SEARCH_DIR} command.
386
387 @cindex link map
388 @kindex -M
389 @item -M
390 Print (to the standard output) a link map---diagnostic information
391 about where symbols are mapped by @code{ld}, and information on global
392 common storage allocation.
393
394 @cindex link map
395 @kindex -Map
396 @item -Map @var{mapfile}
397 Print to the file @var{mapfile} a link map---diagnostic information
398 about where symbols are mapped by @code{ld}, and information on global
399 common storage allocation.
400
401 @cindex emulation
402 @kindex -m @var{emulation}
403 @item -m@var{emulation}
404 @itemx -m @var{emulation}
405 Emulate the @var{emulation} linker. You can list the available
406 emulations with the @samp{-V} option. The
407 default is the system for which you configured @code{ld}.
408
409 @kindex -N
410 @cindex read/write from cmd line
411 @kindex OMAGIC
412 @item -N
413 Set the text and data sections to be readable and writable. Also, do
414 not page-align the data segment. If the output format supports Unix
415 style magic numbers, mark the output as @code{OMAGIC}.
416
417 @item -n
418 @kindex -n
419 @cindex read-only text
420 @kindex NMAGIC
421 Set the text segment to be read only, and mark the output as
422 @code{NMAGIC} if possible.
423
424 @item -noinhibit-exec
425 @cindex output file after errors
426 @kindex -noinhibit-exec
427 Retain the executable output file whenever it is still usable.
428 Normally, the linker will not produce an output file if it encounters
429 errors during the link process; it exits without writing an output file
430 when it issues any error whatsoever.
431
432 @item -o @var{output}
433 @kindex -o @var{output}
434 @cindex naming the output file
435 Use @var{output} as the name for the program produced by @code{ld}; if this
436 option is not specified, the name @file{a.out} is used by default. The
437 script command @code{OUTPUT} can also specify the output file name.
438
439 @item -R @var{filename}
440 @kindex -R @var{file}
441 @cindex symbol-only input
442 On some platforms, this option performs global optimizations
443 that become possible when the linker resolves addressing in the
444 program, such as relaxing address modes and synthesizing new
445 instructions in the output object file.
446
447 @item -relax
448 @kindex -relax
449 @cindex synthesizing linker
450 @cindex relaxing addressing modes
451 An option with machine dependent effects. Currently this option is only
452 supported on the H8/300.
453 @ifset H8300
454 @xref{H8/300,,@code{ld} and the H8/300}.
455 @end ifset
456
457 On some platforms, use option performs global optimizations that
458 become possible when the linker resolves addressing in the program, such
459 as relaxing address modes and synthesizing new instructions in the
460 output object file.
461
462 On platforms where this is not supported, @samp{-relax} is accepted, but
463 ignored.
464
465 @item -r
466 @cindex partial link
467 @cindex relocatable output
468 @kindex -r
469 Generate relocatable output---i.e., generate an output file that can in
470 turn serve as input to @code{ld}. This is often called @dfn{partial
471 linking}. As a side effect, in environments that support standard Unix
472 magic numbers, this option also sets the output file's magic number to
473 @code{OMAGIC}.
474 @c ; see @code{-N}.
475 If this option is not specified, an absolute file is produced. When
476 linking C++ programs, this option @emph{will not} resolve references to
477 constructors; to do that, use @samp{-Ur}.
478
479 This option does the same as @code{-i}.
480
481 @item -S
482 @kindex -S
483 @cindex strip debugger symbols
484 Omit debugger symbol information (but not all symbols) from the output file.
485
486 @item -s
487 @kindex -s
488 @cindex strip all symbols
489 Omit all symbol information from the output file.
490
491 @item @{ @var{script} @}
492 @kindex @{ @var{script} @}
493 @cindex scripts on command line
494 You can, if you wish, include a script of linker commands directly in
495 the command line instead of referring to it via an input file. When the
496 character @samp{@{} occurs on the command line, the linker switches to
497 interpreting the command language until the end of the list of commands
498 is reached; the end is indicated with a closing brace @samp{@}}.
499 @code{ld} does not recognize other command-line options while parsing
500 the script. @xref{Commands}, for a description of the command language.
501
502 @item -Tbss @var{bssorg}
503 @kindex -Tbss @var{bssorg}
504 @itemx -Tdata @var{dataorg}
505 @kindex -Tdata @var{dataorg}
506 @itemx -Ttext @var{textorg}
507 @kindex -Ttext @var{textorg}
508 @cindex segment origins, cmd line
509 Use @var{org} as the starting address for---respectively---the
510 @code{bss}, @code{data}, or the @code{text} segment of the output file.
511 @var{org} must be a single hexadecimal integer;
512 for compatibility with other linkers, you may omit the leading
513 @samp{0x} usually associated with hexadecimal values.
514
515 @item -T @var{commandfile}
516 @itemx -T@var{commandfile}
517 @kindex -T @var{script}
518 @cindex script files
519 Read link commands from the file
520 @var{commandfile}. These commands completely override @code{ld}'s
521 default link format (rather than adding to it); @var{commandfile} must
522 specify everything necessary to describe the target format.
523 @xref{Commands}.
524 If @var{commandfile} does not exist, @code{ld} looks for it in the directories
525 specified by any @samp{-L} options.
526
527 You may also include a script of link commands directly in the command
528 line by bracketing it between @samp{@{} and @samp{@}}.
529
530 @item -t
531 @kindex -t
532 @cindex verbose
533 @cindex input files, displaying
534 Print the names of the input files as @code{ld} processes them.
535
536 @item -u @var{sym}
537 @kindex -u @var{sym}
538 @cindex undefined symbol
539 Force @var{sym} to be entered in the output file as an undefined symbol.
540 Doing this may, for example, trigger linking of additional modules from
541 standard libraries. @samp{-u} may be repeated with different option
542 arguments to enter additional undefined symbols.
543 @c Nice idea, but no such command: This option is equivalent
544 @c to the @code{EXTERN} linker command.
545
546 @item -Ur
547 @kindex -Ur
548 @cindex constructors
549 For anything other than C++ programs, this option is equivalent to
550 @samp{-r}: it generates relocatable output---i.e., an output file that can in
551 turn serve as input to @code{ld}. When linking C++ programs, @samp{-Ur}
552 @emph{will} resolve references to constructors, unlike @samp{-r}.
553
554 @item -V
555 @kindex -V
556 @cindex version
557 Display the version number for @code{ld} and list the supported emulations.
558 Print which input files can and can not be opened.
559
560 @item -v
561 @kindex -v
562 @cindex version
563 Display the version number for @code{ld}.
564 Print which input files can and can not be opened.
565
566 @item -X
567 @kindex -X
568 @cindex local symbols, deleting
569 @cindex L, deleting symbols beginning
570 If @samp{-s} or @samp{-S} is also specified, delete only local symbols
571 beginning with @samp{L}.
572
573 @item -x
574 @kindex -x
575 @cindex deleting local symbols
576 If @samp{-s} or @samp{-S} is also specified, delete all local symbols,
577 not just those beginning with @samp{L}.
578
579 @item -y@var{symbol}
580 @kindex -y@var{symbol}
581 @cindex symbol tracing
582 Print the name of each linked file in which @var{symbol} appears. This
583 option may be given any number of times. On many systems it is necessary
584 to prepend an underscore.
585
586 This option is useful when you have an undefined symbol in your link but
587 don't know where the reference is coming from.
588 @end table
589
590 @ifset UsesEnvVars
591 @node Environment
592 @section Environment Variables
593
594 You can change the behavior of @code{ld} with the environment
595 variable @code{GNUTARGET}.
596
597 @kindex GNUTARGET
598 @cindex default input format
599 @code{GNUTARGET} determines the input-file object format if you don't
600 use @samp{-b} (or its synonym @samp{-format}). Its value should be one
601 of the BFD names for an input format (@pxref{BFD}). If there is no
602 @code{GNUTARGET} in the environment, @code{ld} uses the natural format
603 of the host. If @code{GNUTARGET} is set to @code{default} then BFD attempts to discover the
604 input format by examining binary input files; this method often
605 succeeds, but there are potential ambiguities, since there is no method
606 of ensuring that the magic number used to specify object-file formats is
607 unique. However, the configuration procedure for BFD on each system
608 places the conventional format for that system first in the search-list,
609 so ambiguities are resolved in favor of convention.
610 @end ifset
611
612 @node Commands
613 @chapter Command Language
614
615 @cindex command files
616 The command language provides explicit control over the link process,
617 allowing complete specification of the mapping between the linker's
618 input files and its output. It controls:
619 @itemize @bullet
620 @item
621 input files
622 @item
623 file formats
624 @item
625 output file format
626 @item
627 addresses of sections
628 @item
629 placement of common blocks
630 @end itemize
631
632 You may supply a command file (also known as a link script) to the
633 linker either explicitly through the @samp{-T} option, or implicitly as
634 an ordinary file. If the linker opens a file which it cannot recognize
635 as a supported object or archive format, it tries to interpret the file
636 as a command file.
637
638 You can also include a script directly on the @code{ld} command line,
639 delimited by the characters @samp{@{} and @samp{@}}.
640
641 @menu
642 * Scripts:: Linker Scripts
643 * Expressions:: Expressions
644 * MEMORY:: MEMORY Command
645 * SECTIONS:: SECTIONS Command
646 * Entry Point:: The Entry Point
647 * Other Commands:: Other Commands
648 @end menu
649
650 @node Scripts
651 @section Linker Scripts
652 The @code{ld} command language is a collection of statements; some are
653 simple keywords setting a particular option, some are used to select and
654 group input files or name output files; and two statement
655 types have a fundamental and pervasive impact on the linking process.
656
657 @cindex fundamental script commands
658 @cindex commands, fundamental
659 @cindex output file layout
660 @cindex layout of output file
661 The most fundamental command of the @code{ld} command language is the
662 @code{SECTIONS} command (@pxref{SECTIONS}). Every meaningful command
663 script must have a @code{SECTIONS} command: it specifies a
664 ``picture'' of the output file's layout, in varying degrees of detail.
665 No other command is required in all cases.
666
667 The @code{MEMORY} command complements @code{SECTIONS} by describing the
668 available memory in the target architecture. This command is optional;
669 if you don't use a @code{MEMORY} command, @code{ld} assumes sufficient
670 memory is available in a contiguous block for all output.
671 @xref{MEMORY}.
672
673 @cindex comments
674 You may include comments in linker scripts just as in C: delimited
675 by @samp{/*} and @samp{*/}. As in C, comments are syntactically
676 equivalent to whitespace.
677
678 @node Expressions
679 @section Expressions
680 @cindex expression syntax
681 @cindex arithmetic
682 Many useful commands involve arithmetic expressions. The syntax for
683 expressions in the command language is identical to that of C
684 expressions, with the following features:
685 @itemize @bullet
686 @item
687 All expressions evaluated as integers and
688 are of ``long'' or ``unsigned long'' type.
689 @item
690 All constants are integers.
691 @item
692 All of the C arithmetic operators are provided.
693 @item
694 You may reference, define, and create global variables.
695 @item
696 You may call special purpose built-in functions.
697 @end itemize
698
699 @menu
700 * Integers:: Integers
701 * Symbols:: Symbol Names
702 * Location Counter:: The Location Counter
703 * Operators:: Operators
704 * Evaluation:: Evaluation
705 * Assignment:: Assignment: Defining Symbols
706 * Built-ins:: Built-In Functions
707 @end menu
708
709 @node Integers
710 @subsection Integers
711 @cindex integer notation
712 @cindex octal integers
713 An octal integer is @samp{0} followed by zero or more of the octal
714 digits (@samp{01234567}).
715 @example
716 _as_octal = 0157255;
717 @end example
718
719 @cindex decimal integers
720 A decimal integer starts with a non-zero digit followed by zero or
721 more digits (@samp{0123456789}).
722 @example
723 _as_decimal = 57005;
724 @end example
725
726 @cindex hexadecimal integers
727 @kindex 0x
728 A hexadecimal integer is @samp{0x} or @samp{0X} followed by one or
729 more hexadecimal digits chosen from @samp{0123456789abcdefABCDEF}.
730 @example
731 _as_hex = 0xdead;
732 @end example
733
734 @cindex negative integers
735 To write a negative integer, use
736 the prefix operator @samp{-}; @pxref{Operators}.
737 @example
738 _as_neg = -57005;
739 @end example
740
741 @cindex scaled integers
742 @cindex K and M integer suffixes
743 @cindex M and K integer suffixes
744 @cindex suffixes for integers
745 @cindex integer suffixes
746 Additionally the suffixes @code{K} and @code{M} may be used to scale a
747 constant by
748 @c TEXI2ROFF-KILL
749 @ifinfo
750 @c END TEXI2ROFF-KILL
751 @code{1024} or @code{1024*1024}
752 @c TEXI2ROFF-KILL
753 @end ifinfo
754 @tex
755 ${\rm 1024}$ or ${\rm 1024}^2$
756 @end tex
757 @c END TEXI2ROFF-KILL
758 respectively. For example, the following all refer to the same quantity:
759
760 @example
761 _fourk_1 = 4K;
762 _fourk_2 = 4096;
763 _fourk_3 = 0x1000;
764 @end example
765
766 @node Symbols
767 @subsection Symbol Names
768 @cindex symbol names
769 @cindex names
770 @cindex quoted symbol names
771 @kindex "
772 Unless quoted, symbol names start with a letter, underscore, point or
773 hyphen and may include any letters, underscores, digits, points,
774 and minus signs. Unquoted symbol names must not conflict with any
775 keywords. You can specify a symbol which contains odd characters or has
776 the same name as a keyword, by surrounding the symbol name in double quotes:
777 @example
778 "SECTION" = 9;
779 "with a space" = "also with a space" + 10;
780 @end example
781
782 @node Location Counter
783 @subsection The Location Counter
784 @kindex .
785 @cindex dot
786 @cindex location counter
787 @cindex current output location
788 The special linker variable @dfn{dot} @samp{.} always contains the
789 current output location counter. Since the @code{.} always refers to
790 a location in an output section, it must always appear in an
791 expression within a @code{SECTIONS} command. The @code{.} symbol
792 may appear anywhere that an ordinary symbol is allowed in an
793 expression, but its assignments have a side effect. Assigning a value
794 to the @code{.} symbol will cause the location counter to be moved.
795 @cindex holes
796 This may be used to create holes in the output section. The location
797 counter may never be moved backwards.
798 @example
799 SECTIONS
800 @{
801 output :
802 @{
803 file1(.text)
804 . = . + 1000;
805 file2(.text)
806 . += 1000;
807 file3(.text)
808 @} = 0x1234;
809 @}
810 @end example
811 @noindent
812 In the previous example, @code{file1} is located at the beginning of the
813 output section, then there is a 1000 byte gap. Then @code{file2}
814 appears, also with a 1000 byte gap following before @code{file3} is
815 loaded. The notation @samp{= 0x1234} specifies what data to write in
816 the gaps (@pxref{Section Options}).
817
818 @node Operators
819 @subsection Operators
820 @cindex Operators for arithmetic
821 @cindex arithmetic operators
822 @cindex precedence in expressions
823 The linker recognizes the standard C set of arithmetic operators, with
824 the standard bindings and precedence levels:
825 @c TEXI2ROFF-KILL
826 @ifinfo
827 @c END TEXI2ROFF-KILL
828 @example
829 precedence associativity Operators Notes
830 (highest)
831 1 left ! - ~ (1)
832 2 left * / %
833 3 left + -
834 4 left >> <<
835 5 left == != > < <= >=
836 6 left &
837 7 left |
838 8 left &&
839 9 left ||
840 10 right ? :
841 11 right &= += -= *= /= (2)
842 (lowest)
843 @end example
844 Notes:
845 (1) Prefix operators
846 (2) @xref{Assignment}
847 @c TEXI2ROFF-KILL
848 @end ifinfo
849 @tex
850 \vskip \baselineskip
851 %"lispnarrowing" is the extra indent used generally for @example
852 \hskip\lispnarrowing\vbox{\offinterlineskip
853 \hrule
854 \halign
855 {\vrule#&\strut\hfil\ #\ \hfil&\vrule#&\strut\hfil\ #\ \hfil&\vrule#&\strut\hfil\ {\tt #}\ \hfil&\vrule#\cr
856 height2pt&\omit&&\omit&&\omit&\cr
857 &Precedence&& Associativity &&{\rm Operators}&\cr
858 height2pt&\omit&&\omit&&\omit&\cr
859 \noalign{\hrule}
860 height2pt&\omit&&\omit&&\omit&\cr
861 &highest&&&&&\cr
862 % '176 is tilde, '~' in tt font
863 &1&&left&&\qquad- \char'176\ !\qquad\dag&\cr
864 &2&&left&&* / \%&\cr
865 &3&&left&&+ -&\cr
866 &4&&left&&>> <<&\cr
867 &5&&left&&== != > < <= >=&\cr
868 &6&&left&&\&&\cr
869 &7&&left&&|&\cr
870 &8&&left&&{\&\&}&\cr
871 &9&&left&&||&\cr
872 &10&&right&&? :&\cr
873 &11&&right&&\qquad\&= += -= *= /=\qquad\ddag&\cr
874 &lowest&&&&&\cr
875 height2pt&\omit&&\omit&&\omit&\cr}
876 \hrule}
877 @end tex
878 @iftex
879 {
880 @obeylines@parskip=0pt@parindent=0pt
881 @dag@quad Prefix operators.
882 @ddag@quad @xref{Assignment}.
883 }
884 @end iftex
885 @c END TEXI2ROFF-KILL
886
887 @node Evaluation
888 @subsection Evaluation
889
890 @cindex lazy evaluation
891 @cindex expression evaluation order
892 The linker uses ``lazy evaluation'' for expressions; it only calculates
893 an expression when absolutely necessary. The linker needs the value of
894 the start address, and the lengths of memory regions, in order to do any
895 linking at all; these values are computed as soon as possible when the
896 linker reads in the command file. However, other values (such as symbol
897 values) are not known or needed until after storage allocation. Such
898 values are evaluated later, when other information (such as the sizes of
899 output sections) is available for use in the symbol assignment
900 expression.
901
902 @node Assignment
903 @subsection Assignment: Defining Symbols
904 @cindex assignment in scripts
905 @cindex symbol definition, scripts
906 @cindex variables, defining
907 You may create global symbols, and assign values (addresses) to global
908 symbols, using any of the C assignment operators:
909
910 @table @code
911 @item @var{symbol} = @var{expression} ;
912 @itemx @var{symbol} &= @var{expression} ;
913 @itemx @var{symbol} += @var{expression} ;
914 @itemx @var{symbol} -= @var{expression} ;
915 @itemx @var{symbol} *= @var{expression} ;
916 @itemx @var{symbol} /= @var{expression} ;
917 @end table
918
919 Two things distinguish assignment from other operators in @code{ld}
920 expressions.
921 @itemize @bullet
922 @item
923 Assignment may only be used at the root of an expression;
924 @samp{a=b+3;} is allowed, but @samp{a+b=3;} is an error.
925
926 @kindex ;
927 @cindex semicolon
928 @item
929 You must place a trailing semicolon (``@key{;}'') at the end of an
930 assignment statement.
931 @end itemize
932
933 Assignment statements may appear:
934 @itemize @bullet
935 @item
936 as commands in their own right in an @code{ld} script; or
937 @item
938 as independent statements within a @code{SECTIONS} command; or
939 @item
940 as part of the contents of a section definition in a
941 @code{SECTIONS} command.
942 @end itemize
943
944 The first two cases are equivalent in effect---both define a symbol with
945 an absolute address. The last case defines a symbol whose address is
946 relative to a particular section (@pxref{SECTIONS}).
947
948 @cindex absolute and relocatable symbols
949 @cindex relocatable and absolute symbols
950 @cindex symbols, relocatable and absolute
951 When a linker expression is evaluated and assigned to a variable, it is
952 given either an absolute or a relocatable type. An absolute expression
953 type is one in which the symbol contains the value that it will have in
954 the output file, a relocatable expression type is one in which the
955 value is expressed as a fixed offset from the base of a section.
956
957 The type of the expression is controlled by its position in the script
958 file. A symbol assigned within a section definition is created relative
959 to the base of the section; a symbol assigned in any other place is
960 created as an absolute symbol. Since a symbol created within a
961 section definition is relative to the base of the section, it
962 will remain relocatable if relocatable output is requested. A symbol
963 may be created with an absolute value even when assigned to within a
964 section definition by using the absolute assignment function
965 @code{ABSOLUTE}. For example, to create an absolute symbol whose address
966 is the last byte of an output section named @code{.data}:
967 @example
968 SECTIONS@{ @dots{}
969 .data :
970 @{
971 *(.data)
972 _edata = ABSOLUTE(.) ;
973 @}
974 @dots{} @}
975 @end example
976
977 The linker tries to put off the evaluation of an assignment until all
978 the terms in the source expression are known (@pxref{Evaluation}). For
979 instance, the sizes of sections cannot be known until after allocation,
980 so assignments dependent upon these are not performed until after
981 allocation. Some expressions, such as those depending upon the location
982 counter @dfn{dot}, @samp{.} must be evaluated during allocation. If the
983 result of an expression is required, but the value is not available,
984 then an error results. For example, a script like the following
985 @example
986 SECTIONS @{ @dots{}
987 text 9+this_isnt_constant :
988 @{ @dots{}
989 @}
990 @dots{} @}
991 @end example
992 @kindex Non constant expression
993 @noindent
994 will cause the error message ``@code{Non constant expression for initial
995 address}''.
996
997 @node Built-ins
998 @subsection Built-In Functions
999 @cindex functions in expression language
1000 The command language includes a number of built-in
1001 functions for use in link script expressions.
1002 @table @code
1003 @item ABSOLUTE(@var{exp})
1004 @kindex ABSOLUTE(@var{exp})
1005 @cindex expression, absolute
1006 Return the absolute (non-relocatable, as opposed to non-negative) value
1007 of the expression @var{exp}. Primarily useful to assign an absolute
1008 value to a symbol within a section definition, where symbol values are
1009 normally section-relative.
1010
1011 @item ADDR(@var{section})
1012 @kindex ADDR(@var{section})
1013 @cindex section address
1014 Return the absolute address of the named @var{section}. Your script must
1015 previously have defined the location of that section. In the following
1016 example, @code{symbol_1} and @code{symbol_2} are assigned identical
1017 values:
1018 @example
1019 SECTIONS@{ @dots{}
1020 .output1 :
1021 @{
1022 start_of_output_1 = ABSOLUTE(.);
1023 @dots{}
1024 @}
1025 .output :
1026 @{
1027 symbol_1 = ADDR(.output1);
1028 symbol_2 = start_of_output_1;
1029 @}
1030 @dots{} @}
1031 @end example
1032
1033 @item ALIGN(@var{exp})
1034 @kindex ALIGN(@var{exp})
1035 @cindex rounding up location counter
1036 Return the result of the current location counter (@code{.}) aligned to
1037 the next @var{exp} boundary. @var{exp} must be an expression whose
1038 value is a power of two. This is equivalent to
1039 @example
1040 (. + @var{exp} - 1) & ~(@var{exp} - 1)
1041 @end example
1042
1043 @code{ALIGN} doesn't change the value of the location counter---it just
1044 does arithmetic on it. As an example, to align the output @code{.data}
1045 section to the next @code{0x2000} byte boundary after the preceding
1046 section and to set a variable within the section to the next
1047 @code{0x8000} boundary after the input sections:
1048 @example
1049 SECTIONS@{ @dots{}
1050 .data ALIGN(0x2000): @{
1051 *(.data)
1052 variable = ALIGN(0x8000);
1053 @}
1054 @dots{} @}
1055 @end example
1056 @noindent
1057 The first use of @code{ALIGN} in this example specifies the location of
1058 a section because it is used as the optional @var{start} attribute of a
1059 section definition (@pxref{Section Options}). The second use simply
1060 defines the value of a variable.
1061
1062 The built-in @code{NEXT} is closely related to @code{ALIGN}.
1063
1064 @item DEFINED(@var{symbol})
1065 @kindex DEFINED(@var{symbol})
1066 @cindex symbol defaults
1067 Return 1 if @var{symbol} is in the linker global symbol table and is
1068 defined, otherwise return 0. You can use this function to provide default
1069 values for symbols. For example, the following command-file fragment shows how
1070 to set a global symbol @code{begin} to the first location in the
1071 @code{.text} section---but if a symbol called @code{begin} already
1072 existed, its value is preserved:
1073 @smallexample
1074 SECTIONS@{ @dots{}
1075 .text : @{
1076 begin = DEFINED(begin) ? begin : . ;
1077 @dots{}
1078 @}
1079 @dots{} @}
1080 @end smallexample
1081
1082 @item NEXT(@var{exp})
1083 @kindex NEXT(@var{exp})
1084 @cindex unallocated address, next
1085 Return the next unallocated address that is a multiple of @var{exp}.
1086 This function is closely related to @code{ALIGN(@var{exp})}; unless you
1087 use the @code{MEMORY} command to define discontinuous memory for the
1088 output file, the two functions are equivalent.
1089
1090 @item SIZEOF(@var{section})
1091 @kindex SIZEOF(@var{section})
1092 @cindex section size
1093 Return the size in bytes of the named @var{section}, if that section has
1094 been allocated. In the following example, @code{symbol_1} and
1095 @code{symbol_2} are assigned identical values:
1096 @c What does it return if the section hasn't been allocated? 0?
1097 @example
1098 SECTIONS@{ @dots{}
1099 .output @{
1100 .start = . ;
1101 @dots{}
1102 .end = . ;
1103 @}
1104 symbol_1 = .end - .start ;
1105 symbol_2 = SIZEOF(.output);
1106 @dots{} @}
1107
1108 @end example
1109
1110 @item SIZEOF_HEADERS
1111 @kindex SIZEOF_HEADERS
1112 @cindex header size
1113 @itemx sizeof_headers
1114 @kindex sizeof_headers
1115 Return the size in bytes of the output file's headers. You can use this number
1116 as the start address of the first section, if you choose, to facilitate
1117 paging.
1118
1119 @end table
1120
1121 @node MEMORY
1122 @section MEMORY Command
1123 @kindex MEMORY
1124 @cindex regions of memory
1125 @cindex discontinuous memory
1126 @cindex allocating memory
1127 The linker's default configuration permits allocation of all available memory.
1128 You can override this configuration by using the @code{MEMORY} command. The
1129 @code{MEMORY} command describes the location and size of blocks of
1130 memory in the target. By using it carefully, you can describe which
1131 memory regions may be used by the linker, and which memory regions it
1132 must avoid. The linker does not shuffle sections to fit into the
1133 available regions, but does move the requested sections into the correct
1134 regions and issue errors when the regions become too full.
1135
1136 The command files may contain at most one use of the @code{MEMORY}
1137 command; however, you can define as many blocks of memory within it as
1138 you wish. The syntax is:
1139
1140 @example
1141 MEMORY
1142 @{
1143 @var{name} (@var{attr}) : ORIGIN = @var{origin}, LENGTH = @var{len}
1144 @dots{}
1145 @}
1146 @end example
1147 @table @code
1148 @item @var{name}
1149 @cindex naming memory regions
1150 is a name used internally by the linker to refer to the region. Any
1151 symbol name may be used. The region names are stored in a separate
1152 name space, and will not conflict with symbols, file names or section
1153 names. Use distinct names to specify multiple regions.
1154 @item (@var{attr})
1155 @cindex memory region attributes
1156 is an optional list of attributes, permitted for compatibility with the
1157 AT&T linker but not used by @code{ld} beyond checking that the
1158 attribute list is valid. Valid attribute lists must be made up of the
1159 characters ``@code{LIRWX}''. If you omit the attribute list, you may
1160 omit the parentheses around it as well.
1161 @item @var{origin}
1162 @kindex ORIGIN =
1163 @kindex o =
1164 @kindex org =
1165 is the start address of the region in physical memory. It is
1166 an expression that must evaluate to a constant before
1167 memory allocation is performed. The keyword @code{ORIGIN} may be
1168 abbreviated to @code{org} or @code{o}.
1169 @item @var{len}
1170 @kindex LENGTH =
1171 @kindex len =
1172 @kindex l =
1173 is the size in bytes of the region (an expression).
1174 The keyword @code{LENGTH} may be abbreviated to @code{len} or @code{l}.
1175 @end table
1176
1177 For example, to specify that memory has two regions available for
1178 allocation---one starting at 0 for 256 kilobytes, and the other
1179 starting at @code{0x40000000} for four megabytes:
1180
1181 @example
1182 MEMORY
1183 @{
1184 rom : ORIGIN = 0, LENGTH = 256K
1185 ram : org = 0x40000000, l = 4M
1186 @}
1187 @end example
1188
1189 Once you have defined a region of memory named @var{mem}, you can direct
1190 specific output sections there by using a command ending in
1191 @samp{>@var{mem}} within the @code{SECTIONS} command (@pxref{Section
1192 Options}). If the combined output sections directed to a region are too
1193 big for the region, the linker will issue an error message.
1194
1195 @node SECTIONS
1196 @section SECTIONS Command
1197 @kindex SECTIONS
1198 The @code{SECTIONS} command controls exactly where input sections are
1199 placed into output sections, their order and to which output sections
1200 they are allocated.
1201
1202 You may use at most one @code{SECTIONS} command in a commands file,
1203 but you can have as many statements within it as you wish. Statements
1204 within the @code{SECTIONS} command can do one of three things:
1205 @itemize @bullet
1206 @item
1207 define the entry point;
1208 @item
1209 assign a value to a symbol;
1210 @item
1211 describe the placement of a named output section, and what input
1212 sections make it up.
1213 @end itemize
1214
1215 The first two possibilities---defining the entry point, and defining
1216 symbols---can also be done outside the @code{SECTIONS} command:
1217 @pxref{Entry Point}, @pxref{Assignment}. They are permitted here as
1218 well for your convenience in reading the script, so that symbols or the
1219 entry point can be defined at meaningful points in your output-file
1220 layout.
1221
1222 When no @code{SECTIONS} command is specified, the default action
1223 of the linker is to place each input section into an identically named
1224 output section in the order that the sections are first encountered in
1225 the input files; if all input sections are present in the first file,
1226 for example, the order of sections in the output file will match the
1227 order in the first input file.
1228
1229 @menu
1230 * Section Definition:: Section Definitions
1231 * Section Contents:: Section Contents
1232 * Section Options:: Optional Section Attributes
1233 @end menu
1234
1235 @node Section Definition
1236 @subsection Section Definitions
1237 @cindex section definition
1238 The most frequently used statement in the @code{SECTIONS} command is
1239 the @dfn{section definition}, which you can use to specify the
1240 properties of an output section: its location, alignment, contents,
1241 fill pattern, and target memory region. Most of
1242 these specifications are optional; the simplest form of a section
1243 definition is
1244 @example
1245 SECTIONS @{ @dots{}
1246 @var{secname} : @{
1247 @var{contents}
1248 @}
1249 @dots{} @}
1250 @end example
1251 @cindex naming output sections
1252 @noindent
1253 @var{secname} is the name of the output section, and @var{contents} a
1254 specification of what goes there---for example, a list of input files or
1255 sections of input files. As you might assume, the whitespace shown is
1256 optional. You do need the colon @samp{:} and the braces @samp{@{@}},
1257 however.
1258
1259 @var{secname} must meet the constraints of your output format. In
1260 formats which only support a limited number of sections, such as
1261 @code{a.out}, the name must be one of the names supported by the format
1262 (@code{a.out}, for example, allows only @code{.text}, @code{.data} or
1263 @code{.bss}). If the output format supports any number of sections, but
1264 with numbers and not names (as is the case for Oasys), the name should be
1265 supplied as a quoted numeric string. A section name may consist of any
1266 sequence characters, but any name which does not conform to the standard
1267 @code{ld} symbol name syntax must be quoted.
1268 @xref{Symbols, , Symbol Names}.
1269
1270 @node Section Contents
1271 @subsection Section Contents
1272 @cindex contents of a section
1273 In a section definition, you can specify the contents of an output section by
1274 listing particular object files, by listing particular input-file
1275 sections, or by a combination of the two. You can also place arbitrary
1276 data in the section, and define symbols relative to the beginning of the
1277 section.
1278
1279 The @var{contents} of a section definition may include any of the
1280 following kinds of statement. You can include as many of these as you
1281 like in a single section definition, separated from one another by
1282 whitespace.
1283
1284 @table @code
1285 @item @var{filename}
1286 @kindex @var{filename}
1287 @cindex input files, section defn
1288 @cindex files, including in output sections
1289 You may simply name a particular input file to be placed in the current
1290 output section; @emph{all} sections from that file are placed in the
1291 current section definition. To specify a list of particular files by
1292 name:
1293 @example
1294 .data : @{ afile.o bfile.o cfile.o @}
1295 @end example
1296 @noindent
1297 The example also illustrates that multiple statements can be included in
1298 the contents of a section definition, since each file name is a separate
1299 statement.
1300
1301 If the file name has already been mentioned in another section
1302 definition, with an explicit section name list, then only those sections
1303 which have not yet been allocated are used.
1304
1305 @item @var{filename}( @var{section} )
1306 @itemx @var{filename}( @var{section}, @var{section}, @dots{} )
1307 @itemx @var{filename}( @var{section} @var{section} @dots{} )
1308 @kindex @var{filename}(@var{section})
1309 @cindex files and sections, section defn
1310 You can name one or more sections from your input files, for
1311 insertion in the current output section. If you wish to specify a list
1312 of input-file sections inside the parentheses, you may separate the
1313 section names by either commas or whitespace.
1314
1315 @item * (@var{section})
1316 @itemx * (@var{section}, @var{section}, @dots{})
1317 @itemx * (@var{section} @var{section} @dots{}
1318 @cindex input sections to output section
1319 @kindex *(@var{section})
1320 Instead of explicitly naming particular input files in a link control
1321 script, you can refer to @emph{all} files from the @code{ld} command
1322 line: use @samp{*} instead of a particular file name before the
1323 parenthesized input-file section list.
1324
1325 For example, to copy sections @code{1} through @code{4} from an Oasys file
1326 into the @code{.text} section of an @code{a.out} file, and sections @code{13}
1327 and @code{14} into the @code{.data} section:
1328 @example
1329 SECTIONS @{
1330 .text :@{
1331 *("1" "2" "3" "4")
1332 @}
1333
1334 .data :@{
1335 *("13" "14")
1336 @}
1337 @}
1338 @end example
1339
1340 If you have already explicitly included some files by name, @samp{*}
1341 refers to all @emph{remaining} files---those whose places in the output
1342 file have not yet been defined.
1343
1344 @item [ @var{section} ]
1345 @itemx [ @var{section}, @var{section}, @dots{} ]
1346 @itemx [ @var{section} @var{section} @dots{} ]
1347 @kindex [ @var{sections} ]
1348 This is an alternate notation to specify named sections from all
1349 unallocated input files; its effect is exactly the same as that of
1350 @samp{* (@var{section}@dots{})}
1351
1352 @item @var{filename}@code{( COMMON )}
1353 @itemx ( COMMON )
1354 @kindex ( COMMON )
1355 @cindex uninitialized data
1356 @cindex commons in output
1357 Specify where in your output file to place uninitialized data
1358 with this notation. @code{*(COMMON)} by itself refers to all
1359 uninitialized data from all input files (so far as it is not yet
1360 allocated); @var{filename}@code{(COMMON)} refers to uninitialized data
1361 from a particular file. Both are special cases of the general
1362 mechanisms for specifying where to place input-file sections:
1363 @code{ld} permits you to refer to uninitialized data as if it
1364 were in an input-file section named @code{COMMON}, regardless of the
1365 input file's format.
1366 @end table
1367
1368 For example, the following command script arranges the output file into
1369 three consecutive sections, named @code{.text}, @code{.data}, and
1370 @code{.bss}, taking the input for each from the correspondingly named
1371 sections of all the input files:
1372 @example
1373 SECTIONS @{
1374 .text : @{ *(.text) @}
1375 .data : @{ *(.data) @}
1376 .bss : @{ *(.bss) *(COMMON) @}
1377 @}
1378 @end example
1379
1380 The following example reads all of the sections from file @code{all.o}
1381 and places them at the start of output section @code{outputa} which
1382 starts at location @code{0x10000}. All of section @code{.input1} from
1383 file @code{foo.o} follows immediately, in the same output section. All
1384 of section @code{.input2} from @code{foo.o} goes into output section
1385 @code{outputb}, followed by section @code{.input1} from @code{foo1.o}.
1386 All of the remaining @code{.input1} and @code{.input2} sections from any
1387 files are written to output section @code{outputc}.
1388
1389 @example
1390 SECTIONS @{
1391 outputa 0x10000 :
1392 @{
1393 all.o
1394 foo.o (.input1)
1395 @}
1396 outputb :
1397 @{
1398 foo.o (.input2)
1399 foo1.o (.input1)
1400 @}
1401 outputc :
1402 @{
1403 *(.input1)
1404 *(.input2)
1405 @}
1406 @}
1407 @end example
1408
1409 There are still more kinds of statements permitted in the contents of
1410 output section definitions. The foregoing statements permitted you to
1411 arrange, in your output file, data originating from your input files.
1412 You can also place data directly in an output section from the link
1413 command script. Most of these additional statements involve
1414 expressions; @pxref{Expressions}. Although these statements are shown
1415 separately here for ease of presentation, no such segregation is needed
1416 within a section definition in the @code{SECTIONS} command; you can
1417 intermix them freely with any of the statements we've just described.
1418
1419 @table @code
1420 @item CREATE_OBJECT_SYMBOLS
1421 @kindex CREATE_OBJECT_SYMBOLS
1422 @cindex input filename symbols
1423 @cindex filename symbols
1424 Create a symbol for each input file
1425 in the current section, set to the address of the first byte of
1426 data written from the input file. For instance, with @code{a.out}
1427 files it is conventional to have a symbol for each input file. You can
1428 accomplish this by defining the output @code{.text} section as follows:
1429 @example
1430 SECTIONS @{
1431 .text 0x2020 :
1432 @{
1433 CREATE_OBJECT_SYMBOLS
1434 *(.text)
1435 _etext = ALIGN(0x2000);
1436 @}
1437 @dots{}
1438 @}
1439 @end example
1440
1441 If @code{objsym} is a file containing this script, and @code{a.o},
1442 @code{b.o}, @code{c.o}, and @code{d.o} are four input files with
1443 contents like the following---
1444 @example
1445 /* a.c */
1446
1447 afunction() @{ @}
1448 int adata=1;
1449 int abss;
1450 @end example
1451
1452 @noindent
1453 @samp{ld -M sample a.o b.o c.o d.o} would create a map like this,
1454 containing symbols matching the object file names:
1455 @example
1456 00000000 A __DYNAMIC
1457 00004020 B _abss
1458 00004000 D _adata
1459 00002020 T _afunction
1460 00004024 B _bbss
1461 00004008 D _bdata
1462 00002038 T _bfunction
1463 00004028 B _cbss
1464 00004010 D _cdata
1465 00002050 T _cfunction
1466 0000402c B _dbss
1467 00004018 D _ddata
1468 00002068 T _dfunction
1469 00004020 D _edata
1470 00004030 B _end
1471 00004000 T _etext
1472 00002020 t a.o
1473 00002038 t b.o
1474 00002050 t c.o
1475 00002068 t d.o
1476 @end example
1477
1478 @item @var{symbol} = @var{expression} ;
1479 @kindex @var{symbol} = @var{expression} ;
1480 @itemx @var{symbol} @var{f}= @var{expression} ;
1481 @kindex @var{symbol} @var{f}= @var{expression} ;
1482 @var{symbol} is any symbol name (@pxref{Symbols}). ``@var{f}=''
1483 refers to any of the operators @code{&= += -= *= /=} which combine
1484 arithmetic and assignment.
1485
1486 @cindex assignment, in section defn
1487 When you assign a value to a symbol within a particular section
1488 definition, the value is relative to the beginning of the section
1489 (@pxref{Assignment}). If you write
1490 @example
1491 SECTIONS @{
1492 abs = 14 ;
1493 @dots{}
1494 .data : @{ @dots{} rel = 14 ; @dots{} @}
1495 abs2 = 14 + ADDR(.data);
1496 @dots{}
1497 @}
1498 @end example
1499 @c FIXME: Try above example!
1500 @noindent
1501 @code{abs} and @code{rel} do not have the same value; @code{rel} has the
1502 same value as @code{abs2}.
1503
1504 @item BYTE(@var{expression})
1505 @kindex BYTE(@var{expression})
1506 @itemx SHORT(@var{expression})
1507 @kindex SHORT(@var{expression})
1508 @itemx LONG(@var{expression})
1509 @kindex LONG(@var{expression})
1510 @cindex direct output
1511 By including one of these three statements in a section definition, you
1512 can explicitly place one, two, or four bytes (respectively) at the
1513 current address of that section.
1514
1515 @ifclear SingleFormat
1516 Multiple-byte quantities are represented in whatever byte order is
1517 appropriate for the output file format (@pxref{BFD}).
1518 @end ifclear
1519
1520 @item FILL(@var{expression})
1521 @kindex FILL(@var{expression})
1522 @cindex holes, filling
1523 @cindex unspecified memory
1524 Specifies the ``fill pattern'' for the current section. Any otherwise
1525 unspecified regions of memory within the section (for example, regions
1526 you skip over by assigning a new value to the location counter @samp{.})
1527 are filled with the two least significant bytes from the
1528 @var{expression} argument. A @code{FILL} statement covers memory
1529 locations @emph{after} the point it occurs in the section definition; by
1530 including more than one @code{FILL} statement, you can have different
1531 fill patterns in different parts of an output section.
1532 @end table
1533
1534 @node Section Options
1535 @subsection Optional Section Attributes
1536 @cindex section defn, full syntax
1537 Here is the full syntax of a section definition, including all the
1538 optional portions:
1539
1540 @smallexample
1541 SECTIONS @{
1542 @dots{}
1543 @var{secname} @var{start} BLOCK(@var{align}) (NOLOAD) : @{ @var{contents} @} =@var{fill} >@var{region}
1544 @dots{}
1545 @}
1546 @end smallexample
1547
1548 @var{secname} and @var{contents} are required. @xref{Section
1549 Definition}, and @pxref{Section Contents} for details on @var{contents}.
1550 The remaining elements---@var{start}, @code{BLOCK(@var{align)}},
1551 @code{(NOLOAD)} @code{=@var{fill}}, and @code{>@var{region}}---are all
1552 optional.
1553
1554 @table @code
1555 @item @var{start}
1556 @cindex start address, section
1557 @cindex section start
1558 @cindex section address
1559 You can force the output section to be loaded at a specified address by
1560 specifying @var{start} immediately following the section name.
1561 @var{start} can be represented as any expression. The following
1562 example generates section @var{output} at location
1563 @code{0x40000000}:
1564 @example
1565 SECTIONS @{
1566 @dots{}
1567 output 0x40000000: @{
1568 @dots{}
1569 @}
1570 @dots{}
1571 @}
1572 @end example
1573
1574 @item BLOCK(@var{align})
1575 @kindex BLOCK(@var{align})
1576 @cindex section alignment
1577 @cindex aligning sections
1578 You can include @code{BLOCK()} specification to advance
1579 the location counter @code{.} prior to the beginning of the section, so
1580 that the section will begin at the specified alignment. @var{align} is
1581 an expression.
1582
1583 @item (NOLOAD)
1584 @kindex NOLOAD
1585 @cindex prevent unnecessary loading
1586 Use @samp{(NOLOAD)} to prevent a section from being loaded into memory
1587 each time it is accessed. For example, in the script sample below, the
1588 @code{ROM} segment is addressed at memory location @samp{0} and does not
1589 need to be loaded into each object file:
1590 @example
1591 SECTIONS @{
1592 ROM 0 (NOLOAD) : @{ @dots{} @}
1593 @dots{}
1594 @}
1595 @end example
1596
1597 @item =@var{fill}
1598 @kindex =@var{fill}
1599 @cindex section fill pattern
1600 @cindex fill pattern, entire section
1601 Including
1602 @code{=@var{fill}} in a section definition specifies the initial fill
1603 value for that section.
1604 You may use any expression to specify @var{fill}.
1605 Any unallocated holes in the current output
1606 section when written to the output file will be filled with the two
1607 least significant bytes of the value, repeated as necessary. You can
1608 also change the fill value with a @code{FILL} statement in the
1609 @var{contents} of a section definition.
1610
1611 @item >@var{region}
1612 @kindex >@var{region}
1613 @cindex section, assigning to memory region
1614 @cindex memory regions and sections
1615 Assign this section to a previously defined region of memory.
1616 @xref{MEMORY}.
1617
1618 @end table
1619
1620 @node Entry Point
1621 @section The Entry Point
1622 @kindex ENTRY(@var{symbol})
1623 @cindex start of execution
1624 @cindex first instruction
1625 The linker command language includes a command specifically for
1626 defining the first executable instruction in an output file (its
1627 @dfn{entry point}). Its argument is a symbol name:
1628 @example
1629 ENTRY(@var{symbol})
1630 @end example
1631
1632 Like symbol assignments, the @code{ENTRY} command may be placed either
1633 as an independent command in the command file, or among the section
1634 definitions within the @code{SECTIONS} command---whatever makes the most
1635 sense for your layout.
1636
1637 @cindex entry point, defaults
1638 @code{ENTRY} is only one of several ways of choosing the entry point.
1639 You may indicate it in any of the following ways (shown in descending
1640 order of priority: methods higher in the list override methods lower down).
1641 @itemize @bullet
1642 @item
1643 the @samp{-e} @var{entry} command-line option;
1644 @item
1645 the @code{ENTRY(@var{symbol}} command in a linker control script;
1646 @item
1647 the value of the symbol @code{start}, if present;
1648 @item
1649 the value of the symbol @code{_main}, if present;
1650 @item
1651 the address of the first byte of the @code{.text} section, if present;
1652 @item
1653 The address @code{0}.
1654 @end itemize
1655
1656 For example, you can use these rules to generate an entry point with an
1657 assignment statement: if no symbol @code{start} is defined within your
1658 input files, you can simply define it, assigning it an appropriate
1659 value---
1660 @example
1661 start = 0x2020;
1662 @end example
1663
1664 @noindent
1665 The example shows an absolute address, but you can use any expression.
1666 For example, if your input object files use some other symbol-name
1667 convention for the entry point, you can just assign the value of
1668 whatever symbol contains the start address to @code{start}:
1669 @example
1670 start = other_symbol ;
1671 @end example
1672
1673 @node Other Commands
1674 @section Other Commands
1675 The command language includes a number of other commands that you can
1676 use for specialized purposes. They are similar in purpose to
1677 command-line options.
1678
1679 @table @code
1680 @item FLOAT
1681 @kindex FLOAT
1682 @itemx NOFLOAT
1683 @kindex NOFLOAT
1684 These keywords were used in some older linkers to request a particular
1685 math subroutine library. @code{ld} doesn't use the keywords, assuming
1686 instead that any necessary subroutines are in libraries specified using
1687 the general mechanisms for linking to archives; but to permit the use of
1688 scripts that were written for the older linkers, the keywords
1689 @code{FLOAT} and @code{NOFLOAT} are accepted and ignored.
1690
1691 @item FORCE_COMMON_ALLOCATION
1692 @kindex FORCE_COMMON_ALLOCATION
1693 @cindex common allocation
1694 This command has the same effect as the @samp{-d} command-line option:
1695 to make @code{ld} assign space to common symbols even if a relocatable
1696 output file is specified (@samp{-r}).
1697
1698 @item INPUT ( @var{file}, @var{file}, @dots{} )
1699 @kindex INPUT ( @var{files} )
1700 @itemx INPUT ( @var{file} @var{file} @dots{} )
1701 @cindex binary input files
1702 Use this command to include binary input files in the link, without
1703 including them in a particular section definition. Files specified this
1704 way are treated identically to object files listed on the command line.
1705
1706 @ignore
1707 @item MAP ( @var{name} )
1708 @kindex MAP ( @var{name} )
1709 @c MAP(...) appears to look for an F in the arg, ignoring all other
1710 @c chars; if it finds one, it sets "map_option_f" to true. But nothing
1711 @c checks map_option_f. Apparently a stub for the future...
1712 @end ignore
1713
1714 @item OUTPUT ( @var{filename} )
1715 @kindex OUTPUT ( @var{filename} )
1716 @cindex naming the output file
1717 Use this command to name the link output file @var{filename}. The
1718 effect of @code{OUTPUT(@var{filename})} is identical to the effect of
1719 @w{@samp{-o @var{filename}}}, and whichever is encountered last will
1720 control the name actually used to name the output file. In particular,
1721 you can use this command to supply a default output-file name other than
1722 @code{a.out}.
1723
1724 @ifclear SingleFormat
1725 @item OUTPUT_ARCH ( @var{bfdname} )
1726 @kindex OUTPUT_ARCH ( @var{bfdname} )
1727 @cindex machine architecture, output
1728 Specify a particular output machine architecture, with one of the names
1729 used by the BFD back-end routines (@pxref{BFD}). This command is often
1730 unnecessary; the architecture is most often set implicitly by either the
1731 system BFD configuration or as a side effect of the @code{OUTPUT_FORMAT}
1732 command.
1733
1734 @item OUTPUT_FORMAT ( @var{bfdname} )
1735 @kindex OUTPUT_FORMAT ( @var{bfdname} )
1736 @cindex format, output file
1737 Specify a particular output format, with one of the names used by the
1738 BFD back-end routines (@pxref{BFD}). This selection will only affect
1739 the output file; the related command @code{TARGET} affects primarily
1740 input files.
1741 @end ifclear
1742
1743 @item SEARCH_DIR ( @var{path} )
1744 @kindex SEARCH_DIR ( @var{path} )
1745 @cindex path for libraries
1746 @cindex search path, libraries
1747 Add @var{path} to the list of paths where @code{ld} looks for
1748 archive libraries. @code{SEARCH_DIR(@var{path})} has the same
1749 effect as @samp{-L@var{path}} on the command line.
1750
1751 @item STARTUP ( @var{filename} )
1752 @kindex STARTUP ( @var{filename} )
1753 @cindex first input file
1754 Ensure that @var{filename} is the first input file used in the link
1755 process.
1756
1757 @ifclear SingleFormat
1758 @item TARGET ( @var{format} )
1759 @cindex input file format
1760 @kindex TARGET ( @var{format} )
1761 Change the input-file object code format (like the command-line option
1762 @samp{-b} or its synonym @samp{-format}). The argument @var{format} is
1763 one of the strings used by BFD to name binary formats. In the current
1764 @code{ld} implementation, if @code{TARGET} is specified but
1765 @code{OUTPUT_FORMAT} is not, the last @code{TARGET} argument is also
1766 used as the default format for the @code{ld} output file.
1767 @xref{BFD}.
1768
1769 @kindex GNUTARGET
1770 If you don't use the @code{TARGET} command, @code{ld} uses the value of
1771 the environment variable @code{GNUTARGET}, if available, to select the
1772 output file format. If that variable is also absent, @code{ld} uses
1773 the default format configured for your machine in the BFD libraries.
1774 @end ifclear
1775 @end table
1776
1777 @ifset GENERIC
1778 @node Machine Dependent
1779 @chapter Machine Dependent Features
1780
1781 @cindex machine dependencies
1782 @code{ld} has additional features on some platforms; the following
1783 sections describe them. Machines where @code{ld} has no additional
1784 functionality are not listed.
1785
1786 @menu
1787 * H8/300:: @code{ld} and the H8/300
1788 * i960:: @code{ld} and the Intel 960 family
1789 @end menu
1790 @end ifset
1791
1792 @c FIXME! This could use @up/@down, but there seems to be a conflict
1793 @c between those and node-defaulting.
1794 @ifset H8300
1795 @ifclear GENERIC
1796 @up
1797 @end ifclear
1798 @node H8/300
1799 @section @code{ld} and the H8/300
1800
1801 @cindex H8/300 support
1802 For the H8/300, @code{ld} can perform these global optimizations when
1803 you specify the @samp{-relax} command-line option.
1804
1805 @table @emph
1806 @item relaxing address modes
1807 @cindex relaxing on H8/300
1808 @code{ld} finds all @code{jsr} and @code{jmp} instructions whose
1809 targets are within eight bits, and turns them into eight-bit
1810 program-counter relative @code{bsr} and @code{bra} instructions,
1811 respectively.
1812
1813 @item synthesizing instructions
1814 @cindex synthesizing on H8/300
1815 @c FIXME: specifically mov.b, or any mov instructions really?
1816 @code{ld} finds all @code{mov.b} instructions which use the
1817 sixteen-bit absolute address form, but refer to the top
1818 page of memory, and changes them to use the eight-bit address form.
1819 (That is: the linker turns @samp{mov.b @code{@@}@var{aa}:16} into
1820 @samp{mov.b @code{@@}@var{aa}:8} whenever the address @var{aa} is in the
1821 top page of memory).
1822 @end table
1823 @ifclear GENERIC
1824 @down
1825 @end ifclear
1826 @end ifset
1827
1828 @ifset I960
1829 @ifclear GENERIC
1830 @up
1831 @end ifclear
1832 @node i960
1833 @section @code{ld} and the Intel 960 family
1834
1835 @cindex i960 support
1836
1837 You can use the @samp{-A@var{architecture}} command line option to
1838 specify one of the two-letter names identifying members of the 960
1839 family; the option specifies the desired output target, and warns of any
1840 incompatible instructions in the input files. It also modifies the
1841 linker's search strategy for archive libraries, to support the use of
1842 libraries specific to each particular architecture, by including in the
1843 search loop names suffixed with the string identifying the architecture.
1844
1845 For example, if your @code{ld} command line included @w{@samp{-ACA}} as
1846 well as @w{@samp{-ltry}}, the linker would look (in its built-in search
1847 paths, and in any paths you specify with @samp{-L}) for a library with
1848 the names
1849
1850 @example
1851 try
1852 libtry.a
1853 tryca
1854 libtryca.a
1855 @end example
1856
1857 @noindent
1858 The first two possibilities would be considered in any event; the last
1859 two are due to the use of @w{@samp{-ACA}}.
1860
1861 You can meaningfully use @samp{-A} more than once on a command line, since
1862 the 960 architecture family allows combination of target architectures; each
1863 use will add another pair of name variants to search for when @w{@samp{-l}}
1864 specifies a library.
1865 @ifclear GENERIC
1866 @down
1867 @end ifclear
1868 @end ifset
1869
1870 @ifclear SingleFormat
1871 @node BFD
1872 @chapter BFD
1873
1874 @cindex back end
1875 @cindex object file management
1876 The linker accesses object and archive files using the BFD libraries.
1877 These libraries allow the linker to use the same routines to operate on
1878 object files whatever the object file format. A different object file
1879 format can be supported simply by creating a new BFD back end and adding
1880 it to the library. You can use @code{objdump -i}
1881 (@pxref{objdump,,objdump,binutils.info,The GNU Binary Utilities}) to
1882 list all the formats available for each architecture under BFD. This
1883 was the list of formats, and of architectures supported for each format,
1884 as of the time this manual was prepared:
1885 @cindex formats available
1886 @cindex architectures available
1887 @example
1888 BFD header file version 0.18
1889 a.out-i386
1890 (header big endian, data big endian)
1891 m68k:68020
1892 a29k
1893 sparc
1894 i386
1895 a.out-sunos-big
1896 (header big endian, data big endian)
1897 m68k:68020
1898 a29k
1899 sparc
1900 i386
1901 b.out.big
1902 (header big endian, data little endian)
1903 i960:core
1904 b.out.little
1905 (header little endian, data little endian)
1906 i960:core
1907 coff-a29k-big
1908 (header big endian, data big endian)
1909 a29k
1910 coff-h8300
1911 (header big endian, data big endian)
1912 H8/300
1913 coff-i386
1914 (header little endian, data little endian)
1915 i386
1916 coff-Intel-big
1917 (header big endian, data little endian)
1918 i960:core
1919 coff-Intel-little
1920 (header little endian, data little endian)
1921 i960:core
1922 coff-m68k
1923 (header big endian, data big endian)
1924 m68k:68020
1925 coff-m88kbcs
1926 (header big endian, data big endian)
1927 m88k:88100
1928 ecoff-bigmips
1929 (header big endian, data big endian)
1930 mips
1931 ecoff-littlemips
1932 (header little endian, data little endian)
1933 mips
1934 elf-big
1935 (header big endian, data big endian)
1936 m68k:68020
1937 vax
1938 i960:core
1939 a29k
1940 sparc
1941 mips
1942 i386
1943 m88k:88100
1944 H8/300
1945 rs6000:6000
1946 elf-little
1947 (header little endian, data little endian)
1948 m68k:68020
1949 vax
1950 i960:core
1951 a29k
1952 sparc
1953 mips
1954 i386
1955 m88k:88100
1956 H8/300
1957 rs6000:6000
1958 ieee
1959 (header big endian, data big endian)
1960 m68k:68020
1961 vax
1962 i960:core
1963 a29k
1964 sparc
1965 mips
1966 i386
1967 m88k:88100
1968 H8/300
1969 rs6000:6000
1970 srec
1971 (header big endian, data big endian)
1972 m68k:68020
1973 vax
1974 i960:core
1975 a29k
1976 sparc
1977 mips
1978 i386
1979 m88k:88100
1980 H8/300
1981 rs6000:6000
1982 @end example
1983
1984 @cindex BFD requirements
1985 @cindex requirements for BFD
1986 As with most implementations, BFD is a compromise between
1987 several conflicting requirements. The major factor influencing
1988 BFD design was efficiency: any time used converting between
1989 formats is time which would not have been spent had BFD not
1990 been involved. This is partly offset by abstraction payback; since
1991 BFD simplifies applications and back ends, more time and care
1992 may be spent optimizing algorithms for a greater speed.
1993
1994 One minor artifact of the BFD solution which you should bear in
1995 mind is the potential for information loss. There are two places where
1996 useful information can be lost using the BFD mechanism: during
1997 conversion and during output. @xref{BFD information loss}.
1998
1999 @menu
2000 * BFD outline:: How it works: an outline of BFD
2001 * BFD information loss:: Information Loss
2002 * Mechanism:: Mechanism
2003 @end menu
2004
2005 @node BFD outline
2006 @section How it works: an outline of BFD
2007 @cindex opening object files
2008 When an object file is opened, BFD subroutines automatically
2009 determine the format of the input object file, and build a descriptor in
2010 memory with pointers to routines that will be used to access elements of
2011 the object file's data structures.
2012
2013 As different information from the the object files is required,
2014 BFD reads from different sections of the file and processes them.
2015 For example, a very common operation for the linker is processing symbol
2016 tables. Each BFD back end provides a routine for converting
2017 between the object file's representation of symbols and an internal
2018 canonical format. When the linker asks for the symbol table of an object
2019 file, it calls through the memory pointer to the BFD
2020 back end routine which reads and converts the table into a canonical
2021 form. The linker then operates upon the common form. When the link is
2022 finished and the linker writes the symbol table of the output file,
2023 another BFD back end routine is called which takes the newly
2024 created symbol table and converts it into the chosen output format.
2025
2026 @node BFD information loss
2027 @section Information Loss
2028 @emph{Information can be lost during output.} The output formats
2029 supported by BFD do not provide identical facilities, and
2030 information which may be described in one form has nowhere to go in
2031 another format. One example of this is alignment information in
2032 @code{b.out}. There is nowhere in an @code{a.out} format file to store
2033 alignment information on the contained data, so when a file is linked
2034 from @code{b.out} and an @code{a.out} image is produced, alignment
2035 information will not propagate to the output file. (The linker will
2036 still use the alignment information internally, so the link is performed
2037 correctly).
2038
2039 Another example is COFF section names. COFF files may contain an
2040 unlimited number of sections, each one with a textual section name. If
2041 the target of the link is a format which does not have many sections (e.g.,
2042 @code{a.out}) or has sections without names (e.g., the Oasys format) the
2043 link cannot be done simply. You can circumvent this problem by
2044 describing the desired input-to-output section mapping with the command
2045 language.
2046
2047 @emph{Information can be lost during canonicalization.} The BFD
2048 internal canonical form of the external formats is not exhaustive; there
2049 are structures in input formats for which there is no direct
2050 representation internally. This means that the BFD back ends
2051 cannot maintain all possible data richness through the transformation
2052 between external to internal and back to external formats.
2053
2054 This limitation is only a problem when using the linker to read one
2055 format and write another. Each BFD back end is responsible for
2056 maintaining as much data as possible, and the internal BFD
2057 canonical form has structures which are opaque to the BFD core,
2058 and exported only to the back ends. When a file is read in one format,
2059 the canonical form is generated for BFD and the linker. At the
2060 same time, the back end saves away any information which would otherwise
2061 be lost. If the data is then written back in the same format, the back
2062 end routine will be able to use the canonical form provided by the
2063 BFD core as well as the information it prepared earlier. Since
2064 there is a great deal of commonality between back ends,
2065 there is no information lost when
2066 linking big endian COFF to little endian COFF, or from @code{a.out} to
2067 @code{b.out}. When a mixture of formats is linked, the information is
2068 only lost from the files whose format differs from the destination.
2069
2070 @node Mechanism
2071 @section Mechanism
2072 The greatest potential for loss of information occurs when there is the least
2073 overlap between the information provided by the source format, that
2074 stored by the canonical format, and that needed by the
2075 destination format. A brief description of the canonical form may help
2076 you understand which kinds of data you can count on preserving across
2077 conversions.
2078 @cindex BFD canonical format
2079 @cindex internal object-file format
2080
2081 @table @emph
2082 @item files
2083 Information on target machine architecture, particular implementation,
2084 and format type are stored on a per-file basis. Other information
2085 includes a demand pagable bit and a write protected bit.
2086 Information like Unix magic numbers is not stored here---only the magic
2087 numbers' meaning, so a @code{ZMAGIC} file would have both the demand pagable
2088 bit and the write protected text bit set.
2089
2090 The byte order of the target is stored on a per-file basis, so that big-
2091 and little-endian object files may be linked with one another.
2092
2093 @item sections
2094 Each section in the input file contains the name of the section, the
2095 original address in the object file, various options, size and alignment
2096 information and pointers into other BFD data structures.
2097
2098 @item symbols
2099 Each symbol contains a pointer to the object file which originally
2100 defined it, its name, its value, and various option bits. When a
2101 BFD back end reads in a symbol table, the back end relocates all
2102 symbols to make them relative to the base of the section where they were
2103 defined. Doing this ensures that each symbol points to its containing
2104 section. Each symbol also has a varying amount of hidden
2105 private data for the BFD back end. Since the symbol points to the
2106 original file, the private data format for that symbol is accessible.
2107 @code{ld} can operate on a collection of symbols of wildly different
2108 formats without problems.
2109
2110 Normal global and simple local symbols are maintained on output, so an
2111 output file (no matter its format) will retain symbols pointing to
2112 functions and to global, static, and common variables. Some symbol
2113 information is not worth retaining; in @code{a.out}, type information is
2114 stored in the symbol table as long symbol names. This information would
2115 be useless to most COFF debuggers and may be thrown away with
2116 appropriate command line switches. (The GNU debugger @code{gdb} does
2117 support @code{a.out} style debugging information in COFF).
2118
2119 There is one word of type information within the symbol, so if the
2120 format supports symbol type information within symbols (for example, COFF,
2121 IEEE, Oasys) and the type is simple enough to fit within one word
2122 (nearly everything but aggregates), the information will be preserved.
2123
2124 @item relocation level
2125 Each canonical BFD relocation record contains a pointer to the symbol to
2126 relocate to, the offset of the data to relocate, the section the data
2127 is in, and a pointer to a relocation type descriptor. Relocation is
2128 performed by passing messages through the relocation type
2129 descriptor and the symbol pointer. Therefore, relocations can be performed
2130 on output data using a relocation method that is only available in one of the
2131 input formats. For instance, Oasys provides a byte relocation format.
2132 A relocation record requesting this relocation type would point
2133 indirectly to a routine to perform this, so the relocation may be
2134 performed on a byte being written to a COFF file, even though 68k COFF
2135 has no such relocation type.
2136 @c FIXME why specific reference to 68K above?
2137
2138 @item line numbers
2139 Object formats can contain, for debugging purposes, some form of mapping
2140 between symbols, source line numbers, and addresses in the output file.
2141 These addresses have to be relocated along with the symbol information.
2142 Each symbol with an associated list of line number records points to the
2143 first record of the list. The head of a line number list consists of a
2144 pointer to the symbol, which allows finding out the address of the
2145 function whose line number is being described. The rest of the list is
2146 made up of pairs: offsets into the section and line numbers. Any format
2147 which can simply derive this information can pass it successfully
2148 between formats (COFF, IEEE and Oasys).
2149 @end table
2150 @end ifclear
2151
2152 @node MRI
2153 @appendix MRI Compatible Script Files
2154 @cindex MRI compatibility
2155 To aid users making the transition to @sc{gnu} @code{ld} from the MRI
2156 linker, @code{ld} can use MRI compatible linker scripts as an
2157 alternative to the more general-purpose linker scripting language
2158 described in @ref{Commands,,Command Language}. MRI compatible linker
2159 scripts have a much simpler command set than the scripting language
2160 otherwise used with @code{ld}. @sc{gnu} @code{ld} supports the most
2161 commonly used MRI linker commands; these commands are described here.
2162
2163 You can specify a file containing an MRI-compatible script using the
2164 @samp{-c} command-line option.
2165
2166 Each command in an MRI-compatible script occupies its own line; each
2167 command line starts with the keyword that identifies the command (though
2168 blank lines are also allowed for punctuation). If a line of an
2169 MRI-compatible script begins with an unrecognized keyword, @code{ld}
2170 issues a warning message, but continues processing the script.
2171
2172 Lines beginning with @samp{*} are comments.
2173
2174 You can write these commands using all upper-case letters, or all
2175 lower case; for example, @samp{chip} is the same as @samp{CHIP}.
2176 The following list shows only the upper-case form of each command.
2177
2178 @table @code
2179 @item ABSOLUTE @var{secname}
2180 @item ABSOLUTE @var{secname}, @var{secname}, @dots{} @var{secname}
2181 @cindex @code{ABSOLUTE} (MRI)
2182 Normally, @code{ld} includes in the output file all sections from all
2183 the input files. However, in an MRI-compatible script, you can use the
2184 @code{ABSOLUTE} command to restrict the sections that will be present in
2185 your output program. If the @code{ABSOLUTE} command is used at all in a
2186 script, then only the sections named explicitly in @code{ABSOLUTE}
2187 commands will appear in the linker output. You can still use other
2188 input sections (whatever you select on the command line, or using
2189 @code{LOAD}) to resolve addresses in the output file.
2190
2191 @item ALIAS @var{out-secname}, @var{in-secname}
2192 @cindex @code{ALIAS} (MRI)
2193 Use this command to place the data from input section @var{in-secname}
2194 in a section called @var{out-secname} in the linker output file.
2195
2196 @var{in-secname} may be an integer.
2197
2198 @item BASE @var{expression}
2199 @cindex @code{BASE} (MRI)
2200 Use the value of @var{expression} as the lowest address (other than
2201 absolute addresses) in the output file.
2202
2203 @item CHIP @var{expression}
2204 @itemx CHIP @var{expression}, @var{expression}
2205 @cindex @code{CHIP} (MRI)
2206 This command does nothing; it is accepted only for compatibility.
2207
2208 @item END
2209 @cindex @code{END} (MRI)
2210 This command does nothing whatever; it's only accepted for compatibility.
2211
2212 @item FORMAT @var{output-format}
2213 @cindex @code{FORMAT} (MRI)
2214 Similar to the @code{OUTPUT_FORMAT} command in the more general linker
2215 language, but restricted to one of these output formats:
2216 @enumerate
2217 @item
2218 S-records, if @var{output-format} is @samp{S}
2219
2220 @item
2221 IEEE, if @var{output-format} is @samp{IEEE}
2222
2223 @item
2224 COFF (the @samp{coff-m68k} variant in BFD), if @var{output-format} is
2225 @samp{COFF}
2226 @end enumerate
2227
2228 @item LIST @var{anything}@dots{}
2229 @cindex @code{LIST} (MRI)
2230 Print (to the standard output file) a link map, as produced by the
2231 @code{ld} command-line option @samp{-M}.
2232
2233 The keyword @code{LIST} may be followed by anything on the
2234 same line, with no change in its effect.
2235
2236 @item LOAD @var{filename}
2237 @item LOAD @var{filename}, @var{filename}, @dots{} @var{filename}
2238 @cindex @code{LOAD} (MRI)
2239 Include one or more object file @var{filename} in the link; this has the
2240 same effect as specifying @var{filename} directly on the @code{ld}
2241 command line.
2242
2243 @item NAME @var{output-name}
2244 @cindex @code{NAME} (MRI)
2245 @var{output-name} is the name for the program produced by @code{ld}; the
2246 MRI-compatible command @code{NAME} is equivalent to the command-line
2247 option @samp{-o} or the general script language command @code{OUTPUT}.
2248
2249 @item ORDER @var{secname}, @var{secname}, @dots{} @var{secname}
2250 @itemx ORDER @var{secname} @var{secname} @var{secname}
2251 @cindex @code{ORDER} (MRI)
2252 Normally, @code{ld} orders the sections in its output file in the
2253 order in which they first appear in the input files. In an MRI-compatible
2254 script, you can override this ordering with the @code{ORDER} command. The
2255 sections you list with @code{ORDER} will appear first in your output
2256 file, in the order specified.
2257
2258 @item PUBLIC @var{name}=@var{expression}
2259 @itemx PUBLIC @var{name},@var{expression}
2260 @itemx PUBLIC @var{name} @var{expression}
2261 @cindex @code{PUBLIC} (MRI)
2262 Supply a value (@var{expression}) for external symbol
2263 @var{name} used in the linker input files.
2264
2265 @item SECT @var{secname}, @var{expression}
2266 @itemx SECT @var{secname}=@var{expression}
2267 @itemx SECT @var{secname} @var{expression}
2268 @cindex @code{SECT} (MRI)
2269 You can use any of these three forms of the @code{SECT} command to
2270 specify the start address (@var{expression}) for section @var{secname}.
2271 If you have more than one @code{SECT} statement for the same
2272 @var{secname}, only the @emph{first} sets the start address.
2273 @end table
2274
2275
2276 @node Index
2277 @unnumbered Index
2278
2279 @printindex cp
2280
2281 @tex
2282 % I think something like @colophon should be in texinfo. In the
2283 % meantime:
2284 \long\def\colophon{\hbox to0pt{}\vfill
2285 \centerline{The body of this manual is set in}
2286 \centerline{\fontname\tenrm,}
2287 \centerline{with headings in {\bf\fontname\tenbf}}
2288 \centerline{and examples in {\tt\fontname\tentt}.}
2289 \centerline{{\it\fontname\tenit\/} and}
2290 \centerline{{\sl\fontname\tensl\/}}
2291 \centerline{are used for emphasis.}\vfill}
2292 \page\colophon
2293 % Blame: pesch@cygnus.com, 28mar91.
2294 @end tex
2295
2296
2297 @contents
2298 @bye
2299
2300
This page took 0.08879 seconds and 5 git commands to generate.