* breakpoint.c, breakpoint.h (breakpoint_init_inferior): New function
[deliverable/binutils-gdb.git] / gdb / doc / stabs.texinfo
CommitLineData
e505224d
PB
1\input texinfo
2@setfilename stabs.info
3
6fe91f2c 4@c @finalout
a9ded3ac 5
e505224d
PB
6@ifinfo
7@format
8START-INFO-DIR-ENTRY
8a6d5d4f 9* Stabs:: The "stabs" debugging information format.
e505224d
PB
10END-INFO-DIR-ENTRY
11@end format
12@end ifinfo
13
14@ifinfo
8c59ee11 15This document describes the stabs debugging symbol tables.
e505224d 16
6fe91f2c 17Copyright 1992, 1993 Free Software Foundation, Inc.
ee5e0932
JK
18Contributed by Cygnus Support. Written by Julia Menapace, Jim Kingdon,
19and David MacKenzie.
e505224d
PB
20
21Permission is granted to make and distribute verbatim copies of
22this manual provided the copyright notice and this permission notice
23are preserved on all copies.
24
25@ignore
26Permission is granted to process this file through Tex and print the
27results, provided the printed document carries copying permission
28notice identical to this one except for the removal of this paragraph
29(this paragraph not being relevant to the printed manual).
30
31@end ignore
32Permission is granted to copy or distribute modified versions of this
33manual under the terms of the GPL (for which purpose this text may be
34regarded as a program in the language TeX).
35@end ifinfo
36
139741da 37@setchapternewpage odd
e505224d
PB
38@settitle STABS
39@titlepage
139741da 40@title The ``stabs'' debug format
f958d5cd 41@author Julia Menapace, Jim Kingdon, David MacKenzie
e505224d
PB
42@author Cygnus Support
43@page
44@tex
45\def\$#1${{#1}} % Kluge: collect RCS revision info without $...$
46\xdef\manvers{\$Revision$} % For use in headers, footers too
47{\parskip=0pt
48\hfill Cygnus Support\par
49\hfill \manvers\par
50\hfill \TeX{}info \texinfoversion\par
51}
52@end tex
53
54@vskip 0pt plus 1filll
6fe91f2c 55Copyright @copyright{} 1992, 1993 Free Software Foundation, Inc.
899bafeb 56Contributed by Cygnus Support.
e505224d
PB
57
58Permission is granted to make and distribute verbatim copies of
59this manual provided the copyright notice and this permission notice
60are preserved on all copies.
61
62@end titlepage
63
899bafeb
RP
64@ifinfo
65@node Top
66@top The "stabs" representation of debugging information
e505224d 67
6ae55c65 68This document describes the stabs debugging format.
e505224d
PB
69
70@menu
8eb5e289 71* Overview:: Overview of stabs
bf9d2537 72* Program Structure:: Encoding of the structure of the program
6897f9ec 73* Constants:: Constants
6fe91f2c 74* Variables::
8c59ee11 75* Types:: Type definitions
bf9d2537 76* Symbol Tables:: Symbol information in symbol tables
8eb5e289 77* Cplusplus:: Appendixes:
bf9d2537
DM
78* Stab Types:: Symbol types in a.out files
79* Symbol Descriptors:: Table of symbol descriptors
80* Type Descriptors:: Table of type descriptors
81* Expanded Reference:: Reference information by stab type
8eb5e289 82* Questions:: Questions and anomolies
bf9d2537 83* XCOFF Differences:: Differences between GNU stabs in a.out
f958d5cd 84 and GNU stabs in XCOFF
bf9d2537 85* Sun Differences:: Differences between GNU stabs and Sun
139741da 86 native stabs
bf9d2537
DM
87* Stabs In ELF:: Stabs in an ELF file.
88* Symbol Types Index:: Index of symbolic stab symbol type names.
e505224d 89@end menu
899bafeb 90@end ifinfo
e505224d
PB
91
92
899bafeb 93@node Overview
bf9d2537 94@chapter Overview of Stabs
e505224d 95
139741da
RP
96@dfn{Stabs} refers to a format for information that describes a program
97to a debugger. This format was apparently invented by
98@c FIXME! <<name of inventor>> at
99the University of California at Berkeley, for the @code{pdx} Pascal
100debugger; the format has spread widely since then.
101
8c59ee11 102This document is one of the few published sources of documentation on
dd8126d9 103stabs. It is believed to be comprehensive for stabs used by C. The
bf9d2537
DM
104lists of symbol descriptors (@pxref{Symbol Descriptors}) and type
105descriptors (@pxref{Type Descriptors}) are believed to be completely
dd8126d9
JK
106comprehensive. Stabs for COBOL-specific features and for variant
107records (used by Pascal and Modula-2) are poorly documented here.
108
109Other sources of information on stabs are @cite{Dbx and Dbxtool
110Interfaces}, 2nd edition, by Sun, 1988, and @cite{AIX Version 3.2 Files
111Reference}, Fourth Edition, September 1992, "dbx Stabstring Grammar" in
112the a.out section, page 2-31. This document is believed to incorporate
113the information from those two sources except where it explictly directs
114you to them for more information.
8c59ee11 115
e505224d 116@menu
8eb5e289 117* Flow:: Overview of debugging information flow
bf9d2537
DM
118* Stabs Format:: Overview of stab format
119* String Field:: The string field
120* C Example:: A simple example in C source
121* Assembly Code:: The simple example at the assembly level
e505224d
PB
122@end menu
123
899bafeb 124@node Flow
bf9d2537 125@section Overview of Debugging Information Flow
e505224d 126
139741da 127The GNU C compiler compiles C source in a @file{.c} file into assembly
6fe91f2c
DM
128language in a @file{.s} file, which the assembler translates into
129a @file{.o} file, which the linker combines with other @file{.o} files and
139741da 130libraries to produce an executable file.
e505224d 131
6fe91f2c
DM
132With the @samp{-g} option, GCC puts in the @file{.s} file additional
133debugging information, which is slightly transformed by the assembler
134and linker, and carried through into the final executable. This
135debugging information describes features of the source file like line
136numbers, the types and scopes of variables, and function names,
137parameters, and scopes.
e505224d 138
6fe91f2c
DM
139For some object file formats, the debugging information is encapsulated
140in assembler directives known collectively as @dfn{stab} (symbol table)
141directives, which are interspersed with the generated code. Stabs are
f958d5cd
DM
142the native format for debugging information in the a.out and XCOFF
143object file formats. The GNU tools can also emit stabs in the COFF and
144ECOFF object file formats.
e505224d 145
139741da
RP
146The assembler adds the information from stabs to the symbol information
147it places by default in the symbol table and the string table of the
148@file{.o} file it is building. The linker consolidates the @file{.o}
149files into one executable file, with one symbol table and one string
150table. Debuggers use the symbol and string tables in the executable as
151a source of debugging information about the program.
e505224d 152
bf9d2537
DM
153@node Stabs Format
154@section Overview of Stab Format
e505224d 155
6fe91f2c 156There are three overall formats for stab assembler directives,
139741da 157differentiated by the first word of the stab. The name of the directive
6fe91f2c
DM
158describes which combination of four possible data fields follows. It is
159either @code{.stabs} (string), @code{.stabn} (number), or @code{.stabd}
f958d5cd 160(dot). IBM's XCOFF assembler uses @code{.stabx} (and some other
63cef7d7
JK
161directives such as @code{.file} and @code{.bi}) instead of
162@code{.stabs}, @code{.stabn} or @code{.stabd}.
e505224d
PB
163
164The overall format of each class of stab is:
165
166@example
0a95c18c
JK
167.stabs "@var{string}",@var{type},@var{other},@var{desc},@var{value}
168.stabn @var{type},@var{other},@var{desc},@var{value}
169.stabd @var{type},@var{other},@var{desc}
6fe91f2c 170.stabx "@var{string}",@var{value},@var{type},@var{sdb-type}
e505224d
PB
171@end example
172
63cef7d7
JK
173@c what is the correct term for "current file location"? My AIX
174@c assembler manual calls it "the value of the current location counter".
6fe91f2c 175For @code{.stabn} and @code{.stabd}, there is no @var{string} (the
bf9d2537 176@code{n_strx} field is zero; see @ref{Symbol Tables}). For
6fe91f2c
DM
177@code{.stabd}, the @var{value} field is implicit and has the value of
178the current file location. For @code{.stabx}, the @var{sdb-type} field
0a95c18c
JK
179is unused for stabs and can always be set to zero. The @var{other}
180field is almost always unused and can be set to zero.
6fe91f2c
DM
181
182The number in the @var{type} field gives some basic information about
183which type of stab this is (or whether it @emph{is} a stab, as opposed
184to an ordinary symbol). Each valid type number defines a different stab
685a5e86 185type; further, the stab type defines the exact interpretation of, and
6fe91f2c 186possible values for, any remaining @var{string}, @var{desc}, or
bf9d2537 187@var{value} fields present in the stab. @xref{Stab Types}, for a list
685a5e86 188in numeric order of the valid @var{type} field values for stab directives.
6fe91f2c 189
bf9d2537 190@node String Field
0a95c18c 191@section The String Field
e505224d 192
0a95c18c
JK
193For most stabs the string field holds the meat of the
194debugging information. The flexible nature of this field
195is what makes stabs extensible. For some stab types the string field
139741da
RP
196contains only a name. For other stab types the contents can be a great
197deal more complex.
e505224d 198
0a95c18c 199The overall format of the string field for most stab types is:
e505224d
PB
200
201@example
46351197 202"@var{name}:@var{symbol-descriptor} @var{type-information}"
e505224d
PB
203@end example
204
139741da 205@var{name} is the name of the symbol represented by the stab.
6897f9ec 206@var{name} can be omitted, which means the stab represents an unnamed
8c59ee11 207object. For example, @samp{:t10=*2} defines type 10 as a pointer to
6897f9ec 208type 2, but does not give the type a name. Omitting the @var{name}
dd8126d9 209field is supported by AIX dbx and GDB after about version 4.8, but not
46351197
JK
210other debuggers. GCC sometimes uses a single space as the name instead
211of omitting the name altogether; apparently that is supported by most
6fe91f2c 212debuggers.
e505224d 213
685a5e86 214The @var{symbol-descriptor} following the @samp{:} is an alphabetic
139741da 215character that tells more specifically what kind of symbol the stab
685a5e86 216represents. If the @var{symbol-descriptor} is omitted, but type
139741da 217information follows, then the stab represents a local variable. For a
bf9d2537 218list of symbol descriptors, see @ref{Symbol Descriptors}. The @samp{c}
6fe91f2c
DM
219symbol descriptor is an exception in that it is not followed by type
220information. @xref{Constants}.
e505224d 221
685a5e86
DM
222@var{type-information} is either a @var{type-number}, or
223@samp{@var{type-number}=}. A @var{type-number} alone is a type
139741da 224reference, referring directly to a type that has already been defined.
e505224d 225
685a5e86 226The @samp{@var{type-number}=} form is a type definition, where the
e7bb76cc
JK
227number represents a new type which is about to be defined. The type
228definition may refer to other types by number, and those type numbers
229may be followed by @samp{=} and nested definitions.
e505224d
PB
230
231In a type definition, if the character that follows the equals sign is
685a5e86 232non-numeric then it is a @var{type-descriptor}, and tells what kind of
139741da 233type is about to be defined. Any other values following the
685a5e86 234@var{type-descriptor} vary, depending on the @var{type-descriptor}.
bf9d2537 235@xref{Type Descriptors}, for a list of @var{type-descriptor} values. If
685a5e86
DM
236a number follows the @samp{=} then the number is a @var{type-reference}.
237For a full description of types, @ref{Types}.
139741da 238
6897f9ec 239There is an AIX extension for type attributes. Following the @samp{=}
685a5e86 240are any number of type attributes. Each one starts with @samp{@@} and
dd8126d9
JK
241ends with @samp{;}. Debuggers, including AIX's dbx and GDB 4.10, skip
242any type attributes they do not recognize. GDB 4.9 and other versions
243of dbx may not do this. Because of a conflict with C++
8c59ee11
JK
244(@pxref{Cplusplus}), new attributes should not be defined which begin
245with a digit, @samp{(}, or @samp{-}; GDB may be unable to distinguish
246those from the C++ type descriptor @samp{@@}. The attributes are:
6897f9ec
JK
247
248@table @code
249@item a@var{boundary}
8c59ee11 250@var{boundary} is an integer specifying the alignment. I assume it
6897f9ec
JK
251applies to all variables of this type.
252
253@item s@var{size}
8c59ee11 254Size in bits of a variable of this type.
6897f9ec
JK
255
256@item p@var{integer}
257Pointer class (for checking). Not sure what this means, or how
258@var{integer} is interpreted.
259
260@item P
261Indicate this is a packed type, meaning that structure fields or array
262elements are placed more closely in memory, to save memory at the
263expense of speed.
264@end table
265
0a95c18c 266All of this can make the string field quite long. All
dd8126d9
JK
267versions of GDB, and some versions of dbx, can handle arbitrarily long
268strings. But many versions of dbx cretinously limit the strings to
269about 80 characters, so compilers which must work with such dbx's need
b6963343
JK
270to split the @code{.stabs} directive into several @code{.stabs}
271directives. Each stab duplicates exactly all but the
0a95c18c 272string field. The string field of
b6963343
JK
273every stab except the last is marked as continued with a
274double-backslash at the end. Removing the backslashes and concatenating
0a95c18c 275the string fields of each stab produces the original,
b6963343 276long string.
e505224d 277
bf9d2537
DM
278@node C Example
279@section A Simple Example in C Source
e505224d
PB
280
281To get the flavor of how stabs describe source information for a C
282program, let's look at the simple program:
283
284@example
6fe91f2c 285main()
e505224d 286@{
139741da 287 printf("Hello world");
e505224d
PB
288@}
289@end example
290
139741da
RP
291When compiled with @samp{-g}, the program above yields the following
292@file{.s} file. Line numbers have been added to make it easier to refer
293to parts of the @file{.s} file in the description of the stabs that
294follows.
e505224d 295
bf9d2537
DM
296@node Assembly Code
297@section The Simple Example at the Assembly Level
e505224d 298
6fe91f2c
DM
299This simple ``hello world'' example demonstrates several of the stab
300types used to describe C language source files.
301
e505224d
PB
302@example
3031 gcc2_compiled.:
3042 .stabs "/cygint/s1/users/jcm/play/",100,0,0,Ltext0
3053 .stabs "hello.c",100,0,0,Ltext0
3064 .text
3075 Ltext0:
3086 .stabs "int:t1=r1;-2147483648;2147483647;",128,0,0,0
3097 .stabs "char:t2=r2;0;127;",128,0,0,0
3108 .stabs "long int:t3=r1;-2147483648;2147483647;",128,0,0,0
3119 .stabs "unsigned int:t4=r1;0;-1;",128,0,0,0
31210 .stabs "long unsigned int:t5=r1;0;-1;",128,0,0,0
31311 .stabs "short int:t6=r1;-32768;32767;",128,0,0,0
31412 .stabs "long long int:t7=r1;0;-1;",128,0,0,0
31513 .stabs "short unsigned int:t8=r1;0;65535;",128,0,0,0
31614 .stabs "long long unsigned int:t9=r1;0;-1;",128,0,0,0
31715 .stabs "signed char:t10=r1;-128;127;",128,0,0,0
31816 .stabs "unsigned char:t11=r1;0;255;",128,0,0,0
31917 .stabs "float:t12=r1;4;0;",128,0,0,0
32018 .stabs "double:t13=r1;8;0;",128,0,0,0
32119 .stabs "long double:t14=r1;8;0;",128,0,0,0
32220 .stabs "void:t15=15",128,0,0,0
139741da 32321 .align 4
e505224d 32422 LC0:
139741da
RP
32523 .ascii "Hello, world!\12\0"
32624 .align 4
32725 .global _main
32826 .proc 1
e505224d
PB
32927 _main:
33028 .stabn 68,0,4,LM1
33129 LM1:
139741da
RP
33230 !#PROLOGUE# 0
33331 save %sp,-136,%sp
33432 !#PROLOGUE# 1
33533 call ___main,0
33634 nop
e505224d
PB
33735 .stabn 68,0,5,LM2
33836 LM2:
33937 LBB2:
139741da
RP
34038 sethi %hi(LC0),%o1
34139 or %o1,%lo(LC0),%o0
34240 call _printf,0
34341 nop
e505224d
PB
34442 .stabn 68,0,6,LM3
34543 LM3:
34644 LBE2:
34745 .stabn 68,0,6,LM4
34846 LM4:
34947 L1:
139741da
RP
35048 ret
35149 restore
e505224d
PB
35250 .stabs "main:F1",36,0,0,_main
35351 .stabn 192,0,0,LBB2
35452 .stabn 224,0,0,LBE2
355@end example
356
bf9d2537
DM
357@node Program Structure
358@chapter Encoding the Structure of the Program
e505224d 359
685a5e86
DM
360The elements of the program structure that stabs encode include the name
361of the main function, the names of the source and include files, the
362line numbers, procedure names and types, and the beginnings and ends of
363blocks of code.
364
e505224d 365@menu
bf9d2537
DM
366* Main Program:: Indicate what the main program is
367* Source Files:: The path and name of the source file
368* Include Files:: Names of include files
369* Line Numbers::
6fe91f2c 370* Procedures::
bf9d2537
DM
371* Nested Procedures::
372* Block Structure::
e505224d
PB
373@end menu
374
bf9d2537
DM
375@node Main Program
376@section Main Program
499a5faa 377
685a5e86 378@findex N_MAIN
499a5faa 379Most languages allow the main program to have any name. The
685a5e86 380@code{N_MAIN} stab type tells the debugger the name that is used in this
0a95c18c 381program. Only the string field is significant; it is the name of
685a5e86
DM
382a function which is the main program. Most C compilers do not use this
383stab (they expect the debugger to assume that the name is @code{main}),
384but some C compilers emit an @code{N_MAIN} stab for the @code{main}
385function.
499a5faa 386
bf9d2537
DM
387@node Source Files
388@section Paths and Names of the Source Files
e505224d 389
685a5e86 390@findex N_SO
63cef7d7
JK
391Before any other stabs occur, there must be a stab specifying the source
392file. This information is contained in a symbol of stab type
0a95c18c
JK
393@code{N_SO}; the string field contains the name of the file. The
394value of the symbol is the start address of the portion of the
685a5e86 395text section corresponding to that file.
e505224d 396
0a95c18c 397With the Sun Solaris2 compiler, the desc field contains a
ded6bcab 398source-language code.
685a5e86 399@c Do the debuggers use it? What are the codes? -djm
ded6bcab 400
6fe91f2c 401Some compilers (for example, GCC2 and SunOS4 @file{/bin/cc}) also
63cef7d7
JK
402include the directory in which the source was compiled, in a second
403@code{N_SO} symbol preceding the one containing the file name. This
ded6bcab 404symbol can be distinguished by the fact that it ends in a slash. Code
685a5e86 405from the @code{cfront} C++ compiler can have additional @code{N_SO} symbols for
ded6bcab
JK
406nonexistent source files after the @code{N_SO} for the real source file;
407these are believed to contain no useful information.
e505224d 408
63cef7d7
JK
409For example:
410
411@example
baf4ded0 412.stabs "/cygint/s1/users/jcm/play/",100,0,0,Ltext0 # @r{100 is N_SO}
63cef7d7
JK
413.stabs "hello.c",100,0,0,Ltext0
414 .text
415Ltext0:
416@end example
417
418Instead of @code{N_SO} symbols, XCOFF uses a @code{.file} assembler
419directive which assembles to a standard COFF @code{.file} symbol;
420explaining this in detail is outside the scope of this document.
421
bf9d2537
DM
422@node Include Files
423@section Names of Include Files
6fe91f2c 424
685a5e86 425There are several schemes for dealing with include files: the
6fe91f2c
DM
426traditional @code{N_SOL} approach, Sun's @code{N_BINCL} approach, and the
427XCOFF @code{C_BINCL} approach (which despite the similar name has little in
63cef7d7
JK
428common with @code{N_BINCL}).
429
685a5e86 430@findex N_SOL
63cef7d7 431An @code{N_SOL} symbol specifies which include file subsequent symbols
f4548a46
JK
432refer to. The string field is the name of the file and the value is the
433text address corresponding to the end of the previous include file and
434the start of this one. To specify the main source file again, use an
435@code{N_SOL} symbol with the name of the main source file.
685a5e86 436
685a5e86
DM
437@findex N_BINCL
438@findex N_EINCL
439@findex N_EXCL
43603088
JK
440The @code{N_BINCL} approach works as follows. An @code{N_BINCL} symbol
441specifies the start of an include file. In an object file, only the
0a95c18c 442string is significant; the Sun linker puts data into some of the
43603088 443other fields. The end of the include file is marked by an
0a95c18c 444@code{N_EINCL} symbol (which has no string field). In an object
43603088
JK
445file, there is no significant data in the @code{N_EINCL} symbol; the Sun
446linker puts data into some of the fields. @code{N_BINCL} and
447@code{N_EINCL} can be nested.
448
449If the linker detects that two source files have identical stabs between
450an @code{N_BINCL} and @code{N_EINCL} pair (as will generally be the case
685a5e86
DM
451for a header file), then it only puts out the stabs once. Each
452additional occurance is replaced by an @code{N_EXCL} symbol. I believe
453the Sun (SunOS4, not sure about Solaris) linker is the only one which
454supports this feature.
455@c What do the fields of N_EXCL contain? -djm
685a5e86 456
685a5e86
DM
457@findex C_BINCL
458@findex C_EINCL
63cef7d7 459For the start of an include file in XCOFF, use the @file{.bi} assembler
6fe91f2c 460directive, which generates a @code{C_BINCL} symbol. A @file{.ei}
63cef7d7
JK
461directive, which generates a @code{C_EINCL} symbol, denotes the end of
462the include file. Both directives are followed by the name of the
0a95c18c
JK
463source file in quotes, which becomes the string for the symbol.
464The value of each symbol, produced automatically by the assembler
685a5e86
DM
465and linker, is the offset into the executable of the beginning
466(inclusive, as you'd expect) or end (inclusive, as you would not expect)
467of the portion of the COFF line table that corresponds to this include
468file. @code{C_BINCL} and @code{C_EINCL} do not nest.
63cef7d7 469
bf9d2537
DM
470@node Line Numbers
471@section Line Numbers
e505224d 472
685a5e86
DM
473@findex N_SLINE
474An @code{N_SLINE} symbol represents the start of a source line. The
ac31351a 475desc field contains the line number and the value
f0f4b04e 476contains the code address for the start of that source line. On most
6fe91f2c 477machines the address is absolute; for Sun's stabs-in-ELF, it is relative
f0f4b04e 478to the function in which the @code{N_SLINE} symbol occurs.
e505224d 479
685a5e86
DM
480@findex N_DSLINE
481@findex N_BSLINE
63cef7d7
JK
482GNU documents @code{N_DSLINE} and @code{N_BSLINE} symbols for line
483numbers in the data or bss segments, respectively. They are identical
484to @code{N_SLINE} but are relocated differently by the linker. They
485were intended to be used to describe the source location of a variable
6fe91f2c 486declaration, but I believe that GCC2 actually puts the line number in
0a95c18c
JK
487the desc field of the stab for the variable itself. GDB has been
488ignoring these symbols (unless they contain a string field) since
685a5e86 489at least GDB 3.5.
139741da 490
63cef7d7
JK
491For single source lines that generate discontiguous code, such as flow
492of control statements, there may be more than one line number entry for
493the same source line. In this case there is a line number entry at the
494start of each code range, each with the same line number.
e505224d 495
56bfba9c
JK
496XCOFF does not use stabs for line numbers. Instead, it uses COFF line
497numbers (which are outside the scope of this document). Standard COFF
498line numbers cannot deal with include files, but in XCOFF this is fixed
f19027a6 499with the @code{C_BINCL} method of marking include files (@pxref{Include
408f6c34 500Files}).
685a5e86 501
899bafeb 502@node Procedures
6897f9ec
JK
503@section Procedures
504
f19027a6 505@findex N_FUN, for functions
43603088
JK
506@findex N_FNAME
507@findex N_STSYM, for functions (Sun acc)
508@findex N_GSYM, for functions (Sun acc)
509All of the following stabs normally use the @code{N_FUN} symbol type.
510However, Sun's @code{acc} compiler on SunOS4 uses @code{N_GSYM} and
511@code{N_STSYM}, which means that the value of the stab for the function
512is useless and the debugger must get the address of the function from
513the non-stab symbols instead. BSD Fortran is said to use @code{N_FNAME}
514with the same restriction; the value of the symbol is not useful (I'm
515not sure it really does use this, because GDB doesn't handle this and no
516one has complained).
6897f9ec 517
dd8126d9 518A function is represented by an @samp{F} symbol descriptor for a global
43603088 519(extern) function, and @samp{f} for a static (local) function. The
ac31351a 520value is the address of the start of the function (absolute
43603088
JK
521for @code{a.out}; relative to the start of the file for Sun's
522stabs-in-ELF). The type information of the stab represents the return
523type of the function; thus @samp{foo:f5} means that foo is a function
524returning type 5. There is no need to try to get the line number of the
525start of the function from the stab for the function; it is in the next
526@code{N_SLINE} symbol.
527
528@c FIXME: verify whether the "I suspect" below is true or not.
529Some compilers (such as Sun's Solaris compiler) support an extension for
530specifying the types of the arguments. I suspect this extension is not
531used for old (non-prototyped) function definitions in C. If the
532extension is in use, the type information of the stab for the function
533is followed by type information for each argument, with each argument
534preceded by @samp{;}. An argument type of 0 means that additional
535arguments are being passed, whose types and number may vary (@samp{...}
536in ANSI C). GDB has tolerated this extension (parsed the syntax, if not
537necessarily used the information) since at least version 4.8; I don't
538know whether all versions of dbx tolerate it. The argument types given
539here are not redundant with the symbols for the formal parameters
540(@pxref{Parameters}); they are the types of the arguments as they are
541passed, before any conversions might take place. For example, if a C
542function which is declared without a prototype takes a @code{float}
543argument, the value is passed as a @code{double} but then converted to a
544@code{float}. Debuggers need to use the types given in the arguments
545when printing values, but when calling the function they need to use the
546types given in the symbol defining the function.
ded6bcab
JK
547
548If the return type and types of arguments of a function which is defined
6fe91f2c 549in another source file are specified (i.e., a function prototype in ANSI
ded6bcab
JK
550C), traditionally compilers emit no stab; the only way for the debugger
551to find the information is if the source file where the function is
552defined was also compiled with debugging symbols. As an extension the
553Solaris compiler uses symbol descriptor @samp{P} followed by the return
554type of the function, followed by the arguments, each preceded by
555@samp{;}, as in a stab with symbol descriptor @samp{f} or @samp{F}.
556This use of symbol descriptor @samp{P} can be distinguished from its use
bf9d2537 557for register parameters (@pxref{Register Parameters}) by the fact that it has
ded6bcab
JK
558symbol type @code{N_FUN}.
559
6897f9ec
JK
560The AIX documentation also defines symbol descriptor @samp{J} as an
561internal function. I assume this means a function nested within another
6fe91f2c 562function. It also says symbol descriptor @samp{m} is a module in
6897f9ec
JK
563Modula-2 or extended Pascal.
564
565Procedures (functions which do not return values) are represented as
6fe91f2c
DM
566functions returning the @code{void} type in C. I don't see why this couldn't
567be used for all languages (inventing a @code{void} type for this purpose if
6897f9ec
JK
568necessary), but the AIX documentation defines @samp{I}, @samp{P}, and
569@samp{Q} for internal, global, and static procedures, respectively.
570These symbol descriptors are unusual in that they are not followed by
571type information.
572
43603088
JK
573The following example shows a stab for a function @code{main} which
574returns type number @code{1}. The @code{_main} specified for the value
575is a reference to an assembler label which is used to fill in the start
576address of the function.
685a5e86
DM
577
578@example
43603088 579.stabs "main:F1",36,0,0,_main # @r{36 is N_FUN}
685a5e86
DM
580@end example
581
582The stab representing a procedure is located immediately following the
583code of the procedure. This stab is in turn directly followed by a
584group of other stabs describing elements of the procedure. These other
585stabs describe the procedure's parameters, its block local variables, and
586its block structure.
685a5e86 587
bf9d2537
DM
588@node Nested Procedures
589@section Nested Procedures
685a5e86 590
43603088
JK
591For any of the symbol descriptors representing procedures, after the
592symbol descriptor and the type information is optionally a scope
593specifier. This consists of a comma, the name of the procedure, another
594comma, and the name of the enclosing procedure. The first name is local
595to the scope specified, and seems to be redundant with the name of the
596symbol (before the @samp{:}). This feature is used by GCC, and
597presumably Pascal, Modula-2, etc., compilers, for nested functions.
6ea34847
JK
598
599If procedures are nested more than one level deep, only the immediately
685a5e86 600containing scope is specified. For example, this code:
6ea34847
JK
601
602@example
603int
604foo (int x)
605@{
606 int bar (int y)
607 @{
608 int baz (int z)
6fe91f2c
DM
609 @{
610 return x + y + z;
611 @}
6ea34847
JK
612 return baz (x + 2 * y);
613 @}
614 return x + bar (3 * x);
615@}
616@end example
617
618@noindent
619produces the stabs:
620
621@example
baf4ded0 622.stabs "baz:f1,baz,bar",36,0,0,_baz.15 # @r{36 is N_FUN}
6ea34847
JK
623.stabs "bar:f1,bar,foo",36,0,0,_bar.12
624.stabs "foo:F1",36,0,0,_foo
625@end example
6897f9ec 626
bf9d2537
DM
627@node Block Structure
628@section Block Structure
e505224d 629
685a5e86
DM
630@findex N_LBRAC
631@findex N_RBRAC
139741da 632The program's block structure is represented by the @code{N_LBRAC} (left
f0f4b04e 633brace) and the @code{N_RBRAC} (right brace) stab types. The variables
dd8126d9 634defined inside a block precede the @code{N_LBRAC} symbol for most
f0f4b04e 635compilers, including GCC. Other compilers, such as the Convex, Acorn
f958d5cd 636RISC machine, and Sun @code{acc} compilers, put the variables after the
0a95c18c 637@code{N_LBRAC} symbol. The values of the @code{N_LBRAC} and
f0f4b04e
JK
638@code{N_RBRAC} symbols are the start and end addresses of the code of
639the block, respectively. For most machines, they are relative to the
640starting address of this source file. For the Gould NP1, they are
6fe91f2c 641absolute. For Sun's stabs-in-ELF, they are relative to the function in
f0f4b04e 642which they occur.
e505224d 643
139741da 644The @code{N_LBRAC} and @code{N_RBRAC} stabs that describe the block
f0f4b04e 645scope of a procedure are located after the @code{N_FUN} stab that
6fe91f2c 646represents the procedure itself.
e505224d 647
0a95c18c 648Sun documents the desc field of @code{N_LBRAC} and
f0f4b04e 649@code{N_RBRAC} symbols as containing the nesting level of the block.
0a95c18c 650However, dbx seems to not care, and GCC always sets desc to
f0f4b04e 651zero.
e505224d 652
6897f9ec
JK
653@node Constants
654@chapter Constants
655
656The @samp{c} symbol descriptor indicates that this stab represents a
657constant. This symbol descriptor is an exception to the general rule
658that symbol descriptors are followed by type information. Instead, it
659is followed by @samp{=} and one of the following:
660
661@table @code
b273dc0f 662@item b @var{value}
6897f9ec
JK
663Boolean constant. @var{value} is a numeric value; I assume it is 0 for
664false or 1 for true.
665
b273dc0f 666@item c @var{value}
6897f9ec
JK
667Character constant. @var{value} is the numeric value of the constant.
668
b273dc0f
JK
669@item e @var{type-information} , @var{value}
670Constant whose value can be represented as integral.
671@var{type-information} is the type of the constant, as it would appear
bf9d2537 672after a symbol descriptor (@pxref{String Field}). @var{value} is the
b273dc0f
JK
673numeric value of the constant. GDB 4.9 does not actually get the right
674value if @var{value} does not fit in a host @code{int}, but it does not
675do anything violent, and future debuggers could be extended to accept
676integers of any size (whether unsigned or not). This constant type is
677usually documented as being only for enumeration constants, but GDB has
678never imposed that restriction; I don't know about other debuggers.
679
680@item i @var{value}
681Integer constant. @var{value} is the numeric value. The type is some
682sort of generic integer type (for GDB, a host @code{int}); to specify
683the type explicitly, use @samp{e} instead.
684
685@item r @var{value}
6897f9ec
JK
686Real constant. @var{value} is the real value, which can be @samp{INF}
687(optionally preceded by a sign) for infinity, @samp{QNAN} for a quiet
688NaN (not-a-number), or @samp{SNAN} for a signalling NaN. If it is a
689normal number the format is that accepted by the C library function
690@code{atof}.
691
b273dc0f 692@item s @var{string}
6897f9ec
JK
693String constant. @var{string} is a string enclosed in either @samp{'}
694(in which case @samp{'} characters within the string are represented as
695@samp{\'} or @samp{"} (in which case @samp{"} characters within the
696string are represented as @samp{\"}).
697
b273dc0f 698@item S @var{type-information} , @var{elements} , @var{bits} , @var{pattern}
6897f9ec 699Set constant. @var{type-information} is the type of the constant, as it
bf9d2537 700would appear after a symbol descriptor (@pxref{String Field}).
685a5e86 701@var{elements} is the number of elements in the set (does this means
a03f27c3
JK
702how many bits of @var{pattern} are actually used, which would be
703redundant with the type, or perhaps the number of bits set in
704@var{pattern}? I don't get it), @var{bits} is the number of bits in the
705constant (meaning it specifies the length of @var{pattern}, I think),
706and @var{pattern} is a hexadecimal representation of the set. AIX
707documentation refers to a limit of 32 bytes, but I see no reason why
708this limit should exist. This form could probably be used for arbitrary
709constants, not just sets; the only catch is that @var{pattern} should be
710understood to be target, not host, byte order and format.
6897f9ec
JK
711@end table
712
713The boolean, character, string, and set constants are not supported by
685a5e86 714GDB 4.9, but it ignores them. GDB 4.8 and earlier gave an error
6897f9ec
JK
715message and refused to read symbols from the file containing the
716constants.
717
685a5e86 718The above information is followed by @samp{;}.
e505224d 719
899bafeb 720@node Variables
e505224d
PB
721@chapter Variables
722
685a5e86
DM
723Different types of stabs describe the various ways that variables can be
724allocated: on the stack, globally, in registers, in common blocks,
725statically, or as arguments to a function.
726
e505224d 727@menu
bf9d2537
DM
728* Stack Variables:: Variables allocated on the stack.
729* Global Variables:: Variables used by more than one source file.
730* Register Variables:: Variables in registers.
731* Common Blocks:: Variables statically allocated together.
24dcc707 732* Statics:: Variables local to one source file.
f19027a6 733* Based Variables:: Fortran pointer based variables.
24dcc707 734* Parameters:: Variables for arguments to functions.
e505224d
PB
735@end menu
736
bf9d2537
DM
737@node Stack Variables
738@section Automatic Variables Allocated on the Stack
e505224d 739
685a5e86
DM
740If a variable's scope is local to a function and its lifetime is only as
741long as that function executes (C calls such variables
742@dfn{automatic}), it can be allocated in a register (@pxref{Register
bf9d2537 743Variables}) or on the stack.
e505224d 744
685a5e86 745@findex N_LSYM
43603088
JK
746Each variable allocated on the stack has a stab with the symbol
747descriptor omitted. Since type information should begin with a digit,
748@samp{-}, or @samp{(}, only those characters precluded from being used
749for symbol descriptors. However, the Acorn RISC machine (ARM) is said
750to get this wrong: it puts out a mere type definition here, without the
751preceding @samp{@var{type-number}=}. This is a bad idea; there is no
752guarantee that type descriptors are distinct from symbol descriptors.
753Stabs for stack variables use the @code{N_LSYM} stab type.
e505224d 754
0a95c18c 755The value of the stab is the offset of the variable within the
685a5e86
DM
756local variables. On most machines this is an offset from the frame
757pointer and is negative. The location of the stab specifies which block
bf9d2537 758it is defined in; see @ref{Block Structure}.
e505224d 759
685a5e86 760For example, the following C code:
e505224d 761
e7bb76cc
JK
762@example
763int
764main ()
765@{
766 int x;
767@}
768@end example
139741da 769
685a5e86 770produces the following stabs:
e505224d 771
e7bb76cc 772@example
baf4ded0
JK
773.stabs "main:F1",36,0,0,_main # @r{36 is N_FUN}
774.stabs "x:1",128,0,0,-12 # @r{128 is N_LSYM}
775.stabn 192,0,0,LBB2 # @r{192 is N_LBRAC}
776.stabn 224,0,0,LBE2 # @r{224 is N_RBRAC}
e505224d
PB
777@end example
778
685a5e86 779@xref{Procedures} for more information on the @code{N_FUN} stab, and
bf9d2537 780@ref{Block Structure} for more information on the @code{N_LBRAC} and
685a5e86 781@code{N_RBRAC} stabs.
e505224d 782
bf9d2537
DM
783@node Global Variables
784@section Global Variables
e505224d 785
685a5e86
DM
786@findex N_GSYM
787A variable whose scope is not specific to just one source file is
baf4ded0
JK
788represented by the @samp{G} symbol descriptor. These stabs use the
789@code{N_GSYM} stab type. The type information for the stab
bf9d2537 790(@pxref{String Field}) gives the type of the variable.
e505224d 791
baf4ded0 792For example, the following source code:
6fe91f2c 793
e505224d 794@example
baf4ded0 795char g_foo = 'c';
e505224d
PB
796@end example
797
139741da 798@noindent
baf4ded0 799yields the following assembly code:
e505224d
PB
800
801@example
baf4ded0
JK
802.stabs "g_foo:G2",32,0,0,0 # @r{32 is N_GSYM}
803 .global _g_foo
804 .data
805_g_foo:
806 .byte 99
e505224d
PB
807@end example
808
baf4ded0
JK
809The address of the variable represented by the @code{N_GSYM} is not
810contained in the @code{N_GSYM} stab. The debugger gets this information
811from the external symbol for the global variable. In the example above,
812the @code{.global _g_foo} and @code{_g_foo:} lines tell the assembler to
813produce an external symbol.
e505224d 814
bf9d2537
DM
815@node Register Variables
816@section Register Variables
139741da 817
685a5e86 818@findex N_RSYM
8c59ee11
JK
819@c According to an old version of this manual, AIX uses C_RPSYM instead
820@c of C_RSYM. I am skeptical; this should be verified.
6897f9ec 821Register variables have their own stab type, @code{N_RSYM}, and their
ac31351a 822own symbol descriptor, @samp{r}. The stab's value is the
6897f9ec 823number of the register where the variable data will be stored.
685a5e86 824@c .stabs "name:type",N_RSYM,0,RegSize,RegNumber (Sun doc)
e505224d 825
6897f9ec 826AIX defines a separate symbol descriptor @samp{d} for floating point
935d305d 827registers. This seems unnecessary; why not just just give floating
807e8368
JK
828point registers different register numbers? I have not verified whether
829the compiler actually uses @samp{d}.
e505224d 830
6897f9ec 831If the register is explicitly allocated to a global variable, but not
685a5e86 832initialized, as in:
e505224d
PB
833
834@example
6897f9ec 835register int g_bar asm ("%g5");
e505224d
PB
836@end example
837
685a5e86
DM
838@noindent
839then the stab may be emitted at the end of the object file, with
6897f9ec 840the other bss symbols.
e505224d 841
bf9d2537
DM
842@node Common Blocks
843@section Common Blocks
807e8368
JK
844
845A common block is a statically allocated section of memory which can be
846referred to by several source files. It may contain several variables.
685a5e86
DM
847I believe Fortran is the only language with this feature.
848
685a5e86
DM
849@findex N_BCOMM
850@findex N_ECOMM
851A @code{N_BCOMM} stab begins a common block and an @code{N_ECOMM} stab
852ends it. The only field that is significant in these two stabs is the
0a95c18c 853string, which names a normal (non-debugging) symbol that gives the
685a5e86 854address of the common block.
685a5e86 855
685a5e86
DM
856@findex N_ECOML
857Each stab between the @code{N_BCOMM} and the @code{N_ECOMM} specifies a
0a95c18c 858member of that common block; its value is the offset within the
685a5e86
DM
859common block of that variable. The @code{N_ECOML} stab type is
860documented for this purpose, but Sun's Fortran compiler uses
861@code{N_GSYM} instead. The test case I looked at had a common block
862local to a function and it used the @samp{V} symbol descriptor; I assume
863one would use @samp{S} if not local to a function (that is, if a common
864block @emph{can} be anything other than local to a function).
807e8368 865
24dcc707 866@node Statics
bf9d2537 867@section Static Variables
e505224d 868
24dcc707
JK
869Initialized static variables are represented by the @samp{S} and
870@samp{V} symbol descriptors. @samp{S} means file scope static, and
871@samp{V} means procedure scope static.
e505224d 872
935d305d
JK
873@c This is probably not worth mentioning; it is only true on the sparc
874@c for `double' variables which although declared const are actually in
875@c the data segment (the text segment can't guarantee 8 byte alignment).
6fe91f2c 876@c (although GCC
dd8126d9 877@c 2.4.5 has a bug in that it uses @code{N_FUN}, so neither dbx nor GDB can
935d305d 878@c find the variables)
685a5e86
DM
879@findex N_STSYM
880@findex N_LCSYM
f19027a6
JK
881@findex N_FUN, for variables
882@findex N_ROSYM
31a932d8
JK
883In a.out files, @code{N_STSYM} means the data section, @code{N_FUN}
884means the text section, and @code{N_LCSYM} means the bss section. For
885those systems with a read-only data section separate from the text
886section (Solaris), @code{N_ROSYM} means the read-only data section.
e505224d 887
685a5e86 888For example, the source lines:
e505224d
PB
889
890@example
24dcc707
JK
891static const int var_const = 5;
892static int var_init = 2;
893static int var_noinit;
e505224d
PB
894@end example
895
24dcc707
JK
896@noindent
897yield the following stabs:
e505224d
PB
898
899@example
baf4ded0 900.stabs "var_const:S1",36,0,0,_var_const # @r{36 is N_FUN}
685a5e86 901@dots{}
baf4ded0 902.stabs "var_init:S1",38,0,0,_var_init # @r{38 is N_STSYM}
685a5e86 903@dots{}
baf4ded0 904.stabs "var_noinit:S1",40,0,0,_var_noinit # @r{40 is N_LCSYM}
e505224d 905@end example
685a5e86
DM
906
907In XCOFF files, each symbol has a section number, so the stab type
31a932d8 908need not indicate the section.
685a5e86
DM
909
910In ECOFF files, the storage class is used to specify the section, so the
31a932d8 911stab type need not indicate the section.
685a5e86 912
31a932d8
JK
913In ELF files, for Solaris 2.1, symbol descriptor @samp{S} means that the
914address is absolute (ld relocates it) and symbol descriptor @samp{V}
915means that the address is relative to the start of the relevant section
916for that compilation unit. I don't know what it does for @samp{S} stabs
917on Solaris 2.3 (in which ld no longer relocates stabs). For more
918information on ld stab relocation, @xref{Stabs In ELF}.
e505224d 919
f19027a6
JK
920@node Based Variables
921@section Fortran Based Variables
922
923Fortran (at least, the Sun and SGI dialects of FORTRAN-77) has a feature
924which allows allocating arrays with @code{malloc}, but which avoids
925blurring the line between arrays and pointers the way that C does. In
926stabs such a variable uses the @samp{b} symbol descriptor.
927
928For example, the Fortran declarations
929
930@example
931real foo, foo10(10), foo10_5(10,5)
932pointer (foop, foo)
933pointer (foo10p, foo10)
934pointer (foo105p, foo10_5)
935@end example
936
937produce the stabs
938
939@example
940foo:b6
941foo10:bar3;1;10;6
942foo10_5:bar3;1;5;ar3;1;10;6
943@end example
944
945In this example, @code{real} is type 6 and type 3 is an integral type
946which is the type of the subscripts of the array (probably
947@code{integer}).
948
949The @samp{b} symbol descriptor is like @samp{V} in that it denotes a
950statically allocated symbol whose scope is local to a function; see
951@xref{Statics}. The value of the symbol, instead of being the address
952of the variable itself, is the address of a pointer to that variable.
953So in the above example, the value of the @code{foo} stab is the address
954of a pointer to a real, the value of the @code{foo10} stab is the
955address of a pointer to a 10-element array of reals, and the value of
956the @code{foo10_5} stab is the address of a pointer to a 5-element array
957of 10-element arrays of reals.
958
899bafeb 959@node Parameters
907a9cab
JK
960@section Parameters
961
43603088 962Formal parameters to a function are represented by a stab (or sometimes
685a5e86
DM
963two; see below) for each parameter. The stabs are in the order in which
964the debugger should print the parameters (i.e., the order in which the
dd8126d9
JK
965parameters are declared in the source file). The exact form of the stab
966depends on how the parameter is being passed.
e505224d 967
685a5e86
DM
968@findex N_PSYM
969Parameters passed on the stack use the symbol descriptor @samp{p} and
0a95c18c 970the @code{N_PSYM} symbol type. The value of the symbol is an offset
dd8126d9 971used to locate the parameter on the stack; its exact meaning is
685a5e86 972machine-dependent, but on most machines it is an offset from the frame
dd8126d9 973pointer.
b82ea042 974
685a5e86
DM
975As a simple example, the code:
976
977@example
978main (argc, argv)
979 int argc;
980 char **argv;
981@end example
982
983produces the stabs:
984
985@example
986.stabs "main:F1",36,0,0,_main # @r{36 is N_FUN}
987.stabs "argc:p1",160,0,0,68 # @r{160 is N_PSYM}
988.stabs "argv:p20=*21=*2",160,0,0,72
989@end example
990
991The type definition of @code{argv} is interesting because it contains
992several type definitions. Type 21 is pointer to type 2 (char) and
993@code{argv} (type 20) is pointer to type 21.
43603088
JK
994
995@c FIXME: figure out what these mean and describe them coherently.
408f6c34
JK
996The following symbol descriptors are also said to go with @code{N_PSYM}.
997The value of the symbol is said to be an offset from the argument
998pointer (I'm not sure whether this is true or not).
43603088
JK
999
1000@example
408f6c34
JK
1001pP (<<??>>)
1002pF Fortran function parameter
1003X (function result variable)
43603088 1004@end example
685a5e86
DM
1005
1006@menu
bf9d2537
DM
1007* Register Parameters::
1008* Local Variable Parameters::
1009* Reference Parameters::
1010* Conformant Arrays::
685a5e86
DM
1011@end menu
1012
bf9d2537
DM
1013@node Register Parameters
1014@subsection Passing Parameters in Registers
685a5e86
DM
1015
1016If the parameter is passed in a register, then traditionally there are
1017two symbols for each argument:
e505224d
PB
1018
1019@example
baf4ded0
JK
1020.stabs "arg:p1" . . . ; N_PSYM
1021.stabs "arg:r1" . . . ; N_RSYM
e505224d
PB
1022@end example
1023
685a5e86
DM
1024Debuggers use the second one to find the value, and the first one to
1025know that it is an argument.
1026
685a5e86 1027@findex C_RPSYM
43603088 1028@findex N_RSYM, for parameters
685a5e86
DM
1029Because that approach is kind of ugly, some compilers use symbol
1030descriptor @samp{P} or @samp{R} to indicate an argument which is in a
1031register. Symbol type @code{C_RPSYM} is used with @samp{R} and
ac31351a 1032@code{N_RSYM} is used with @samp{P}. The symbol's value is
685a5e86
DM
1033the register number. @samp{P} and @samp{R} mean the same thing; the
1034difference is that @samp{P} is a GNU invention and @samp{R} is an IBM
1035(XCOFF) invention. As of version 4.9, GDB should handle either one.
e505224d 1036
685a5e86
DM
1037There is at least one case where GCC uses a @samp{p} and @samp{r} pair
1038rather than @samp{P}; this is where the argument is passed in the
1039argument list and then loaded into a register.
b82ea042 1040
685a5e86 1041According to the AIX documentation, symbol descriptor @samp{D} is for a
acf7d010
JK
1042parameter passed in a floating point register. This seems
1043unnecessary---why not just use @samp{R} with a register number which
23aed449 1044indicates that it's a floating point register? I haven't verified
6897f9ec
JK
1045whether the system actually does what the documentation indicates.
1046
43603088
JK
1047@c FIXME: On the hppa this is for any type > 8 bytes, I think, and not
1048@c for small structures (investigate).
c156f3c1
JK
1049On the sparc and hppa, for a @samp{P} symbol whose type is a structure
1050or union, the register contains the address of the structure. On the
685a5e86
DM
1051sparc, this is also true of a @samp{p} and @samp{r} pair (using Sun
1052@code{cc}) or a @samp{p} symbol. However, if a (small) structure is
1053really in a register, @samp{r} is used. And, to top it all off, on the
1054hppa it might be a structure which was passed on the stack and loaded
1055into a register and for which there is a @samp{p} and @samp{r} pair! I
1056believe that symbol descriptor @samp{i} is supposed to deal with this
1057case (it is said to mean "value parameter by reference, indirect
1058access"; I don't know the source for this information), but I don't know
1059details or what compilers or debuggers use it, if any (not GDB or GCC).
1060It is not clear to me whether this case needs to be dealt with
bf9d2537 1061differently than parameters passed by reference (@pxref{Reference Parameters}).
685a5e86 1062
bf9d2537
DM
1063@node Local Variable Parameters
1064@subsection Storing Parameters as Local Variables
685a5e86
DM
1065
1066There is a case similar to an argument in a register, which is an
1067argument that is actually stored as a local variable. Sometimes this
98ef6f31
JK
1068happens when the argument was passed in a register and then the compiler
1069stores it as a local variable. If possible, the compiler should claim
685a5e86
DM
1070that it's in a register, but this isn't always done.
1071
43603088 1072@findex N_LSYM, for parameter
685a5e86
DM
1073Some compilers use the pair of symbols approach described above
1074(@samp{@var{arg}:p} followed by @samp{@var{arg}:}); this includes GCC1
1075(not GCC2) on the sparc when passing a small structure and GCC2
1076(sometimes) when the argument type is @code{float} and it is passed as a
1077@code{double} and converted to @code{float} by the prologue (in the
1078latter case the type of the @samp{@var{arg}:p} symbol is @code{double}
f3bb0be2
JK
1079and the type of the @samp{@var{arg}:} symbol is @code{float}).
1080
1081GCC, at least on the 960, has another solution to the same problem. It
1082uses a single @samp{p} symbol descriptor for an argument which is stored
1083as a local variable but uses @code{N_LSYM} instead of @code{N_PSYM}. In
1084this case, the value of the symbol is an offset relative to the local
1085variables for that function, not relative to the arguments; on some
1086machines those are the same thing, but not on all.
1087
1088@c This is mostly just background info; the part that logically belongs
1089@c here is the last sentence.
1090On the VAX or on other machines in which the calling convention includes
1091the number of words of arguments actually passed, the debugger (GDB at
1092least) uses the parameter symbols to keep track of whether it needs to
1093print nameless arguments in addition to the formal parameters which it
1094has printed because each one has a stab. For example, in
1095
1096@example
1097extern int fprintf (FILE *stream, char *format, @dots{});
1098@dots{}
1099fprintf (stdout, "%d\n", x);
1100@end example
1101
1102there are stabs for @code{stream} and @code{format}. On most machines,
1103the debugger can only print those two arguments (because it has no way
1104of knowing that additional arguments were passed), but on the VAX or
1105other machines with a calling convention which indicates the number of
1106words of arguments, the debugger can print all three arguments. To do
1107so, the parameter symbol (symbol descriptor @samp{p}) (not necessarily
1108@samp{r} or symbol descriptor omitted symbols) needs to contain the
1109actual type as passed (for example, @code{double} not @code{float} if it
1110is passed as a double and converted to a float).
685a5e86 1111
bf9d2537
DM
1112@node Reference Parameters
1113@subsection Passing Parameters by Reference
685a5e86
DM
1114
1115If the parameter is passed by reference (e.g., Pascal @code{VAR}
1116parameters), then the symbol descriptor is @samp{v} if it is in the
1117argument list, or @samp{a} if it in a register. Other than the fact
1118that these contain the address of the parameter rather than the
1119parameter itself, they are identical to @samp{p} and @samp{R},
1120respectively. I believe @samp{a} is an AIX invention; @samp{v} is
1121supported by all stabs-using systems as far as I know.
1122
bf9d2537
DM
1123@node Conformant Arrays
1124@subsection Passing Conformant Array Parameters
6897f9ec
JK
1125
1126@c Is this paragraph correct? It is based on piecing together patchy
1127@c information and some guesswork
685a5e86 1128Conformant arrays are a feature of Modula-2, and perhaps other
6897f9ec 1129languages, in which the size of an array parameter is not known to the
685a5e86 1130called function until run-time. Such parameters have two stabs: a
6897f9ec 1131@samp{x} for the array itself, and a @samp{C}, which represents the size
0a95c18c 1132of the array. The value of the @samp{x} stab is the offset in the
6897f9ec 1133argument list where the address of the array is stored (it this right?
0a95c18c 1134it is a guess); the value of the @samp{C} stab is the offset in the
6897f9ec
JK
1135argument list where the size of the array (in elements? in bytes?) is
1136stored.
1137
8c59ee11 1138@node Types
bf9d2537 1139@chapter Defining Types
e505224d 1140
685a5e86
DM
1141The examples so far have described types as references to previously
1142defined types, or defined in terms of subranges of or pointers to
1143previously defined types. This chapter describes the other type
1144descriptors that may follow the @samp{=} in a type definition.
e505224d
PB
1145
1146@menu
bf9d2537
DM
1147* Builtin Types:: Integers, floating point, void, etc.
1148* Miscellaneous Types:: Pointers, sets, files, etc.
1149* Cross-References:: Referring to a type not yet defined.
8c59ee11
JK
1150* Subranges:: A type with a specific range.
1151* Arrays:: An aggregate type of same-typed elements.
1152* Strings:: Like an array but also has a length.
1153* Enumerations:: Like an integer but the values have names.
1154* Structures:: An aggregate type of different-typed elements.
ded6bcab
JK
1155* Typedefs:: Giving a type a name.
1156* Unions:: Different types sharing storage.
bf9d2537 1157* Function Types::
e505224d
PB
1158@end menu
1159
bf9d2537
DM
1160@node Builtin Types
1161@section Builtin Types
e505224d 1162
8c59ee11
JK
1163Certain types are built in (@code{int}, @code{short}, @code{void},
1164@code{float}, etc.); the debugger recognizes these types and knows how
685a5e86 1165to handle them. Thus, don't be surprised if some of the following ways
8c59ee11
JK
1166of specifying builtin types do not specify everything that a debugger
1167would need to know about the type---in some cases they merely specify
1168enough information to distinguish the type from other types.
1169
1170The traditional way to define builtin types is convolunted, so new ways
dd8126d9
JK
1171have been invented to describe them. Sun's @code{acc} uses special
1172builtin type descriptors (@samp{b} and @samp{R}), and IBM uses negative
685a5e86 1173type numbers. GDB accepts all three ways, as of version 4.8; dbx just
dd8126d9
JK
1174accepts the traditional builtin types and perhaps one of the other two
1175formats. The following sections describe each of these formats.
8c59ee11
JK
1176
1177@menu
bf9d2537
DM
1178* Traditional Builtin Types:: Put on your seatbelts and prepare for kludgery
1179* Builtin Type Descriptors:: Builtin types with special type descriptors
1180* Negative Type Numbers:: Builtin types using negative type numbers
8c59ee11
JK
1181@end menu
1182
bf9d2537
DM
1183@node Traditional Builtin Types
1184@subsection Traditional Builtin Types
8c59ee11 1185
685a5e86
DM
1186This is the traditional, convoluted method for defining builtin types.
1187There are several classes of such type definitions: integer, floating
1188point, and @code{void}.
1189
1190@menu
bf9d2537
DM
1191* Traditional Integer Types::
1192* Traditional Other Types::
685a5e86
DM
1193@end menu
1194
bf9d2537
DM
1195@node Traditional Integer Types
1196@subsubsection Traditional Integer Types
685a5e86
DM
1197
1198Often types are defined as subranges of themselves. If the bounding values
1199fit within an @code{int}, then they are given normally. For example:
8c59ee11
JK
1200
1201@example
baf4ded0 1202.stabs "int:t1=r1;-2147483648;2147483647;",128,0,0,0 # @r{128 is N_LSYM}
8c59ee11
JK
1203.stabs "char:t2=r2;0;127;",128,0,0,0
1204@end example
1205
1206Builtin types can also be described as subranges of @code{int}:
1207
1208@example
1209.stabs "unsigned short:t6=r1;0;65535;",128,0,0,0
1210@end example
1211
685a5e86
DM
1212If the lower bound of a subrange is 0 and the upper bound is -1,
1213the type is an unsigned integral type whose bounds are too
1214big to describe in an @code{int}. Traditionally this is only used for
1215@code{unsigned int} and @code{unsigned long}:
8c59ee11
JK
1216
1217@example
1218.stabs "unsigned int:t4=r1;0;-1;",128,0,0,0
8c59ee11
JK
1219@end example
1220
43603088
JK
1221For larger types, GCC 2.4.5 puts out bounds in octal, with a leading 0.
1222In this case a negative bound consists of a number which is a 1 bit
1223followed by a bunch of 0 bits, and a positive bound is one in which a
1224bunch of bits are 1. All known versions of dbx and GDB version 4 accept
1225this, but GDB 3.5 refuses to read the whole file containing such
1226symbols. So GCC 2.3.3 did not output the proper size for these types.
1227@c FIXME: How about an example?
685a5e86 1228
b273dc0f 1229If the lower bound of a subrange is 0 and the upper bound is negative,
685a5e86 1230the type is an unsigned integral type whose size in bytes is the
b273dc0f
JK
1231absolute value of the upper bound. I believe this is a Convex
1232convention for @code{unsigned long long}.
1233
1234If the lower bound of a subrange is negative and the upper bound is 0,
685a5e86 1235the type is a signed integral type whose size in bytes is
b273dc0f
JK
1236the absolute value of the lower bound. I believe this is a Convex
1237convention for @code{long long}. To distinguish this from a legitimate
1238subrange, the type should be a subrange of itself. I'm not sure whether
1239this is the case for Convex.
1240
bf9d2537
DM
1241@node Traditional Other Types
1242@subsubsection Traditional Other Types
685a5e86
DM
1243
1244If the upper bound of a subrange is 0 and the lower bound is positive,
1245the type is a floating point type, and the lower bound of the subrange
1246indicates the number of bytes in the type:
8c59ee11
JK
1247
1248@example
1249.stabs "float:t12=r1;4;0;",128,0,0,0
1250.stabs "double:t13=r1;8;0;",128,0,0,0
1251@end example
1252
1253However, GCC writes @code{long double} the same way it writes
dd8126d9 1254@code{double}, so there is no way to distinguish.
8c59ee11
JK
1255
1256@example
1257.stabs "long double:t14=r1;8;0;",128,0,0,0
1258@end example
1259
dd8126d9
JK
1260Complex types are defined the same way as floating-point types; there is
1261no way to distinguish a single-precision complex from a double-precision
1262floating-point type.
8c59ee11
JK
1263
1264The C @code{void} type is defined as itself:
1265
1266@example
1267.stabs "void:t15=15",128,0,0,0
1268@end example
1269
1270I'm not sure how a boolean type is represented.
1271
bf9d2537
DM
1272@node Builtin Type Descriptors
1273@subsection Defining Builtin Types Using Builtin Type Descriptors
8c59ee11 1274
685a5e86
DM
1275This is the method used by Sun's @code{acc} for defining builtin types.
1276These are the type descriptors to define builtin types:
8c59ee11
JK
1277
1278@table @code
1a8b5668
JK
1279@c FIXME: clean up description of width and offset, once we figure out
1280@c what they mean
8c59ee11
JK
1281@item b @var{signed} @var{char-flag} @var{width} ; @var{offset} ; @var{nbits} ;
1282Define an integral type. @var{signed} is @samp{u} for unsigned or
1283@samp{s} for signed. @var{char-flag} is @samp{c} which indicates this
1284is a character type, or is omitted. I assume this is to distinguish an
1285integral type from a character type of the same size, for example it
1286might make sense to set it for the C type @code{wchar_t} so the debugger
1287can print such variables differently (Solaris does not do this). Sun
1288sets it on the C types @code{signed char} and @code{unsigned char} which
1289arguably is wrong. @var{width} and @var{offset} appear to be for small
1290objects stored in larger ones, for example a @code{short} in an
1291@code{int} register. @var{width} is normally the number of bytes in the
1292type. @var{offset} seems to always be zero. @var{nbits} is the number
1293of bits in the type.
1294
1295Note that type descriptor @samp{b} used for builtin types conflicts with
bf9d2537 1296its use for Pascal space types (@pxref{Miscellaneous Types}); they can
8c59ee11
JK
1297be distinguished because the character following the type descriptor
1298will be a digit, @samp{(}, or @samp{-} for a Pascal space type, or
1299@samp{u} or @samp{s} for a builtin type.
1300
1301@item w
1302Documented by AIX to define a wide character type, but their compiler
bf9d2537 1303actually uses negative type numbers (@pxref{Negative Type Numbers}).
8c59ee11 1304
685a5e86
DM
1305@item R @var{fp-type} ; @var{bytes} ;
1306Define a floating point type. @var{fp-type} has one of the following values:
1a8b5668
JK
1307
1308@table @code
1309@item 1 (NF_SINGLE)
1310IEEE 32-bit (single precision) floating point format.
1311
1312@item 2 (NF_DOUBLE)
1313IEEE 64-bit (double precision) floating point format.
1314
1315@item 3 (NF_COMPLEX)
1316@item 4 (NF_COMPLEX16)
1317@item 5 (NF_COMPLEX32)
3d4cf720
JK
1318@c "GDB source" really means @file{include/aout/stab_gnu.h}, but trying
1319@c to put that here got an overfull hbox.
1320These are for complex numbers. A comment in the GDB source describes
685a5e86
DM
1321them as Fortran @code{complex}, @code{double complex}, and
1322@code{complex*16}, respectively, but what does that mean? (i.e., Single
1323precision? Double precison?).
1a8b5668
JK
1324
1325@item 6 (NF_LDOUBLE)
43603088 1326Long double. This should probably only be used for Sun format
685a5e86
DM
1327@code{long double}, and new codes should be used for other floating
1328point formats (@code{NF_DOUBLE} can be used if a @code{long double} is
1329really just an IEEE double, of course).
1a8b5668
JK
1330@end table
1331
1332@var{bytes} is the number of bytes occupied by the type. This allows a
1333debugger to perform some operations with the type even if it doesn't
685a5e86 1334understand @var{fp-type}.
8c59ee11
JK
1335
1336@item g @var{type-information} ; @var{nbits}
1337Documented by AIX to define a floating type, but their compiler actually
bf9d2537 1338uses negative type numbers (@pxref{Negative Type Numbers}).
8c59ee11
JK
1339
1340@item c @var{type-information} ; @var{nbits}
1341Documented by AIX to define a complex type, but their compiler actually
bf9d2537 1342uses negative type numbers (@pxref{Negative Type Numbers}).
8c59ee11
JK
1343@end table
1344
1345The C @code{void} type is defined as a signed integral type 0 bits long:
1346@example
1347.stabs "void:t19=bs0;0;0",128,0,0,0
1348@end example
e9f687d5
JK
1349The Solaris compiler seems to omit the trailing semicolon in this case.
1350Getting sloppy in this way is not a swift move because if a type is
1351embedded in a more complex expression it is necessary to be able to tell
1352where it ends.
8c59ee11
JK
1353
1354I'm not sure how a boolean type is represented.
1355
bf9d2537
DM
1356@node Negative Type Numbers
1357@subsection Negative Type Numbers
8c59ee11 1358
685a5e86 1359This is the method used in XCOFF for defining builtin types.
8c59ee11
JK
1360Since the debugger knows about the builtin types anyway, the idea of
1361negative type numbers is simply to give a special type number which
685a5e86 1362indicates the builtin type. There is no stab defining these types.
8c59ee11 1363
23afb447
JK
1364There are several subtle issues with negative type numbers.
1365
1366One is the size of the type. A builtin type (for example the C types
1367@code{int} or @code{long}) might have different sizes depending on
1368compiler options, the target architecture, the ABI, etc. This issue
1369doesn't come up for IBM tools since (so far) they just target the
1370RS/6000; the sizes indicated below for each size are what the IBM
1371RS/6000 tools use. To deal with differing sizes, either define separate
1372negative type numbers for each size (which works but requires changing
1373the debugger, and, unless you get both AIX dbx and GDB to accept the
1374change, introduces an incompatibility), or use a type attribute
1375(@pxref{String Field}) to define a new type with the appropriate size
1376(which merely requires a debugger which understands type attributes,
1377like AIX dbx). For example,
1378
1379@example
1380.stabs "boolean:t10=@@s8;-16",128,0,0,0
1381@end example
1382
1383defines an 8-bit boolean type, and
1384
1385@example
1386.stabs "boolean:t10=@@s64;-16",128,0,0,0
1387@end example
1388
1389defines a 64-bit boolean type.
1390
1391A similar issue is the format of the type. This comes up most often for
1392floating-point types, which could have various formats (particularly
1393extended doubles, which vary quite a bit even among IEEE systems).
1394Again, it is best to define a new negative type number for each
1395different format; changing the format based on the target system has
1396various problems. One such problem is that the Alpha has both VAX and
1397IEEE floating types. One can easily imagine one library using the VAX
1398types and another library in the same executable using the IEEE types.
1399Another example is that the interpretation of whether a boolean is true
1400or false can be based on the least significant bit, most significant
1401bit, whether it is zero, etc., and different compilers (or different
1402options to the same compiler) might provide different kinds of boolean.
1403
1404The last major issue is the names of the types. The name of a given
1405type depends @emph{only} on the negative type number given; these do not
1406vary depending on the language, the target system, or anything else.
1407One can always define separate type numbers---in the following list you
1408will see for example separate @code{int} and @code{integer*4} types
1409which are identical except for the name. But compatibility can be
1410maintained by not inventing new negative type numbers and instead just
1411defining a new type with a new name. For example:
1412
1413@example
1414.stabs "CARDINAL:t10=-8",128,0,0,0
1415@end example
1416
1417Here is the list of negative type numbers. The phrase @dfn{integral
1418type} is used to mean twos-complement (I strongly suspect that all
1419machines which use stabs use twos-complement; most machines use
1420twos-complement these days).
b273dc0f 1421
8c59ee11
JK
1422@table @code
1423@item -1
1424@code{int}, 32 bit signed integral type.
1425
1426@item -2
dd8126d9 1427@code{char}, 8 bit type holding a character. Both GDB and dbx on AIX
8c59ee11 1428treat this as signed. GCC uses this type whether @code{char} is signed
685a5e86 1429or not, which seems like a bad idea. The AIX compiler (@code{xlc}) seems to
8c59ee11
JK
1430avoid this type; it uses -5 instead for @code{char}.
1431
1432@item -3
1433@code{short}, 16 bit signed integral type.
1434
1435@item -4
1436@code{long}, 32 bit signed integral type.
1437
1438@item -5
1439@code{unsigned char}, 8 bit unsigned integral type.
1440
1441@item -6
1442@code{signed char}, 8 bit signed integral type.
1443
1444@item -7
1445@code{unsigned short}, 16 bit unsigned integral type.
1446
1447@item -8
1448@code{unsigned int}, 32 bit unsigned integral type.
1449
1450@item -9
1451@code{unsigned}, 32 bit unsigned integral type.
1452
1453@item -10
1454@code{unsigned long}, 32 bit unsigned integral type.
1455
1456@item -11
1457@code{void}, type indicating the lack of a value.
1458
1459@item -12
1460@code{float}, IEEE single precision.
1461
1462@item -13
1463@code{double}, IEEE double precision.
1464
1465@item -14
b273dc0f
JK
1466@code{long double}, IEEE double precision. The compiler claims the size
1467will increase in a future release, and for binary compatibility you have
1468to avoid using @code{long double}. I hope when they increase it they
1469use a new negative type number.
8c59ee11
JK
1470
1471@item -15
b273dc0f 1472@code{integer}. 32 bit signed integral type.
8c59ee11
JK
1473
1474@item -16
455c8603
JK
1475@code{boolean}. 32 bit type. How is the truth value encoded? Is it
1476the least significant bit or is it a question of whether the whole value
1477is zero or non-zero?
8c59ee11
JK
1478
1479@item -17
b273dc0f 1480@code{short real}. IEEE single precision.
8c59ee11
JK
1481
1482@item -18
b273dc0f 1483@code{real}. IEEE double precision.
8c59ee11
JK
1484
1485@item -19
b273dc0f 1486@code{stringptr}. @xref{Strings}.
8c59ee11
JK
1487
1488@item -20
dcb9e869 1489@code{character}, 8 bit unsigned character type.
8c59ee11
JK
1490
1491@item -21
6fe91f2c 1492@code{logical*1}, 8 bit type. This Fortran type has a split
01c4b039 1493personality in that it is used for boolean variables, but can also be
03ffea63
JK
1494used for unsigned integers. 0 is false, 1 is true, and other values are
1495non-boolean.
8c59ee11
JK
1496
1497@item -22
6fe91f2c 1498@code{logical*2}, 16 bit type. This Fortran type has a split
01c4b039 1499personality in that it is used for boolean variables, but can also be
03ffea63
JK
1500used for unsigned integers. 0 is false, 1 is true, and other values are
1501non-boolean.
8c59ee11
JK
1502
1503@item -23
6fe91f2c 1504@code{logical*4}, 32 bit type. This Fortran type has a split
01c4b039 1505personality in that it is used for boolean variables, but can also be
03ffea63
JK
1506used for unsigned integers. 0 is false, 1 is true, and other values are
1507non-boolean.
8c59ee11
JK
1508
1509@item -24
6fe91f2c 1510@code{logical}, 32 bit type. This Fortran type has a split
0e84d6ec 1511personality in that it is used for boolean variables, but can also be
03ffea63
JK
1512used for unsigned integers. 0 is false, 1 is true, and other values are
1513non-boolean.
8c59ee11
JK
1514
1515@item -25
b273dc0f
JK
1516@code{complex}. A complex type consisting of two IEEE single-precision
1517floating point values.
8c59ee11
JK
1518
1519@item -26
b273dc0f
JK
1520@code{complex}. A complex type consisting of two IEEE double-precision
1521floating point values.
8c59ee11
JK
1522
1523@item -27
1524@code{integer*1}, 8 bit signed integral type.
1525
1526@item -28
1527@code{integer*2}, 16 bit signed integral type.
1528
1529@item -29
1530@code{integer*4}, 32 bit signed integral type.
1531
1532@item -30
dcb9e869
JK
1533@code{wchar}. Wide character, 16 bits wide, unsigned (what format?
1534Unicode?).
8c59ee11
JK
1535@end table
1536
bf9d2537
DM
1537@node Miscellaneous Types
1538@section Miscellaneous Types
8c59ee11
JK
1539
1540@table @code
1541@item b @var{type-information} ; @var{bytes}
1542Pascal space type. This is documented by IBM; what does it mean?
1543
685a5e86 1544This use of the @samp{b} type descriptor can be distinguished
bf9d2537
DM
1545from its use for builtin integral types (@pxref{Builtin Type
1546Descriptors}) because the character following the type descriptor is
8c59ee11
JK
1547always a digit, @samp{(}, or @samp{-}.
1548
1549@item B @var{type-information}
43603088 1550A volatile-qualified version of @var{type-information}. This is
685a5e86 1551a Sun extension. References and stores to a variable with a
43603088 1552volatile-qualified type must not be optimized or cached; they
685a5e86 1553must occur as the user specifies them.
8c59ee11
JK
1554
1555@item d @var{type-information}
1556File of type @var{type-information}. As far as I know this is only used
1557by Pascal.
1558
1559@item k @var{type-information}
43603088
JK
1560A const-qualified version of @var{type-information}. This is a Sun
1561extension. A variable with a const-qualified type cannot be modified.
8c59ee11
JK
1562
1563@item M @var{type-information} ; @var{length}
1564Multiple instance type. The type seems to composed of @var{length}
1565repetitions of @var{type-information}, for example @code{character*3} is
1566represented by @samp{M-2;3}, where @samp{-2} is a reference to a
bf9d2537 1567character type (@pxref{Negative Type Numbers}). I'm not sure how this
6fe91f2c
DM
1568differs from an array. This appears to be a Fortran feature.
1569@var{length} is a bound, like those in range types; see @ref{Subranges}.
8c59ee11
JK
1570
1571@item S @var{type-information}
1572Pascal set type. @var{type-information} must be a small type such as an
1573enumeration or a subrange, and the type is a bitmask whose length is
1574specified by the number of elements in @var{type-information}.
1575
1576@item * @var{type-information}
1577Pointer to @var{type-information}.
139741da 1578@end table
e505224d 1579
bf9d2537
DM
1580@node Cross-References
1581@section Cross-References to Other Types
8c59ee11 1582
685a5e86
DM
1583A type can be used before it is defined; one common way to deal with
1584that situation is just to use a type reference to a type which has not
1585yet been defined.
8c59ee11
JK
1586
1587Another way is with the @samp{x} type descriptor, which is followed by
1588@samp{s} for a structure tag, @samp{u} for a union tag, or @samp{e} for
1589a enumerator tag, followed by the name of the tag, followed by @samp{:}.
685a5e86 1590For example, the following C declarations:
e505224d
PB
1591
1592@example
8c59ee11
JK
1593struct foo;
1594struct foo *bar;
e505224d
PB
1595@end example
1596
685a5e86
DM
1597@noindent
1598produce:
8c59ee11
JK
1599
1600@example
1601.stabs "bar:G16=*17=xsfoo:",32,0,0,0
1602@end example
1603
1604Not all debuggers support the @samp{x} type descriptor, so on some
1605machines GCC does not use it. I believe that for the above example it
1606would just emit a reference to type 17 and never define it, but I
1607haven't verified that.
1608
1609Modula-2 imported types, at least on AIX, use the @samp{i} type
1610descriptor, which is followed by the name of the module from which the
1611type is imported, followed by @samp{:}, followed by the name of the
1612type. There is then optionally a comma followed by type information for
685a5e86 1613the type. This differs from merely naming the type (@pxref{Typedefs}) in
8c59ee11
JK
1614that it identifies the module; I don't understand whether the name of
1615the type given here is always just the same as the name we are giving
1616it, or whether this type descriptor is used with a nameless stab
bf9d2537 1617(@pxref{String Field}), or what. The symbol ends with @samp{;}.
e505224d 1618
8c59ee11 1619@node Subranges
bf9d2537 1620@section Subrange Types
8c59ee11
JK
1621
1622The @samp{r} type descriptor defines a type as a subrange of another
685a5e86
DM
1623type. It is followed by type information for the type of which it is a
1624subrange, a semicolon, an integral lower bound, a semicolon, an
8c59ee11 1625integral upper bound, and a semicolon. The AIX documentation does not
63cef7d7
JK
1626specify the trailing semicolon, in an effort to specify array indexes
1627more cleanly, but a subrange which is not an array index has always
466bdeb2 1628included a trailing semicolon (@pxref{Arrays}).
8c59ee11 1629
8cfe3beb 1630Instead of an integer, either bound can be one of the following:
8c59ee11
JK
1631
1632@table @code
1633@item A @var{offset}
1634The bound is passed by reference on the stack at offset @var{offset}
1635from the argument list. @xref{Parameters}, for more information on such
1636offsets.
1637
1638@item T @var{offset}
1639The bound is passed by value on the stack at offset @var{offset} from
1640the argument list.
1641
1642@item a @var{register-number}
1643The bound is pased by reference in register number
1644@var{register-number}.
1645
1646@item t @var{register-number}
1647The bound is passed by value in register number @var{register-number}.
1648
1649@item J
1650There is no bound.
1651@end table
1652
bf9d2537 1653Subranges are also used for builtin types; see @ref{Traditional Builtin Types}.
8c59ee11
JK
1654
1655@node Arrays
bf9d2537 1656@section Array Types
8c59ee11
JK
1657
1658Arrays use the @samp{a} type descriptor. Following the type descriptor
63cef7d7 1659is the type of the index and the type of the array elements. If the
685a5e86
DM
1660index type is a range type, it ends in a semicolon; otherwise
1661(for example, if it is a type reference), there does not
63cef7d7
JK
1662appear to be any way to tell where the types are separated. In an
1663effort to clean up this mess, IBM documents the two types as being
1664separated by a semicolon, and a range type as not ending in a semicolon
1665(but this is not right for range types which are not array indexes,
1666@pxref{Subranges}). I think probably the best solution is to specify
1667that a semicolon ends a range type, and that the index type and element
1668type of an array are separated by a semicolon, but that if the index
1669type is a range type, the extra semicolon can be omitted. GDB (at least
1670through version 4.9) doesn't support any kind of index type other than a
1671range anyway; I'm not sure about dbx.
6aa83a79 1672
ee59134e 1673It is well established, and widely used, that the type of the index,
3d4cf720 1674unlike most types found in the stabs, is merely a type definition, not
bf9d2537 1675type information (@pxref{String Field}) (that is, it need not start with
685a5e86 1676@samp{@var{type-number}=} if it is defining a new type). According to a
3d4cf720
JK
1677comment in GDB, this is also true of the type of the array elements; it
1678gives @samp{ar1;1;10;ar1;1;10;4} as a legitimate way to express a two
1679dimensional array. According to AIX documentation, the element type
1680must be type information. GDB accepts either.
ee59134e 1681
43603088
JK
1682The type of the index is often a range type, expressed as the type
1683descriptor @samp{r} and some parameters. It defines the size of the
1684array. In the example below, the range @samp{r1;0;2;} defines an index
1685type which is a subrange of type 1 (integer), with a lower bound of 0
1686and an upper bound of 2. This defines the valid range of subscripts of
1687a three-element C array.
e505224d 1688
685a5e86 1689For example, the definition:
e505224d
PB
1690
1691@example
8c59ee11
JK
1692char char_vec[3] = @{'a','b','c'@};
1693@end example
e505224d 1694
8c59ee11 1695@noindent
685a5e86 1696produces the output:
8c59ee11
JK
1697
1698@example
1699.stabs "char_vec:G19=ar1;0;2;2",32,0,0,0
1700 .global _char_vec
1701 .align 4
1702_char_vec:
1703 .byte 97
1704 .byte 98
1705 .byte 99
1706@end example
1707
685a5e86 1708If an array is @dfn{packed}, the elements are spaced more
8c59ee11
JK
1709closely than normal, saving memory at the expense of speed. For
1710example, an array of 3-byte objects might, if unpacked, have each
1711element aligned on a 4-byte boundary, but if packed, have no padding.
1712One way to specify that something is packed is with type attributes
bf9d2537 1713(@pxref{String Field}). In the case of arrays, another is to use the
8c59ee11
JK
1714@samp{P} type descriptor instead of @samp{a}. Other than specifying a
1715packed array, @samp{P} is identical to @samp{a}.
1716
1717@c FIXME-what is it? A pointer?
1718An open array is represented by the @samp{A} type descriptor followed by
1719type information specifying the type of the array elements.
1720
1721@c FIXME: what is the format of this type? A pointer to a vector of pointers?
1722An N-dimensional dynamic array is represented by
1723
1724@example
1725D @var{dimensions} ; @var{type-information}
1726@end example
1727
1728@c Does dimensions really have this meaning? The AIX documentation
1729@c doesn't say.
1730@var{dimensions} is the number of dimensions; @var{type-information}
1731specifies the type of the array elements.
1732
1733@c FIXME: what is the format of this type? A pointer to some offsets in
1734@c another array?
1735A subarray of an N-dimensional array is represented by
1736
1737@example
1738E @var{dimensions} ; @var{type-information}
e505224d
PB
1739@end example
1740
8c59ee11
JK
1741@c Does dimensions really have this meaning? The AIX documentation
1742@c doesn't say.
1743@var{dimensions} is the number of dimensions; @var{type-information}
1744specifies the type of the array elements.
1745
1746@node Strings
1747@section Strings
1748
1749Some languages, like C or the original Pascal, do not have string types,
1750they just have related things like arrays of characters. But most
1751Pascals and various other languages have string types, which are
1752indicated as follows:
1753
1754@table @code
1755@item n @var{type-information} ; @var{bytes}
1756@var{bytes} is the maximum length. I'm not sure what
1757@var{type-information} is; I suspect that it means that this is a string
1758of @var{type-information} (thus allowing a string of integers, a string
1759of wide characters, etc., as well as a string of characters). Not sure
1760what the format of this type is. This is an AIX feature.
1761
1762@item z @var{type-information} ; @var{bytes}
1763Just like @samp{n} except that this is a gstring, not an ordinary
1764string. I don't know the difference.
1765
1766@item N
1767Pascal Stringptr. What is this? This is an AIX feature.
1768@end table
1769
899bafeb 1770@node Enumerations
6fe91f2c 1771@section Enumerations
e505224d 1772
8c59ee11 1773Enumerations are defined with the @samp{e} type descriptor.
e505224d 1774
8c59ee11
JK
1775@c FIXME: Where does this information properly go? Perhaps it is
1776@c redundant with something we already explain.
685a5e86 1777The source line below declares an enumeration type at file scope.
6fe91f2c
DM
1778The type definition is located after the @code{N_RBRAC} that marks the end of
1779the previous procedure's block scope, and before the @code{N_FUN} that marks
8c59ee11 1780the beginning of the next procedure's block scope. Therefore it does not
6fe91f2c 1781describe a block local symbol, but a file local one.
8c59ee11
JK
1782
1783The source line:
e505224d
PB
1784
1785@example
8c59ee11 1786enum e_places @{first,second=3,last@};
e505224d
PB
1787@end example
1788
899bafeb 1789@noindent
685a5e86 1790generates the following stab:
e505224d 1791
899bafeb 1792@example
8c59ee11 1793.stabs "e_places:T22=efirst:0,second:3,last:4,;",128,0,0,0
899bafeb 1794@end example
e505224d 1795
685a5e86
DM
1796The symbol descriptor (@samp{T}) says that the stab describes a
1797structure, enumeration, or union tag. The type descriptor @samp{e},
1798following the @samp{22=} of the type definition narrows it down to an
1799enumeration type. Following the @samp{e} is a list of the elements of
1800the enumeration. The format is @samp{@var{name}:@var{value},}. The
43603088 1801list of elements ends with @samp{;}.
e505224d 1802
8c59ee11
JK
1803There is no standard way to specify the size of an enumeration type; it
1804is determined by the architecture (normally all enumerations types are
180532 bits). There should be a way to specify an enumeration type of
685a5e86 1806another size; type attributes would be one way to do this. @xref{Stabs
bf9d2537 1807Format}.
8c59ee11
JK
1808
1809@node Structures
1810@section Structures
e505224d 1811
685a5e86 1812The encoding of structures in stabs can be shown with an example.
e505224d
PB
1813
1814The following source code declares a structure tag and defines an
685a5e86
DM
1815instance of the structure in global scope. Then a @code{typedef} equates the
1816structure tag with a new type. Seperate stabs are generated for the
1817structure tag, the structure @code{typedef}, and the structure instance. The
1818stabs for the tag and the @code{typedef} are emited when the definitions are
e505224d
PB
1819encountered. Since the structure elements are not initialized, the
1820stab and code for the structure variable itself is located at the end
685a5e86 1821of the program in the bss section.
e505224d
PB
1822
1823@example
685a5e86
DM
1824struct s_tag @{
1825 int s_int;
1826 float s_float;
1827 char s_char_vec[8];
1828 struct s_tag* s_next;
1829@} g_an_s;
e505224d 1830
685a5e86
DM
1831typedef struct s_tag s_typedef;
1832@end example
e505224d 1833
685a5e86
DM
1834The structure tag has an @code{N_LSYM} stab type because, like the
1835enumeration, the symbol has file scope. Like the enumeration, the
1836symbol descriptor is @samp{T}, for enumeration, structure, or tag type.
43603088 1837The type descriptor @samp{s} following the @samp{16=} of the type
685a5e86 1838definition narrows the symbol type to structure.
e505224d 1839
43603088 1840Following the @samp{s} type descriptor is the number of bytes the
685a5e86
DM
1841structure occupies, followed by a description of each structure element.
1842The structure element descriptions are of the form @var{name:type, bit
1843offset from the start of the struct, number of bits in the element}.
e505224d 1844
43603088
JK
1845@c FIXME: phony line break. Can probably be fixed by using an example
1846@c with fewer fields.
685a5e86 1847@example
43603088 1848# @r{128 is N_LSYM}
685a5e86
DM
1849.stabs "s_tag:T16=s20s_int:1,0,32;s_float:12,32,32;
1850 s_char_vec:17=ar1;0;7;2,64,64;s_next:18=*16,128,32;;",128,0,0,0
612dbd4c 1851@end example
6fe91f2c 1852
685a5e86
DM
1853In this example, the first two structure elements are previously defined
1854types. For these, the type following the @samp{@var{name}:} part of the
1855element description is a simple type reference. The other two structure
e505224d 1856elements are new types. In this case there is a type definition
685a5e86
DM
1857embedded after the @samp{@var{name}:}. The type definition for the
1858array element looks just like a type definition for a standalone array.
1859The @code{s_next} field is a pointer to the same kind of structure that
1860the field is an element of. So the definition of structure type 16
1861contains a type definition for an element which is a pointer to type 16.
e505224d 1862
899bafeb 1863@node Typedefs
bf9d2537 1864@section Giving a Type a Name
e505224d 1865
e7bb76cc 1866To give a type a name, use the @samp{t} symbol descriptor. The type
bf9d2537 1867is specified by the type information (@pxref{String Field}) for the stab.
e7bb76cc 1868For example,
e505224d 1869
899bafeb 1870@example
43603088 1871.stabs "s_typedef:t16",128,0,0,0 # @r{128 is N_LSYM}
899bafeb 1872@end example
e505224d 1873
8c59ee11 1874specifies that @code{s_typedef} refers to type number 16. Such stabs
43603088 1875have symbol type @code{N_LSYM} (or @code{C_DECL} for XCOFF).
e505224d 1876
685a5e86 1877If you are specifying the tag name for a structure, union, or
8c59ee11
JK
1878enumeration, use the @samp{T} symbol descriptor instead. I believe C is
1879the only language with this feature.
e505224d 1880
8c59ee11
JK
1881If the type is an opaque type (I believe this is a Modula-2 feature),
1882AIX provides a type descriptor to specify it. The type descriptor is
1883@samp{o} and is followed by a name. I don't know what the name
1884means---is it always the same as the name of the type, or is this type
bf9d2537 1885descriptor used with a nameless stab (@pxref{String Field})? There
8c59ee11
JK
1886optionally follows a comma followed by type information which defines
1887the type of this type. If omitted, a semicolon is used in place of the
e7bb76cc 1888comma and the type information, and the type is much like a generic
8c59ee11
JK
1889pointer type---it has a known size but little else about it is
1890specified.
e505224d 1891
899bafeb 1892@node Unions
6fe91f2c 1893@section Unions
e505224d 1894
e505224d 1895@example
685a5e86
DM
1896union u_tag @{
1897 int u_int;
1898 float u_float;
1899 char* u_char;
1900@} an_u;
e505224d
PB
1901@end example
1902
685a5e86
DM
1903This code generates a stab for a union tag and a stab for a union
1904variable. Both use the @code{N_LSYM} stab type. If a union variable is
e505224d 1905scoped locally to the procedure in which it is defined, its stab is
6fe91f2c 1906located immediately preceding the @code{N_LBRAC} for the procedure's block
e505224d
PB
1907start.
1908
685a5e86 1909The stab for the union tag, however, is located preceding the code for
6fe91f2c 1910the procedure in which it is defined. The stab type is @code{N_LSYM}. This
e505224d 1911would seem to imply that the union type is file scope, like the struct
f958d5cd
DM
1912type @code{s_tag}. This is not true. The contents and position of the stab
1913for @code{u_type} do not convey any infomation about its procedure local
e505224d
PB
1914scope.
1915
43603088
JK
1916@c FIXME: phony line break. Can probably be fixed by using an example
1917@c with fewer fields.
5bc927fb 1918@smallexample
43603088 1919# @r{128 is N_LSYM}
685a5e86
DM
1920.stabs "u_tag:T23=u4u_int:1,0,32;u_float:12,0,32;u_char:21,0,32;;",
1921 128,0,0,0
5bc927fb 1922@end smallexample
e505224d 1923
685a5e86
DM
1924The symbol descriptor @samp{T}, following the @samp{name:} means that
1925the stab describes an enumeration, structure, or union tag. The type
1926descriptor @samp{u}, following the @samp{23=} of the type definition,
1927narrows it down to a union type definition. Following the @samp{u} is
1928the number of bytes in the union. After that is a list of union element
1929descriptions. Their format is @var{name:type, bit offset into the
1930union, number of bytes for the element;}.
e505224d 1931
685a5e86 1932The stab for the union variable is:
e505224d 1933
899bafeb 1934@example
43603088 1935.stabs "an_u:23",128,0,0,-20 # @r{128 is N_LSYM}
899bafeb 1936@end example
e505224d 1937
43603088 1938@samp{-20} specifies where the variable is stored (@pxref{Stack
bf9d2537 1939Variables}).
43603088 1940
bf9d2537
DM
1941@node Function Types
1942@section Function Types
e505224d 1943
685a5e86
DM
1944Various types can be defined for function variables. These types are
1945not used in defining functions (@pxref{Procedures}); they are used for
1946things like pointers to functions.
e505224d 1947
8c59ee11
JK
1948The simple, traditional, type is type descriptor @samp{f} is followed by
1949type information for the return type of the function, followed by a
1950semicolon.
1951
685a5e86
DM
1952This does not deal with functions for which the number and types of the
1953parameters are part of the type, as in Modula-2 or ANSI C. AIX provides
1954extensions to specify these, using the @samp{f}, @samp{F}, @samp{p}, and
1955@samp{R} type descriptors.
8c59ee11 1956
685a5e86 1957First comes the type descriptor. If it is @samp{f} or @samp{F}, this
43603088
JK
1958type involves a function rather than a procedure, and the type
1959information for the return type of the function follows, followed by a
1960comma. Then comes the number of parameters to the function and a
1961semicolon. Then, for each parameter, there is the name of the parameter
1962followed by a colon (this is only present for type descriptors @samp{R}
1963and @samp{F} which represent Pascal function or procedure parameters),
1964type information for the parameter, a comma, 0 if passed by reference or
19651 if passed by value, and a semicolon. The type definition ends with a
1966semicolon.
8c59ee11 1967
685a5e86 1968For example, this variable definition:
e505224d
PB
1969
1970@example
8c59ee11 1971int (*g_pf)();
e505224d
PB
1972@end example
1973
8c59ee11
JK
1974@noindent
1975generates the following code:
e505224d 1976
899bafeb 1977@example
8c59ee11
JK
1978.stabs "g_pf:G24=*25=f1",32,0,0,0
1979 .common _g_pf,4,"bss"
899bafeb 1980@end example
e505224d 1981
8c59ee11 1982The variable defines a new type, 24, which is a pointer to another new
685a5e86 1983type, 25, which is a function returning @code{int}.
e505224d 1984
bf9d2537
DM
1985@node Symbol Tables
1986@chapter Symbol Information in Symbol Tables
e505224d 1987
6fe91f2c
DM
1988This chapter describes the format of symbol table entries
1989and how stab assembler directives map to them. It also describes the
1990transformations that the assembler and linker make on data from stabs.
e505224d 1991
685a5e86 1992@menu
bf9d2537
DM
1993* Symbol Table Format::
1994* Transformations On Symbol Tables::
685a5e86
DM
1995@end menu
1996
bf9d2537
DM
1997@node Symbol Table Format
1998@section Symbol Table Format
685a5e86
DM
1999
2000Each time the assembler encounters a stab directive, it puts
2001each field of the stab into a corresponding field in a symbol table
0a95c18c 2002entry of its output file. If the stab contains a string field, the
e505224d
PB
2003symbol table entry for that stab points to a string table entry
2004containing the string data from the stab. Assembler labels become
2005relocatable addresses. Symbol table entries in a.out have the format:
2006
dd8126d9 2007@c FIXME: should refer to external, not internal.
e505224d
PB
2008@example
2009struct internal_nlist @{
139741da
RP
2010 unsigned long n_strx; /* index into string table of name */
2011 unsigned char n_type; /* type of symbol */
2012 unsigned char n_other; /* misc info (usually empty) */
2013 unsigned short n_desc; /* description field */
2014 bfd_vma n_value; /* value of symbol */
e505224d
PB
2015@};
2016@end example
2017
0a95c18c
JK
2018If the stab has a string, the @code{n_strx} field holds the offset in
2019bytes of the string within the string table. The string is terminated
2020by a NUL character. If the stab lacks a string (for example, it was
2021produced by a @code{.stabn} or @code{.stabd} directive), the
2022@code{n_strx} field is zero.
685a5e86
DM
2023
2024Symbol table entries with @code{n_type} field values greater than 0x1f
2025originated as stabs generated by the compiler (with one random
2026exception). The other entries were placed in the symbol table of the
2027executable by the assembler or the linker.
e505224d 2028
bf9d2537
DM
2029@node Transformations On Symbol Tables
2030@section Transformations on Symbol Tables
e505224d
PB
2031
2032The linker concatenates object files and does fixups of externally
685a5e86 2033defined symbols.
e505224d 2034
685a5e86
DM
2035You can see the transformations made on stab data by the assembler and
2036linker by examining the symbol table after each pass of the build. To
2037do this, use @samp{nm -ap}, which dumps the symbol table, including
6fe91f2c
DM
2038debugging information, unsorted. For stab entries the columns are:
2039@var{value}, @var{other}, @var{desc}, @var{type}, @var{string}. For
2040assembler and linker symbols, the columns are: @var{value}, @var{type},
2041@var{string}.
e505224d 2042
43603088
JK
2043The low 5 bits of the stab type tell the linker how to relocate the
2044value of the stab. Thus for stab types like @code{N_RSYM} and
2045@code{N_LSYM}, where the value is an offset or a register number, the
2046low 5 bits are @code{N_ABS}, which tells the linker not to relocate the
2047value.
e505224d 2048
0a95c18c 2049Where the value of a stab contains an assembly language label,
e505224d
PB
2050it is transformed by each build step. The assembler turns it into a
2051relocatable address and the linker turns it into an absolute address.
685a5e86
DM
2052
2053@menu
bf9d2537
DM
2054* Transformations On Static Variables::
2055* Transformations On Global Variables::
cd61aa60 2056* ELF Transformations:: In ELF, things are a bit different.
685a5e86
DM
2057@end menu
2058
bf9d2537
DM
2059@node Transformations On Static Variables
2060@subsection Transformations on Static Variables
685a5e86 2061
e505224d
PB
2062This source line defines a static variable at file scope:
2063
899bafeb 2064@example
685a5e86 2065static int s_g_repeat
899bafeb 2066@end example
e505224d 2067
899bafeb 2068@noindent
6fe91f2c 2069The following stab describes the symbol:
e505224d 2070
899bafeb 2071@example
685a5e86 2072.stabs "s_g_repeat:S1",38,0,0,_s_g_repeat
899bafeb 2073@end example
e505224d 2074
899bafeb 2075@noindent
e505224d 2076The assembler transforms the stab into this symbol table entry in the
899bafeb 2077@file{.o} file. The location is expressed as a data segment offset.
e505224d 2078
899bafeb 2079@example
685a5e86 208000000084 - 00 0000 STSYM s_g_repeat:S1
899bafeb 2081@end example
e505224d 2082
899bafeb 2083@noindent
685a5e86 2084In the symbol table entry from the executable, the linker has made the
e505224d
PB
2085relocatable address absolute.
2086
899bafeb 2087@example
685a5e86 20880000e00c - 00 0000 STSYM s_g_repeat:S1
899bafeb 2089@end example
e505224d 2090
bf9d2537
DM
2091@node Transformations On Global Variables
2092@subsection Transformations on Global Variables
685a5e86 2093
e505224d 2094Stabs for global variables do not contain location information. In
685a5e86 2095this case, the debugger finds location information in the assembler or
e505224d
PB
2096linker symbol table entry describing the variable. The source line:
2097
899bafeb 2098@example
685a5e86 2099char g_foo = 'c';
899bafeb 2100@end example
e505224d 2101
899bafeb 2102@noindent
e505224d
PB
2103generates the stab:
2104
899bafeb 2105@example
685a5e86 2106.stabs "g_foo:G2",32,0,0,0
899bafeb 2107@end example
e505224d 2108
685a5e86
DM
2109The variable is represented by two symbol table entries in the object
2110file (see below). The first one originated as a stab. The second one
2111is an external symbol. The upper case @samp{D} signifies that the
2112@code{n_type} field of the symbol table contains 7, @code{N_DATA} with
ac31351a
JK
2113local linkage. The stab's value is zero since the value is not used for
2114@code{N_GSYM} stabs. The value of the linker symbol is the relocatable
2115address corresponding to the variable.
e505224d 2116
899bafeb 2117@example
685a5e86
DM
211800000000 - 00 0000 GSYM g_foo:G2
211900000080 D _g_foo
899bafeb 2120@end example
e505224d 2121
899bafeb 2122@noindent
e505224d 2123These entries as transformed by the linker. The linker symbol table
685a5e86 2124entry now holds an absolute address:
e505224d 2125
899bafeb 2126@example
685a5e86 212700000000 - 00 0000 GSYM g_foo:G2
899bafeb 2128@dots{}
685a5e86 21290000e008 D _g_foo
899bafeb 2130@end example
e505224d 2131
cd61aa60
JK
2132@node ELF Transformations
2133@subsection Transformations of Stabs in ELF Files
2134
2135For ELF files, use @code{objdump --stabs} instead of @code{nm} to show
2136the stabs in an object or executable file. @code{objdump} is a GNU
2137utility; Sun does not provide any equivalent.
2138
2139The following example is for a stab whose value is an address is
2140relative to the compilation unit (@pxref{Stabs In ELF}). For example,
2141if the source line
2142
2143@example
2144static int ld = 5;
2145@end example
2146
2147appears within a function, then the assembly language output from the
2148compiler contains:
2149
2150@example
2151.Ddata.data:
2152@dots{}
2153 .stabs "ld:V(0,3)",0x26,0,4,.L18-Ddata.data # @r{0x26 is N_STSYM}
2154@dots{}
2155.L18:
2156 .align 4
2157 .word 0x5
2158@end example
2159
2160Because the value is formed by subtracting one symbol from another, the
2161value is absolute, not relocatable, and so the object file contains
2162
2163@example
2164Symnum n_type n_othr n_desc n_value n_strx String
216531 STSYM 0 4 00000004 680 ld:V(0,3)
2166@end example
2167
2168without any relocations, and the executable file also contains
2169
2170@example
2171Symnum n_type n_othr n_desc n_value n_strx String
217231 STSYM 0 4 00000004 680 ld:V(0,3)
2173@end example
2174
8c59ee11 2175@node Cplusplus
bf9d2537 2176@chapter GNU C++ Stabs
e505224d
PB
2177
2178@menu
bf9d2537
DM
2179* Basic Cplusplus Types::
2180* Simple Classes::
2181* Class Instance::
8eb5e289 2182* Methods:: Method definition
6fe91f2c 2183* Protections::
bf9d2537
DM
2184* Method Modifiers::
2185* Virtual Methods::
6fe91f2c 2186* Inheritence::
bf9d2537
DM
2187* Virtual Base Classes::
2188* Static Members::
e505224d
PB
2189@end menu
2190
6fe91f2c 2191Type descriptors added for C++ descriptions:
e505224d
PB
2192
2193@table @code
2194@item #
6fe91f2c 2195method type (@code{##} if minimal debug)
e505224d 2196
8c59ee11
JK
2197@item @@
2198Member (class and variable) type. It is followed by type information
2199for the offset basetype, a comma, and type information for the type of
2200the field being pointed to. (FIXME: this is acknowledged to be
2201gibberish. Can anyone say what really goes here?).
2202
2203Note that there is a conflict between this and type attributes
bf9d2537 2204(@pxref{String Field}); both use type descriptor @samp{@@}.
8c59ee11
JK
2205Fortunately, the @samp{@@} type descriptor used in this C++ sense always
2206will be followed by a digit, @samp{(}, or @samp{-}, and type attributes
2207never start with those things.
e505224d
PB
2208@end table
2209
bf9d2537
DM
2210@node Basic Cplusplus Types
2211@section Basic Types For C++
e505224d
PB
2212
2213<< the examples that follow are based on a01.C >>
2214
2215
2216C++ adds two more builtin types to the set defined for C. These are
2217the unknown type and the vtable record type. The unknown type, type
221816, is defined in terms of itself like the void type.
2219
2220The vtable record type, type 17, is defined as a structure type and
6fe91f2c 2221then as a structure tag. The structure has four fields: delta, index,
e505224d
PB
2222pfn, and delta2. pfn is the function pointer.
2223
2224<< In boilerplate $vtbl_ptr_type, what are the fields delta,
2225index, and delta2 used for? >>
2226
2227This basic type is present in all C++ programs even if there are no
2228virtual methods defined.
2229
899bafeb 2230@display
e505224d 2231.stabs "struct_name:sym_desc(type)type_def(17)=type_desc(struct)struct_bytes(8)
139741da
RP
2232 elem_name(delta):type_ref(short int),bit_offset(0),field_bits(16);
2233 elem_name(index):type_ref(short int),bit_offset(16),field_bits(16);
2234 elem_name(pfn):type_def(18)=type_desc(ptr to)type_ref(void),
2235 bit_offset(32),field_bits(32);
2236 elem_name(delta2):type_def(short int);bit_offset(32),field_bits(16);;"
2237 N_LSYM, NIL, NIL
899bafeb 2238@end display
6fe91f2c 2239
899bafeb 2240@smallexample
e505224d 2241.stabs "$vtbl_ptr_type:t17=s8
139741da
RP
2242 delta:6,0,16;index:6,16,16;pfn:18=*15,32,32;delta2:6,32,16;;"
2243 ,128,0,0,0
899bafeb 2244@end smallexample
e505224d 2245
899bafeb 2246@display
e505224d 2247.stabs "name:sym_dec(struct tag)type_ref($vtbl_ptr_type)",N_LSYM,NIL,NIL,NIL
899bafeb 2248@end display
e505224d 2249
899bafeb 2250@example
e505224d 2251.stabs "$vtbl_ptr_type:T17",128,0,0,0
899bafeb 2252@end example
e505224d 2253
bf9d2537
DM
2254@node Simple Classes
2255@section Simple Class Definition
e505224d
PB
2256
2257The stabs describing C++ language features are an extension of the
2258stabs describing C. Stabs representing C++ class types elaborate
2259extensively on the stab format used to describe structure types in C.
2260Stabs representing class type variables look just like stabs
2261representing C language variables.
2262
2263Consider the following very simple class definition.
2264
2265@example
2266class baseA @{
2267public:
139741da
RP
2268 int Adat;
2269 int Ameth(int in, char other);
e505224d
PB
2270@};
2271@end example
2272
6fe91f2c 2273The class @code{baseA} is represented by two stabs. The first stab describes
e505224d 2274the class as a structure type. The second stab describes a structure
6fe91f2c 2275tag of the class type. Both stabs are of stab type @code{N_LSYM}. Since the
685a5e86 2276stab is not located between an @code{N_FUN} and an @code{N_LBRAC} stab this indicates
6fe91f2c 2277that the class is defined at file scope. If it were, then the @code{N_LSYM}
e505224d
PB
2278would signify a local variable.
2279
2280A stab describing a C++ class type is similar in format to a stab
2281describing a C struct, with each class member shown as a field in the
2282structure. The part of the struct format describing fields is
2283expanded to include extra information relevent to C++ class members.
2284In addition, if the class has multiple base classes or virtual
2285functions the struct format outside of the field parts is also
2286augmented.
2287
2288In this simple example the field part of the C++ class stab
2289representing member data looks just like the field part of a C struct
2290stab. The section on protections describes how its format is
2291sometimes extended for member data.
2292
2293The field part of a C++ class stab representing a member function
2294differs substantially from the field part of a C struct stab. It
6fe91f2c 2295still begins with @samp{name:} but then goes on to define a new type number
e505224d
PB
2296for the member function, describe its return type, its argument types,
2297its protection level, any qualifiers applied to the method definition,
2298and whether the method is virtual or not. If the method is virtual
2299then the method description goes on to give the vtable index of the
2300method, and the type number of the first base class defining the
6fe91f2c 2301method.
e505224d 2302
dd8126d9
JK
2303When the field name is a method name it is followed by two colons rather
2304than one. This is followed by a new type definition for the method.
2305This is a number followed by an equal sign and the type descriptor
2306@samp{#}, indicating a method type, and a second @samp{#}, indicating
2307that this is the @dfn{minimal} type of method definition used by GCC2,
2308not larger method definitions used by earlier versions of GCC. This is
2309followed by a type reference showing the return type of the method and a
e505224d
PB
2310semi-colon.
2311
dd8126d9
JK
2312The format of an overloaded operator method name differs from that of
2313other methods. It is @samp{op$::@var{operator-name}.} where
2314@var{operator-name} is the operator name such as @samp{+} or @samp{+=}.
2315The name ends with a period, and any characters except the period can
2316occur in the @var{operator-name} string.
e505224d 2317
dd8126d9
JK
2318The next part of the method description represents the arguments to the
2319method, preceeded by a colon and ending with a semi-colon. The types of
2320the arguments are expressed in the same way argument types are expressed
2321in C++ name mangling. In this example an @code{int} and a @code{char}
6fe91f2c 2322map to @samp{ic}.
e505224d
PB
2323
2324This is followed by a number, a letter, and an asterisk or period,
2325followed by another semicolon. The number indicates the protections
2326that apply to the member function. Here the 2 means public. The
2327letter encodes any qualifier applied to the method definition. In
6fe91f2c 2328this case, @samp{A} means that it is a normal function definition. The dot
e505224d
PB
2329shows that the method is not virtual. The sections that follow
2330elaborate further on these fields and describe the additional
2331information present for virtual methods.
2332
2333
899bafeb 2334@display
e505224d 2335.stabs "class_name:sym_desc(type)type_def(20)=type_desc(struct)struct_bytes(4)
139741da 2336 field_name(Adat):type(int),bit_offset(0),field_bits(32);
e505224d 2337
139741da 2338 method_name(Ameth)::type_def(21)=type_desc(method)return_type(int);
6fe91f2c 2339 :arg_types(int char);
139741da
RP
2340 protection(public)qualifier(normal)virtual(no);;"
2341 N_LSYM,NIL,NIL,NIL
899bafeb 2342@end display
e505224d 2343
899bafeb 2344@smallexample
e505224d
PB
2345.stabs "baseA:t20=s4Adat:1,0,32;Ameth::21=##1;:ic;2A.;;",128,0,0,0
2346
2347.stabs "class_name:sym_desc(struct tag)",N_LSYM,NIL,NIL,NIL
2348
2349.stabs "baseA:T20",128,0,0,0
899bafeb 2350@end smallexample
e505224d 2351
bf9d2537
DM
2352@node Class Instance
2353@section Class Instance
e505224d
PB
2354
2355As shown above, describing even a simple C++ class definition is
2356accomplished by massively extending the stab format used in C to
2357describe structure types. However, once the class is defined, C stabs
2358with no modifications can be used to describe class instances. The
2359following source:
2360
2361@example
2362main () @{
139741da 2363 baseA AbaseA;
e505224d
PB
2364@}
2365@end example
2366
899bafeb
RP
2367@noindent
2368yields the following stab describing the class instance. It looks no
e505224d
PB
2369different from a standard C stab describing a local variable.
2370
899bafeb 2371@display
e505224d 2372.stabs "name:type_ref(baseA)", N_LSYM, NIL, NIL, frame_ptr_offset
899bafeb 2373@end display
e505224d 2374
899bafeb 2375@example
e505224d 2376.stabs "AbaseA:20",128,0,0,-20
899bafeb 2377@end example
e505224d 2378
899bafeb 2379@node Methods
9d719a9c 2380@section Method Definition
e505224d
PB
2381
2382The class definition shown above declares Ameth. The C++ source below
2383defines Ameth:
2384
2385@example
6fe91f2c
DM
2386int
2387baseA::Ameth(int in, char other)
e505224d 2388@{
139741da 2389 return in;
e505224d
PB
2390@};
2391@end example
2392
2393
2394This method definition yields three stabs following the code of the
3a642a82
JK
2395method. One stab describes the method itself and following two describe
2396its parameters. Although there is only one formal argument all methods
6fe91f2c 2397have an implicit argument which is the @code{this} pointer. The @code{this}
3a642a82
JK
2398pointer is a pointer to the object on which the method was called. Note
2399that the method name is mangled to encode the class name and argument
2400types. Name mangling is described in the @sc{arm} (@cite{The Annotated
2401C++ Reference Manual}, by Ellis and Stroustrup, @sc{isbn}
24020-201-51459-1); @file{gpcompare.texi} in Cygnus GCC distributions
6fe91f2c 2403describes the differences between GNU mangling and @sc{arm}
3a642a82
JK
2404mangling.
2405@c FIXME: Use @xref, especially if this is generally installed in the
2406@c info tree.
2407@c FIXME: This information should be in a net release, either of GCC or
2408@c GDB. But gpcompare.texi doesn't seem to be in the FSF GCC.
e505224d 2409
612dbd4c 2410@example
e505224d 2411.stabs "name:symbol_desriptor(global function)return_type(int)",
6fe91f2c 2412 N_FUN, NIL, NIL, code_addr_of_method_start
e505224d
PB
2413
2414.stabs "Ameth__5baseAic:F1",36,0,0,_Ameth__5baseAic
612dbd4c 2415@end example
e505224d 2416
6fe91f2c
DM
2417Here is the stab for the @code{this} pointer implicit argument. The
2418name of the @code{this} pointer is always @code{this}. Type 19, the
2419@code{this} pointer is defined as a pointer to type 20, @code{baseA},
2420but a stab defining @code{baseA} has not yet been emited. Since the
2421compiler knows it will be emited shortly, here it just outputs a cross
2422reference to the undefined symbol, by prefixing the symbol name with
2423@samp{xs}.
e505224d 2424
612dbd4c 2425@example
e505224d 2426.stabs "name:sym_desc(register param)type_def(19)=
139741da 2427 type_desc(ptr to)type_ref(baseA)=
6fe91f2c 2428 type_desc(cross-reference to)baseA:",N_RSYM,NIL,NIL,register_number
e505224d 2429
c2dc518b 2430.stabs "this:P19=*20=xsbaseA:",64,0,0,8
612dbd4c 2431@end example
e505224d
PB
2432
2433The stab for the explicit integer argument looks just like a parameter
2434to a C function. The last field of the stab is the offset from the
2435argument pointer, which in most systems is the same as the frame
2436pointer.
2437
612dbd4c 2438@example
e505224d 2439.stabs "name:sym_desc(value parameter)type_ref(int)",
6fe91f2c 2440 N_PSYM,NIL,NIL,offset_from_arg_ptr
e505224d
PB
2441
2442.stabs "in:p1",160,0,0,72
612dbd4c 2443@end example
e505224d
PB
2444
2445<< The examples that follow are based on A1.C >>
2446
899bafeb 2447@node Protections
e505224d
PB
2448@section Protections
2449
2450
2451In the simple class definition shown above all member data and
2452functions were publicly accessable. The example that follows
2453contrasts public, protected and privately accessable fields and shows
2454how these protections are encoded in C++ stabs.
2455
dd8126d9 2456@c FIXME: What does "part of the string" mean?
e505224d 2457Protections for class member data are signified by two characters
dd8126d9
JK
2458embedded in the stab defining the class type. These characters are
2459located after the name: part of the string. @samp{/0} means private,
2460@samp{/1} means protected, and @samp{/2} means public. If these
2461characters are omited this means that the member is public. The
2462following C++ source:
e505224d
PB
2463
2464@example
2465class all_data @{
6fe91f2c 2466private:
139741da 2467 int priv_dat;
e505224d 2468protected:
139741da 2469 char prot_dat;
e505224d 2470public:
139741da 2471 float pub_dat;
e505224d
PB
2472@};
2473@end example
2474
899bafeb 2475@noindent
e505224d
PB
2476generates the following stab to describe the class type all_data.
2477
899bafeb 2478@display
e505224d 2479.stabs "class_name:sym_desc(type)type_def(19)=type_desc(struct)struct_bytes
139741da
RP
2480 data_name:/protection(private)type_ref(int),bit_offset,num_bits;
2481 data_name:/protection(protected)type_ref(char),bit_offset,num_bits;
2482 data_name:(/num omited, private)type_ref(float),bit_offset,num_bits;;"
2483 N_LSYM,NIL,NIL,NIL
899bafeb 2484@end display
e505224d 2485
899bafeb 2486@smallexample
e505224d 2487.stabs "all_data:t19=s12
139741da 2488 priv_dat:/01,0,32;prot_dat:/12,32,8;pub_dat:12,64,32;;",128,0,0,0
899bafeb 2489@end smallexample
e505224d
PB
2490
2491Protections for member functions are signified by one digit embeded in
2492the field part of the stab describing the method. The digit is 0 if
2493private, 1 if protected and 2 if public. Consider the C++ class
2494definition below:
2495
2496@example
2497class all_methods @{
2498private:
139741da 2499 int priv_meth(int in)@{return in;@};
e505224d 2500protected:
139741da 2501 char protMeth(char in)@{return in;@};
e505224d 2502public:
139741da 2503 float pubMeth(float in)@{return in;@};
e505224d
PB
2504@};
2505@end example
2506
2507It generates the following stab. The digit in question is to the left
6fe91f2c 2508of an @samp{A} in each case. Notice also that in this case two symbol
e505224d
PB
2509descriptors apply to the class name struct tag and struct type.
2510
899bafeb 2511@display
e505224d 2512.stabs "class_name:sym_desc(struct tag&type)type_def(21)=
139741da
RP
2513 sym_desc(struct)struct_bytes(1)
2514 meth_name::type_def(22)=sym_desc(method)returning(int);
2515 :args(int);protection(private)modifier(normal)virtual(no);
2516 meth_name::type_def(23)=sym_desc(method)returning(char);
2517 :args(char);protection(protected)modifier(normal)virual(no);
2518 meth_name::type_def(24)=sym_desc(method)returning(float);
2519 :args(float);protection(public)modifier(normal)virtual(no);;",
2520 N_LSYM,NIL,NIL,NIL
899bafeb 2521@end display
6fe91f2c 2522
899bafeb 2523@smallexample
e505224d 2524.stabs "all_methods:Tt21=s1priv_meth::22=##1;:i;0A.;protMeth::23=##2;:c;1A.;
139741da 2525 pubMeth::24=##12;:f;2A.;;",128,0,0,0
899bafeb 2526@end smallexample
e505224d 2527
bf9d2537
DM
2528@node Method Modifiers
2529@section Method Modifiers (@code{const}, @code{volatile}, @code{const volatile})
e505224d
PB
2530
2531<< based on a6.C >>
2532
2533In the class example described above all the methods have the normal
2534modifier. This method modifier information is located just after the
2535protection information for the method. This field has four possible
6fe91f2c
DM
2536character values. Normal methods use @samp{A}, const methods use
2537@samp{B}, volatile methods use @samp{C}, and const volatile methods use
2538@samp{D}. Consider the class definition below:
e505224d
PB
2539
2540@example
2541class A @{
2542public:
139741da
RP
2543 int ConstMeth (int arg) const @{ return arg; @};
2544 char VolatileMeth (char arg) volatile @{ return arg; @};
2545 float ConstVolMeth (float arg) const volatile @{return arg; @};
e505224d
PB
2546@};
2547@end example
2548
2549This class is described by the following stab:
2550
899bafeb 2551@display
e505224d 2552.stabs "class(A):sym_desc(struct)type_def(20)=type_desc(struct)struct_bytes(1)
139741da
RP
2553 meth_name(ConstMeth)::type_def(21)sym_desc(method)
2554 returning(int);:arg(int);protection(public)modifier(const)virtual(no);
2555 meth_name(VolatileMeth)::type_def(22)=sym_desc(method)
2556 returning(char);:arg(char);protection(public)modifier(volatile)virt(no)
2557 meth_name(ConstVolMeth)::type_def(23)=sym_desc(method)
2558 returning(float);:arg(float);protection(public)modifer(const volatile)
2559 virtual(no);;", @dots{}
899bafeb 2560@end display
6fe91f2c 2561
899bafeb 2562@example
e505224d 2563.stabs "A:T20=s1ConstMeth::21=##1;:i;2B.;VolatileMeth::22=##2;:c;2C.;
139741da 2564 ConstVolMeth::23=##12;:f;2D.;;",128,0,0,0
612dbd4c 2565@end example
e505224d 2566
bf9d2537
DM
2567@node Virtual Methods
2568@section Virtual Methods
e505224d 2569
6fe91f2c 2570<< The following examples are based on a4.C >>
e505224d
PB
2571
2572The presence of virtual methods in a class definition adds additional
2573data to the class description. The extra data is appended to the
2574description of the virtual method and to the end of the class
2575description. Consider the class definition below:
2576
2577@example
2578class A @{
2579public:
139741da
RP
2580 int Adat;
2581 virtual int A_virt (int arg) @{ return arg; @};
e505224d
PB
2582@};
2583@end example
6fe91f2c 2584
e505224d
PB
2585This results in the stab below describing class A. It defines a new
2586type (20) which is an 8 byte structure. The first field of the class
6fe91f2c
DM
2587struct is @samp{Adat}, an integer, starting at structure offset 0 and
2588occupying 32 bits.
e505224d
PB
2589
2590The second field in the class struct is not explicitly defined by the
2591C++ class definition but is implied by the fact that the class
2592contains a virtual method. This field is the vtable pointer. The
6fe91f2c 2593name of the vtable pointer field starts with @samp{$vf} and continues with a
e505224d
PB
2594type reference to the class it is part of. In this example the type
2595reference for class A is 20 so the name of its vtable pointer field is
6fe91f2c 2596@samp{$vf20}, followed by the usual colon.
e505224d
PB
2597
2598Next there is a type definition for the vtable pointer type (21).
6fe91f2c 2599This is in turn defined as a pointer to another new type (22).
e505224d
PB
2600
2601Type 22 is the vtable itself, which is defined as an array, indexed by
6aa83a79
JG
2602a range of integers between 0 and 1, and whose elements are of type
260317. Type 17 was the vtable record type defined by the boilerplate C++
2604type definitions, as shown earlier.
e505224d
PB
2605
2606The bit offset of the vtable pointer field is 32. The number of bits
2607in the field are not specified when the field is a vtable pointer.
6fe91f2c
DM
2608
2609Next is the method definition for the virtual member function @code{A_virt}.
e505224d
PB
2610Its description starts out using the same format as the non-virtual
2611member functions described above, except instead of a dot after the
6fe91f2c 2612@samp{A} there is an asterisk, indicating that the function is virtual.
e505224d 2613Since is is virtual some addition information is appended to the end
6fe91f2c 2614of the method description.
e505224d
PB
2615
2616The first number represents the vtable index of the method. This is a
261732 bit unsigned number with the high bit set, followed by a
2618semi-colon.
2619
2620The second number is a type reference to the first base class in the
2621inheritence hierarchy defining the virtual member function. In this
2622case the class stab describes a base class so the virtual function is
2623not overriding any other definition of the method. Therefore the
2624reference is to the type number of the class that the stab is
6fe91f2c 2625describing (20).
e505224d
PB
2626
2627This is followed by three semi-colons. One marks the end of the
2628current sub-section, one marks the end of the method field, and the
2629third marks the end of the struct definition.
2630
2631For classes containing virtual functions the very last section of the
2632string part of the stab holds a type reference to the first base
6fe91f2c 2633class. This is preceeded by @samp{~%} and followed by a final semi-colon.
e505224d 2634
899bafeb 2635@display
e505224d 2636.stabs "class_name(A):type_def(20)=sym_desc(struct)struct_bytes(8)
139741da
RP
2637 field_name(Adat):type_ref(int),bit_offset(0),field_bits(32);
2638 field_name(A virt func ptr):type_def(21)=type_desc(ptr to)type_def(22)=
6aa83a79 2639 sym_desc(array)index_type_ref(range of int from 0 to 1);
6fe91f2c 2640 elem_type_ref(vtbl elem type),
139741da
RP
2641 bit_offset(32);
2642 meth_name(A_virt)::typedef(23)=sym_desc(method)returning(int);
2643 :arg_type(int),protection(public)normal(yes)virtual(yes)
2644 vtable_index(1);class_first_defining(A);;;~%first_base(A);",
2645 N_LSYM,NIL,NIL,NIL
899bafeb 2646@end display
e505224d 2647
3d4cf720 2648@c FIXME: bogus line break.
899bafeb 2649@example
3d4cf720 2650.stabs "A:t20=s8Adat:1,0,32;$vf20:21=*22=ar1;0;1;17,32;
6fe91f2c 2651 A_virt::23=##1;:i;2A*-2147483647;20;;;~%20;",128,0,0,0
612dbd4c 2652@end example
e505224d 2653
2dd00294
JG
2654@node Inheritence
2655@section Inheritence
e505224d
PB
2656
2657Stabs describing C++ derived classes include additional sections that
2658describe the inheritence hierarchy of the class. A derived class stab
2659also encodes the number of base classes. For each base class it tells
2660if the base class is virtual or not, and if the inheritence is private
2661or public. It also gives the offset into the object of the portion of
6fe91f2c 2662the object corresponding to each base class.
e505224d
PB
2663
2664This additional information is embeded in the class stab following the
2665number of bytes in the struct. First the number of base classes
6fe91f2c 2666appears bracketed by an exclamation point and a comma.
e505224d
PB
2667
2668Then for each base type there repeats a series: two digits, a number,
6fe91f2c 2669a comma, another number, and a semi-colon.
e505224d
PB
2670
2671The first of the two digits is 1 if the base class is virtual and 0 if
2672not. The second digit is 2 if the derivation is public and 0 if not.
2673
2674The number following the first two digits is the offset from the start
6fe91f2c 2675of the object to the part of the object pertaining to the base class.
e505224d
PB
2676
2677After the comma, the second number is a type_descriptor for the base
2678type. Finally a semi-colon ends the series, which repeats for each
2679base class.
2680
6fe91f2c
DM
2681The source below defines three base classes @code{A}, @code{B}, and
2682@code{C} and the derived class @code{D}.
e505224d
PB
2683
2684
2685@example
2686class A @{
2687public:
139741da
RP
2688 int Adat;
2689 virtual int A_virt (int arg) @{ return arg; @};
e505224d
PB
2690@};
2691
2692class B @{
2693public:
6fe91f2c 2694 int B_dat;
139741da 2695 virtual int B_virt (int arg) @{return arg; @};
6fe91f2c 2696@};
e505224d
PB
2697
2698class C @{
6fe91f2c 2699public:
139741da 2700 int Cdat;
6fe91f2c 2701 virtual int C_virt (int arg) @{return arg; @};
e505224d
PB
2702@};
2703
2704class D : A, virtual B, public C @{
2705public:
139741da
RP
2706 int Ddat;
2707 virtual int A_virt (int arg ) @{ return arg+1; @};
2708 virtual int B_virt (int arg) @{ return arg+2; @};
2709 virtual int C_virt (int arg) @{ return arg+3; @};
2710 virtual int D_virt (int arg) @{ return arg; @};
e505224d
PB
2711@};
2712@end example
2713
2714Class stabs similar to the ones described earlier are generated for
6fe91f2c 2715each base class.
e505224d 2716
5bc927fb
RP
2717@c FIXME!!! the linebreaks in the following example probably make the
2718@c examples literally unusable, but I don't know any other way to get
2719@c them on the page.
63cef7d7
JK
2720@c One solution would be to put some of the type definitions into
2721@c separate stabs, even if that's not exactly what the compiler actually
2722@c emits.
899bafeb 2723@smallexample
5bc927fb
RP
2724.stabs "A:T20=s8Adat:1,0,32;$vf20:21=*22=ar1;0;1;17,32;
2725 A_virt::23=##1;:i;2A*-2147483647;20;;;~%20;",128,0,0,0
e505224d 2726
5bc927fb
RP
2727.stabs "B:Tt25=s8Bdat:1,0,32;$vf25:21,32;B_virt::26=##1;
2728 :i;2A*-2147483647;25;;;~%25;",128,0,0,0
e505224d 2729
5bc927fb
RP
2730.stabs "C:Tt28=s8Cdat:1,0,32;$vf28:21,32;C_virt::29=##1;
2731 :i;2A*-2147483647;28;;;~%28;",128,0,0,0
899bafeb 2732@end smallexample
e505224d 2733
6fe91f2c 2734In the stab describing derived class @code{D} below, the information about
e505224d
PB
2735the derivation of this class is encoded as follows.
2736
899bafeb 2737@display
e505224d 2738.stabs "derived_class_name:symbol_descriptors(struct tag&type)=
139741da
RP
2739 type_descriptor(struct)struct_bytes(32)!num_bases(3),
2740 base_virtual(no)inheritence_public(no)base_offset(0),
2741 base_class_type_ref(A);
2742 base_virtual(yes)inheritence_public(no)base_offset(NIL),
2743 base_class_type_ref(B);
2744 base_virtual(no)inheritence_public(yes)base_offset(64),
2745 base_class_type_ref(C); @dots{}
899bafeb 2746@end display
6fe91f2c 2747
5bc927fb 2748@c FIXME! fake linebreaks.
899bafeb 2749@smallexample
5bc927fb
RP
2750.stabs "D:Tt31=s32!3,000,20;100,25;0264,28;$vb25:24,128;Ddat:
2751 1,160,32;A_virt::32=##1;:i;2A*-2147483647;20;;B_virt:
2752 :32:i;2A*-2147483647;25;;C_virt::32:i;2A*-2147483647;
2753 28;;D_virt::32:i;2A*-2147483646;31;;;~%20;",128,0,0,0
899bafeb 2754@end smallexample
e505224d 2755
bf9d2537
DM
2756@node Virtual Base Classes
2757@section Virtual Base Classes
e505224d 2758
dd8126d9
JK
2759A derived class object consists of a concatination in memory of the data
2760areas defined by each base class, starting with the leftmost and ending
2761with the rightmost in the list of base classes. The exception to this
2762rule is for virtual inheritence. In the example above, class @code{D}
2763inherits virtually from base class @code{B}. This means that an
2764instance of a @code{D} object will not contain its own @code{B} part but
2765merely a pointer to a @code{B} part, known as a virtual base pointer.
e505224d
PB
2766
2767In a derived class stab, the base offset part of the derivation
2768information, described above, shows how the base class parts are
dd8126d9
JK
2769ordered. The base offset for a virtual base class is always given as 0.
2770Notice that the base offset for @code{B} is given as 0 even though
2771@code{B} is not the first base class. The first base class @code{A}
2772starts at offset 0.
e505224d 2773
6fe91f2c
DM
2774The field information part of the stab for class @code{D} describes the field
2775which is the pointer to the virtual base class @code{B}. The vbase pointer
2776name is @samp{$vb} followed by a type reference to the virtual base class.
2777Since the type id for @code{B} in this example is 25, the vbase pointer name
2778is @samp{$vb25}.
e505224d 2779
5bc927fb 2780@c FIXME!! fake linebreaks below
899bafeb 2781@smallexample
5bc927fb
RP
2782.stabs "D:Tt31=s32!3,000,20;100,25;0264,28;$vb25:24,128;Ddat:1,
2783 160,32;A_virt::32=##1;:i;2A*-2147483647;20;;B_virt::32:i;
2784 2A*-2147483647;25;;C_virt::32:i;2A*-2147483647;28;;D_virt:
2785 :32:i;2A*-2147483646;31;;;~%20;",128,0,0,0
899bafeb 2786@end smallexample
e505224d
PB
2787
2788Following the name and a semicolon is a type reference describing the
2789type of the virtual base class pointer, in this case 24. Type 24 was
6fe91f2c
DM
2790defined earlier as the type of the @code{B} class @code{this} pointer. The
2791@code{this} pointer for a class is a pointer to the class type.
e505224d 2792
899bafeb 2793@example
c2dc518b 2794.stabs "this:P24=*25=xsB:",64,0,0,8
899bafeb 2795@end example
e505224d
PB
2796
2797Finally the field offset part of the vbase pointer field description
6fe91f2c
DM
2798shows that the vbase pointer is the first field in the @code{D} object,
2799before any data fields defined by the class. The layout of a @code{D}
2800class object is a follows, @code{Adat} at 0, the vtable pointer for
2801@code{A} at 32, @code{Cdat} at 64, the vtable pointer for C at 96, the
2802virtual base pointer for @code{B} at 128, and @code{Ddat} at 160.
e505224d
PB
2803
2804
bf9d2537
DM
2805@node Static Members
2806@section Static Members
e505224d 2807
446e5d80
JG
2808The data area for a class is a concatenation of the space used by the
2809data members of the class. If the class has virtual methods, a vtable
e505224d 2810pointer follows the class data. The field offset part of each field
446e5d80 2811description in the class stab shows this ordering.
e505224d 2812
446e5d80 2813<< How is this reflected in stabs? See Cygnus bug #677 for some info. >>
e505224d 2814
bf9d2537
DM
2815@node Stab Types
2816@appendix Table of Stab Types
e505224d 2817
0a95c18c
JK
2818The following are all the possible values for the stab type field, for
2819@code{a.out} files, in numeric order. This does not apply to XCOFF, but
2820it does apply to stabs in ELF. Stabs in ECOFF use these values but add
28210x8f300 to distinguish them from non-stab symbols.
e505224d 2822
6fe91f2c
DM
2823The symbolic names are defined in the file @file{include/aout/stabs.def}.
2824
2825@menu
bf9d2537
DM
2826* Non-Stab Symbol Types:: Types from 0 to 0x1f
2827* Stab Symbol Types:: Types from 0x20 to 0xff
6fe91f2c
DM
2828@end menu
2829
bf9d2537
DM
2830@node Non-Stab Symbol Types
2831@appendixsec Non-Stab Symbol Types
6fe91f2c
DM
2832
2833The following types are used by the linker and assembler, not by stab
2834directives. Since this document does not attempt to describe aspects of
2835object file format other than the debugging format, no details are
2836given.
e505224d 2837
3d4cf720
JK
2838@c Try to get most of these to fit on a single line.
2839@iftex
2840@tableindent=1.5in
2841@end iftex
e505224d 2842
3d4cf720 2843@table @code
6fe91f2c 2844@item 0x0 N_UNDF
3d4cf720 2845Undefined symbol
e505224d 2846
6fe91f2c 2847@item 0x2 N_ABS
3d4cf720 2848File scope absolute symbol
e505224d 2849
6fe91f2c 2850@item 0x3 N_ABS | N_EXT
3d4cf720
JK
2851External absolute symbol
2852
6fe91f2c 2853@item 0x4 N_TEXT
3d4cf720
JK
2854File scope text symbol
2855
6fe91f2c 2856@item 0x5 N_TEXT | N_EXT
3d4cf720
JK
2857External text symbol
2858
6fe91f2c 2859@item 0x6 N_DATA
3d4cf720
JK
2860File scope data symbol
2861
6fe91f2c 2862@item 0x7 N_DATA | N_EXT
3d4cf720
JK
2863External data symbol
2864
6fe91f2c 2865@item 0x8 N_BSS
3d4cf720
JK
2866File scope BSS symbol
2867
6fe91f2c 2868@item 0x9 N_BSS | N_EXT
3d4cf720
JK
2869External BSS symbol
2870
6fe91f2c
DM
2871@item 0x0c N_FN_SEQ
2872Same as @code{N_FN}, for Sequent compilers
3d4cf720 2873
6fe91f2c 2874@item 0x0a N_INDR
3d4cf720
JK
2875Symbol is indirected to another symbol
2876
6fe91f2c 2877@item 0x12 N_COMM
dd8126d9 2878Common---visible after shared library dynamic link
3d4cf720 2879
6fe91f2c 2880@item 0x14 N_SETA
3d4cf720
JK
2881Absolute set element
2882
6fe91f2c 2883@item 0x16 N_SETT
3d4cf720
JK
2884Text segment set element
2885
6fe91f2c 2886@item 0x18 N_SETD
3d4cf720
JK
2887Data segment set element
2888
6fe91f2c 2889@item 0x1a N_SETB
3d4cf720
JK
2890BSS segment set element
2891
6fe91f2c 2892@item 0x1c N_SETV
3d4cf720
JK
2893Pointer to set vector
2894
6fe91f2c 2895@item 0x1e N_WARNING
3d4cf720
JK
2896Print a warning message during linking
2897
6fe91f2c
DM
2898@item 0x1f N_FN
2899File name of a @file{.o} file
3d4cf720
JK
2900@end table
2901
bf9d2537
DM
2902@node Stab Symbol Types
2903@appendixsec Stab Symbol Types
6fe91f2c 2904
3d4cf720
JK
2905The following symbol types indicate that this is a stab. This is the
2906full list of stab numbers, including stab types that are used in
2907languages other than C.
2908
2909@table @code
2910@item 0x20 N_GSYM
bf9d2537 2911Global symbol; see @ref{Global Variables}.
3d4cf720
JK
2912
2913@item 0x22 N_FNAME
43603088 2914Function name (for BSD Fortran); see @ref{Procedures}.
3d4cf720 2915
24dcc707
JK
2916@item 0x24 N_FUN
2917Function name (@pxref{Procedures}) or text segment variable
2918(@pxref{Statics}).
3d4cf720 2919
24dcc707 2920@item 0x26 N_STSYM
6fe91f2c 2921Data segment file-scope variable; see @ref{Statics}.
3d4cf720 2922
24dcc707 2923@item 0x28 N_LCSYM
6fe91f2c 2924BSS segment file-scope variable; see @ref{Statics}.
3d4cf720 2925
6fe91f2c 2926@item 0x2a N_MAIN
bf9d2537 2927Name of main routine; see @ref{Main Program}.
3d4cf720 2928
ded6bcab 2929@item 0x2c N_ROSYM
6fe91f2c 2930Variable in @code{.rodata} section; see @ref{Statics}.
ded6bcab 2931
6fe91f2c
DM
2932@item 0x30 N_PC
2933Global symbol (for Pascal); see @ref{N_PC}.
3d4cf720 2934
6fe91f2c
DM
2935@item 0x32 N_NSYMS
2936Number of symbols (according to Ultrix V4.0); see @ref{N_NSYMS}.
3d4cf720 2937
6fe91f2c
DM
2938@item 0x34 N_NOMAP
2939No DST map; see @ref{N_NOMAP}.
3d4cf720 2940
ded6bcab
JK
2941@c FIXME: describe this solaris feature in the body of the text (see
2942@c comments in include/aout/stab.def).
2943@item 0x38 N_OBJ
2944Object file (Solaris2).
2945
2946@c See include/aout/stab.def for (a little) more info.
2947@item 0x3c N_OPT
2948Debugger options (Solaris2).
2949
6fe91f2c 2950@item 0x40 N_RSYM
bf9d2537 2951Register variable; see @ref{Register Variables}.
3d4cf720 2952
6fe91f2c
DM
2953@item 0x42 N_M2C
2954Modula-2 compilation unit; see @ref{N_M2C}.
3d4cf720 2955
6fe91f2c 2956@item 0x44 N_SLINE
bf9d2537 2957Line number in text segment; see @ref{Line Numbers}.
3d4cf720 2958
6fe91f2c 2959@item 0x46 N_DSLINE
bf9d2537 2960Line number in data segment; see @ref{Line Numbers}.
3d4cf720 2961
6fe91f2c 2962@item 0x48 N_BSLINE
bf9d2537 2963Line number in bss segment; see @ref{Line Numbers}.
3d4cf720 2964
6fe91f2c
DM
2965@item 0x48 N_BROWS
2966Sun source code browser, path to @file{.cb} file; see @ref{N_BROWS}.
3d4cf720 2967
6fe91f2c
DM
2968@item 0x4a N_DEFD
2969GNU Modula2 definition module dependency; see @ref{N_DEFD}.
3d4cf720 2970
ded6bcab
JK
2971@item 0x4c N_FLINE
2972Function start/body/end line numbers (Solaris2).
2973
6fe91f2c
DM
2974@item 0x50 N_EHDECL
2975GNU C++ exception variable; see @ref{N_EHDECL}.
3d4cf720 2976
6fe91f2c
DM
2977@item 0x50 N_MOD2
2978Modula2 info "for imc" (according to Ultrix V4.0); see @ref{N_MOD2}.
3d4cf720 2979
6fe91f2c
DM
2980@item 0x54 N_CATCH
2981GNU C++ @code{catch} clause; see @ref{N_CATCH}.
3d4cf720 2982
6fe91f2c
DM
2983@item 0x60 N_SSYM
2984Structure of union element; see @ref{N_SSYM}.
3d4cf720 2985
ded6bcab
JK
2986@item 0x62 N_ENDM
2987Last stab for module (Solaris2).
2988
6fe91f2c 2989@item 0x64 N_SO
bf9d2537 2990Path and name of source file; see @ref{Source Files}.
3d4cf720 2991
935d305d 2992@item 0x80 N_LSYM
bf9d2537 2993Stack variable (@pxref{Stack Variables}) or type (@pxref{Typedefs}).
3d4cf720 2994
6fe91f2c 2995@item 0x82 N_BINCL
bf9d2537 2996Beginning of an include file (Sun only); see @ref{Include Files}.
3d4cf720 2997
6fe91f2c 2998@item 0x84 N_SOL
bf9d2537 2999Name of include file; see @ref{Include Files}.
3d4cf720 3000
6fe91f2c
DM
3001@item 0xa0 N_PSYM
3002Parameter variable; see @ref{Parameters}.
3d4cf720 3003
6fe91f2c 3004@item 0xa2 N_EINCL
bf9d2537 3005End of an include file; see @ref{Include Files}.
3d4cf720 3006
6fe91f2c
DM
3007@item 0xa4 N_ENTRY
3008Alternate entry point; see @ref{N_ENTRY}.
3d4cf720 3009
6fe91f2c 3010@item 0xc0 N_LBRAC
bf9d2537 3011Beginning of a lexical block; see @ref{Block Structure}.
3d4cf720 3012
6fe91f2c 3013@item 0xc2 N_EXCL
bf9d2537 3014Place holder for a deleted include file; see @ref{Include Files}.
3d4cf720 3015
6fe91f2c
DM
3016@item 0xc4 N_SCOPE
3017Modula2 scope information (Sun linker); see @ref{N_SCOPE}.
3d4cf720 3018
6fe91f2c 3019@item 0xe0 N_RBRAC
bf9d2537 3020End of a lexical block; see @ref{Block Structure}.
3d4cf720 3021
6fe91f2c 3022@item 0xe2 N_BCOMM
bf9d2537 3023Begin named common block; see @ref{Common Blocks}.
3d4cf720 3024
6fe91f2c 3025@item 0xe4 N_ECOMM
bf9d2537 3026End named common block; see @ref{Common Blocks}.
3d4cf720 3027
6fe91f2c 3028@item 0xe8 N_ECOML
bf9d2537 3029Member of a common block; see @ref{Common Blocks}.
3d4cf720 3030
ded6bcab
JK
3031@c FIXME: How does this really work? Move it to main body of document.
3032@item 0xea N_WITH
3033Pascal @code{with} statement: type,,0,0,offset (Solaris2).
3034
6fe91f2c
DM
3035@item 0xf0 N_NBTEXT
3036Gould non-base registers; see @ref{Gould}.
3d4cf720 3037
6fe91f2c
DM
3038@item 0xf2 N_NBDATA
3039Gould non-base registers; see @ref{Gould}.
3d4cf720
JK
3040
3041@item 0xf4 N_NBBSS
6fe91f2c 3042Gould non-base registers; see @ref{Gould}.
3d4cf720 3043
6fe91f2c
DM
3044@item 0xf6 N_NBSTS
3045Gould non-base registers; see @ref{Gould}.
3d4cf720 3046
6fe91f2c
DM
3047@item 0xf8 N_NBLCS
3048Gould non-base registers; see @ref{Gould}.
3d4cf720
JK
3049@end table
3050
3051@c Restore the default table indent
3052@iftex
3053@tableindent=.8in
3054@end iftex
e505224d 3055
bf9d2537
DM
3056@node Symbol Descriptors
3057@appendix Table of Symbol Descriptors
e505224d 3058
0a95c18c 3059The symbol descriptor is the character which follows the colon in many
bf9d2537 3060stabs, and which tells what kind of stab it is. @xref{String Field},
0a95c18c 3061for more information about their use.
6fe91f2c 3062
ed9708e2 3063@c Please keep this alphabetical
497e44a5 3064@table @code
466bdeb2
JK
3065@c In TeX, this looks great, digit is in italics. But makeinfo insists
3066@c on putting it in `', not realizing that @var should override @code.
3067@c I don't know of any way to make makeinfo do the right thing. Seems
3068@c like a makeinfo bug to me.
3069@item @var{digit}
8c59ee11
JK
3070@itemx (
3071@itemx -
bf9d2537 3072Variable on the stack; see @ref{Stack Variables}.
497e44a5 3073
6897f9ec 3074@item a
bf9d2537 3075Parameter passed by reference in register; see @ref{Reference Parameters}.
6897f9ec 3076
408f6c34 3077@item b
f19027a6 3078Based variable; see @ref{Based Variables}.
408f6c34 3079
6897f9ec 3080@item c
6fe91f2c 3081Constant; see @ref{Constants}.
6897f9ec 3082
ed9708e2 3083@item C
43603088 3084Conformant array bound (Pascal, maybe other languages); @ref{Conformant
bf9d2537 3085Arrays}. Name of a caught exception (GNU C++). These can be
685a5e86 3086distinguished because the latter uses @code{N_CATCH} and the former uses
8c59ee11 3087another symbol type.
6897f9ec
JK
3088
3089@item d
bf9d2537 3090Floating point register variable; see @ref{Register Variables}.
6897f9ec
JK
3091
3092@item D
bf9d2537 3093Parameter in floating point register; see @ref{Register Parameters}.
ed9708e2 3094
497e44a5 3095@item f
6fe91f2c 3096File scope function; see @ref{Procedures}.
497e44a5
JK
3097
3098@item F
6fe91f2c 3099Global function; see @ref{Procedures}.
497e44a5 3100
497e44a5 3101@item G
bf9d2537 3102Global variable; see @ref{Global Variables}.
497e44a5 3103
ed9708e2 3104@item i
bf9d2537 3105@xref{Register Parameters}.
ed9708e2 3106
6897f9ec 3107@item I
bf9d2537 3108Internal (nested) procedure; see @ref{Nested Procedures}.
6897f9ec
JK
3109
3110@item J
bf9d2537 3111Internal (nested) function; see @ref{Nested Procedures}.
6897f9ec
JK
3112
3113@item L
3114Label name (documented by AIX, no further information known).
3115
3116@item m
6fe91f2c 3117Module; see @ref{Procedures}.
6897f9ec 3118
ed9708e2 3119@item p
6fe91f2c 3120Argument list parameter; see @ref{Parameters}.
ed9708e2
JK
3121
3122@item pP
3123@xref{Parameters}.
3124
3125@item pF
6fe91f2c 3126Fortran Function parameter; see @ref{Parameters}.
ed9708e2
JK
3127
3128@item P
1a8b5668
JK
3129Unfortunately, three separate meanings have been independently invented
3130for this symbol descriptor. At least the GNU and Sun uses can be
3131distinguished by the symbol type. Global Procedure (AIX) (symbol type
685a5e86
DM
3132used unknown); see @ref{Procedures}. Register parameter (GNU) (symbol
3133type @code{N_PSYM}); see @ref{Parameters}. Prototype of function
3134referenced by this file (Sun @code{acc}) (symbol type @code{N_FUN}).
6897f9ec
JK
3135
3136@item Q
6fe91f2c 3137Static Procedure; see @ref{Procedures}.
6897f9ec
JK
3138
3139@item R
bf9d2537 3140Register parameter; see @ref{Register Parameters}.
ed9708e2 3141
497e44a5 3142@item r
bf9d2537 3143Register variable; see @ref{Register Variables}.
497e44a5
JK
3144
3145@item S
6fe91f2c 3146File scope variable; see @ref{Statics}.
497e44a5 3147
ed9708e2 3148@item t
6fe91f2c 3149Type name; see @ref{Typedefs}.
ed9708e2
JK
3150
3151@item T
685a5e86 3152Enumeration, structure, or union tag; see @ref{Typedefs}.
ed9708e2
JK
3153
3154@item v
bf9d2537 3155Parameter passed by reference; see @ref{Reference Parameters}.
ed9708e2 3156
497e44a5 3157@item V
6fe91f2c 3158Procedure scope static variable; see @ref{Statics}.
497e44a5 3159
6897f9ec 3160@item x
bf9d2537 3161Conformant array; see @ref{Conformant Arrays}.
6897f9ec 3162
ed9708e2 3163@item X
6fe91f2c 3164Function return variable; see @ref{Parameters}.
497e44a5 3165@end table
e505224d 3166
bf9d2537
DM
3167@node Type Descriptors
3168@appendix Table of Type Descriptors
e505224d 3169
0a95c18c
JK
3170The type descriptor is the character which follows the type number and
3171an equals sign. It specifies what kind of type is being defined.
bf9d2537 3172@xref{String Field}, for more information about their use.
6fe91f2c 3173
6897f9ec 3174@table @code
8c59ee11
JK
3175@item @var{digit}
3176@itemx (
bf9d2537 3177Type reference; see @ref{String Field}.
8c59ee11
JK
3178
3179@item -
bf9d2537 3180Reference to builtin type; see @ref{Negative Type Numbers}.
8c59ee11
JK
3181
3182@item #
6fe91f2c 3183Method (C++); see @ref{Cplusplus}.
6897f9ec
JK
3184
3185@item *
bf9d2537 3186Pointer; see @ref{Miscellaneous Types}.
8c59ee11
JK
3187
3188@item &
3189Reference (C++).
6897f9ec
JK
3190
3191@item @@
bf9d2537 3192Type Attributes (AIX); see @ref{String Field}. Member (class and variable)
6fe91f2c 3193type (GNU C++); see @ref{Cplusplus}.
e505224d 3194
6897f9ec 3195@item a
6fe91f2c 3196Array; see @ref{Arrays}.
8c59ee11
JK
3197
3198@item A
6fe91f2c 3199Open array; see @ref{Arrays}.
8c59ee11
JK
3200
3201@item b
bf9d2537
DM
3202Pascal space type (AIX); see @ref{Miscellaneous Types}. Builtin integer
3203type (Sun); see @ref{Builtin Type Descriptors}.
8c59ee11
JK
3204
3205@item B
bf9d2537 3206Volatile-qualified type; see @ref{Miscellaneous Types}.
8c59ee11
JK
3207
3208@item c
bf9d2537 3209Complex builtin type; see @ref{Builtin Type Descriptors}.
8c59ee11
JK
3210
3211@item C
3212COBOL Picture type. See AIX documentation for details.
3213
3214@item d
bf9d2537 3215File type; see @ref{Miscellaneous Types}.
8c59ee11
JK
3216
3217@item D
6fe91f2c 3218N-dimensional dynamic array; see @ref{Arrays}.
6897f9ec
JK
3219
3220@item e
6fe91f2c 3221Enumeration type; see @ref{Enumerations}.
8c59ee11
JK
3222
3223@item E
6fe91f2c 3224N-dimensional subarray; see @ref{Arrays}.
6897f9ec
JK
3225
3226@item f
bf9d2537 3227Function type; see @ref{Function Types}.
a03f27c3
JK
3228
3229@item F
bf9d2537 3230Pascal function parameter; see @ref{Function Types}
8c59ee11
JK
3231
3232@item g
bf9d2537 3233Builtin floating point type; see @ref{Builtin Type Descriptors}.
8c59ee11
JK
3234
3235@item G
3236COBOL Group. See AIX documentation for details.
3237
3238@item i
bf9d2537 3239Imported type; see @ref{Cross-References}.
8c59ee11
JK
3240
3241@item k
bf9d2537 3242Const-qualified type; see @ref{Miscellaneous Types}.
8c59ee11
JK
3243
3244@item K
3245COBOL File Descriptor. See AIX documentation for details.
3246
a03f27c3 3247@item M
bf9d2537 3248Multiple instance type; see @ref{Miscellaneous Types}.
a03f27c3 3249
8c59ee11 3250@item n
6fe91f2c 3251String type; see @ref{Strings}.
8c59ee11
JK
3252
3253@item N
6fe91f2c 3254Stringptr; see @ref{Strings}.
8c59ee11 3255
8c59ee11 3256@item o
6fe91f2c 3257Opaque type; see @ref{Typedefs}.
8c59ee11 3258
a03f27c3 3259@item p
bf9d2537 3260Procedure; see @ref{Function Types}.
a03f27c3 3261
8c59ee11 3262@item P
6fe91f2c 3263Packed array; see @ref{Arrays}.
6897f9ec
JK
3264
3265@item r
6fe91f2c 3266Range type; see @ref{Subranges}.
8c59ee11
JK
3267
3268@item R
bf9d2537
DM
3269Builtin floating type; see @ref{Builtin Type Descriptors} (Sun). Pascal
3270subroutine parameter; see @ref{Function Types} (AIX). Detecting this
a03f27c3
JK
3271conflict is possible with careful parsing (hint: a Pascal subroutine
3272parameter type will always contain a comma, and a builtin type
3273descriptor never will).
6897f9ec
JK
3274
3275@item s
6fe91f2c 3276Structure type; see @ref{Structures}.
8c59ee11
JK
3277
3278@item S
bf9d2537 3279Set type; see @ref{Miscellaneous Types}.
6897f9ec
JK
3280
3281@item u
6fe91f2c 3282Union; see @ref{Unions}.
8c59ee11
JK
3283
3284@item v
3285Variant record. This is a Pascal and Modula-2 feature which is like a
3286union within a struct in C. See AIX documentation for details.
3287
3288@item w
bf9d2537 3289Wide character; see @ref{Builtin Type Descriptors}.
8c59ee11
JK
3290
3291@item x
bf9d2537 3292Cross-reference; see @ref{Cross-References}.
6897f9ec 3293
8c59ee11 3294@item z
6fe91f2c 3295gstring; see @ref{Strings}.
6897f9ec 3296@end table
e505224d 3297
bf9d2537
DM
3298@node Expanded Reference
3299@appendix Expanded Reference by Stab Type
e505224d 3300
685a5e86 3301@c FIXME: This appendix should go away; see N_PSYM or N_SO for an example.
8c59ee11 3302
3d4cf720 3303For a full list of stab types, and cross-references to where they are
bf9d2537 3304described, see @ref{Stab Types}. This appendix just duplicates certain
3d4cf720
JK
3305information from the main body of this document; eventually the
3306information will all be in one place.
8c59ee11 3307
e505224d 3308Format of an entry:
6fe91f2c 3309
685a5e86 3310The first line is the symbol type (see @file{include/aout/stab.def}).
e505224d
PB
3311
3312The second line describes the language constructs the symbol type
3313represents.
3314
3315The third line is the stab format with the significant stab fields
3316named and the rest NIL.
3317
3318Subsequent lines expand upon the meaning and possible values for each
685a5e86 3319significant stab field. @samp{#} stands in for the type descriptor.
e505224d
PB
3320
3321Finally, any further information.
3322
899bafeb 3323@menu
8eb5e289
DZ
3324* N_PC:: Pascal global symbol
3325* N_NSYMS:: Number of symbols
3326* N_NOMAP:: No DST map
8eb5e289
DZ
3327* N_M2C:: Modula-2 compilation unit
3328* N_BROWS:: Path to .cb file for Sun source code browser
3329* N_DEFD:: GNU Modula2 definition module dependency
3330* N_EHDECL:: GNU C++ exception variable
3331* N_MOD2:: Modula2 information "for imc"
3332* N_CATCH:: GNU C++ "catch" clause
3333* N_SSYM:: Structure or union element
8eb5e289
DZ
3334* N_ENTRY:: Alternate entry point
3335* N_SCOPE:: Modula2 scope information (Sun only)
3336* Gould:: non-base register symbols used on Gould systems
3337* N_LENG:: Length of preceding entry
899bafeb
RP
3338@end menu
3339
899bafeb 3340@node N_PC
685a5e86 3341@section N_PC
e505224d 3342
685a5e86
DM
3343@deffn @code{.stabs} N_PC
3344@findex N_PC
3345Global symbol (for Pascal).
e505224d 3346
899bafeb 3347@example
e505224d
PB
3348"name" -> "symbol_name" <<?>>
3349value -> supposedly the line number (stab.def is skeptical)
899bafeb 3350@end example
e505224d 3351
899bafeb 3352@display
f958d5cd 3353@file{stabdump.c} says:
e505224d 3354
6fe91f2c 3355global pascal symbol: name,,0,subtype,line
e505224d 3356<< subtype? >>
899bafeb 3357@end display
685a5e86 3358@end deffn
e505224d 3359
899bafeb 3360@node N_NSYMS
685a5e86
DM
3361@section N_NSYMS
3362
3363@deffn @code{.stabn} N_NSYMS
3364@findex N_NSYMS
3365Number of symbols (according to Ultrix V4.0).
e505224d 3366
899bafeb 3367@display
139741da 3368 0, files,,funcs,lines (stab.def)
899bafeb 3369@end display
685a5e86 3370@end deffn
e505224d 3371
899bafeb 3372@node N_NOMAP
685a5e86
DM
3373@section N_NOMAP
3374
3375@deffn @code{.stabs} N_NOMAP
3376@findex N_NOMAP
935d305d
JK
3377No DST map for symbol (according to Ultrix V4.0). I think this means a
3378variable has been optimized out.
e505224d 3379
899bafeb 3380@display
139741da 3381 name, ,0,type,ignored (stab.def)
899bafeb 3382@end display
685a5e86 3383@end deffn
e505224d 3384
899bafeb 3385@node N_M2C
685a5e86 3386@section N_M2C
e505224d 3387
685a5e86
DM
3388@deffn @code{.stabs} N_M2C
3389@findex N_M2C
3390Modula-2 compilation unit.
e505224d 3391
899bafeb 3392@example
685a5e86 3393"string" -> "unit_name,unit_time_stamp[,code_time_stamp]"
e505224d
PB
3394desc -> unit_number
3395value -> 0 (main unit)
139741da 3396 1 (any other unit)
899bafeb 3397@end example
685a5e86 3398@end deffn
e505224d 3399
899bafeb 3400@node N_BROWS
685a5e86
DM
3401@section N_BROWS
3402
3403@deffn @code{.stabs} N_BROWS
3404@findex N_BROWS
6fe91f2c 3405Sun source code browser, path to @file{.cb} file
e505224d 3406
6fe91f2c 3407<<?>>
685a5e86 3408"path to associated @file{.cb} file"
e505224d 3409
0a95c18c 3410Note: N_BROWS has the same value as N_BSLINE.
685a5e86 3411@end deffn
e505224d 3412
899bafeb 3413@node N_DEFD
685a5e86
DM
3414@section N_DEFD
3415
3416@deffn @code{.stabn} N_DEFD
3417@findex N_DEFD
3418GNU Modula2 definition module dependency.
e505224d 3419
0a95c18c
JK
3420GNU Modula-2 definition module dependency. The value is the
3421modification time of the definition file. The other field is non-zero
3422if it is imported with the GNU M2 keyword @code{%INITIALIZE}. Perhaps
3423@code{N_M2C} can be used if there are enough empty fields?
685a5e86 3424@end deffn
e505224d 3425
899bafeb 3426@node N_EHDECL
685a5e86 3427@section N_EHDECL
e505224d 3428
685a5e86
DM
3429@deffn @code{.stabs} N_EHDECL
3430@findex N_EHDECL
3431GNU C++ exception variable <<?>>.
e505224d 3432
685a5e86
DM
3433"@var{string} is variable name"
3434
3435Note: conflicts with @code{N_MOD2}.
3436@end deffn
e505224d 3437
899bafeb 3438@node N_MOD2
685a5e86
DM
3439@section N_MOD2
3440
3441@deffn @code{.stab?} N_MOD2
3442@findex N_MOD2
899bafeb 3443Modula2 info "for imc" (according to Ultrix V4.0)
e505224d 3444
685a5e86
DM
3445Note: conflicts with @code{N_EHDECL} <<?>>
3446@end deffn
e505224d 3447
899bafeb 3448@node N_CATCH
685a5e86
DM
3449@section N_CATCH
3450
3451@deffn @code{.stabn} N_CATCH
3452@findex N_CATCH
6fe91f2c 3453GNU C++ @code{catch} clause
e505224d 3454
0a95c18c 3455GNU C++ @code{catch} clause. The value is its address. The desc field
685a5e86
DM
3456is nonzero if this entry is immediately followed by a @code{CAUGHT} stab
3457saying what exception was caught. Multiple @code{CAUGHT} stabs means
0a95c18c
JK
3458that multiple exceptions can be caught here. If desc is 0, it means all
3459exceptions are caught here.
685a5e86 3460@end deffn
e505224d 3461
899bafeb 3462@node N_SSYM
685a5e86
DM
3463@section N_SSYM
3464
3465@deffn @code{.stabn} N_SSYM
3466@findex N_SSYM
3467Structure or union element.
e505224d 3468
0a95c18c 3469The value is the offset in the structure.
899bafeb
RP
3470
3471<<?looking at structs and unions in C I didn't see these>>
685a5e86 3472@end deffn
e505224d 3473
899bafeb 3474@node N_ENTRY
685a5e86 3475@section N_ENTRY
e505224d 3476
685a5e86
DM
3477@deffn @code{.stabn} N_ENTRY
3478@findex N_ENTRY
6fe91f2c 3479Alternate entry point.
0a95c18c 3480The value is its address.
e505224d 3481<<?>>
685a5e86 3482@end deffn
e505224d 3483
899bafeb 3484@node N_SCOPE
685a5e86 3485@section N_SCOPE
e505224d 3486
685a5e86
DM
3487@deffn @code{.stab?} N_SCOPE
3488@findex N_SCOPE
e505224d
PB
3489Modula2 scope information (Sun linker)
3490<<?>>
685a5e86 3491@end deffn
e505224d 3492
899bafeb
RP
3493@node Gould
3494@section Non-base registers on Gould systems
ded6bcab 3495
685a5e86
DM
3496@deffn @code{.stab?} N_NBTEXT
3497@deffnx @code{.stab?} N_NBDATA
3498@deffnx @code{.stab?} N_NBBSS
3499@deffnx @code{.stab?} N_NBSTS
3500@deffnx @code{.stab?} N_NBLCS
3501@findex N_NBTEXT
3502@findex N_NBDATA
3503@findex N_NBBSS
3504@findex N_NBSTS
3505@findex N_NBLCS
ded6bcab
JK
3506These are used on Gould systems for non-base registers syms.
3507
3508However, the following values are not the values used by Gould; they are
3509the values which GNU has been documenting for these values for a long
3510time, without actually checking what Gould uses. I include these values
3511only because perhaps some someone actually did something with the GNU
3512information (I hope not, why GNU knowingly assigned wrong values to
3513these in the header file is a complete mystery to me).
e505224d 3514
899bafeb 3515@example
139741da
RP
3516240 0xf0 N_NBTEXT ??
3517242 0xf2 N_NBDATA ??
3518244 0xf4 N_NBBSS ??
3519246 0xf6 N_NBSTS ??
3520248 0xf8 N_NBLCS ??
899bafeb 3521@end example
685a5e86 3522@end deffn
e505224d 3523
899bafeb 3524@node N_LENG
685a5e86 3525@section N_LENG
e505224d 3526
685a5e86
DM
3527@deffn @code{.stabn} N_LENG
3528@findex N_LENG
e505224d 3529Second symbol entry containing a length-value for the preceding entry.
0a95c18c 3530The value is the length.
685a5e86 3531@end deffn
e505224d 3532
899bafeb 3533@node Questions
bf9d2537 3534@appendix Questions and Anomalies
e505224d
PB
3535
3536@itemize @bullet
3537@item
dd8126d9 3538@c I think this is changed in GCC 2.4.5 to put the line number there.
6fe91f2c 3539For GNU C stabs defining local and global variables (@code{N_LSYM} and
0a95c18c
JK
3540@code{N_GSYM}), the desc field is supposed to contain the source
3541line number on which the variable is defined. In reality the desc
dd8126d9 3542field is always 0. (This behavior is defined in @file{dbxout.c} and
0a95c18c 3543putting a line number in desc is controlled by @samp{#ifdef
dd8126d9
JK
3544WINNING_GDB}, which defaults to false). GDB supposedly uses this
3545information if you say @samp{list @var{var}}. In reality, @var{var} can
3546be a variable defined in the program and GDB says @samp{function
6fe91f2c 3547@var{var} not defined}.
e505224d
PB
3548
3549@item
6fe91f2c
DM
3550In GNU C stabs, there seems to be no way to differentiate tag types:
3551structures, unions, and enums (symbol descriptor @samp{T}) and typedefs
3552(symbol descriptor @samp{t}) defined at file scope from types defined locally
3553to a procedure or other more local scope. They all use the @code{N_LSYM}
e505224d 3554stab type. Types defined at procedure scope are emited after the
6fe91f2c 3555@code{N_RBRAC} of the preceding function and before the code of the
e505224d
PB
3556procedure in which they are defined. This is exactly the same as
3557types defined in the source file between the two procedure bodies.
4d7f562d 3558GDB overcompensates by placing all types in block #1, the block for
6fe91f2c
DM
3559symbols of file scope. This is true for default, @samp{-ansi} and
3560@samp{-traditional} compiler options. (Bugs gcc/1063, gdb/1066.)
e505224d
PB
3561
3562@item
6fe91f2c
DM
3563What ends the procedure scope? Is it the proc block's @code{N_RBRAC} or the
3564next @code{N_FUN}? (I believe its the first.)
e505224d
PB
3565
3566@item
24dcc707 3567@c FIXME: This should go with the other stuff about global variables.
e505224d
PB
3568Global variable stabs don't have location information. This comes
3569from the external symbol for the same variable. The external symbol
3570has a leading underbar on the _name of the variable and the stab does
3571not. How do we know these two symbol table entries are talking about
24dcc707
JK
3572the same symbol when their names are different? (Answer: the debugger
3573knows that external symbols have leading underbars).
e505224d 3574
24dcc707
JK
3575@c FIXME: This is absurdly vague; there all kinds of differences, some
3576@c of which are the same between gnu & sun, and some of which aren't.
dd8126d9
JK
3577@c In particular, I'm pretty sure GCC works with Sun dbx by default.
3578@c @item
3579@c Can GCC be configured to output stabs the way the Sun compiler
3580@c does, so that their native debugging tools work? <NO?> It doesn't by
3581@c default. GDB reads either format of stab. (GCC or SunC). How about
3582@c dbx?
e505224d
PB
3583@end itemize
3584
bf9d2537
DM
3585@node XCOFF Differences
3586@appendix Differences Between GNU Stabs in a.out and GNU Stabs in XCOFF
e505224d 3587
497e44a5 3588@c FIXME: Merge *all* these into the main body of the document.
f958d5cd 3589The AIX/RS6000 native object file format is XCOFF with stabs. This
497e44a5
JK
3590appendix only covers those differences which are not covered in the main
3591body of this document.
e505224d
PB
3592
3593@itemize @bullet
e505224d 3594@item
dd8126d9
JK
3595BSD a.out stab types correspond to AIX XCOFF storage classes. In general
3596the mapping is @code{N_@var{stabtype}} becomes @code{C_@var{stabtype}}.
3597Some stab types in a.out are not supported in XCOFF; most of these use
3598@code{C_DECL}.
e505224d 3599
24dcc707
JK
3600@c FIXME: Get C_* types for the block, figure out whether it is always
3601@c used (I suspect not), explain clearly, and move to node Statics.
dd8126d9
JK
3602Exception: initialised static @code{N_STSYM} and un-initialized static
3603@code{N_LCSYM} both map to the @code{C_STSYM} storage class. But the
bf9d2537 3604distinction is preserved because in XCOFF @code{N_STSYM} and
dd8126d9
JK
3605@code{N_LCSYM} must be emited in a named static block. Begin the block
3606with @samp{.bs s[RW] data_section_name} for @code{N_STSYM} or @samp{.bs
3607s bss_section_name} for @code{N_LCSYM}. End the block with @samp{.es}.
e505224d 3608
24dcc707
JK
3609@c FIXME: I think they are trying to say something about whether the
3610@c assembler defaults the value to the location counter.
e505224d 3611@item
685a5e86 3612If the XCOFF stab is an @code{N_FUN} (@code{C_FUN}) then follow the
dd8126d9 3613string field with @samp{,.} instead of just @samp{,}.
e505224d
PB
3614@end itemize
3615
6fe91f2c 3616I think that's it for @file{.s} file differences. They could stand to be
e505224d 3617better presented. This is just a list of what I have noticed so far.
6fe91f2c
DM
3618There are a @emph{lot} of differences in the information in the symbol
3619tables of the executable and object files.
e505224d 3620
f958d5cd 3621Mapping of a.out stab types to XCOFF storage classes:
e505224d
PB
3622
3623@example
139741da 3624stab type storage class
e505224d 3625-------------------------------
139741da 3626N_GSYM C_GSYM
43603088 3627N_FNAME unused
139741da
RP
3628N_FUN C_FUN
3629N_STSYM C_STSYM
3630N_LCSYM C_STSYM
43603088 3631N_MAIN unknown
139741da
RP
3632N_PC unknown
3633N_RSYM C_RSYM
dd8126d9 3634unknown C_RPSYM
139741da
RP
3635N_M2C unknown
3636N_SLINE unknown
3637N_DSLINE unknown
3638N_BSLINE unknown
3639N_BROWSE unchanged
3640N_CATCH unknown
3641N_SSYM unknown
3642N_SO unknown
3643N_LSYM C_LSYM
dd8126d9 3644various C_DECL
139741da
RP
3645N_BINCL unknown
3646N_SOL unknown
3647N_PSYM C_PSYM
3648N_EINCL unknown
3649N_ENTRY C_ENTRY
3650N_LBRAC unknown
3651N_EXCL unknown
3652N_SCOPE unknown
3653N_RBRAC unknown
3654N_BCOMM C_BCOMM
3655N_ECOMM C_ECOMM
3656N_ECOML C_ECOML
3657
3658N_LENG unknown
e505224d
PB
3659@end example
3660
bf9d2537
DM
3661@node Sun Differences
3662@appendix Differences Between GNU Stabs and Sun Native Stabs
e505224d 3663
497e44a5
JK
3664@c FIXME: Merge all this stuff into the main body of the document.
3665
e505224d
PB
3666@itemize @bullet
3667@item
6fe91f2c
DM
3668GNU C stabs define @emph{all} types, file or procedure scope, as
3669@code{N_LSYM}. Sun doc talks about using @code{N_GSYM} too.
e505224d 3670
e505224d 3671@item
4e9570e8
JK
3672Sun C stabs use type number pairs in the format
3673(@var{file-number},@var{type-number}) where @var{file-number} is a
3674number starting with 1 and incremented for each sub-source file in the
3675compilation. @var{type-number} is a number starting with 1 and
6fe91f2c
DM
3676incremented for each new type defined in the compilation. GNU C stabs
3677use the type number alone, with no source file number.
e505224d
PB
3678@end itemize
3679
bf9d2537
DM
3680@node Stabs In ELF
3681@appendix Using Stabs With The ELF Object File Format
935d305d 3682
6fe91f2c
DM
3683The ELF object file format allows tools to create object files with
3684custom sections containing any arbitrary data. To use stabs in ELF
935d305d
JK
3685object files, the tools create two custom sections, a section named
3686@code{.stab} which contains an array of fixed length structures, one
3687struct per stab, and a section named @code{.stabstr} containing all the
3688variable length strings that are referenced by stabs in the @code{.stab}
3689section. The byte order of the stabs binary data matches the byte order
6fe91f2c
DM
3690of the ELF file itself, as determined from the @code{EI_DATA} field in
3691the @code{e_ident} member of the ELF header.
935d305d 3692
4e9570e8 3693The first stab in the @code{.stab} section for each compilation unit is
935d305d
JK
3694synthetic, generated entirely by the assembler, with no corresponding
3695@code{.stab} directive as input to the assembler. This stab contains
3696the following fields:
cc4fb848 3697
935d305d
JK
3698@table @code
3699@item n_strx
3700Offset in the @code{.stabstr} section to the source filename.
cc4fb848 3701
935d305d
JK
3702@item n_type
3703@code{N_UNDF}.
cc4fb848 3704
935d305d 3705@item n_other
cc4fb848
FF
3706Unused field, always zero.
3707
935d305d 3708@item n_desc
6fe91f2c 3709Count of upcoming symbols, i.e., the number of remaining stabs for this
935d305d 3710source file.
cc4fb848 3711
935d305d
JK
3712@item n_value
3713Size of the string table fragment associated with this source file, in
cc4fb848 3714bytes.
935d305d 3715@end table
cc4fb848 3716
935d305d 3717The @code{.stabstr} section always starts with a null byte (so that string
cc4fb848
FF
3718offsets of zero reference a null string), followed by random length strings,
3719each of which is null byte terminated.
3720
6fe91f2c 3721The ELF section header for the @code{.stab} section has its
935d305d 3722@code{sh_link} member set to the section number of the @code{.stabstr}
6fe91f2c 3723section, and the @code{.stabstr} section has its ELF section
935d305d
JK
3724header @code{sh_type} member set to @code{SHT_STRTAB} to mark it as a
3725string table.
3726
31a932d8
JK
3727To keep linking fast, it is a bad idea to have the linker relocating
3728stabs, so (except for Solaris 2.2 and earlier, see below) none of the
3729addresses in the @code{n_value} field of the stabs are relocated by the
3730linker. Instead they are relative to the source file (or some entity
3731smaller than a source file, like a function). To find the address of
3732each section corresponding to a given source file, the compiler puts out
3733symbols giving the address of each section for a given source file.
3734Since these are ELF (not stab) symbols, the linker can relocate them
3735correctly. They are named @code{Bbss.bss} for the bss section,
3736@code{Ddata.data} for the data section, and @code{Drodata.rodata} for
cd61aa60
JK
3737the rodata section. For the text section, there is no such symbol. For
3738an example of how these symbols work, @xref{ELF Transformations}. GCC
31a932d8
JK
3739does not provide these symbols; it instead relies on the stabs getting
3740relocated, which loses for Solaris 2.3 (see below). Thus address which
3741would normally be relative to @code{Bbss.bss}, etc., are absolute. The
3742linker provided with Solaris 2.2 and earlier relocates stabs using
3743relocation information from a @code{.rela.stab} section, which means
3744that the value of an @code{N_FUN} stab in an executable is the actual
cd61aa60
JK
3745address. I think this is just standard ELF relocations, as it would do
3746for any section, rather than a special-purpose stabs hack. For Solaris
37472.3 and later, the linker ignores relocations for the stabs section.
3748The value of a @code{N_FUN} stab is zero and the address of a function
3749can be obtained from the ELF (non-stab) symbols. Sun, in reference to
3750bug 1142109, has verified that this is intentional. Because looking
3751things up in the ELF symbols would probably be slow, and this doesn't
3752provide any way to deal with nested functions, it would probably be
3753better to use a @code{Ttext.text} symbol for stabs-in-elf on non-Solaris
3754machines, and make the address in the @code{N_FUN} relative to the
3755@code{Ttext.text} symbol. In addition to @code{N_FUN} symbols, whether
3756the linker relocates stabs also affects some @code{N_ROSYM},
3757@code{N_STSYM}, and @code{N_LCSYM} symbols; see @ref{Statics}.
cc4fb848 3758
685a5e86
DM
3759@node Symbol Types Index
3760@unnumbered Symbol Types Index
3761
3762@printindex fn
3763
e505224d
PB
3764@contents
3765@bye
This page took 0.314777 seconds and 4 git commands to generate.