gdb/doc/gdbint.texinfo

   1 \input texinfo
   2 @setfilename gdbint.info
   3
   4 @ifinfo
   5 @format
   6 START-INFO-DIR-ENTRY
   7 * Gdb-Internals: (gdbint).      The GNU debugger's internals.
   8 END-INFO-DIR-ENTRY
   9 @end format
  10 @end ifinfo
  11
  12 @ifinfo
  13 This file documents the internals of the GNU debugger GDB.
  14
  15 Copyright 1990-1999 Free Software Foundation, Inc.
  16 Contributed by Cygnus Solutions.  Written by John Gilmore.
  17 Second Edition by Stan Shebs.
  18
  19 Permission is granted to make and distribute verbatim copies of this
  20 manual provided the copyright notice and this permission notice are
  21 preserved on all copies.
  22
  23 @ignore
  24 Permission is granted to process this file through Tex and print the
  25 results, provided the printed document carries copying permission notice
  26 identical to this one except for the removal of this paragraph (this
  27 paragraph not being relevant to the printed manual).
  28
  29 @end ignore
  30 Permission is granted to copy or distribute modified versions of this
  31 manual under the terms of the GPL (for which purpose this text may be
  32 regarded as a program in the language TeX).
  33 @end ifinfo
  34
  35 @setchapternewpage off
  36 @settitle GDB Internals
  37
  38 @titlepage
  39 @title{GDB Internals}
  40 @subtitle{A guide to the internals of the GNU debugger}
  41 @author John Gilmore
  42 @author Cygnus Solutions
  43 @author Second Edition:
  44 @author Stan Shebs
  45 @author Cygnus Solutions
  46 @page
  47 @tex
  48 \def\$#1${{#1}}  % Kluge: collect RCS revision info without $...$
  49 \xdef\manvers{\$Revision$}  % For use in headers, footers too
  50 {\parskip=0pt
  51 \hfill Cygnus Solutions\par
  52 \hfill \manvers\par
  53 \hfill \TeX{}info \texinfoversion\par
  54 }
  55 @end tex
  56
  57 @vskip 0pt plus 1filll
  58 Copyright @copyright{} 1990-1999 Free Software Foundation, Inc.
  59
  60 Permission is granted to make and distribute verbatim copies of
  61 this manual provided the copyright notice and this permission notice
  62 are preserved on all copies.
  63
  64 @end titlepage
  65
  66 @node Top
  67 @c Perhaps this should be the title of the document (but only for info,
  68 @c not for TeX).  Existing GNU manuals seem inconsistent on this point.
  69 @top Scope of this Document
  70
  71 This document documents the internals of the GNU debugger, GDB.  It
  72 includes description of GDB's key algorithms and operations, as well
  73 as the mechanisms that adapt GDB to specific hosts and targets.
  74
  75 @menu
  76 * Requirements::
  77 * Overall Structure::
  78 * Algorithms::
  79 * User Interface::
  80 * Symbol Handling::
  81 * Language Support::
  82 * Host Definition::
  83 * Target Architecture Definition::
  84 * Target Vector Definition::
  85 * Native Debugging::
  86 * Support Libraries::
  87 * Coding::
  88 * Porting GDB::
  89 * Hints::
  90 @end menu
  91
  92 @node Requirements
  93
  94 @chapter Requirements
  95
  96 Before diving into the internals, you should understand the formal
  97 requirements and other expectations for GDB.  Although some of these may
  98 seem obvious, there have been proposals for GDB that have run counter to
  99 these requirements.
 100
 101 First of all, GDB is a debugger.  It's not designed to be a front panel
 102 for embedded systems.  It's not a text editor.  It's not a shell.  It's
 103 not a programming environment.
 104
 105 GDB is an interactive tool.  Although a batch mode is available, GDB's
 106 primary role is to interact with a human programmer.
 107
 108 GDB should be responsive to the user.  A programmer hot on the trail of
 109 a nasty bug, and operating under a looming deadline, is going to be very
 110 impatient of everything, including the response time to debugger
 111 commands.
 112
 113 GDB should be relatively permissive, such as for expressions.  While the
 114 compiler should be picky (or have the option to be made picky), since
 115 source code lives for a long time usually, the programmer doing
 116 debugging shouldn't be spending time figuring out to mollify the
 117 debugger.
 118
 119 GDB will be called upon to deal with really large programs.  Executable
 120 sizes of 50 to 100 megabytes occur regularly, and we've heard reports of
 121 programs approaching 1 gigabyte in size.
 122
 123 GDB should be able to run everywhere.  No other debugger is available
 124 for even half as many configurations as GDB supports.
 125
 126
 127 @node Overall Structure
 128
 129 @chapter Overall Structure
 130
 131 GDB consists of three major subsystems: user interface, symbol handling
 132 (the ``symbol side''), and target system handling (the ``target side'').
 133
 134 Ther user interface consists of several actual interfaces, plus
 135 supporting code.
 136
 137 The symbol side consists of object file readers, debugging info
 138 interpreters, symbol table management, source language expression
 139 parsing, type and value printing.
 140
 141 The target side consists of execution control, stack frame analysis, and
 142 physical target manipulation.
 143
 144 The target side/symbol side division is not formal, and there are a
 145 number of exceptions.  For instance, core file support involves symbolic
 146 elements (the basic core file reader is in BFD) and target elements (it
 147 supplies the contents of memory and the values of registers).  Instead,
 148 this division is useful for understanding how the minor subsystems
 149 should fit together.
 150
 151 @section The Symbol Side
 152
 153 The symbolic side of GDB can be thought of as ``everything you can do in
 154 GDB without having a live program running''.  For instance, you can look
 155 at the types of variables, and evaluate many kinds of expressions.
 156
 157 @section The Target Side
 158
 159 The target side of GDB is the ``bits and bytes manipulator''.  Although
 160 it may make reference to symbolic info here and there, most of the
 161 target side will run with only a stripped executable available -- or
 162 even no executable at all, in remote debugging cases.
 163
 164 Operations such as disassembly, stack frame crawls, and register
 165 display, are able to work with no symbolic info at all.  In some cases,
 166 such as disassembly, GDB will use symbolic info to present addresses
 167 relative to symbols rather than as raw numbers, but it will work either
 168 way.
 169
 170 @section Configurations
 171
 172 @dfn{Host} refers to attributes of the system where GDB runs.
 173 @dfn{Target} refers to the system where the program being debugged
 174 executes.  In most cases they are the same machine, in which case a
 175 third type of @dfn{Native} attributes come into play.
 176
 177 Defines and include files needed to build on the host are host support.
 178 Examples are tty support, system defined types, host byte order, host
 179 float format.
 180
 181 Defines and information needed to handle the target format are target
 182 dependent.  Examples are the stack frame format, instruction set,
 183 breakpoint instruction, registers, and how to set up and tear down the stack
 184 to call a function.
 185
 186 Information that is only needed when the host and target are the same,
 187 is native dependent.  One example is Unix child process support; if the
 188 host and target are not the same, doing a fork to start the target
 189 process is a bad idea.  The various macros needed for finding the
 190 registers in the @code{upage}, running @code{ptrace}, and such are all
 191 in the native-dependent files.
 192
 193 Another example of native-dependent code is support for features that
 194 are really part of the target environment, but which require
 195 @code{#include} files that are only available on the host system.  Core
 196 file handling and @code{setjmp} handling are two common cases.
 197
 198 When you want to make GDB work ``native'' on a particular machine, you
 199 have to include all three kinds of information.
 200
 201
 202 @node Algorithms
 203
 204 @chapter Algorithms
 205
 206 GDB uses a number of debugging-specific algorithms.  They are often not
 207 very complicated, but get lost in the thicket of special cases and
 208 real-world issues.  This chapter describes the basic algorithms and
 209 mentions some of the specific target definitions that they use.
 210
 211 @section Frames
 212
 213 A frame is a construct that GDB uses to keep track of calling and called
 214 functions.
 215
 216 @code{FRAME_FP} in the machine description has no meaning to the
 217 machine-independent part of GDB, except that it is used when setting up
 218 a new frame from scratch, as follows:
 219
 220 @example
 221       create_new_frame (read_register (FP_REGNUM), read_pc ()));
 222 @end example
 223
 224 Other than that, all the meaning imparted to @code{FP_REGNUM} is
 225 imparted by the machine-dependent code.  So, @code{FP_REGNUM} can have
 226 any value that is convenient for the code that creates new frames.
 227 (@code{create_new_frame} calls @code{INIT_EXTRA_FRAME_INFO} if it is
 228 defined; that is where you should use the @code{FP_REGNUM} value, if
 229 your frames are nonstandard.)
 230
 231 Given a GDB frame, define @code{FRAME_CHAIN} to determine the address of
 232 the calling function's frame.  This will be used to create a new GDB
 233 frame struct, and then @code{INIT_EXTRA_FRAME_INFO} and
 234 @code{INIT_FRAME_PC} will be called for the new frame.
 235
 236 @section Breakpoint Handling
 237
 238 In general, a breakpoint is a user-designated location in the program
 239 where the user wants to regain control if program execution ever reaches
 240 that location.
 241
 242 There are two main ways to implement breakpoints; either as ``hardware''
 243 breakpoints or as ``software'' breakpoints.
 244
 245 Hardware breakpoints are sometimes available as a builtin debugging
 246 features with some chips.  Typically these work by having dedicated
 247 register into which the breakpoint address may be stored.  If the PC
 248 ever matches a value in a breakpoint registers, the CPU raises an
 249 exception and reports it to GDB.  Another possibility is when an
 250 emulator is in use; many emulators include circuitry that watches the
 251 address lines coming out from the processor, and force it to stop if the
 252 address matches a breakpoint's address.  A third possibility is that the
 253 target already has the ability to do breakpoints somehow; for instance,
 254 a ROM monitor may do its own software breakpoints.  So although these
 255 are not literally ``hardware breakpoints'', from GDB's point of view
 256 they work the same; GDB need not do nothing more than set the breakpoint
 257 and wait for something to happen.
 258
 259 Since they depend on hardware resources, hardware breakpoints may be
 260 limited in number; when the user asks for more, GDB will start trying to
 261 set software breakpoints.
 262
 263 Software breakpoints require GDB to do somewhat more work.  The basic
 264 theory is that GDB will replace a program instruction a trap, illegal
 265 divide, or some other instruction that will cause an exception, and then
 266 when it's encountered, GDB will take the exception and stop the program.
 267 When the user says to continue, GDB will restore the original
 268 instruction, single-step, re-insert the trap, and continue on.
 269
 270 Since it literally overwrites the program being tested, the program area
 271 must be writeable, so this technique won't work on programs in ROM.  It
 272 can also distort the behavior of programs that examine themselves,
 273 although the situation would be highly unusual.
 274
 275 Also, the software breakpoint instruction should be the smallest size of
 276 instruction, so it doesn't overwrite an instruction that might be a jump
 277 target, and cause disaster when the program jumps into the middle of the
 278 breakpoint instruction.  (Strictly speaking, the breakpoint must be no
 279 larger than the smallest interval between instructions that may be jump
 280 targets; perhaps there is an architecture where only even-numbered
 281 instructions may jumped to.)  Note that it's possible for an instruction
 282 set not to have any instructions usable for a software breakpoint,
 283 although in practice only the ARC has failed to define such an
 284 instruction.
 285
 286 The basic definition of the software breakpoint is the macro
 287 @code{BREAKPOINT}.
 288
 289 Basic breakpoint object handling is in @file{breakpoint.c}.  However,
 290 much of the interesting breakpoint action is in @file{infrun.c}.
 291
 292 @section Single Stepping
 293
 294 @section Signal Handling
 295
 296 @section Thread Handling
 297
 298 @section Inferior Function Calls
 299
 300 @section Longjmp Support
 301
 302 GDB has support for figuring out that the target is doing a
 303 @code{longjmp} and for stopping at the target of the jump, if we are
 304 stepping.  This is done with a few specialized internal breakpoints,
 305 which are visible in the @code{maint info breakpoint} command.
 306
 307 To make this work, you need to define a macro called
 308 @code{GET_LONGJMP_TARGET}, which will examine the @code{jmp_buf}
 309 structure and extract the longjmp target address.  Since @code{jmp_buf}
 310 is target specific, you will need to define it in the appropriate
 311 @file{tm-@var{xyz}.h} file.  Look in @file{tm-sun4os4.h} and
 312 @file{sparc-tdep.c} for examples of how to do this.
 313
 314 @node User Interface
 315
 316 @chapter User Interface
 317
 318 GDB has several user interfaces.  Although the command-line interface
 319 is the most common and most familiar, there are others.
 320
 321 @section Command Interpreter
 322
 323 The command interpreter in GDB is fairly simple.  It is designed to
 324 allow for the set of commands to be augmented dynamically, and also
 325 has a recursive subcommand capability, where the first argument to
 326 a command may itself direct a lookup on a different command list.
 327
 328 For instance, the @code{set} command just starts a lookup on the
 329 @code{setlist} command list, while @code{set thread} recurses
 330 to the @code{set_thread_cmd_list}.
 331
 332 To add commands in general, use @code{add_cmd}.  @code{add_com} adds to
 333 the main command list, and should be used for those commands.  The usual
 334 place to add commands is in the @code{_initialize_@var{xyz}} routines at the
 335 ends of most source files.
 336
 337 @section Console Printing
 338
 339 @section TUI
 340
 341 @section libgdb
 342
 343 @code{libgdb} was an abortive project of years ago.  The theory was to
 344 provide an API to GDB's functionality.
 345
 346 @node Symbol Handling
 347
 348 @chapter Symbol Handling
 349
 350 Symbols are a key part of GDB's operation.  Symbols include variables,
 351 functions, and types.
 352
 353 @section Symbol Reading
 354
 355 GDB reads symbols from ``symbol files''.  The usual symbol file is the
 356 file containing the program which GDB is debugging.  GDB can be directed
 357 to use a different file for symbols (with the @code{symbol-file}
 358 command), and it can also read more symbols via the ``add-file'' and
 359 ``load'' commands, or while reading symbols from shared libraries.
 360
 361 Symbol files are initially opened by code in @file{symfile.c} using the
 362 BFD library.  BFD identifies the type of the file by examining its
 363 header.  @code{symfile_init} then uses this identification to locate a
 364 set of symbol-reading functions.
 365
 366 Symbol reading modules identify themselves to GDB by calling
 367 @code{add_symtab_fns} during their module initialization.  The argument
 368 to @code{add_symtab_fns} is a @code{struct sym_fns} which contains the
 369 name (or name prefix) of the symbol format, the length of the prefix,
 370 and pointers to four functions.  These functions are called at various
 371 times to process symbol-files whose identification matches the specified
 372 prefix.
 373
 374 The functions supplied by each module are:
 375
 376 @table @code
 377 @item @var{xyz}_symfile_init(struct sym_fns *sf)
 378
 379 Called from @code{symbol_file_add} when we are about to read a new
 380 symbol file.  This function should clean up any internal state (possibly
 381 resulting from half-read previous files, for example) and prepare to
 382 read a new symbol file. Note that the symbol file which we are reading
 383 might be a new "main" symbol file, or might be a secondary symbol file
 384 whose symbols are being added to the existing symbol table.
 385
 386 The argument to @code{@var{xyz}_symfile_init} is a newly allocated
 387 @code{struct sym_fns} whose @code{bfd} field contains the BFD for the
 388 new symbol file being read.  Its @code{private} field has been zeroed,
 389 and can be modified as desired.  Typically, a struct of private
 390 information will be @code{malloc}'d, and a pointer to it will be placed
 391 in the @code{private} field.
 392
 393 There is no result from @code{@var{xyz}_symfile_init}, but it can call
 394 @code{error} if it detects an unavoidable problem.
 395
 396 @item @var{xyz}_new_init()
 397
 398 Called from @code{symbol_file_add} when discarding existing symbols.
 399 This function need only handle the symbol-reading module's internal
 400 state; the symbol table data structures visible to the rest of GDB will
 401 be discarded by @code{symbol_file_add}.  It has no arguments and no
 402 result.  It may be called after @code{@var{xyz}_symfile_init}, if a new
 403 symbol table is being read, or may be called alone if all symbols are
 404 simply being discarded.
 405
 406 @item @var{xyz}_symfile_read(struct sym_fns *sf, CORE_ADDR addr, int mainline)
 407
 408 Called from @code{symbol_file_add} to actually read the symbols from a
 409 symbol-file into a set of psymtabs or symtabs.
 410
 411 @code{sf} points to the struct sym_fns originally passed to
 412 @code{@var{xyz}_sym_init} for possible initialization.  @code{addr} is
 413 the offset between the file's specified start address and its true
 414 address in memory.  @code{mainline} is 1 if this is the main symbol
 415 table being read, and 0 if a secondary symbol file (e.g. shared library
 416 or dynamically loaded file) is being read.@refill
 417 @end table
 418
 419 In addition, if a symbol-reading module creates psymtabs when
 420 @var{xyz}_symfile_read is called, these psymtabs will contain a pointer
 421 to a function @code{@var{xyz}_psymtab_to_symtab}, which can be called
 422 from any point in the GDB symbol-handling code.
 423
 424 @table @code
 425 @item @var{xyz}_psymtab_to_symtab (struct partial_symtab *pst)
 426
 427 Called from @code{psymtab_to_symtab} (or the PSYMTAB_TO_SYMTAB macro) if
 428 the psymtab has not already been read in and had its @code{pst->symtab}
 429 pointer set.  The argument is the psymtab to be fleshed-out into a
 430 symtab.  Upon return, pst->readin should have been set to 1, and
 431 pst->symtab should contain a pointer to the new corresponding symtab, or
 432 zero if there were no symbols in that part of the symbol file.
 433 @end table
 434
 435 @section Partial Symbol Tables
 436
 437 GDB has three types of symbol tables.
 438
 439 @itemize @bullet
 440
 441 @item full symbol tables (symtabs).  These contain the main information
 442 about symbols and addresses.
 443
 444 @item partial symbol tables (psymtabs).  These contain enough
 445 information to know when to read the corresponding part of the full
 446 symbol table.
 447
 448 @item minimal symbol tables (msymtabs).  These contain information
 449 gleaned from non-debugging symbols.
 450
 451 @end itemize
 452
 453 This section describes partial symbol tables.
 454
 455 A psymtab is constructed by doing a very quick pass over an executable
 456 file's debugging information.  Small amounts of information are
 457 extracted -- enough to identify which parts of the symbol table will
 458 need to be re-read and fully digested later, when the user needs the
 459 information.  The speed of this pass causes GDB to start up very
 460 quickly.  Later, as the detailed rereading occurs, it occurs in small
 461 pieces, at various times, and the delay therefrom is mostly invisible to
 462 the user.
 463 @c (@xref{Symbol Reading}.)
 464
 465 The symbols that show up in a file's psymtab should be, roughly, those
 466 visible to the debugger's user when the program is not running code from
 467 that file.  These include external symbols and types, static symbols and
 468 types, and enum values declared at file scope.
 469
 470 The psymtab also contains the range of instruction addresses that the
 471 full symbol table would represent.
 472
 473 The idea is that there are only two ways for the user (or much of the
 474 code in the debugger) to reference a symbol:
 475
 476 @itemize @bullet
 477
 478 @item by its address
 479 (e.g. execution stops at some address which is inside a function in this
 480 file).  The address will be noticed to be in the range of this psymtab,
 481 and the full symtab will be read in.  @code{find_pc_function},
 482 @code{find_pc_line}, and other @code{find_pc_@dots{}} functions handle
 483 this.
 484
 485 @item by its name
 486 (e.g. the user asks to print a variable, or set a breakpoint on a
 487 function).  Global names and file-scope names will be found in the
 488 psymtab, which will cause the symtab to be pulled in.  Local names will
 489 have to be qualified by a global name, or a file-scope name, in which
 490 case we will have already read in the symtab as we evaluated the
 491 qualifier.  Or, a local symbol can be referenced when we are "in" a
 492 local scope, in which case the first case applies.  @code{lookup_symbol}
 493 does most of the work here.
 494
 495 @end itemize
 496
 497 The only reason that psymtabs exist is to cause a symtab to be read in
 498 at the right moment.  Any symbol that can be elided from a psymtab,
 499 while still causing that to happen, should not appear in it.  Since
 500 psymtabs don't have the idea of scope, you can't put local symbols in
 501 them anyway.  Psymtabs don't have the idea of the type of a symbol,
 502 either, so types need not appear, unless they will be referenced by
 503 name.
 504
 505 It is a bug for GDB to behave one way when only a psymtab has been read,
 506 and another way if the corresponding symtab has been read in.  Such bugs
 507 are typically caused by a psymtab that does not contain all the visible
 508 symbols, or which has the wrong instruction address ranges.
 509
 510 The psymtab for a particular section of a symbol-file (objfile) could be
 511 thrown away after the symtab has been read in.  The symtab should always
 512 be searched before the psymtab, so the psymtab will never be used (in a
 513 bug-free environment).  Currently, psymtabs are allocated on an obstack,
 514 and all the psymbols themselves are allocated in a pair of large arrays
 515 on an obstack, so there is little to be gained by trying to free them
 516 unless you want to do a lot more work.
 517
 518 @section Types
 519
 520 Fundamental Types (e.g., FT_VOID, FT_BOOLEAN).
 521
 522 These are the fundamental types that GDB uses internally.  Fundamental
 523 types from the various debugging formats (stabs, ELF, etc) are mapped
 524 into one of these.  They are basically a union of all fundamental types
 525 that gdb knows about for all the languages that GDB knows about.
 526
 527 Type Codes (e.g., TYPE_CODE_PTR, TYPE_CODE_ARRAY).
 528
 529 Each time GDB builds an internal type, it marks it with one of these
 530 types.  The type may be a fundamental type, such as TYPE_CODE_INT, or a
 531 derived type, such as TYPE_CODE_PTR which is a pointer to another type.
 532 Typically, several FT_* types map to one TYPE_CODE_* type, and are
 533 distinguished by other members of the type struct, such as whether the
 534 type is signed or unsigned, and how many bits it uses.
 535
 536 Builtin Types (e.g., builtin_type_void, builtin_type_char).
 537
 538 These are instances of type structs that roughly correspond to
 539 fundamental types and are created as global types for GDB to use for
 540 various ugly historical reasons.  We eventually want to eliminate these.
 541 Note for example that builtin_type_int initialized in gdbtypes.c is
 542 basically the same as a TYPE_CODE_INT type that is initialized in
 543 c-lang.c for an FT_INTEGER fundamental type.  The difference is that the
 544 builtin_type is not associated with any particular objfile, and only one
 545 instance exists, while c-lang.c builds as many TYPE_CODE_INT types as
 546 needed, with each one associated with some particular objfile.
 547
 548 @section Object File Formats
 549
 550 @subsection a.out
 551
 552 The @file{a.out} format is the original file format for Unix.  It
 553 consists of three sections: text, data, and bss, which are for program
 554 code, initialized data, and uninitialized data, respectively.
 555
 556 The @file{a.out} format is so simple that it doesn't have any reserved
 557 place for debugging information.  (Hey, the original Unix hackers used
 558 @file{adb}, which is a machine-language debugger.)  The only debugging
 559 format for @file{a.out} is stabs, which is encoded as a set of normal
 560 symbols with distinctive attributes.
 561
 562 The basic @file{a.out} reader is in @file{dbxread.c}.
 563
 564 @subsection COFF
 565
 566 The COFF format was introduced with System V Release 3 (SVR3) Unix.
 567 COFF files may have multiple sections, each prefixed by a header.  The
 568 number of sections is limited.
 569
 570 The COFF specification includes support for debugging.  Although this
 571 was a step forward, the debugging information was woefully limited.  For
 572 instance, it was not possible to represent code that came from an
 573 included file.
 574
 575 The COFF reader is in @file{coffread.c}.
 576
 577 @subsection ECOFF
 578
 579 ECOFF is an extended COFF originally introduced for Mips and Alpha
 580 workstations.
 581
 582 The basic ECOFF reader is in @file{mipsread.c}.
 583
 584 @subsection XCOFF
 585
 586 The IBM RS/6000 running AIX uses an object file format called XCOFF.
 587 The COFF sections, symbols, and line numbers are used, but debugging
 588 symbols are dbx-style stabs whose strings are located in the
 589 @samp{.debug} section (rather than the string table).  For more
 590 information, see @xref{Top,,,stabs,The Stabs Debugging Format}.
 591
 592 The shared library scheme has a clean interface for figuring out what
 593 shared libraries are in use, but the catch is that everything which
 594 refers to addresses (symbol tables and breakpoints at least) needs to be
 595 relocated for both shared libraries and the main executable.  At least
 596 using the standard mechanism this can only be done once the program has
 597 been run (or the core file has been read).
 598
 599 @subsection PE
 600
 601 Windows 95 and NT use the PE (Portable Executable) format for their
 602 executables.  PE is basically COFF with additional headers.
 603
 604 While BFD includes special PE support, GDB needs only the basic
 605 COFF reader.
 606
 607 @subsection ELF
 608
 609 The ELF format came with System V Release 4 (SVR4) Unix.  ELF is similar
 610 to COFF in being organized into a number of sections, but it removes
 611 many of COFF's limitations.
 612
 613 The basic ELF reader is in @file{elfread.c}.
 614
 615 @subsection SOM
 616
 617 SOM is HP's object file and debug format (not to be confused with IBM's
 618 SOM, which is a cross-language ABI).
 619
 620 The SOM reader is in @file{hpread.c}.
 621
 622 @subsection Other File Formats
 623
 624 Other file formats that have been supported by GDB include Netware
 625 Loadable Modules (@file{nlmread.c}.
 626
 627 @section Debugging File Formats
 628
 629 This section describes characteristics of debugging information that
 630 are independent of the object file format.
 631
 632 @subsection stabs
 633
 634 @code{stabs} started out as special symbols within the @code{a.out}
 635 format.  Since then, it has been encapsulated into other file
 636 formats, such as COFF and ELF.
 637
 638 While @file{dbxread.c} does some of the basic stab processing,
 639 including for encapsulated versions, @file{stabsread.c} does
 640 the real work.
 641
 642 @subsection COFF
 643
 644 The basic COFF definition includes debugging information.  The level
 645 of support is minimal and non-extensible, and is not often used.
 646
 647 @subsection Mips debug (Third Eye)
 648
 649 ECOFF includes a definition of a special debug format.
 650
 651 The file @file{mdebugread.c} implements reading for this format.
 652
 653 @subsection DWARF 1
 654
 655 DWARF 1 is a debugging format that was originally designed to be
 656 used with ELF in SVR4 systems.
 657
 658 @c CHILL_PRODUCER
 659 @c GCC_PRODUCER
 660 @c GPLUS_PRODUCER
 661 @c LCC_PRODUCER
 662 @c If defined, these are the producer strings in a DWARF 1 file.  All of
 663 @c these have reasonable defaults already.
 664
 665 The DWARF 1 reader is in @file{dwarfread.c}.
 666
 667 @subsection DWARF 2
 668
 669 DWARF 2 is an improved but incompatible version of DWARF 1.
 670
 671 The DWARF 2 reader is in @file{dwarf2read.c}.
 672
 673 @subsection SOM
 674
 675 Like COFF, the SOM definition includes debugging information.
 676
 677 @section Adding a New Symbol Reader to GDB
 678
 679 If you are using an existing object file format (a.out, COFF, ELF, etc),
 680 there is probably little to be done.
 681
 682 If you need to add a new object file format, you must first add it to
 683 BFD.  This is beyond the scope of this document.
 684
 685 You must then arrange for the BFD code to provide access to the
 686 debugging symbols.  Generally GDB will have to call swapping routines
 687 from BFD and a few other BFD internal routines to locate the debugging
 688 information.  As much as possible, GDB should not depend on the BFD
 689 internal data structures.
 690
 691 For some targets (e.g., COFF), there is a special transfer vector used
 692 to call swapping routines, since the external data structures on various
 693 platforms have different sizes and layouts.  Specialized routines that
 694 will only ever be implemented by one object file format may be called
 695 directly.  This interface should be described in a file
 696 @file{bfd/libxyz.h}, which is included by GDB.
 697
 698
 699 @node Language Support
 700
 701 @chapter Language Support
 702
 703 GDB's language support is mainly driven by the symbol reader, although
 704 it is possible for the user to set the source language manually.
 705
 706 GDB chooses the source language by looking at the extension of the file
 707 recorded in the debug info; @code{.c} means C, @code{.f} means Fortran,
 708 etc.  It may also use a special-purpose language identifier if the debug
 709 format supports it, such as DWARF.
 710
 711 @section Adding a Source Language to GDB
 712
 713 To add other languages to GDB's expression parser, follow the following
 714 steps:
 715
 716 @table @emph
 717 @item Create the expression parser.
 718
 719 This should reside in a file @file{@var{lang}-exp.y}.  Routines for
 720 building parsed expressions into a @samp{union exp_element} list are in
 721 @file{parse.c}.
 722
 723 Since we can't depend upon everyone having Bison, and YACC produces
 724 parsers that define a bunch of global names, the following lines
 725 @emph{must} be included at the top of the YACC parser, to prevent the
 726 various parsers from defining the same global names:
 727
 728 @example
 729 #define yyparse         @var{lang}_parse
 730 #define yylex   @var{lang}_lex
 731 #define yyerror         @var{lang}_error
 732 #define yylval  @var{lang}_lval
 733 #define yychar  @var{lang}_char
 734 #define yydebug         @var{lang}_debug
 735 #define yypact          @var{lang}_pact
 736 #define yyr1            @var{lang}_r1
 737 #define yyr2            @var{lang}_r2
 738 #define yydef           @var{lang}_def
 739 #define yychk           @var{lang}_chk
 740 #define yypgo           @var{lang}_pgo
 741 #define yyact   @var{lang}_act
 742 #define yyexca          @var{lang}_exca
 743 #define yyerrflag       @var{lang}_errflag
 744 #define yynerrs         @var{lang}_nerrs
 745 @end example
 746
 747 At the bottom of your parser, define a @code{struct language_defn} and
 748 initialize it with the right values for your language.  Define an
 749 @code{initialize_@var{lang}} routine and have it call
 750 @samp{add_language(@var{lang}_language_defn)} to tell the rest of GDB
 751 that your language exists.  You'll need some other supporting variables
 752 and functions, which will be used via pointers from your
 753 @code{@var{lang}_language_defn}.  See the declaration of @code{struct
 754 language_defn} in @file{language.h}, and the other @file{*-exp.y} files,
 755 for more information.
 756
 757 @item Add any evaluation routines, if necessary
 758
 759 If you need new opcodes (that represent the operations of the language),
 760 add them to the enumerated type in @file{expression.h}.  Add support
 761 code for these operations in @code{eval.c:evaluate_subexp()}.  Add cases
 762 for new opcodes in two functions from @file{parse.c}:
 763 @code{prefixify_subexp()} and @code{length_of_subexp()}.  These compute
 764 the number of @code{exp_element}s that a given operation takes up.
 765
 766 @item Update some existing code
 767
 768 Add an enumerated identifier for your language to the enumerated type
 769 @code{enum language} in @file{defs.h}.
 770
 771 Update the routines in @file{language.c} so your language is included.
 772 These routines include type predicates and such, which (in some cases)
 773 are language dependent.  If your language does not appear in the switch
 774 statement, an error is reported.
 775
 776 Also included in @file{language.c} is the code that updates the variable
 777 @code{current_language}, and the routines that translate the
 778 @code{language_@var{lang}} enumerated identifier into a printable
 779 string.
 780
 781 Update the function @code{_initialize_language} to include your
 782 language.  This function picks the default language upon startup, so is
 783 dependent upon which languages that GDB is built for.
 784
 785 Update @code{allocate_symtab} in @file{symfile.c} and/or symbol-reading
 786 code so that the language of each symtab (source file) is set properly.
 787 This is used to determine the language to use at each stack frame level.
 788 Currently, the language is set based upon the extension of the source
 789 file.  If the language can be better inferred from the symbol
 790 information, please set the language of the symtab in the symbol-reading
 791 code.
 792
 793 Add helper code to @code{expprint.c:print_subexp()} to handle any new
 794 expression opcodes you have added to @file{expression.h}.  Also, add the
 795 printed representations of your operators to @code{op_print_tab}.
 796
 797 @item Add a place of call
 798
 799 Add a call to @code{@var{lang}_parse()} and @code{@var{lang}_error} in
 800 @code{parse.c:parse_exp_1()}.
 801
 802 @item Use macros to trim code
 803
 804 The user has the option of building GDB for some or all of the
 805 languages.  If the user decides to build GDB for the language
 806 @var{lang}, then every file dependent on @file{language.h} will have the
 807 macro @code{_LANG_@var{lang}} defined in it.  Use @code{#ifdef}s to
 808 leave out large routines that the user won't need if he or she is not
 809 using your language.
 810
 811 Note that you do not need to do this in your YACC parser, since if GDB
 812 is not build for @var{lang}, then @file{@var{lang}-exp.tab.o} (the
 813 compiled form of your parser) is not linked into GDB at all.
 814
 815 See the file @file{configure.in} for how GDB is configured for different
 816 languages.
 817
 818 @item Edit @file{Makefile.in}
 819
 820 Add dependencies in @file{Makefile.in}.  Make sure you update the macro
 821 variables such as @code{HFILES} and @code{OBJS}, otherwise your code may
 822 not get linked in, or, worse yet, it may not get @code{tar}red into the
 823 distribution!
 824
 825 @end table
 826
 827
 828 @node Host Definition
 829
 830 @chapter Host Definition
 831
 832 With the advent of autoconf, it's rarely necessary to have host
 833 definition machinery anymore.
 834
 835 @section Adding a New Host
 836
 837 Most of GDB's host configuration support happens via autoconf.  It
 838 should be rare to need new host-specific definitions.  GDB still uses
 839 the host-specific definitions and files listed below, but these mostly
 840 exist for historical reasons, and should eventually disappear.
 841
 842 Several files control GDB's configuration for host systems:
 843
 844 @table @file
 845
 846 @item gdb/config/@var{arch}/@var{xyz}.mh
 847 Specifies Makefile fragments needed when hosting on machine @var{xyz}.
 848 In particular, this lists the required machine-dependent object files,
 849 by defining @samp{XDEPFILES=@dots{}}.  Also specifies the header file
 850 which describes host @var{xyz}, by defining @code{XM_FILE=
 851 xm-@var{xyz}.h}.  You can also define @code{CC}, @code{SYSV_DEFINE},
 852 @code{XM_CFLAGS}, @code{XM_ADD_FILES}, @code{XM_CLIBS}, @code{XM_CDEPS},
 853 etc.; see @file{Makefile.in}.
 854
 855 @item gdb/config/@var{arch}/xm-@var{xyz}.h
 856 (@file{xm.h} is a link to this file, created by configure).  Contains C
 857 macro definitions describing the host system environment, such as byte
 858 order, host C compiler and library.
 859
 860 @item gdb/@var{xyz}-xdep.c
 861 Contains any miscellaneous C code required for this machine as a host.
 862 On most machines it doesn't exist at all.  If it does exist, put
 863 @file{@var{xyz}-xdep.o} into the @code{XDEPFILES} line in
 864 @file{gdb/config/@var{arch}/@var{xyz}.mh}.
 865
 866 @end table
 867
 868 @subheading Generic Host Support Files
 869
 870 There are some ``generic'' versions of routines that can be used by
 871 various systems.  These can be customized in various ways by macros
 872 defined in your @file{xm-@var{xyz}.h} file.  If these routines work for
 873 the @var{xyz} host, you can just include the generic file's name (with
 874 @samp{.o}, not @samp{.c}) in @code{XDEPFILES}.
 875
 876 Otherwise, if your machine needs custom support routines, you will need
 877 to write routines that perform the same functions as the generic file.
 878 Put them into @code{@var{xyz}-xdep.c}, and put @code{@var{xyz}-xdep.o}
 879 into @code{XDEPFILES}.
 880
 881 @table @file
 882
 883 @item ser-unix.c
 884 This contains serial line support for Unix systems.  This is always
 885 included, via the makefile variable @code{SER_HARDWIRE}; override this
 886 variable in the @file{.mh} file to avoid it.
 887
 888 @item ser-go32.c
 889 This contains serial line support for 32-bit programs running under DOS,
 890 using the GO32 execution environment.
 891
 892 @item ser-tcp.c
 893 This contains generic TCP support using sockets.
 894
 895 @end table
 896
 897 @section Host Conditionals
 898
 899 When GDB is configured and compiled, various macros are defined or left
 900 undefined, to control compilation based on the attributes of the host
 901 system.  These macros and their meanings (or if the meaning is not
 902 documented here, then one of the source files where they are used is
 903 indicated) are:
 904
 905 @table @code
 906
 907 @item GDBINIT_FILENAME
 908 The default name of GDB's initialization file (normally @file{.gdbinit}).
 909
 910 @item MEM_FNS_DECLARED
 911 Your host config file defines this if it includes declarations of
 912 @code{memcpy} and @code{memset}.  Define this to avoid conflicts between
 913 the native include files and the declarations in @file{defs.h}.
 914
 915 @item NO_STD_REGS
 916 This macro is deprecated.
 917
 918 @item NO_SYS_FILE
 919 Define this if your system does not have a @code{<sys/file.h>}.
 920
 921 @item SIGWINCH_HANDLER
 922 If your host defines @code{SIGWINCH}, you can define this to be the name
 923 of a function to be called if @code{SIGWINCH} is received.
 924
 925 @item SIGWINCH_HANDLER_BODY
 926 Define this to expand into code that will define the function named by
 927 the expansion of @code{SIGWINCH_HANDLER}.
 928
 929 @item ALIGN_STACK_ON_STARTUP
 930 Define this if your system is of a sort that will crash in
 931 @code{tgetent} if the stack happens not to be longword-aligned when
 932 @code{main} is called.  This is a rare situation, but is known to occur
 933 on several different types of systems.
 934
 935 @item CRLF_SOURCE_FILES
 936 Define this if host files use @code{\r\n} rather than @code{\n} as a
 937 line terminator.  This will cause source file listings to omit @code{\r}
 938 characters when printing and it will allow \r\n line endings of files
 939 which are "sourced" by gdb.  It must be possible to open files in binary
 940 mode using @code{O_BINARY} or, for fopen, @code{"rb"}.
 941
 942 @item DEFAULT_PROMPT
 943 The default value of the prompt string (normally @code{"(gdb) "}).
 944
 945 @item DEV_TTY
 946 The name of the generic TTY device, defaults to @code{"/dev/tty"}.
 947
 948 @item FCLOSE_PROVIDED
 949 Define this if the system declares @code{fclose} in the headers included
 950 in @code{defs.h}.  This isn't needed unless your compiler is unusually
 951 anal.
 952
 953 @item FOPEN_RB
 954 Define this if binary files are opened the same way as text files.
 955
 956 @item GETENV_PROVIDED
 957 Define this if the system declares @code{getenv} in its headers included
 958 in @code{defs.h}. This isn't needed unless your compiler is unusually
 959 anal.
 960
 961 @item HAVE_MMAP
 962 In some cases, use the system call @code{mmap} for reading symbol
 963 tables.  For some machines this allows for sharing and quick updates.
 964
 965 @item HAVE_SIGSETMASK
 966 Define this if the host system has job control, but does not define
 967 @code{sigsetmask()}.  Currently, this is only true of the RS/6000.
 968
 969 @item HAVE_TERMIO
 970 Define this if the host system has @code{termio.h}.
 971
 972 @item HOST_BYTE_ORDER
 973 The ordering of bytes in the host.  This must be defined to be either
 974 @code{BIG_ENDIAN} or @code{LITTLE_ENDIAN}.
 975
 976 @item INT_MAX
 977 @item INT_MIN
 978 @item LONG_MAX
 979 @item UINT_MAX
 980 @item ULONG_MAX
 981 Values for host-side constants.
 982
 983 @item ISATTY
 984 Substitute for isatty, if not available.
 985
 986 @item LONGEST
 987 This is the longest integer type available on the host.  If not defined,
 988 it will default to @code{long long} or @code{long}, depending on
 989 @code{CC_HAS_LONG_LONG}.
 990
 991 @item CC_HAS_LONG_LONG
 992 Define this if the host C compiler supports ``long long''.  This is set
 993 by the configure script.
 994
 995 @item PRINTF_HAS_LONG_LONG
 996 Define this if the host can handle printing of long long integers via
 997 the printf format directive ``ll''. This is set by the configure script.
 998
 999 @item HAVE_LONG_DOUBLE
1000 Define this if the host C compiler supports ``long double''.  This is
1001 set by the configure script.
1002
1003 @item PRINTF_HAS_LONG_DOUBLE
1004 Define this if the host can handle printing of long double float-point
1005 numbers via the printf format directive ``Lg''. This is set by the
1006 configure script.
1007
1008 @item SCANF_HAS_LONG_DOUBLE
1009 Define this if the host can handle the parsing of long double
1010 float-point numbers via the scanf format directive directive
1011 ``Lg''. This is set by the configure script.
1012
1013 @item LSEEK_NOT_LINEAR
1014 Define this if @code{lseek (n)} does not necessarily move to byte number
1015 @code{n} in the file.  This is only used when reading source files.  It
1016 is normally faster to define @code{CRLF_SOURCE_FILES} when possible.
1017
1018 @item L_SET
1019 This macro is used as the argument to lseek (or, most commonly,
1020 bfd_seek).  FIXME, should be replaced by SEEK_SET instead, which is the
1021 POSIX equivalent.
1022
1023 @item MALLOC_INCOMPATIBLE
1024 Define this if the system's prototype for @code{malloc} differs from the
1025 @sc{ANSI} definition.
1026
1027 @item MMAP_BASE_ADDRESS
1028 When using HAVE_MMAP, the first mapping should go at this address.
1029
1030 @item MMAP_INCREMENT
1031 when using HAVE_MMAP, this is the increment between mappings.
1032
1033 @item NEED_POSIX_SETPGID
1034 Define this to use the POSIX version of @code{setpgid} to determine
1035 whether job control is available.
1036
1037 @item NORETURN
1038 If defined, this should be one or more tokens, such as @code{volatile},
1039 that can be used in both the declaration and definition of functions to
1040 indicate that they never return.  The default is already set correctly
1041 if compiling with GCC.  This will almost never need to be defined.
1042
1043 @item ATTR_NORETURN
1044 If defined, this should be one or more tokens, such as
1045 @code{__attribute__ ((noreturn))}, that can be used in the declarations
1046 of functions to indicate that they never return.  The default is already
1047 set correctly if compiling with GCC.  This will almost never need to be
1048 defined.
1049
1050 @item USE_GENERIC_DUMMY_FRAMES
1051 Define this to 1 if the target is using the generic inferior function
1052 call code.  See @code{blockframe.c} for more information.
1053
1054 @item USE_MMALLOC
1055 GDB will use the @code{mmalloc} library for memory allocation for symbol
1056 reading if this symbol is defined.  Be careful defining it since there
1057 are systems on which @code{mmalloc} does not work for some reason.  One
1058 example is the DECstation, where its RPC library can't cope with our
1059 redefinition of @code{malloc} to call @code{mmalloc}.  When defining
1060 @code{USE_MMALLOC}, you will also have to set @code{MMALLOC} in the
1061 Makefile, to point to the mmalloc library.  This define is set when you
1062 configure with --with-mmalloc.
1063
1064 @item NO_MMCHECK
1065 Define this if you are using @code{mmalloc}, but don't want the overhead
1066 of checking the heap with @code{mmcheck}.  Note that on some systems,
1067 the C runtime makes calls to malloc prior to calling @code{main}, and if
1068 @code{free} is ever called with these pointers after calling
1069 @code{mmcheck} to enable checking, a memory corruption abort is certain
1070 to occur.  These systems can still use mmalloc, but must define
1071 NO_MMCHECK.
1072
1073 @item MMCHECK_FORCE
1074 Define this to 1 if the C runtime allocates memory prior to
1075 @code{mmcheck} being called, but that memory is never freed so we don't
1076 have to worry about it triggering a memory corruption abort.  The
1077 default is 0, which means that @code{mmcheck} will only install the heap
1078 checking functions if there has not yet been any memory allocation
1079 calls, and if it fails to install the functions, gdb will issue a
1080 warning.  This is currently defined if you configure using
1081 --with-mmalloc.
1082
1083 @item NO_SIGINTERRUPT
1084 Define this to indicate that siginterrupt() is not available.
1085
1086 @item R_OK
1087 Define if this is not in a system .h file.
1088
1089 @item SEEK_CUR
1090 @item SEEK_SET
1091 Define these to appropriate value for the system lseek(), if not already
1092 defined.
1093
1094 @item STOP_SIGNAL
1095 This is the signal for stopping GDB.  Defaults to SIGTSTP.  (Only
1096 redefined for the Convex.)
1097
1098 @item USE_O_NOCTTY
1099 Define this if the interior's tty should be opened with the O_NOCTTY
1100 flag.  (FIXME: This should be a native-only flag, but @file{inflow.c} is
1101 always linked in.)
1102
1103 @item USG
1104 Means that System V (prior to SVR4) include files are in use.  (FIXME:
1105 This symbol is abused in @file{infrun.c}, @file{regex.c},
1106 @file{remote-nindy.c}, and @file{utils.c} for other things, at the
1107 moment.)
1108
1109 @item lint
1110 Define this to help placate lint in some situations.
1111
1112 @item volatile
1113 Define this to override the defaults of @code{__volatile__} or
1114 @code{/**/}.
1115
1116 @end table
1117
1118
1119 @node Target Architecture Definition
1120
1121 @chapter Target Architecture Definition
1122
1123 GDB's target architecture defines what sort of machine-language programs
1124 GDB can work with, and how it works with them.
1125
1126 At present, the target architecture definition consists of a number of C
1127 macros.
1128
1129 @section Registers and Memory
1130
1131 GDB's model of the target machine is rather simple.  GDB assumes the
1132 machine includes a bank of registers and a block of memory.  Each
1133 register may have a different size.
1134
1135 GDB does not have a magical way to match up with the compiler's idea of
1136 which registers are which; however, it is critical that they do match up
1137 accurately.  The only way to make this work is to get accurate
1138 information about the order that the compiler uses, and to reflect that
1139 in the @code{REGISTER_NAME} and related macros.
1140
1141 GDB can handle big-endian, little-endian, and bi-endian architectures.
1142
1143 @section Frame Interpretation
1144
1145 @section Inferior Call Setup
1146
1147 @section Compiler Characteristics
1148
1149 @section Target Conditionals
1150
1151 This section describes the macros that you can use to define the target
1152 machine.
1153
1154 @table @code
1155
1156 @item ADDITIONAL_OPTIONS
1157 @item ADDITIONAL_OPTION_CASES
1158 @item ADDITIONAL_OPTION_HANDLER
1159 @item ADDITIONAL_OPTION_HELP
1160 These are a set of macros that allow the addition of additional command
1161 line options to GDB.  They are currently used only for the unsupported
1162 i960 Nindy target, and should not be used in any other configuration.
1163
1164 @item ADDR_BITS_REMOVE (addr)
1165 If a raw machine address includes any bits that are not really part of
1166 the address, then define this macro to expand into an expression that
1167 zeros those bits in @var{addr}.  For example, the two low-order bits of
1168 a Motorola 88K address may be used by some kernels for their own
1169 purposes, since addresses must always be 4-byte aligned, and so are of
1170 no use for addressing.  Those bits should be filtered out with an
1171 expression such as @code{((addr) & ~3)}.
1172
1173 @item BEFORE_MAIN_LOOP_HOOK
1174 Define this to expand into any code that you want to execute before the
1175 main loop starts.  Although this is not, strictly speaking, a target
1176 conditional, that is how it is currently being used.  Note that if a
1177 configuration were to define it one way for a host and a different way
1178 for the target, GDB will probably not compile, let alone run correctly.
1179 This is currently used only for the unsupported i960 Nindy target, and
1180 should not be used in any other configuration.
1181
1182 @item BELIEVE_PCC_PROMOTION
1183 Define if the compiler promotes a short or char parameter to an int, but
1184 still reports the parameter as its original type, rather than the
1185 promoted type.
1186
1187 @item BELIEVE_PCC_PROMOTION_TYPE
1188 Define this if GDB should believe the type of a short argument when
1189 compiled by pcc, but look within a full int space to get its value.
1190 Only defined for Sun-3 at present.
1191
1192 @item BITS_BIG_ENDIAN
1193 Define this if the numbering of bits in the targets does *not* match the
1194 endianness of the target byte order.  A value of 1 means that the bits
1195 are numbered in a big-endian order, 0 means little-endian.
1196
1197 @item BREAKPOINT
1198 This is the character array initializer for the bit pattern to put into
1199 memory where a breakpoint is set.  Although it's common to use a trap
1200 instruction for a breakpoint, it's not required; for instance, the bit
1201 pattern could be an invalid instruction.  The breakpoint must be no
1202 longer than the shortest instruction of the architecture.
1203
1204 @var{BREAKPOINT} has been deprecated in favour of
1205 @var{BREAKPOINT_FROM_PC}.
1206
1207 @item BIG_BREAKPOINT
1208 @item LITTLE_BREAKPOINT
1209 Similar to BREAKPOINT, but used for bi-endian targets.
1210
1211 @var{BIG_BREAKPOINT} and @var{LITTLE_BREAKPOINT} have been deprecated in
1212 favour of @var{BREAKPOINT_FROM_PC}.
1213
1214 @item REMOTE_BREAKPOINT
1215 @item LITTLE_REMOTE_BREAKPOINT
1216 @item BIG_REMOTE_BREAKPOINT
1217 Similar to BREAKPOINT, but used for remote targets.
1218
1219 @var{BIG_REMOTE_BREAKPOINT} and @var{LITTLE_REMOTE_BREAKPOINT} have been
1220 deprecated in favour of @var{BREAKPOINT_FROM_PC}.
1221
1222 @item BREAKPOINT_FROM_PC (pcptr, lenptr)
1223
1224 Use the program counter to determine the contents and size of a
1225 breakpoint instruction.  It returns a pointer to a string of bytes that
1226 encode a breakpoint instruction, stores the length of the string to
1227 *lenptr, and adjusts pc (if necessary) to point to the actual memory
1228 location where the breakpoint should be inserted.
1229
1230 Although it is common to use a trap instruction for a breakpoint, it's
1231 not required; for instance, the bit pattern could be an invalid
1232 instruction.  The breakpoint must be no longer than the shortest
1233 instruction of the architecture.
1234
1235 Replaces all the other @var{BREAKPOINT} macros.
1236
1237 @item CALL_DUMMY_P
1238 A C expresson that is non-zero when the target suports inferior function
1239 calls.
1240
1241 @item CALL_DUMMY_WORDS
1242 Pointer to an array of @var{LONGEST} words of data containing
1243 host-byte-ordered @var{REGISTER_BYTES} sized values that partially
1244 specify the sequence of instructions needed for an inferior function
1245 call.
1246
1247 Should be deprecated in favour of a macro that uses target-byte-ordered
1248 data.
1249
1250 @item SIZEOF_CALL_DUMMY_WORDS
1251 The size of @var{CALL_DUMMY_WORDS}.  When @var{CALL_DUMMY_P} this must
1252 return a positive value.  See also @var{CALL_DUMMY_LENGTH}.
1253
1254 @item CALL_DUMMY
1255 A static initializer for @var{CALL_DUMMY_WORDS}.  Deprecated.
1256
1257 @item CALL_DUMMY_LOCATION
1258 inferior.h
1259
1260 @item CALL_DUMMY_STACK_ADJUST
1261 Stack adjustment needed when performing an inferior function call.
1262
1263 Should be deprecated in favor of something like @var{STACK_ALIGN}.
1264
1265 @item CALL_DUMMY_STACK_ADJUST_P
1266 Predicate for use of @var{CALL_DUMMY_STACK_ADJUST}.
1267
1268 Should be deprecated in favor of something like @var{STACK_ALIGN}.
1269
1270 @item CANNOT_FETCH_REGISTER (regno)
1271 A C expression that should be nonzero if @var{regno} cannot be fetched
1272 from an inferior process.  This is only relevant if
1273 @code{FETCH_INFERIOR_REGISTERS} is not defined.
1274
1275 @item CANNOT_STORE_REGISTER (regno)
1276 A C expression that should be nonzero if @var{regno} should not be
1277 written to the target.  This is often the case for program counters,
1278 status words, and other special registers.  If this is not defined, GDB
1279 will assume that all registers may be written.
1280
1281 @item DO_DEFERRED_STORES
1282 @item CLEAR_DEFERRED_STORES
1283 Define this to execute any deferred stores of registers into the inferior,
1284 and to cancel any deferred stores.
1285
1286 Currently only implemented correctly for native Sparc configurations?
1287
1288 @item CPLUS_MARKER
1289 Define this to expand into the character that G++ uses to distinguish
1290 compiler-generated identifiers from programmer-specified identifiers.
1291 By default, this expands into @code{'$'}.  Most System V targets should
1292 define this to @code{'.'}.
1293
1294 @item DBX_PARM_SYMBOL_CLASS
1295 Hook for the @code{SYMBOL_CLASS} of a parameter when decoding DBX symbol
1296 information.  In the i960, parameters can be stored as locals or as
1297 args, depending on the type of the debug record.
1298
1299 @item DECR_PC_AFTER_BREAK
1300 Define this to be the amount by which to decrement the PC after the
1301 program encounters a breakpoint.  This is often the number of bytes in
1302 BREAKPOINT, though not always.  For most targets this value will be 0.
1303
1304 @item DECR_PC_AFTER_HW_BREAK
1305 Similarly, for hardware breakpoints.
1306
1307 @item DISABLE_UNSETTABLE_BREAK addr
1308 If defined, this should evaluate to 1 if @var{addr} is in a shared
1309 library in which breakpoints cannot be set and so should be disabled.
1310
1311 @item DO_REGISTERS_INFO
1312 If defined, use this to print the value of a register or all registers.
1313
1314 @item END_OF_TEXT_DEFAULT
1315 This is an expression that should designate the end of the text section
1316 (? FIXME ?)
1317
1318 @item EXTRACT_RETURN_VALUE(type,regbuf,valbuf)
1319 Define this to extract a function's return value of type @var{type} from
1320 the raw register state @var{regbuf} and copy that, in virtual format,
1321 into @var{valbuf}.
1322
1323 @item EXTRACT_STRUCT_VALUE_ADDRESS(regbuf)
1324 When @var{EXTRACT_STRUCT_VALUE_ADDRESS_P} this is used to to extract
1325 from an array @var{regbuf} (containing the raw register state) the
1326 address in which a function should return its structure value, as a
1327 CORE_ADDR (or an expression that can be used as one).
1328
1329 @item EXTRACT_STRUCT_VALUE_ADDRESS_P
1330 Predicate for @var{EXTRACT_STRUCT_VALUE_ADDRESS}.
1331
1332 @item FLOAT_INFO
1333 If defined, then the `info float' command will print information about
1334 the processor's floating point unit.
1335
1336 @item FP_REGNUM
1337 If the virtual frame pointer is kept in a register, then define this
1338 macro to be the number (greater than or equal to zero) of that register.
1339
1340 This should only need to be defined if @code{TARGET_READ_FP} and
1341 @code{TARGET_WRITE_FP} are not defined.
1342
1343 @item FRAMELESS_FUNCTION_INVOCATION(fi)
1344 Define this to an expression that returns 1 if the function invocation
1345 represented by @var{fi} does not have a stack frame associated with it.
1346 Otherwise return 0.
1347
1348 @item FRAME_ARGS_ADDRESS_CORRECT
1349 stack.c
1350
1351 @item FRAME_CHAIN(frame)
1352 Given @var{frame}, return a pointer to the calling frame.
1353
1354 @item FRAME_CHAIN_COMBINE(chain,frame)
1355 Define this to take the frame chain pointer and the frame's nominal
1356 address and produce the nominal address of the caller's frame.
1357 Presently only defined for HP PA.
1358
1359 @item FRAME_CHAIN_VALID(chain,thisframe)
1360
1361 Define this to be an expression that returns zero if the given frame is
1362 an outermost frame, with no caller, and nonzero otherwise.  Three common
1363 definitions are available. @code{default_frame_chain_valid} (the
1364 default) is nonzero if the chain pointer is nonzero and given frame's PC
1365 is not inside the startup file (such as @file{crt0.o}).
1366 @code{alternate_frame_chain_valid} is nonzero if the chain pointer is
1367 nonzero and the given frame's PC is not in @code{main()} or a known
1368 entry point function (such as @code{_start()}).
1369
1370 @item FRAME_INIT_SAVED_REGS(frame)
1371 See @file{frame.h}.  Determines the address of all registers in the
1372 current stack frame storing each in @code{frame->saved_regs}.  Space for
1373 @code{frame->saved_regs} shall be allocated by
1374 @code{FRAME_INIT_SAVED_REGS} using either
1375 @code{frame_saved_regs_zalloc} or @code{frame_obstack_alloc}.
1376
1377 @var{FRAME_FIND_SAVED_REGS} and @var{EXTRA_FRAME_INFO} are deprecated.
1378
1379 @item FRAME_NUM_ARGS (fi)
1380 For the frame described by @var{fi} return the number of arguments that
1381 are being passed.  If the number of arguments is not known, return
1382 @code{-1}.
1383
1384 @item FRAME_SAVED_PC(frame)
1385 Given @var{frame}, return the pc saved there.  That is, the return
1386 address.
1387
1388 @item FUNCTION_EPILOGUE_SIZE
1389 For some COFF targets, the @code{x_sym.x_misc.x_fsize} field of the
1390 function end symbol is 0.  For such targets, you must define
1391 @code{FUNCTION_EPILOGUE_SIZE} to expand into the standard size of a
1392 function's epilogue.
1393
1394 @item GCC_COMPILED_FLAG_SYMBOL
1395 @item GCC2_COMPILED_FLAG_SYMBOL
1396 If defined, these are the names of the symbols that GDB will look for to
1397 detect that GCC compiled the file.  The default symbols are
1398 @code{gcc_compiled.} and @code{gcc2_compiled.}, respectively. (Currently
1399 only defined for the Delta 68.)
1400
1401 @item GDB_MULTI_ARCH
1402 If defined and non-zero, enables suport for multiple architectures
1403 within GDB.
1404
1405 The support can be enabled at two levels.  At level one, only
1406 definitions for previously undefined macros are provided; at level two,
1407 a multi-arch definition of all architecture dependant macros will be
1408 defined.
1409
1410 @item GDB_TARGET_IS_HPPA
1411 This determines whether horrible kludge code in dbxread.c and
1412 partial-stab.h is used to mangle multiple-symbol-table files from
1413 HPPA's.  This should all be ripped out, and a scheme like elfread.c
1414 used.
1415
1416 @item GDB_TARGET_IS_MACH386
1417 @item GDB_TARGET_IS_SUN3
1418 @item GDB_TARGET_IS_SUN386
1419 Kludges that should go away.
1420
1421 @item GET_LONGJMP_TARGET
1422 For most machines, this is a target-dependent parameter.  On the
1423 DECstation and the Iris, this is a native-dependent parameter, since
1424 <setjmp.h> is needed to define it.
1425
1426 This macro determines the target PC address that longjmp() will jump to,
1427 assuming that we have just stopped at a longjmp breakpoint.  It takes a
1428 CORE_ADDR * as argument, and stores the target PC value through this
1429 pointer.  It examines the current state of the machine as needed.
1430
1431 @item GET_SAVED_REGISTER
1432 Define this if you need to supply your own definition for the function
1433 @code{get_saved_register}.
1434
1435 @item HAVE_REGISTER_WINDOWS
1436 Define this if the target has register windows.
1437 @item REGISTER_IN_WINDOW_P (regnum)
1438 Define this to be an expression that is 1 if the given register is in
1439 the window.
1440
1441 @item IBM6000_TARGET
1442 Shows that we are configured for an IBM RS/6000 target.  This
1443 conditional should be eliminated (FIXME) and replaced by
1444 feature-specific macros.  It was introduced in haste and we are
1445 repenting at leisure.
1446
1447 @item IEEE_FLOAT
1448 Define this if the target system uses IEEE-format floating point numbers.
1449
1450 @item INIT_EXTRA_FRAME_INFO (fromleaf, frame)
1451 If additional information about the frame is required this should be
1452 stored in @code{frame->extra_info}.  Space for @code{frame->extra_info}
1453 is allocated using @code{frame_obstack_alloc}.
1454
1455 @item INIT_FRAME_PC (fromleaf, prev)
1456 This is a C statement that sets the pc of the frame pointed to by
1457 @var{prev}.  [By default...]
1458
1459 @item INNER_THAN (lhs,rhs)
1460 Returns non-zero if stack address @var{lhs} is inner than (nearer to the
1461 stack top) stack address @var{rhs}. Define this as @code{lhs < rhs} if
1462 the target's stack grows downward in memory, or @code{lhs > rsh} if the
1463 stack grows upward.
1464
1465 @item IN_SIGTRAMP (pc, name)
1466 Define this to return true if the given @var{pc} and/or @var{name}
1467 indicates that the current function is a sigtramp.
1468
1469 @item SIGTRAMP_START (pc)
1470 @item SIGTRAMP_END (pc)
1471 Define these to be the start and end address of the sigtramp for the
1472 given @var{pc}.  On machines where the address is just a compile time
1473 constant, the macro expansion will typically just ignore the supplied
1474 @var{pc}.
1475
1476 @item IN_SOLIB_CALL_TRAMPOLINE pc name
1477 Define this to evaluate to nonzero if the program is stopped in the
1478 trampoline that connects to a shared library.
1479
1480 @item IN_SOLIB_RETURN_TRAMPOLINE pc name
1481 Define this to evaluate to nonzero if the program is stopped in the
1482 trampoline that returns from a shared library.
1483
1484 @item IS_TRAPPED_INTERNALVAR (name)
1485 This is an ugly hook to allow the specification of special actions that
1486 should occur as a side-effect of setting the value of a variable
1487 internal to GDB.  Currently only used by the h8500.  Note that this
1488 could be either a host or target conditional.
1489
1490 @item NEED_TEXT_START_END
1491 Define this if GDB should determine the start and end addresses of the
1492 text section.  (Seems dubious.)
1493
1494 @item NO_HIF_SUPPORT
1495 (Specific to the a29k.)
1496
1497 @item SOFTWARE_SINGLE_STEP_P
1498 Define this as 1 if the target does not have a hardware single-step
1499 mechanism. The macro @code{SOFTWARE_SINGLE_STEP} must also be defined.
1500
1501 @item SOFTWARE_SINGLE_STEP(signal,insert_breapoints_p)
1502 A function that inserts or removes (dependant on
1503 @var{insert_breapoints_p}) breakpoints at each possible destinations of
1504 the next instruction. See @code{sparc-tdep.c} and @code{rs6000-tdep.c}
1505 for examples.
1506
1507 @item PCC_SOL_BROKEN
1508 (Used only in the Convex target.)
1509
1510 @item PC_IN_CALL_DUMMY
1511 inferior.h
1512
1513 @item PC_LOAD_SEGMENT
1514 If defined, print information about the load segment for the program
1515 counter.  (Defined only for the RS/6000.)
1516
1517 @item PC_REGNUM
1518 If the program counter is kept in a register, then define this macro to
1519 be the number (greater than or equal to zero) of that register.
1520
1521 This should only need to be defined if @code{TARGET_READ_PC} and
1522 @code{TARGET_WRITE_PC} are not defined.
1523
1524 @item NPC_REGNUM
1525 The number of the ``next program counter'' register, if defined.
1526
1527 @item NNPC_REGNUM
1528 The number of the ``next next program counter'' register, if defined.
1529 Currently, this is only defined for the Motorola 88K.
1530
1531 @item PRINT_REGISTER_HOOK (regno)
1532 If defined, this must be a function that prints the contents of the
1533 given register to standard output.
1534
1535 @item PRINT_TYPELESS_INTEGER
1536 This is an obscure substitute for @code{print_longest} that seems to
1537 have been defined for the Convex target.
1538
1539 @item PROCESS_LINENUMBER_HOOK
1540 A hook defined for XCOFF reading.
1541
1542 @item PROLOGUE_FIRSTLINE_OVERLAP
1543 (Only used in unsupported Convex configuration.)
1544
1545 @item PS_REGNUM
1546 If defined, this is the number of the processor status register.  (This
1547 definition is only used in generic code when parsing "$ps".)
1548
1549 @item POP_FRAME
1550 Used in @samp{call_function_by_hand} to remove an artificial stack
1551 frame.
1552
1553 @item PUSH_ARGUMENTS (nargs, args, sp, struct_return, struct_addr)
1554 Define this to push arguments onto the stack for inferior function
1555 call. Return the updated stack pointer value.
1556
1557 @item PUSH_DUMMY_FRAME
1558 Used in @samp{call_function_by_hand} to create an artificial stack frame.
1559
1560 @item REGISTER_BYTES
1561 The total amount of space needed to store GDB's copy of the machine's
1562 register state.
1563
1564 @item REGISTER_NAME(i)
1565 Return the name of register @var{i} as a string.  May return @var{NULL}
1566 or @var{NUL} to indicate that register @var{i} is not valid.
1567
1568 @item REGISTER_NAMES
1569 Deprecated in favor of @var{REGISTER_NAME}.
1570
1571 @item REG_STRUCT_HAS_ADDR (gcc_p, type)
1572 Define this to return 1 if the given type will be passed by pointer
1573 rather than directly.
1574
1575 @item SDB_REG_TO_REGNUM
1576 Define this to convert sdb register numbers into GDB regnums.  If not
1577 defined, no conversion will be done.
1578
1579 @item SHIFT_INST_REGS
1580 (Only used for m88k targets.)
1581
1582 @item SKIP_PROLOGUE (pc)
1583 A C expression that returns the address of the ``real'' code beyond the
1584 function entry prologue found at @var{pc}.
1585
1586 @item SKIP_PROLOGUE_FRAMELESS_P
1587 A C expression that should behave similarly, but that can stop as soon
1588 as the function is known to have a frame.  If not defined,
1589 @code{SKIP_PROLOGUE} will be used instead.
1590
1591 @item SKIP_TRAMPOLINE_CODE (pc)
1592 If the target machine has trampoline code that sits between callers and
1593 the functions being called, then define this macro to return a new PC
1594 that is at the start of the real function.
1595
1596 @item SP_REGNUM
1597 If the stack-pointer is kept in a register, then define this macro to be
1598 the number (greater than or equal to zero) of that register.
1599
1600 This should only need to be defined if @code{TARGET_WRITE_SP} and
1601 @code{TARGET_WRITE_SP} are not defined.
1602
1603 @item STAB_REG_TO_REGNUM
1604 Define this to convert stab register numbers (as gotten from `r'
1605 declarations) into GDB regnums.  If not defined, no conversion will be
1606 done.
1607
1608 @item STACK_ALIGN (addr)
1609 Define this to adjust the address to the alignment required for the
1610 processor's stack.
1611
1612 @item STEP_SKIPS_DELAY (addr)
1613 Define this to return true if the address is of an instruction with a
1614 delay slot.  If a breakpoint has been placed in the instruction's delay
1615 slot, GDB will single-step over that instruction before resuming
1616 normally.  Currently only defined for the Mips.
1617
1618 @item STORE_RETURN_VALUE (type, valbuf)
1619 A C expression that stores a function return value of type @var{type},
1620 where @var{valbuf} is the address of the value to be stored.
1621
1622 @item SUN_FIXED_LBRAC_BUG
1623 (Used only for Sun-3 and Sun-4 targets.)
1624
1625 @item SYMBOL_RELOADING_DEFAULT
1626 The default value of the `symbol-reloading' variable.  (Never defined in
1627 current sources.)
1628
1629 @item TARGET_BYTE_ORDER_DEFAULT
1630 The ordering of bytes in the target.  This must be either
1631 @code{BIG_ENDIAN} or @code{LITTLE_ENDIAN}.  This macro replaces
1632 @var{TARGET_BYTE_ORDER} which is deprecated.
1633
1634 @item TARGET_BYTE_ORDER_SELECTABLE_P
1635 Non-zero if the target has both @code{BIG_ENDIAN} and
1636 @code{LITTLE_ENDIAN} variants.  This macro replaces
1637 @var{TARGET_BYTE_ORDER_SELECTABLE} which is deprecated.
1638
1639 @item TARGET_CHAR_BIT
1640 Number of bits in a char; defaults to 8.
1641
1642 @item TARGET_COMPLEX_BIT
1643 Number of bits in a complex number; defaults to @code{2 * TARGET_FLOAT_BIT}.
1644
1645 At present this macro is not used.
1646
1647 @item TARGET_DOUBLE_BIT
1648 Number of bits in a double float; defaults to @code{8 * TARGET_CHAR_BIT}.
1649
1650 @item TARGET_DOUBLE_COMPLEX_BIT
1651 Number of bits in a double complex; defaults to @code{2 * TARGET_DOUBLE_BIT}.
1652
1653 At present this macro is not used.
1654
1655 @item TARGET_FLOAT_BIT
1656 Number of bits in a float; defaults to @code{4 * TARGET_CHAR_BIT}.
1657
1658 @item TARGET_INT_BIT
1659 Number of bits in an integer; defaults to @code{4 * TARGET_CHAR_BIT}.
1660
1661 @item TARGET_LONG_BIT
1662 Number of bits in a long integer; defaults to @code{4 * TARGET_CHAR_BIT}.
1663
1664 @item TARGET_LONG_DOUBLE_BIT
1665 Number of bits in a long double float;
1666 defaults to @code{2 * TARGET_DOUBLE_BIT}.
1667
1668 @item TARGET_LONG_LONG_BIT
1669 Number of bits in a long long integer; defaults to @code{2 * TARGET_LONG_BIT}.
1670
1671 @item TARGET_PTR_BIT
1672 Number of bits in a pointer; defaults to @code{TARGET_INT_BIT}.
1673
1674 @item TARGET_SHORT_BIT
1675 Number of bits in a short integer; defaults to @code{2 * TARGET_CHAR_BIT}.
1676
1677 @item TARGET_READ_PC
1678 @item TARGET_WRITE_PC (val, pid)
1679 @item TARGET_READ_SP
1680 @item TARGET_WRITE_SP
1681 @item TARGET_READ_FP
1682 @item TARGET_WRITE_FP
1683 These change the behavior of @code{read_pc}, @code{write_pc},
1684 @code{read_sp}, @code{write_sp}, @code{read_fp} and @code{write_fp}.
1685 For most targets, these may be left undefined.  GDB will call the read
1686 and write register functions with the relevant @code{_REGNUM} argument.
1687
1688 These macros are useful when a target keeps one of these registers in a
1689 hard to get at place; for example, part in a segment register and part
1690 in an ordinary register.
1691
1692 @item TARGET_VIRTUAL_FRAME_POINTER(pc,regp,offsetp)
1693 Returns a @code{(register, offset)} pair representing the virtual
1694 frame pointer in use at the code address @code{"pc"}.  If virtual
1695 frame pointers are not used, a default definition simply returns
1696 @code{FP_REGNUM}, with an offset of zero.
1697
1698 @item USE_STRUCT_CONVENTION (gcc_p, type)
1699 If defined, this must be an expression that is nonzero if a value of the
1700 given @var{type} being returned from a function must have space
1701 allocated for it on the stack.  @var{gcc_p} is true if the function
1702 being considered is known to have been compiled by GCC; this is helpful
1703 for systems where GCC is known to use different calling convention than
1704 other compilers.
1705
1706 @item VARIABLES_INSIDE_BLOCK (desc, gcc_p)
1707 For dbx-style debugging information, if the compiler puts variable
1708 declarations inside LBRAC/RBRAC blocks, this should be defined to be
1709 nonzero.  @var{desc} is the value of @code{n_desc} from the
1710 @code{N_RBRAC} symbol, and @var{gcc_p} is true if GDB has noticed the
1711 presence of either the @code{GCC_COMPILED_SYMBOL} or the
1712 @code{GCC2_COMPILED_SYMBOL}.  By default, this is 0.
1713
1714 @item OS9K_VARIABLES_INSIDE_BLOCK (desc, gcc_p)
1715 Similarly, for OS/9000.  Defaults to 1.
1716
1717 @end table
1718
1719 Motorola M68K target conditionals.
1720
1721 @table @code
1722
1723 @item BPT_VECTOR
1724 Define this to be the 4-bit location of the breakpoint trap vector.  If
1725 not defined, it will default to @code{0xf}.
1726
1727 @item REMOTE_BPT_VECTOR
1728 Defaults to @code{1}.
1729
1730 @end table
1731
1732 @section Adding a New Target
1733
1734 The following files define a target to GDB:
1735
1736 @table @file
1737
1738 @item gdb/config/@var{arch}/@var{ttt}.mt
1739 Contains a Makefile fragment specific to this target.  Specifies what
1740 object files are needed for target @var{ttt}, by defining
1741 @samp{TDEPFILES=@dots{}}.  Also specifies the header file which
1742 describes @var{ttt}, by defining @samp{TM_FILE= tm-@var{ttt}.h}.  You
1743 can also define @samp{TM_CFLAGS}, @samp{TM_CLIBS}, @samp{TM_CDEPS}, but
1744 these are now deprecated and may go away in future versions of GDB.
1745
1746 @item gdb/config/@var{arch}/tm-@var{ttt}.h
1747 (@file{tm.h} is a link to this file, created by configure).  Contains
1748 macro definitions about the target machine's registers, stack frame
1749 format and instructions.
1750
1751 @item gdb/@var{ttt}-tdep.c
1752 Contains any miscellaneous code required for this target machine.  On
1753 some machines it doesn't exist at all.  Sometimes the macros in
1754 @file{tm-@var{ttt}.h} become very complicated, so they are implemented
1755 as functions here instead, and the macro is simply defined to call the
1756 function.  This is vastly preferable, since it is easier to understand
1757 and debug.
1758
1759 @item gdb/config/@var{arch}/tm-@var{arch}.h
1760 This often exists to describe the basic layout of the target machine's
1761 processor chip (registers, stack, etc).  If used, it is included by
1762 @file{tm-@var{ttt}.h}.  It can be shared among many targets that use the
1763 same processor.
1764
1765 @item gdb/@var{arch}-tdep.c
1766 Similarly, there are often common subroutines that are shared by all
1767 target machines that use this particular architecture.
1768
1769 @end table
1770
1771 If you are adding a new operating system for an existing CPU chip, add a
1772 @file{config/tm-@var{os}.h} file that describes the operating system
1773 facilities that are unusual (extra symbol table info; the breakpoint
1774 instruction needed; etc).  Then write a @file{@var{arch}/tm-@var{os}.h}
1775 that just @code{#include}s @file{tm-@var{arch}.h} and
1776 @file{config/tm-@var{os}.h}.
1777
1778
1779 @node Target Vector Definition
1780
1781 @chapter Target Vector Definition
1782
1783 The target vector defines the interface between GDB's abstract handling
1784 of target systems, and the nitty-gritty code that actually exercises
1785 control over a process or a serial port.  GDB includes some 30-40
1786 different target vectors; however, each configuration of GDB includes
1787 only a few of them.
1788
1789 @section File Targets
1790
1791 Both executables and core files have target vectors.
1792
1793 @section Standard Protocol and Remote Stubs
1794
1795 GDB's file @file{remote.c} talks a serial protocol to code that runs in
1796 the target system.  GDB provides several sample ``stubs'' that can be
1797 integrated into target programs or operating systems for this purpose;
1798 they are named @file{*-stub.c}.
1799
1800 The GDB user's manual describes how to put such a stub into your target
1801 code.  What follows is a discussion of integrating the SPARC stub into a
1802 complicated operating system (rather than a simple program), by Stu
1803 Grossman, the author of this stub.
1804
1805 The trap handling code in the stub assumes the following upon entry to
1806 trap_low:
1807
1808 @enumerate
1809
1810 @item %l1 and %l2 contain pc and npc respectively at the time of the trap
1811
1812 @item traps are disabled
1813
1814 @item you are in the correct trap window
1815
1816 @end enumerate
1817
1818 As long as your trap handler can guarantee those conditions, then there
1819 is no reason why you shouldn't be able to `share' traps with the stub.
1820 The stub has no requirement that it be jumped to directly from the
1821 hardware trap vector.  That is why it calls @code{exceptionHandler()},
1822 which is provided by the external environment.  For instance, this could
1823 setup the hardware traps to actually execute code which calls the stub
1824 first, and then transfers to its own trap handler.
1825
1826 For the most point, there probably won't be much of an issue with
1827 `sharing' traps, as the traps we use are usually not used by the kernel,
1828 and often indicate unrecoverable error conditions.  Anyway, this is all
1829 controlled by a table, and is trivial to modify.  The most important
1830 trap for us is for @code{ta 1}.  Without that, we can't single step or
1831 do breakpoints.  Everything else is unnecessary for the proper operation
1832 of the debugger/stub.
1833
1834 From reading the stub, it's probably not obvious how breakpoints work.
1835 They are simply done by deposit/examine operations from GDB.
1836
1837 @section ROM Monitor Interface
1838
1839 @section Custom Protocols
1840
1841 @section Transport Layer
1842
1843 @section Builtin Simulator
1844
1845
1846 @node Native Debugging
1847
1848 @chapter Native Debugging
1849
1850 Several files control GDB's configuration for native support:
1851
1852 @table @file
1853
1854 @item gdb/config/@var{arch}/@var{xyz}.mh
1855 Specifies Makefile fragments needed when hosting @emph{or native} on
1856 machine @var{xyz}.  In particular, this lists the required
1857 native-dependent object files, by defining @samp{NATDEPFILES=@dots{}}.
1858 Also specifies the header file which describes native support on
1859 @var{xyz}, by defining @samp{NAT_FILE= nm-@var{xyz}.h}.  You can also
1860 define @samp{NAT_CFLAGS}, @samp{NAT_ADD_FILES}, @samp{NAT_CLIBS},
1861 @samp{NAT_CDEPS}, etc.; see @file{Makefile.in}.
1862
1863 @item gdb/config/@var{arch}/nm-@var{xyz}.h
1864 (@file{nm.h} is a link to this file, created by configure).  Contains C
1865 macro definitions describing the native system environment, such as
1866 child process control and core file support.
1867
1868 @item gdb/@var{xyz}-nat.c
1869 Contains any miscellaneous C code required for this native support of
1870 this machine.  On some machines it doesn't exist at all.
1871
1872 @end table
1873
1874 There are some ``generic'' versions of routines that can be used by
1875 various systems.  These can be customized in various ways by macros
1876 defined in your @file{nm-@var{xyz}.h} file.  If these routines work for
1877 the @var{xyz} host, you can just include the generic file's name (with
1878 @samp{.o}, not @samp{.c}) in @code{NATDEPFILES}.
1879
1880 Otherwise, if your machine needs custom support routines, you will need
1881 to write routines that perform the same functions as the generic file.
1882 Put them into @code{@var{xyz}-nat.c}, and put @code{@var{xyz}-nat.o}
1883 into @code{NATDEPFILES}.
1884
1885 @table @file
1886
1887 @item inftarg.c
1888 This contains the @emph{target_ops vector} that supports Unix child
1889 processes on systems which use ptrace and wait to control the child.
1890
1891 @item procfs.c
1892 This contains the @emph{target_ops vector} that supports Unix child
1893 processes on systems which use /proc to control the child.
1894
1895 @item fork-child.c
1896 This does the low-level grunge that uses Unix system calls to do a "fork
1897 and exec" to start up a child process.
1898
1899 @item infptrace.c
1900 This is the low level interface to inferior processes for systems using
1901 the Unix @code{ptrace} call in a vanilla way.
1902
1903 @end table
1904
1905 @section Native core file Support
1906
1907 @table @file
1908
1909 @item core-aout.c::fetch_core_registers()
1910 Support for reading registers out of a core file.  This routine calls
1911 @code{register_addr()}, see below.  Now that BFD is used to read core
1912 files, virtually all machines should use @code{core-aout.c}, and should
1913 just provide @code{fetch_core_registers} in @code{@var{xyz}-nat.c} (or
1914 @code{REGISTER_U_ADDR} in @code{nm-@var{xyz}.h}).
1915
1916 @item core-aout.c::register_addr()
1917 If your @code{nm-@var{xyz}.h} file defines the macro
1918 @code{REGISTER_U_ADDR(addr, blockend, regno)}, it should be defined to
1919 set @code{addr} to the offset within the @samp{user} struct of GDB
1920 register number @code{regno}.  @code{blockend} is the offset within the
1921 ``upage'' of @code{u.u_ar0}.  If @code{REGISTER_U_ADDR} is defined,
1922 @file{core-aout.c} will define the @code{register_addr()} function and
1923 use the macro in it.  If you do not define @code{REGISTER_U_ADDR}, but
1924 you are using the standard @code{fetch_core_registers()}, you will need
1925 to define your own version of @code{register_addr()}, put it into your
1926 @code{@var{xyz}-nat.c} file, and be sure @code{@var{xyz}-nat.o} is in
1927 the @code{NATDEPFILES} list.  If you have your own
1928 @code{fetch_core_registers()}, you may not need a separate
1929 @code{register_addr()}.  Many custom @code{fetch_core_registers()}
1930 implementations simply locate the registers themselves.@refill
1931
1932 @end table
1933
1934 When making GDB run native on a new operating system, to make it
1935 possible to debug core files, you will need to either write specific
1936 code for parsing your OS's core files, or customize
1937 @file{bfd/trad-core.c}.  First, use whatever @code{#include} files your
1938 machine uses to define the struct of registers that is accessible
1939 (possibly in the u-area) in a core file (rather than
1940 @file{machine/reg.h}), and an include file that defines whatever header
1941 exists on a core file (e.g. the u-area or a @samp{struct core}).  Then
1942 modify @code{trad_unix_core_file_p()} to use these values to set up the
1943 section information for the data segment, stack segment, any other
1944 segments in the core file (perhaps shared library contents or control
1945 information), ``registers'' segment, and if there are two discontiguous
1946 sets of registers (e.g.  integer and float), the ``reg2'' segment.  This
1947 section information basically delimits areas in the core file in a
1948 standard way, which the section-reading routines in BFD know how to seek
1949 around in.
1950
1951 Then back in GDB, you need a matching routine called
1952 @code{fetch_core_registers()}.  If you can use the generic one, it's in
1953 @file{core-aout.c}; if not, it's in your @file{@var{xyz}-nat.c} file.
1954 It will be passed a char pointer to the entire ``registers'' segment,
1955 its length, and a zero; or a char pointer to the entire ``regs2''
1956 segment, its length, and a 2.  The routine should suck out the supplied
1957 register values and install them into GDB's ``registers'' array.
1958
1959 If your system uses @file{/proc} to control processes, and uses ELF
1960 format core files, then you may be able to use the same routines for
1961 reading the registers out of processes and out of core files.
1962
1963 @section ptrace
1964
1965 @section /proc
1966
1967 @section win32
1968
1969 @section shared libraries
1970
1971 @section Native Conditionals
1972
1973 When GDB is configured and compiled, various macros are defined or left
1974 undefined, to control compilation when the host and target systems are
1975 the same.  These macros should be defined (or left undefined) in
1976 @file{nm-@var{system}.h}.
1977
1978 @table @code
1979
1980 @item ATTACH_DETACH
1981 If defined, then GDB will include support for the @code{attach} and
1982 @code{detach} commands.
1983
1984 @item CHILD_PREPARE_TO_STORE
1985 If the machine stores all registers at once in the child process, then
1986 define this to ensure that all values are correct.  This usually entails
1987 a read from the child.
1988
1989 [Note that this is incorrectly defined in @file{xm-@var{system}.h} files
1990 currently.]
1991
1992 @item FETCH_INFERIOR_REGISTERS
1993 Define this if the native-dependent code will provide its own routines
1994 @code{fetch_inferior_registers} and @code{store_inferior_registers} in
1995 @file{@var{HOST}-nat.c}.  If this symbol is @emph{not} defined, and
1996 @file{infptrace.c} is included in this configuration, the default
1997 routines in @file{infptrace.c} are used for these functions.
1998
1999 @item FILES_INFO_HOOK
2000 (Only defined for Convex.)
2001
2002 @item FP0_REGNUM
2003 This macro is normally defined to be the number of the first floating
2004 point register, if the machine has such registers.  As such, it would
2005 appear only in target-specific code.  However, /proc support uses this
2006 to decide whether floats are in use on this target.
2007
2008 @item GET_LONGJMP_TARGET
2009 For most machines, this is a target-dependent parameter.  On the
2010 DECstation and the Iris, this is a native-dependent parameter, since
2011 <setjmp.h> is needed to define it.
2012
2013 This macro determines the target PC address that longjmp() will jump to,
2014 assuming that we have just stopped at a longjmp breakpoint.  It takes a
2015 CORE_ADDR * as argument, and stores the target PC value through this
2016 pointer.  It examines the current state of the machine as needed.
2017
2018 @item KERNEL_U_ADDR
2019 Define this to the address of the @code{u} structure (the ``user
2020 struct'', also known as the ``u-page'') in kernel virtual memory.  GDB
2021 needs to know this so that it can subtract this address from absolute
2022 addresses in the upage, that are obtained via ptrace or from core files.
2023 On systems that don't need this value, set it to zero.
2024
2025 @item KERNEL_U_ADDR_BSD
2026 Define this to cause GDB to determine the address of @code{u} at
2027 runtime, by using Berkeley-style @code{nlist} on the kernel's image in
2028 the root directory.
2029
2030 @item KERNEL_U_ADDR_HPUX
2031 Define this to cause GDB to determine the address of @code{u} at
2032 runtime, by using HP-style @code{nlist} on the kernel's image in the
2033 root directory.
2034
2035 @item ONE_PROCESS_WRITETEXT
2036 Define this to be able to, when a breakpoint insertion fails, warn the
2037 user that another process may be running with the same executable.
2038
2039 @item PROC_NAME_FMT
2040 Defines the format for the name of a @file{/proc} device.  Should be
2041 defined in @file{nm.h} @emph{only} in order to override the default
2042 definition in @file{procfs.c}.
2043
2044 @item PTRACE_FP_BUG
2045 mach386-xdep.c
2046
2047 @item PTRACE_ARG3_TYPE
2048 The type of the third argument to the @code{ptrace} system call, if it
2049 exists and is different from @code{int}.
2050
2051 @item REGISTER_U_ADDR
2052 Defines the offset of the registers in the ``u area''.
2053
2054 @item SHELL_COMMAND_CONCAT
2055 If defined, is a string to prefix on the shell command used to start the
2056 inferior.
2057
2058 @item SHELL_FILE
2059 If defined, this is the name of the shell to use to run the inferior.
2060 Defaults to @code{"/bin/sh"}.
2061
2062 @item SOLIB_ADD (filename, from_tty, targ)
2063 Define this to expand into an expression that will cause the symbols in
2064 @var{filename} to be added to GDB's symbol table.
2065
2066 @item SOLIB_CREATE_INFERIOR_HOOK
2067 Define this to expand into any shared-library-relocation code that you
2068 want to be run just after the child process has been forked.
2069
2070 @item START_INFERIOR_TRAPS_EXPECTED
2071 When starting an inferior, GDB normally expects to trap twice; once when
2072 the shell execs, and once when the program itself execs.  If the actual
2073 number of traps is something other than 2, then define this macro to
2074 expand into the number expected.
2075
2076 @item SVR4_SHARED_LIBS
2077 Define this to indicate that SVR4-style shared libraries are in use.
2078
2079 @item USE_PROC_FS
2080 This determines whether small routines in @file{*-tdep.c}, which
2081 translate register values between GDB's internal representation and the
2082 /proc representation, are compiled.
2083
2084 @item U_REGS_OFFSET
2085 This is the offset of the registers in the upage.  It need only be
2086 defined if the generic ptrace register access routines in
2087 @file{infptrace.c} are being used (that is, @file{infptrace.c} is
2088 configured in, and @code{FETCH_INFERIOR_REGISTERS} is not defined).  If
2089 the default value from @file{infptrace.c} is good enough, leave it
2090 undefined.
2091
2092 The default value means that u.u_ar0 @emph{points to} the location of
2093 the registers.  I'm guessing that @code{#define U_REGS_OFFSET 0} means
2094 that u.u_ar0 @emph{is} the location of the registers.
2095
2096 @item CLEAR_SOLIB
2097 objfiles.c
2098
2099 @item DEBUG_PTRACE
2100 Define this to debug ptrace calls.
2101
2102 @end table
2103
2104
2105 @node Support Libraries
2106
2107 @chapter Support Libraries
2108
2109 @section BFD
2110
2111 BFD provides support for GDB in several ways:
2112
2113 @table @emph
2114
2115 @item identifying executable and core files
2116 BFD will identify a variety of file types, including a.out, coff, and
2117 several variants thereof, as well as several kinds of core files.
2118
2119 @item access to sections of files
2120 BFD parses the file headers to determine the names, virtual addresses,
2121 sizes, and file locations of all the various named sections in files
2122 (such as the text section or the data section).  GDB simply calls BFD to
2123 read or write section X at byte offset Y for length Z.
2124
2125 @item specialized core file support
2126 BFD provides routines to determine the failing command name stored in a
2127 core file, the signal with which the program failed, and whether a core
2128 file matches (i.e. could be a core dump of) a particular executable
2129 file.
2130
2131 @item locating the symbol information
2132 GDB uses an internal interface of BFD to determine where to find the
2133 symbol information in an executable file or symbol-file.  GDB itself
2134 handles the reading of symbols, since BFD does not ``understand'' debug
2135 symbols, but GDB uses BFD's cached information to find the symbols,
2136 string table, etc.
2137
2138 @end table
2139
2140 @section opcodes
2141
2142 The opcodes library provides GDB's disassembler.  (It's a separate
2143 library because it's also used in binutils, for @file{objdump}).
2144
2145 @section readline
2146
2147 @section mmalloc
2148
2149 @section libiberty
2150
2151 @section gnu-regex
2152
2153 Regex conditionals.
2154
2155 @table @code
2156
2157 @item C_ALLOCA
2158
2159 @item NFAILURES
2160
2161 @item RE_NREGS
2162
2163 @item SIGN_EXTEND_CHAR
2164
2165 @item SWITCH_ENUM_BUG
2166
2167 @item SYNTAX_TABLE
2168
2169 @item Sword
2170
2171 @item sparc
2172
2173 @end table
2174
2175 @section include
2176
2177 @node Coding
2178
2179 @chapter Coding
2180
2181 This chapter covers topics that are lower-level than the major
2182 algorithms of GDB.
2183
2184 @section Cleanups
2185
2186 Cleanups are a structured way to deal with things that need to be done
2187 later.  When your code does something (like @code{malloc} some memory,
2188 or open a file) that needs to be undone later (e.g. free the memory or
2189 close the file), it can make a cleanup.  The cleanup will be done at
2190 some future point: when the command is finished, when an error occurs,
2191 or when your code decides it's time to do cleanups.
2192
2193 You can also discard cleanups, that is, throw them away without doing
2194 what they say.  This is only done if you ask that it be done.
2195
2196 Syntax:
2197
2198 @table @code
2199
2200 @item struct cleanup *@var{old_chain};
2201 Declare a variable which will hold a cleanup chain handle.
2202
2203 @item @var{old_chain} = make_cleanup (@var{function}, @var{arg});
2204 Make a cleanup which will cause @var{function} to be called with
2205 @var{arg} (a @code{char *}) later.  The result, @var{old_chain}, is a
2206 handle that can be passed to @code{do_cleanups} or
2207 @code{discard_cleanups} later.  Unless you are going to call
2208 @code{do_cleanups} or @code{discard_cleanups} yourself, you can ignore
2209 the result from @code{make_cleanup}.
2210
2211 @item do_cleanups (@var{old_chain});
2212 Perform all cleanups done since @code{make_cleanup} returned
2213 @var{old_chain}.  E.g.:
2214 @example
2215 make_cleanup (a, 0);
2216 old = make_cleanup (b, 0);
2217 do_cleanups (old);
2218 @end example
2219 @noindent
2220 will call @code{b()} but will not call @code{a()}.  The cleanup that
2221 calls @code{a()} will remain in the cleanup chain, and will be done
2222 later unless otherwise discarded.@refill
2223
2224 @item discard_cleanups (@var{old_chain});
2225 Same as @code{do_cleanups} except that it just removes the cleanups from
2226 the chain and does not call the specified functions.
2227
2228 @end table
2229
2230 Some functions, e.g. @code{fputs_filtered()} or @code{error()}, specify
2231 that they ``should not be called when cleanups are not in place''.  This
2232 means that any actions you need to reverse in the case of an error or
2233 interruption must be on the cleanup chain before you call these
2234 functions, since they might never return to your code (they
2235 @samp{longjmp} instead).
2236
2237 @section Wrapping Output Lines
2238
2239 Output that goes through @code{printf_filtered} or @code{fputs_filtered}
2240 or @code{fputs_demangled} needs only to have calls to @code{wrap_here}
2241 added in places that would be good breaking points.  The utility
2242 routines will take care of actually wrapping if the line width is
2243 exceeded.
2244
2245 The argument to @code{wrap_here} is an indentation string which is
2246 printed @emph{only} if the line breaks there.  This argument is saved
2247 away and used later.  It must remain valid until the next call to
2248 @code{wrap_here} or until a newline has been printed through the
2249 @code{*_filtered} functions.  Don't pass in a local variable and then
2250 return!
2251
2252 It is usually best to call @code{wrap_here()} after printing a comma or
2253 space.  If you call it before printing a space, make sure that your
2254 indentation properly accounts for the leading space that will print if
2255 the line wraps there.
2256
2257 Any function or set of functions that produce filtered output must
2258 finish by printing a newline, to flush the wrap buffer, before switching
2259 to unfiltered (``@code{printf}'') output.  Symbol reading routines that
2260 print warnings are a good example.
2261
2262 @section GDB Coding Standards
2263
2264 GDB follows the GNU coding standards, as described in
2265 @file{etc/standards.texi}.  This file is also available for anonymous
2266 FTP from GNU archive sites.  GDB takes a strict interpretation of the
2267 standard; in general, when the GNU standard recommends a practice but
2268 does not require it, GDB requires it.
2269
2270 GDB follows an additional set of coding standards specific to GDB,
2271 as described in the following sections.
2272
2273 You can configure with @samp{--enable-build-warnings} to get GCC to
2274 check on a number of these rules.  GDB sources ought not to engender any
2275 complaints, unless they are caused by bogus host systems.  (The exact
2276 set of enabled warnings is currently @samp{-Wall -Wpointer-arith
2277 -Wstrict-prototypes -Wmissing-prototypes -Wmissing-declarations}.
2278
2279 @subsection Formatting
2280
2281 The standard GNU recommendations for formatting must be followed
2282 strictly.
2283
2284 Note that while in a definition, the function's name must be in column
2285 zero; in a function declaration, the name must be on the same line as
2286 the return type.
2287
2288 In addition, there must be a space between a function or macro name and
2289 the opening parenthesis of its argument list (except for macro
2290 definitions, as required by C).  There must not be a space after an open
2291 paren/bracket or before a close paren/bracket.
2292
2293 While additional whitespace is generally helpful for reading, do not use
2294 more than one blank line to separate blocks, and avoid adding whitespace
2295 after the end of a program line (as of 1/99, some 600 lines had whitespace
2296 after the semicolon).  Excess whitespace causes difficulties for diff and
2297 patch.
2298
2299 @subsection Comments
2300
2301 The standard GNU requirements on comments must be followed strictly.
2302
2303 Block comments must appear in the following form, with no `/*'- or
2304 '*/'-only lines, and no leading `*':
2305
2306 @example @code
2307 /* Wait for control to return from inferior to debugger.  If inferior
2308    gets a signal, we may decide to start it up again instead of
2309    returning.  That is why there is a loop in this function.  When
2310    this function actually returns it means the inferior should be left
2311    stopped and GDB should read more commands.  */
2312 @end example
2313
2314 (Note that this format is encouraged by Emacs; tabbing for a multi-line
2315 comment works correctly, and M-Q fills the block consistently.)
2316
2317 Put a blank line between the block comments preceding function or
2318 variable definitions, and the definition itself.
2319
2320 In general, put function-body comments on lines by themselves, rather
2321 than trying to fit them into the 20 characters left at the end of a
2322 line, since either the comment or the code will inevitably get longer
2323 than will fit, and then somebody will have to move it anyhow.
2324
2325 @subsection C Usage
2326
2327 Code must not depend on the sizes of C data types, the format of the
2328 host's floating point numbers, the alignment of anything, or the order
2329 of evaluation of expressions.
2330
2331 Use functions freely.  There are only a handful of compute-bound areas
2332 in GDB that might be affected by the overhead of a function call, mainly
2333 in symbol reading.  Most of GDB's performance is limited by the target
2334 interface (whether serial line or system call).
2335
2336 However, use functions with moderation.  A thousand one-line functions
2337 are just as hard to understand as a single thousand-line function.
2338
2339 @subsection Function Prototypes
2340
2341 Prototypes must be used to @emph{declare} functions but never to
2342 @emph{define} them.  Prototypes for GDB functions must include both the
2343 argument type and name, with the name matching that used in the actual
2344 function definition.
2345
2346 For the sake of compatibility with pre-ANSI compilers, define prototypes
2347 with the @code{PARAMS} macro:
2348
2349 @example @code
2350 extern int memory_remove_breakpoint PARAMS ((CORE_ADDR addr,
2351                                              char *contents_cache));
2352 @end example
2353
2354 Note the double parentheses around the parameter types.  This allows an
2355 arbitrary number of parameters to be described, without freaking out the
2356 C preprocessor.  When the function has no parameters, it should be
2357 described like:
2358
2359 @example @code
2360 extern void noprocess PARAMS ((void));
2361 @end example
2362
2363 The @code{PARAMS} macro expands to its argument in ANSI C, or to a
2364 simple @code{()} in traditional C.
2365
2366 All external functions should have a @code{PARAMS} declaration in a
2367 header file that callers include, except for @code{_initialize_*}
2368 functions, which must be external so that @file{init.c} construction
2369 works, but shouldn't be visible to random source files.
2370
2371 All static functions must be declared in a block near the top of the
2372 source file.
2373
2374 @subsection Clean Design
2375
2376 In addition to getting the syntax right, there's the little question of
2377 semantics.  Some things are done in certain ways in GDB because long
2378 experience has shown that the more obvious ways caused various kinds of
2379 trouble.
2380
2381 You can't assume the byte order of anything that comes from a target
2382 (including @var{value}s, object files, and instructions).  Such things
2383 must be byte-swapped using @code{SWAP_TARGET_AND_HOST} in GDB, or one of
2384 the swap routines defined in @file{bfd.h}, such as @code{bfd_get_32}.
2385
2386 You can't assume that you know what interface is being used to talk to
2387 the target system.  All references to the target must go through the
2388 current @code{target_ops} vector.
2389
2390 You can't assume that the host and target machines are the same machine
2391 (except in the ``native'' support modules).  In particular, you can't
2392 assume that the target machine's header files will be available on the
2393 host machine.  Target code must bring along its own header files --
2394 written from scratch or explicitly donated by their owner, to avoid
2395 copyright problems.
2396
2397 Insertion of new @code{#ifdef}'s will be frowned upon.  It's much better
2398 to write the code portably than to conditionalize it for various
2399 systems.
2400
2401 New @code{#ifdef}'s which test for specific compilers or manufacturers
2402 or operating systems are unacceptable.  All @code{#ifdef}'s should test
2403 for features.  The information about which configurations contain which
2404 features should be segregated into the configuration files.  Experience
2405 has proven far too often that a feature unique to one particular system
2406 often creeps into other systems; and that a conditional based on some
2407 predefined macro for your current system will become worthless over
2408 time, as new versions of your system come out that behave differently
2409 with regard to this feature.
2410
2411 Adding code that handles specific architectures, operating systems,
2412 target interfaces, or hosts, is not acceptable in generic code.  If a
2413 hook is needed at that point, invent a generic hook and define it for
2414 your configuration, with something like:
2415
2416 @example
2417 #ifdef  WRANGLE_SIGNALS
2418    WRANGLE_SIGNALS (signo);
2419 #endif
2420 @end example
2421
2422 In your host, target, or native configuration file, as appropriate,
2423 define @code{WRANGLE_SIGNALS} to do the machine-dependent thing.  Take a
2424 bit of care in defining the hook, so that it can be used by other ports
2425 in the future, if they need a hook in the same place.
2426
2427 If the hook is not defined, the code should do whatever "most" machines
2428 want.  Using @code{#ifdef}, as above, is the preferred way to do this,
2429 but sometimes that gets convoluted, in which case use
2430
2431 @example
2432 #ifndef SPECIAL_FOO_HANDLING
2433 #define SPECIAL_FOO_HANDLING(pc, sp) (0)
2434 #endif
2435 @end example
2436
2437 where the macro is used or in an appropriate header file.
2438
2439 Whether to include a @dfn{small} hook, a hook around the exact pieces of
2440 code which are system-dependent, or whether to replace a whole function
2441 with a hook depends on the case.  A good example of this dilemma can be
2442 found in @code{get_saved_register}.  All machines that GDB 2.8 ran on
2443 just needed the @code{FRAME_FIND_SAVED_REGS} hook to find the saved
2444 registers.  Then the SPARC and Pyramid came along, and
2445 @code{HAVE_REGISTER_WINDOWS} and @code{REGISTER_IN_WINDOW_P} were
2446 introduced.  Then the 29k and 88k required the @code{GET_SAVED_REGISTER}
2447 hook.  The first three are examples of small hooks; the latter replaces
2448 a whole function.  In this specific case, it is useful to have both
2449 kinds; it would be a bad idea to replace all the uses of the small hooks
2450 with @code{GET_SAVED_REGISTER}, since that would result in much
2451 duplicated code.  Other times, duplicating a few lines of code here or
2452 there is much cleaner than introducing a large number of small hooks.
2453
2454 Another way to generalize GDB along a particular interface is with an
2455 attribute struct.  For example, GDB has been generalized to handle
2456 multiple kinds of remote interfaces -- not by #ifdef's everywhere, but
2457 by defining the "target_ops" structure and having a current target (as
2458 well as a stack of targets below it, for memory references).  Whenever
2459 something needs to be done that depends on which remote interface we are
2460 using, a flag in the current target_ops structure is tested (e.g.
2461 `target_has_stack'), or a function is called through a pointer in the
2462 current target_ops structure.  In this way, when a new remote interface
2463 is added, only one module needs to be touched -- the one that actually
2464 implements the new remote interface.  Other examples of
2465 attribute-structs are BFD access to multiple kinds of object file
2466 formats, or GDB's access to multiple source languages.
2467
2468 Please avoid duplicating code.  For example, in GDB 3.x all the code
2469 interfacing between @code{ptrace} and the rest of GDB was duplicated in
2470 @file{*-dep.c}, and so changing something was very painful.  In GDB 4.x,
2471 these have all been consolidated into @file{infptrace.c}.
2472 @file{infptrace.c} can deal with variations between systems the same way
2473 any system-independent file would (hooks, #if defined, etc.), and
2474 machines which are radically different don't need to use infptrace.c at
2475 all.
2476
2477
2478 @node Porting GDB
2479
2480 @chapter Porting GDB
2481
2482 Most of the work in making GDB compile on a new machine is in specifying
2483 the configuration of the machine.  This is done in a dizzying variety of
2484 header files and configuration scripts, which we hope to make more
2485 sensible soon.  Let's say your new host is called an @var{xyz} (e.g.
2486 @samp{sun4}), and its full three-part configuration name is
2487 @code{@var{arch}-@var{xvend}-@var{xos}} (e.g.  @samp{sparc-sun-sunos4}).
2488 In particular:
2489
2490 In the top level directory, edit @file{config.sub} and add @var{arch},
2491 @var{xvend}, and @var{xos} to the lists of supported architectures,
2492 vendors, and operating systems near the bottom of the file.  Also, add
2493 @var{xyz} as an alias that maps to
2494 @code{@var{arch}-@var{xvend}-@var{xos}}.  You can test your changes by
2495 running
2496
2497 @example
2498 ./config.sub @var{xyz}
2499 @end example
2500 @noindent
2501 and
2502 @example
2503 ./config.sub @code{@var{arch}-@var{xvend}-@var{xos}}
2504 @end example
2505 @noindent
2506 which should both respond with @code{@var{arch}-@var{xvend}-@var{xos}}
2507 and no error messages.
2508
2509 You need to port BFD, if that hasn't been done already.  Porting BFD is
2510 beyond the scope of this manual.
2511
2512 To configure GDB itself, edit @file{gdb/configure.host} to recognize
2513 your system and set @code{gdb_host} to @var{xyz}, and (unless your
2514 desired target is already available) also edit @file{gdb/configure.tgt},
2515 setting @code{gdb_target} to something appropriate (for instance,
2516 @var{xyz}).
2517
2518 Finally, you'll need to specify and define GDB's host-, native-, and
2519 target-dependent @file{.h} and @file{.c} files used for your
2520 configuration.
2521
2522 @section Configuring GDB for Release
2523
2524 From the top level directory (containing @file{gdb}, @file{bfd},
2525 @file{libiberty}, and so on):
2526 @example
2527 make -f Makefile.in gdb.tar.gz
2528 @end example
2529
2530 This will properly configure, clean, rebuild any files that are
2531 distributed pre-built (e.g. @file{c-exp.tab.c} or @file{refcard.ps}),
2532 and will then make a tarfile.  (If the top level directory has already
2533 been configured, you can just do @code{make gdb.tar.gz} instead.)
2534
2535 This procedure requires:
2536 @itemize @bullet
2537 @item symbolic links
2538 @item @code{makeinfo} (texinfo2 level)
2539 @item @TeX{}
2540 @item @code{dvips}
2541 @item @code{yacc} or @code{bison}
2542 @end itemize
2543 @noindent
2544 @dots{} and the usual slew of utilities (@code{sed}, @code{tar}, etc.).
2545
2546 @subheading TEMPORARY RELEASE PROCEDURE FOR DOCUMENTATION
2547
2548 @file{gdb.texinfo} is currently marked up using the texinfo-2 macros,
2549 which are not yet a default for anything (but we have to start using
2550 them sometime).
2551
2552 For making paper, the only thing this implies is the right generation of
2553 @file{texinfo.tex} needs to be included in the distribution.
2554
2555 For making info files, however, rather than duplicating the texinfo2
2556 distribution, generate @file{gdb-all.texinfo} locally, and include the
2557 files @file{gdb.info*} in the distribution.  Note the plural;
2558 @code{makeinfo} will split the document into one overall file and five
2559 or so included files.
2560
2561 @node Hints
2562
2563 @chapter Hints
2564
2565 Check the @file{README} file, it often has useful information that does not
2566 appear anywhere else in the directory.
2567
2568 @menu
2569 * Getting Started::             Getting started working on GDB
2570 * Debugging GDB::               Debugging GDB with itself
2571 @end menu
2572
2573 @node Getting Started,,, Hints
2574
2575 @section Getting Started
2576
2577 GDB is a large and complicated program, and if you first starting to
2578 work on it, it can be hard to know where to start.  Fortunately, if you
2579 know how to go about it, there are ways to figure out what is going on.
2580
2581 This manual, the GDB Internals manual, has information which applies
2582 generally to many parts of GDB.
2583
2584 Information about particular functions or data structures are located in
2585 comments with those functions or data structures.  If you run across a
2586 function or a global variable which does not have a comment correctly
2587 explaining what is does, this can be thought of as a bug in GDB; feel
2588 free to submit a bug report, with a suggested comment if you can figure
2589 out what the comment should say.  If you find a comment which is
2590 actually wrong, be especially sure to report that.
2591
2592 Comments explaining the function of macros defined in host, target, or
2593 native dependent files can be in several places.  Sometimes they are
2594 repeated every place the macro is defined.  Sometimes they are where the
2595 macro is used.  Sometimes there is a header file which supplies a
2596 default definition of the macro, and the comment is there.  This manual
2597 also documents all the available macros.
2598 @c (@pxref{Host Conditionals}, @pxref{Target
2599 @c Conditionals}, @pxref{Native Conditionals}, and @pxref{Obsolete
2600 @c Conditionals})
2601
2602 Start with the header files.  Once you some idea of how GDB's internal
2603 symbol tables are stored (see @file{symtab.h}, @file{gdbtypes.h}), you
2604 will find it much easier to understand the code which uses and creates
2605 those symbol tables.
2606
2607 You may wish to process the information you are getting somehow, to
2608 enhance your understanding of it.  Summarize it, translate it to another
2609 language, add some (perhaps trivial or non-useful) feature to GDB, use
2610 the code to predict what a test case would do and write the test case
2611 and verify your prediction, etc.  If you are reading code and your eyes
2612 are starting to glaze over, this is a sign you need to use a more active
2613 approach.
2614
2615 Once you have a part of GDB to start with, you can find more
2616 specifically the part you are looking for by stepping through each
2617 function with the @code{next} command.  Do not use @code{step} or you
2618 will quickly get distracted; when the function you are stepping through
2619 calls another function try only to get a big-picture understanding
2620 (perhaps using the comment at the beginning of the function being
2621 called) of what it does.  This way you can identify which of the
2622 functions being called by the function you are stepping through is the
2623 one which you are interested in.  You may need to examine the data
2624 structures generated at each stage, with reference to the comments in
2625 the header files explaining what the data structures are supposed to
2626 look like.
2627
2628 Of course, this same technique can be used if you are just reading the
2629 code, rather than actually stepping through it.  The same general
2630 principle applies---when the code you are looking at calls something
2631 else, just try to understand generally what the code being called does,
2632 rather than worrying about all its details.
2633
2634 A good place to start when tracking down some particular area is with a
2635 command which invokes that feature.  Suppose you want to know how
2636 single-stepping works.  As a GDB user, you know that the @code{step}
2637 command invokes single-stepping.  The command is invoked via command
2638 tables (see @file{command.h}); by convention the function which actually
2639 performs the command is formed by taking the name of the command and
2640 adding @samp{_command}, or in the case of an @code{info} subcommand,
2641 @samp{_info}.  For example, the @code{step} command invokes the
2642 @code{step_command} function and the @code{info display} command invokes
2643 @code{display_info}.  When this convention is not followed, you might
2644 have to use @code{grep} or @kbd{M-x tags-search} in emacs, or run GDB on
2645 itself and set a breakpoint in @code{execute_command}.
2646
2647 If all of the above fail, it may be appropriate to ask for information
2648 on @code{bug-gdb}.  But @emph{never} post a generic question like ``I was
2649 wondering if anyone could give me some tips about understanding
2650 GDB''---if we had some magic secret we would put it in this manual.
2651 Suggestions for improving the manual are always welcome, of course.
2652
2653 @node Debugging GDB,,,Hints
2654
2655 @section Debugging GDB with itself
2656
2657 If GDB is limping on your machine, this is the preferred way to get it
2658 fully functional.  Be warned that in some ancient Unix systems, like
2659 Ultrix 4.2, a program can't be running in one process while it is being
2660 debugged in another.  Rather than typing the command @code{@w{./gdb
2661 ./gdb}}, which works on Suns and such, you can copy @file{gdb} to
2662 @file{gdb2} and then type @code{@w{./gdb ./gdb2}}.
2663
2664 When you run GDB in the GDB source directory, it will read a
2665 @file{.gdbinit} file that sets up some simple things to make debugging
2666 gdb easier.  The @code{info} command, when executed without a subcommand
2667 in a GDB being debugged by gdb, will pop you back up to the top level
2668 gdb.  See @file{.gdbinit} for details.
2669
2670 If you use emacs, you will probably want to do a @code{make TAGS} after
2671 you configure your distribution; this will put the machine dependent
2672 routines for your local machine where they will be accessed first by
2673 @kbd{M-.}
2674
2675 Also, make sure that you've either compiled GDB with your local cc, or
2676 have run @code{fixincludes} if you are compiling with gcc.
2677
2678 @section Submitting Patches
2679
2680 Thanks for thinking of offering your changes back to the community of
2681 GDB users.  In general we like to get well designed enhancements.
2682 Thanks also for checking in advance about the best way to transfer the
2683 changes.
2684
2685 The GDB maintainers will only install ``cleanly designed'' patches.  You
2686 may not always agree on what is clean design.
2687 @c @pxref{Coding Style}, @pxref{Clean Design}.
2688
2689 If the maintainers don't have time to put the patch in when it arrives,
2690 or if there is any question about a patch, it goes into a large queue
2691 with everyone else's patches and bug reports.
2692
2693 The legal issue is that to incorporate substantial changes requires a
2694 copyright assignment from you and/or your employer, granting ownership
2695 of the changes to the Free Software Foundation.  You can get the
2696 standard document for doing this by sending mail to
2697 @code{gnu@@prep.ai.mit.edu} and asking for it.  I recommend that people
2698 write in "All programs owned by the Free Software Foundation" as "NAME
2699 OF PROGRAM", so that changes in many programs (not just GDB, but GAS,
2700 Emacs, GCC, etc) can be contributed with only one piece of legalese
2701 pushed through the bureacracy and filed with the FSF.  I can't start
2702 merging changes until this paperwork is received by the FSF (their
2703 rules, which I follow since I maintain it for them).
2704
2705 Technically, the easiest way to receive changes is to receive each
2706 feature as a small context diff or unidiff, suitable for "patch".
2707 Each message sent to me should include the changes to C code and
2708 header files for a single feature, plus ChangeLog entries for each
2709 directory where files were modified, and diffs for any changes needed
2710 to the manuals (gdb/doc/gdb.texi or gdb/doc/gdbint.texi).  If there
2711 are a lot of changes for a single feature, they can be split down
2712 into multiple messages.
2713
2714 In this way, if I read and like the feature, I can add it to the
2715 sources with a single patch command, do some testing, and check it in.
2716 If you leave out the ChangeLog, I have to write one.  If you leave
2717 out the doc, I have to puzzle out what needs documenting.  Etc.
2718
2719 The reason to send each change in a separate message is that I will
2720 not install some of the changes.  They'll be returned to you with
2721 questions or comments.  If I'm doing my job, my message back to you
2722 will say what you have to fix in order to make the change acceptable.
2723 The reason to have separate messages for separate features is so
2724 that other changes (which I @emph{am} willing to accept) can be installed
2725 while one or more changes are being reworked.  If multiple features
2726 are sent in a single message, I tend to not put in the effort to sort
2727 out the acceptable changes from the unacceptable, so none of the
2728 features get installed until all are acceptable.
2729
2730 If this sounds painful or authoritarian, well, it is.  But I get a lot
2731 of bug reports and a lot of patches, and most of them don't get
2732 installed because I don't have the time to finish the job that the bug
2733 reporter or the contributor could have done.  Patches that arrive
2734 complete, working, and well designed, tend to get installed on the day
2735 they arrive.  The others go into a queue and get installed if and when
2736 I scan back over the queue -- which can literally take months
2737 sometimes.  It's in both our interests to make patch installation easy
2738 -- you get your changes installed, and I make some forward progress on
2739 GDB in a normal 12-hour day (instead of them having to wait until I
2740 have a 14-hour or 16-hour day to spend cleaning up patches before I
2741 can install them).
2742
2743 Please send patches directly to the GDB maintainers at
2744 @code{gdb-patches@@cygnus.com}.
2745
2746 @section Obsolete Conditionals
2747
2748 Fragments of old code in GDB sometimes reference or set the following
2749 configuration macros.  They should not be used by new code, and old uses
2750 should be removed as those parts of the debugger are otherwise touched.
2751
2752 @table @code
2753
2754 @item STACK_END_ADDR
2755 This macro used to define where the end of the stack appeared, for use
2756 in interpreting core file formats that don't record this address in the
2757 core file itself.  This information is now configured in BFD, and GDB
2758 gets the info portably from there.  The values in GDB's configuration
2759 files should be moved into BFD configuration files (if needed there),
2760 and deleted from all of GDB's config files.
2761
2762 Any @file{@var{foo}-xdep.c} file that references STACK_END_ADDR
2763 is so old that it has never been converted to use BFD.  Now that's old!
2764
2765 @item PYRAMID_CONTROL_FRAME_DEBUGGING
2766 pyr-xdep.c
2767 @item PYRAMID_CORE
2768 pyr-xdep.c
2769 @item PYRAMID_PTRACE
2770 pyr-xdep.c
2771
2772 @item REG_STACK_SEGMENT
2773 exec.c
2774
2775 @end table
2776
2777
2778 @contents
2779 @bye