gdb/doc/gdbint.texinfo

   1 \input texinfo
   2 @setfilename gdbint.info
   3
   4 @ifinfo
   5 @format
   6 START-INFO-DIR-ENTRY
   7 * Gdb-Internals: (gdbint).      The GNU debugger's internals.
   8 END-INFO-DIR-ENTRY
   9 @end format
  10 @end ifinfo
  11
  12 @ifinfo
  13 This file documents the internals of the GNU debugger GDB.
  14
  15 Copyright 1990, 91, 92, 93, 94, 95, 96, 97, 1998 Free Software Foundation, Inc.
  16 Contributed by Cygnus Solutions.  Written by John Gilmore.
  17 Second Edition by Stan Shebs.
  18
  19 Permission is granted to make and distribute verbatim copies of this
  20 manual provided the copyright notice and this permission notice are
  21 preserved on all copies.
  22
  23 @ignore
  24 Permission is granted to process this file through Tex and print the
  25 results, provided the printed document carries copying permission notice
  26 identical to this one except for the removal of this paragraph (this
  27 paragraph not being relevant to the printed manual).
  28
  29 @end ignore
  30 Permission is granted to copy or distribute modified versions of this
  31 manual under the terms of the GPL (for which purpose this text may be
  32 regarded as a program in the language TeX).
  33 @end ifinfo
  34
  35 @setchapternewpage off
  36 @settitle GDB Internals
  37
  38 @titlepage
  39 @title{GDB Internals}
  40 @subtitle{A guide to the internals of the GNU debugger}
  41 @author John Gilmore
  42 @author Cygnus Solutions
  43 @author Second Edition:
  44 @author Stan Shebs
  45 @author Cygnus Solutions
  46 @page
  47 @tex
  48 \def\$#1${{#1}}  % Kluge: collect RCS revision info without $...$
  49 \xdef\manvers{\$Revision$}  % For use in headers, footers too
  50 {\parskip=0pt
  51 \hfill Cygnus Solutions\par
  52 \hfill \manvers\par
  53 \hfill \TeX{}info \texinfoversion\par
  54 }
  55 @end tex
  56
  57 @vskip 0pt plus 1filll
  58 Copyright @copyright{} 1990, 91, 92, 93, 94, 95, 96, 97, 1998 Free Software Foundation, Inc.
  59
  60 Permission is granted to make and distribute verbatim copies of
  61 this manual provided the copyright notice and this permission notice
  62 are preserved on all copies.
  63
  64 @end titlepage
  65
  66 @node Top
  67 @c Perhaps this should be the title of the document (but only for info,
  68 @c not for TeX).  Existing GNU manuals seem inconsistent on this point.
  69 @top Scope of this Document
  70
  71 This document documents the internals of the GNU debugger, GDB.  It
  72 includes description of GDB's key algorithms and operations, as well
  73 as the mechanisms that adapt GDB to specific hosts and targets.
  74
  75 @menu
  76 * Requirements::
  77 * Overall Structure::
  78 * Algorithms::
  79 * User Interface::
  80 * Symbol Handling::
  81 * Language Support::
  82 * Host Definition::
  83 * Target Architecture Definition::
  84 * Target Vector Definition::
  85 * Native Debugging::
  86 * Support Libraries::
  87 * Coding::
  88 * Porting GDB::
  89 * Hints::
  90 @end menu
  91
  92 @node Requirements
  93
  94 @chapter Requirements
  95
  96 Before diving into the internals, you should understand the formal
  97 requirements and other expectations for GDB.  Although some of these may
  98 seem obvious, there have been proposals for GDB that have run counter to
  99 these requirements.
 100
 101 First of all, GDB is a debugger.  It's not designed to be a front panel
 102 for embedded systems.  It's not a text editor.  It's not a shell.  It's
 103 not a programming environment.
 104
 105 GDB is an interactive tool.  Although a batch mode is available, GDB's
 106 primary role is to interact with a human programmer.
 107
 108 GDB should be responsive to the user.  A programmer hot on the trail of
 109 a nasty bug, and operating under a looming deadline, is going to be very
 110 impatient of everything, including the response time to debugger
 111 commands.
 112
 113 GDB should be relatively permissive, such as for expressions.  While the
 114 compiler should be picky (or have the option to be made picky), since
 115 source code lives for a long time usually, the programmer doing
 116 debugging shouldn't be spending time figuring out to mollify the
 117 debugger.
 118
 119 GDB will be called upon to deal with really large programs.  Executable
 120 sizes of 50 to 100 megabytes occur regularly, and we've heard reports of
 121 programs approaching 1 gigabyte in size.
 122
 123 GDB should be able to run everywhere.  No other debugger is available
 124 for even half as many configurations as GDB supports.
 125
 126
 127 @node Overall Structure
 128
 129 @chapter Overall Structure
 130
 131 GDB consists of three major subsystems: user interface, symbol handling
 132 (the ``symbol side''), and target system handling (the ``target side'').
 133
 134 Ther user interface consists of several actual interfaces, plus
 135 supporting code.
 136
 137 The symbol side consists of object file readers, debugging info
 138 interpreters, symbol table management, source language expression
 139 parsing, type and value printing.
 140
 141 The target side consists of execution control, stack frame analysis, and
 142 physical target manipulation.
 143
 144 The target side/symbol side division is not formal, and there are a
 145 number of exceptions.  For instance, core file support involves symbolic
 146 elements (the basic core file reader is in BFD) and target elements (it
 147 supplies the contents of memory and the values of registers).  Instead,
 148 this division is useful for understanding how the minor subsystems
 149 should fit together.
 150
 151 @section The Symbol Side
 152
 153 The symbolic side of GDB can be thought of as ``everything you can do in
 154 GDB without having a live program running''.  For instance, you can look
 155 at the types of variables, and evaluate many kinds of expressions.
 156
 157 @section The Target Side
 158
 159 The target side of GDB is the ``bits and bytes manipulator''.  Although
 160 it may make reference to symbolic info here and there, most of the
 161 target side will run with only a stripped executable available -- or
 162 even no executable at all, in remote debugging cases.
 163
 164 Operations such as disassembly, stack frame crawls, and register
 165 display, are able to work with no symbolic info at all.  In some cases,
 166 such as disassembly, GDB will use symbolic info to present addresses
 167 relative to symbols rather than as raw numbers, but it will work either
 168 way.
 169
 170 @section Configurations
 171
 172 @dfn{Host} refers to attributes of the system where GDB runs.
 173 @dfn{Target} refers to the system where the program being debugged
 174 executes.  In most cases they are the same machine, in which case a
 175 third type of @dfn{Native} attributes come into play.
 176
 177 Defines and include files needed to build on the host are host support.
 178 Examples are tty support, system defined types, host byte order, host
 179 float format.
 180
 181 Defines and information needed to handle the target format are target
 182 dependent.  Examples are the stack frame format, instruction set,
 183 breakpoint instruction, registers, and how to set up and tear down the stack
 184 to call a function.
 185
 186 Information that is only needed when the host and target are the same,
 187 is native dependent.  One example is Unix child process support; if the
 188 host and target are not the same, doing a fork to start the target
 189 process is a bad idea.  The various macros needed for finding the
 190 registers in the @code{upage}, running @code{ptrace}, and such are all
 191 in the native-dependent files.
 192
 193 Another example of native-dependent code is support for features that
 194 are really part of the target environment, but which require
 195 @code{#include} files that are only available on the host system.  Core
 196 file handling and @code{setjmp} handling are two common cases.
 197
 198 When you want to make GDB work ``native'' on a particular machine, you
 199 have to include all three kinds of information.
 200
 201
 202 @node Algorithms
 203
 204 @chapter Algorithms
 205
 206 GDB uses a number of debugging-specific algorithms.  They are often not
 207 very complicated, but get lost in the thicket of special cases and
 208 real-world issues.  This chapter describes the basic algorithms and
 209 mentions some of the specific target definitions that they use.
 210
 211 @section Frames
 212
 213 A frame is a construct that GDB uses to keep track of calling and called
 214 functions.
 215
 216 @code{FRAME_FP} in the machine description has no meaning to the
 217 machine-independent part of GDB, except that it is used when setting up
 218 a new frame from scratch, as follows:
 219
 220 @example
 221       create_new_frame (read_register (FP_REGNUM), read_pc ()));
 222 @end example
 223
 224 Other than that, all the meaning imparted to @code{FP_REGNUM} is
 225 imparted by the machine-dependent code.  So, @code{FP_REGNUM} can have
 226 any value that is convenient for the code that creates new frames.
 227 (@code{create_new_frame} calls @code{INIT_EXTRA_FRAME_INFO} if it is
 228 defined; that is where you should use the @code{FP_REGNUM} value, if
 229 your frames are nonstandard.)
 230
 231 Given a GDB frame, define @code{FRAME_CHAIN} to determine the address of
 232 the calling function's frame.  This will be used to create a new GDB
 233 frame struct, and then @code{INIT_EXTRA_FRAME_INFO} and
 234 @code{INIT_FRAME_PC} will be called for the new frame.
 235
 236 @section Breakpoint Handling
 237
 238 In general, a breakpoint is a user-designated location in the program
 239 where the user wants to regain control if program execution ever reaches
 240 that location.
 241
 242 There are two main ways to implement breakpoints; either as ``hardware''
 243 breakpoints or as ``software'' breakpoints.
 244
 245 Hardware breakpoints are sometimes available as a builtin debugging
 246 features with some chips.  Typically these work by having dedicated
 247 register into which the breakpoint address may be stored.  If the PC
 248 ever matches a value in a breakpoint registers, the CPU raises an
 249 exception and reports it to GDB.  Another possibility is when an
 250 emulator is in use; many emulators include circuitry that watches the
 251 address lines coming out from the processor, and force it to stop if the
 252 address matches a breakpoint's address.  A third possibility is that the
 253 target already has the ability to do breakpoints somehow; for instance,
 254 a ROM monitor may do its own software breakpoints.  So although these
 255 are not literally ``hardware breakpoints'', from GDB's point of view
 256 they work the same; GDB need not do nothing more than set the breakpoint
 257 and wait for something to happen.
 258
 259 Since they depend on hardware resources, hardware breakpoints may be
 260 limited in number; when the user asks for more, GDB will start trying to
 261 set software breakpoints.
 262
 263 Software breakpoints require GDB to do somewhat more work.  The basic
 264 theory is that GDB will replace a program instruction a trap, illegal
 265 divide, or some other instruction that will cause an exception, and then
 266 when it's encountered, GDB will take the exception and stop the program.
 267 When the user says to continue, GDB will restore the original
 268 instruction, single-step, re-insert the trap, and continue on.
 269
 270 Since it literally overwrites the program being tested, the program area
 271 must be writeable, so this technique won't work on programs in ROM.  It
 272 can also distort the behavior of programs that examine themselves,
 273 although the situation would be highly unusual.
 274
 275 Also, the software breakpoint instruction should be the smallest size of
 276 instruction, so it doesn't overwrite an instruction that might be a jump
 277 target, and cause disaster when the program jumps into the middle of the
 278 breakpoint instruction.  (Strictly speaking, the breakpoint must be no
 279 larger than the smallest interval between instructions that may be jump
 280 targets; perhaps there is an architecture where only even-numbered
 281 instructions may jumped to.)  Note that it's possible for an instruction
 282 set not to have any instructions usable for a software breakpoint,
 283 although in practice only the ARC has failed to define such an
 284 instruction.
 285
 286 The basic definition of the software breakpoint is the macro
 287 @code{BREAKPOINT}.
 288
 289 Basic breakpoint object handling is in @file{breakpoint.c}.  However,
 290 much of the interesting breakpoint action is in @file{infrun.c}.
 291
 292 @section Single Stepping
 293
 294 @section Signal Handling
 295
 296 @section Thread Handling
 297
 298 @section Inferior Function Calls
 299
 300 @section Longjmp Support
 301
 302 GDB has support for figuring out that the target is doing a
 303 @code{longjmp} and for stopping at the target of the jump, if we are
 304 stepping.  This is done with a few specialized internal breakpoints,
 305 which are visible in the @code{maint info breakpoint} command.
 306
 307 To make this work, you need to define a macro called
 308 @code{GET_LONGJMP_TARGET}, which will examine the @code{jmp_buf}
 309 structure and extract the longjmp target address.  Since @code{jmp_buf}
 310 is target specific, you will need to define it in the appropriate
 311 @file{tm-@var{xyz}.h} file.  Look in @file{tm-sun4os4.h} and
 312 @file{sparc-tdep.c} for examples of how to do this.
 313
 314 @node User Interface
 315
 316 @chapter User Interface
 317
 318 GDB has several user interfaces.  Although the command-line interface
 319 is the most common and most familiar, there are others.
 320
 321 @section Command Interpreter
 322
 323 The command interpreter in GDB is fairly simple.  It is designed to
 324 allow for the set of commands to be augmented dynamically, and also
 325 has a recursive subcommand capability, where the first argument to
 326 a command may itself direct a lookup on a different command list.
 327
 328 For instance, the @code{set} command just starts a lookup on the
 329 @code{setlist} command list, while @code{set thread} recurses
 330 to the @code{set_thread_cmd_list}.
 331
 332 To add commands in general, use @code{add_cmd}.  @code{add_com} adds to
 333 the main command list, and should be used for those commands.  The usual
 334 place to add commands is in the @code{_initialize_@var{xyz}} routines at the
 335 ends of most source files.
 336
 337 @section Console Printing
 338
 339 @section TUI
 340
 341 @section libgdb
 342
 343 @code{libgdb} was an abortive project of years ago.  The theory was to
 344 provide an API to GDB's functionality.
 345
 346 @node Symbol Handling
 347
 348 @chapter Symbol Handling
 349
 350 Symbols are a key part of GDB's operation.  Symbols include variables,
 351 functions, and types.
 352
 353 @section Symbol Reading
 354
 355 GDB reads symbols from ``symbol files''.  The usual symbol file is the
 356 file containing the program which GDB is debugging.  GDB can be directed
 357 to use a different file for symbols (with the @code{symbol-file}
 358 command), and it can also read more symbols via the ``add-file'' and
 359 ``load'' commands, or while reading symbols from shared libraries.
 360
 361 Symbol files are initially opened by code in @file{symfile.c} using the
 362 BFD library.  BFD identifies the type of the file by examining its
 363 header.  @code{symfile_init} then uses this identification to locate a
 364 set of symbol-reading functions.
 365
 366 Symbol reading modules identify themselves to GDB by calling
 367 @code{add_symtab_fns} during their module initialization.  The argument
 368 to @code{add_symtab_fns} is a @code{struct sym_fns} which contains the
 369 name (or name prefix) of the symbol format, the length of the prefix,
 370 and pointers to four functions.  These functions are called at various
 371 times to process symbol-files whose identification matches the specified
 372 prefix.
 373
 374 The functions supplied by each module are:
 375
 376 @table @code
 377 @item @var{xyz}_symfile_init(struct sym_fns *sf)
 378
 379 Called from @code{symbol_file_add} when we are about to read a new
 380 symbol file.  This function should clean up any internal state (possibly
 381 resulting from half-read previous files, for example) and prepare to
 382 read a new symbol file. Note that the symbol file which we are reading
 383 might be a new "main" symbol file, or might be a secondary symbol file
 384 whose symbols are being added to the existing symbol table.
 385
 386 The argument to @code{@var{xyz}_symfile_init} is a newly allocated
 387 @code{struct sym_fns} whose @code{bfd} field contains the BFD for the
 388 new symbol file being read.  Its @code{private} field has been zeroed,
 389 and can be modified as desired.  Typically, a struct of private
 390 information will be @code{malloc}'d, and a pointer to it will be placed
 391 in the @code{private} field.
 392
 393 There is no result from @code{@var{xyz}_symfile_init}, but it can call
 394 @code{error} if it detects an unavoidable problem.
 395
 396 @item @var{xyz}_new_init()
 397
 398 Called from @code{symbol_file_add} when discarding existing symbols.
 399 This function need only handle the symbol-reading module's internal
 400 state; the symbol table data structures visible to the rest of GDB will
 401 be discarded by @code{symbol_file_add}.  It has no arguments and no
 402 result.  It may be called after @code{@var{xyz}_symfile_init}, if a new
 403 symbol table is being read, or may be called alone if all symbols are
 404 simply being discarded.
 405
 406 @item @var{xyz}_symfile_read(struct sym_fns *sf, CORE_ADDR addr, int mainline)
 407
 408 Called from @code{symbol_file_add} to actually read the symbols from a
 409 symbol-file into a set of psymtabs or symtabs.
 410
 411 @code{sf} points to the struct sym_fns originally passed to
 412 @code{@var{xyz}_sym_init} for possible initialization.  @code{addr} is
 413 the offset between the file's specified start address and its true
 414 address in memory.  @code{mainline} is 1 if this is the main symbol
 415 table being read, and 0 if a secondary symbol file (e.g. shared library
 416 or dynamically loaded file) is being read.@refill
 417 @end table
 418
 419 In addition, if a symbol-reading module creates psymtabs when
 420 @var{xyz}_symfile_read is called, these psymtabs will contain a pointer
 421 to a function @code{@var{xyz}_psymtab_to_symtab}, which can be called
 422 from any point in the GDB symbol-handling code.
 423
 424 @table @code
 425 @item @var{xyz}_psymtab_to_symtab (struct partial_symtab *pst)
 426
 427 Called from @code{psymtab_to_symtab} (or the PSYMTAB_TO_SYMTAB macro) if
 428 the psymtab has not already been read in and had its @code{pst->symtab}
 429 pointer set.  The argument is the psymtab to be fleshed-out into a
 430 symtab.  Upon return, pst->readin should have been set to 1, and
 431 pst->symtab should contain a pointer to the new corresponding symtab, or
 432 zero if there were no symbols in that part of the symbol file.
 433 @end table
 434
 435 @section Partial Symbol Tables
 436
 437 GDB has three types of symbol tables.
 438
 439 @itemize @bullet
 440
 441 @item full symbol tables (symtabs).  These contain the main information
 442 about symbols and addresses.
 443
 444 @item partial symbol tables (psymtabs).  These contain enough
 445 information to know when to read the corresponding part of the full
 446 symbol table.
 447
 448 @item minimal symbol tables (msymtabs).  These contain information
 449 gleaned from non-debugging symbols.
 450
 451 @end itemize
 452
 453 This section describes partial symbol tables.
 454
 455 A psymtab is constructed by doing a very quick pass over an executable
 456 file's debugging information.  Small amounts of information are
 457 extracted -- enough to identify which parts of the symbol table will
 458 need to be re-read and fully digested later, when the user needs the
 459 information.  The speed of this pass causes GDB to start up very
 460 quickly.  Later, as the detailed rereading occurs, it occurs in small
 461 pieces, at various times, and the delay therefrom is mostly invisible to
 462 the user.
 463 @c (@xref{Symbol Reading}.)
 464
 465 The symbols that show up in a file's psymtab should be, roughly, those
 466 visible to the debugger's user when the program is not running code from
 467 that file.  These include external symbols and types, static symbols and
 468 types, and enum values declared at file scope.
 469
 470 The psymtab also contains the range of instruction addresses that the
 471 full symbol table would represent.
 472
 473 The idea is that there are only two ways for the user (or much of the
 474 code in the debugger) to reference a symbol:
 475
 476 @itemize @bullet
 477
 478 @item by its address
 479 (e.g. execution stops at some address which is inside a function in this
 480 file).  The address will be noticed to be in the range of this psymtab,
 481 and the full symtab will be read in.  @code{find_pc_function},
 482 @code{find_pc_line}, and other @code{find_pc_@dots{}} functions handle
 483 this.
 484
 485 @item by its name
 486 (e.g. the user asks to print a variable, or set a breakpoint on a
 487 function).  Global names and file-scope names will be found in the
 488 psymtab, which will cause the symtab to be pulled in.  Local names will
 489 have to be qualified by a global name, or a file-scope name, in which
 490 case we will have already read in the symtab as we evaluated the
 491 qualifier.  Or, a local symbol can be referenced when we are "in" a
 492 local scope, in which case the first case applies.  @code{lookup_symbol}
 493 does most of the work here.
 494
 495 @end itemize
 496
 497 The only reason that psymtabs exist is to cause a symtab to be read in
 498 at the right moment.  Any symbol that can be elided from a psymtab,
 499 while still causing that to happen, should not appear in it.  Since
 500 psymtabs don't have the idea of scope, you can't put local symbols in
 501 them anyway.  Psymtabs don't have the idea of the type of a symbol,
 502 either, so types need not appear, unless they will be referenced by
 503 name.
 504
 505 It is a bug for GDB to behave one way when only a psymtab has been read,
 506 and another way if the corresponding symtab has been read in.  Such bugs
 507 are typically caused by a psymtab that does not contain all the visible
 508 symbols, or which has the wrong instruction address ranges.
 509
 510 The psymtab for a particular section of a symbol-file (objfile) could be
 511 thrown away after the symtab has been read in.  The symtab should always
 512 be searched before the psymtab, so the psymtab will never be used (in a
 513 bug-free environment).  Currently, psymtabs are allocated on an obstack,
 514 and all the psymbols themselves are allocated in a pair of large arrays
 515 on an obstack, so there is little to be gained by trying to free them
 516 unless you want to do a lot more work.
 517
 518 @section Types
 519
 520 Fundamental Types (e.g., FT_VOID, FT_BOOLEAN).
 521
 522 These are the fundamental types that GDB uses internally.  Fundamental
 523 types from the various debugging formats (stabs, ELF, etc) are mapped
 524 into one of these.  They are basically a union of all fundamental types
 525 that gdb knows about for all the languages that GDB knows about.
 526
 527 Type Codes (e.g., TYPE_CODE_PTR, TYPE_CODE_ARRAY).
 528
 529 Each time GDB builds an internal type, it marks it with one of these
 530 types.  The type may be a fundamental type, such as TYPE_CODE_INT, or a
 531 derived type, such as TYPE_CODE_PTR which is a pointer to another type.
 532 Typically, several FT_* types map to one TYPE_CODE_* type, and are
 533 distinguished by other members of the type struct, such as whether the
 534 type is signed or unsigned, and how many bits it uses.
 535
 536 Builtin Types (e.g., builtin_type_void, builtin_type_char).
 537
 538 These are instances of type structs that roughly correspond to
 539 fundamental types and are created as global types for GDB to use for
 540 various ugly historical reasons.  We eventually want to eliminate these.
 541 Note for example that builtin_type_int initialized in gdbtypes.c is
 542 basically the same as a TYPE_CODE_INT type that is initialized in
 543 c-lang.c for an FT_INTEGER fundamental type.  The difference is that the
 544 builtin_type is not associated with any particular objfile, and only one
 545 instance exists, while c-lang.c builds as many TYPE_CODE_INT types as
 546 needed, with each one associated with some particular objfile.
 547
 548 @section Object File Formats
 549
 550 @subsection a.out
 551
 552 The @file{a.out} format is the original file format for Unix.  It
 553 consists of three sections: text, data, and bss, which are for program
 554 code, initialized data, and uninitialized data, respectively.
 555
 556 The @file{a.out} format is so simple that it doesn't have any reserved
 557 place for debugging information.  (Hey, the original Unix hackers used
 558 @file{adb}, which is a machine-language debugger.)  The only debugging
 559 format for @file{a.out} is stabs, which is encoded as a set of normal
 560 symbols with distinctive attributes.
 561
 562 The basic @file{a.out} reader is in @file{dbxread.c}.
 563
 564 @subsection COFF
 565
 566 The COFF format was introduced with System V Release 3 (SVR3) Unix.
 567 COFF files may have multiple sections, each prefixed by a header.  The
 568 number of sections is limited.
 569
 570 The COFF specification includes support for debugging.  Although this
 571 was a step forward, the debugging information was woefully limited.  For
 572 instance, it was not possible to represent code that came from an
 573 included file.
 574
 575 The COFF reader is in @file{coffread.c}.
 576
 577 @subsection ECOFF
 578
 579 ECOFF is an extended COFF originally introduced for Mips and Alpha
 580 workstations.
 581
 582 The basic ECOFF reader is in @file{mipsread.c}.
 583
 584 @subsection XCOFF
 585
 586 The IBM RS/6000 running AIX uses an object file format called XCOFF.
 587 The COFF sections, symbols, and line numbers are used, but debugging
 588 symbols are dbx-style stabs whose strings are located in the
 589 @samp{.debug} section (rather than the string table).  For more
 590 information, see @xref{Top,,,stabs,The Stabs Debugging Format}.
 591
 592 The shared library scheme has a clean interface for figuring out what
 593 shared libraries are in use, but the catch is that everything which
 594 refers to addresses (symbol tables and breakpoints at least) needs to be
 595 relocated for both shared libraries and the main executable.  At least
 596 using the standard mechanism this can only be done once the program has
 597 been run (or the core file has been read).
 598
 599 @subsection PE
 600
 601 Windows 95 and NT use the PE (Portable Executable) format for their
 602 executables.  PE is basically COFF with additional headers.
 603
 604 While BFD includes special PE support, GDB needs only the basic
 605 COFF reader.
 606
 607 @subsection ELF
 608
 609 The ELF format came with System V Release 4 (SVR4) Unix.  ELF is similar
 610 to COFF in being organized into a number of sections, but it removes
 611 many of COFF's limitations.
 612
 613 The basic ELF reader is in @file{elfread.c}.
 614
 615 @subsection SOM
 616
 617 SOM is HP's object file and debug format (not to be confused with IBM's
 618 SOM, which is a cross-language ABI).
 619
 620 The SOM reader is in @file{hpread.c}.
 621
 622 @subsection Other File Formats
 623
 624 Other file formats that have been supported by GDB include Netware
 625 Loadable Modules (@file{nlmread.c}.
 626
 627 @section Debugging File Formats
 628
 629 This section describes characteristics of debugging information that
 630 are independent of the object file format.
 631
 632 @subsection stabs
 633
 634 @code{stabs} started out as special symbols within the @code{a.out}
 635 format.  Since then, it has been encapsulated into other file
 636 formats, such as COFF and ELF.
 637
 638 While @file{dbxread.c} does some of the basic stab processing,
 639 including for encapsulated versions, @file{stabsread.c} does
 640 the real work.
 641
 642 @subsection COFF
 643
 644 The basic COFF definition includes debugging information.  The level
 645 of support is minimal and non-extensible, and is not often used.
 646
 647 @subsection Mips debug (Third Eye)
 648
 649 ECOFF includes a definition of a special debug format.
 650
 651 The file @file{mdebugread.c} implements reading for this format.
 652
 653 @subsection DWARF 1
 654
 655 DWARF 1 is a debugging format that was originally designed to be
 656 used with ELF in SVR4 systems.
 657
 658 @c CHILL_PRODUCER
 659 @c GCC_PRODUCER
 660 @c GPLUS_PRODUCER
 661 @c LCC_PRODUCER
 662 @c If defined, these are the producer strings in a DWARF 1 file.  All of
 663 @c these have reasonable defaults already.
 664
 665 The DWARF 1 reader is in @file{dwarfread.c}.
 666
 667 @subsection DWARF 2
 668
 669 DWARF 2 is an improved but incompatible version of DWARF 1.
 670
 671 The DWARF 2 reader is in @file{dwarf2read.c}.
 672
 673 @subsection SOM
 674
 675 Like COFF, the SOM definition includes debugging information.
 676
 677 @section Adding a New Symbol Reader to GDB
 678
 679 If you are using an existing object file format (a.out, COFF, ELF, etc),
 680 there is probably little to be done.
 681
 682 If you need to add a new object file format, you must first add it to
 683 BFD.  This is beyond the scope of this document.
 684
 685 You must then arrange for the BFD code to provide access to the
 686 debugging symbols.  Generally GDB will have to call swapping routines
 687 from BFD and a few other BFD internal routines to locate the debugging
 688 information.  As much as possible, GDB should not depend on the BFD
 689 internal data structures.
 690
 691 For some targets (e.g., COFF), there is a special transfer vector used
 692 to call swapping routines, since the external data structures on various
 693 platforms have different sizes and layouts.  Specialized routines that
 694 will only ever be implemented by one object file format may be called
 695 directly.  This interface should be described in a file
 696 @file{bfd/libxyz.h}, which is included by GDB.
 697
 698
 699 @node Language Support
 700
 701 @chapter Language Support
 702
 703 GDB's language support is mainly driven by the symbol reader, although
 704 it is possible for the user to set the source language manually.
 705
 706 GDB chooses the source language by looking at the extension of the file
 707 recorded in the debug info; @code{.c} means C, @code{.f} means Fortran,
 708 etc.  It may also use a special-purpose language identifier if the debug
 709 format supports it, such as DWARF.
 710
 711 @section Adding a Source Language to GDB
 712
 713 To add other languages to GDB's expression parser, follow the following
 714 steps:
 715
 716 @table @emph
 717 @item Create the expression parser.
 718
 719 This should reside in a file @file{@var{lang}-exp.y}.  Routines for
 720 building parsed expressions into a @samp{union exp_element} list are in
 721 @file{parse.c}.
 722
 723 Since we can't depend upon everyone having Bison, and YACC produces
 724 parsers that define a bunch of global names, the following lines
 725 @emph{must} be included at the top of the YACC parser, to prevent the
 726 various parsers from defining the same global names:
 727
 728 @example
 729 #define yyparse         @var{lang}_parse
 730 #define yylex   @var{lang}_lex
 731 #define yyerror         @var{lang}_error
 732 #define yylval  @var{lang}_lval
 733 #define yychar  @var{lang}_char
 734 #define yydebug         @var{lang}_debug
 735 #define yypact          @var{lang}_pact
 736 #define yyr1            @var{lang}_r1
 737 #define yyr2            @var{lang}_r2
 738 #define yydef           @var{lang}_def
 739 #define yychk           @var{lang}_chk
 740 #define yypgo           @var{lang}_pgo
 741 #define yyact   @var{lang}_act
 742 #define yyexca          @var{lang}_exca
 743 #define yyerrflag       @var{lang}_errflag
 744 #define yynerrs         @var{lang}_nerrs
 745 @end example
 746
 747 At the bottom of your parser, define a @code{struct language_defn} and
 748 initialize it with the right values for your language.  Define an
 749 @code{initialize_@var{lang}} routine and have it call
 750 @samp{add_language(@var{lang}_language_defn)} to tell the rest of GDB
 751 that your language exists.  You'll need some other supporting variables
 752 and functions, which will be used via pointers from your
 753 @code{@var{lang}_language_defn}.  See the declaration of @code{struct
 754 language_defn} in @file{language.h}, and the other @file{*-exp.y} files,
 755 for more information.
 756
 757 @item Add any evaluation routines, if necessary
 758
 759 If you need new opcodes (that represent the operations of the language),
 760 add them to the enumerated type in @file{expression.h}.  Add support
 761 code for these operations in @code{eval.c:evaluate_subexp()}.  Add cases
 762 for new opcodes in two functions from @file{parse.c}:
 763 @code{prefixify_subexp()} and @code{length_of_subexp()}.  These compute
 764 the number of @code{exp_element}s that a given operation takes up.
 765
 766 @item Update some existing code
 767
 768 Add an enumerated identifier for your language to the enumerated type
 769 @code{enum language} in @file{defs.h}.
 770
 771 Update the routines in @file{language.c} so your language is included.
 772 These routines include type predicates and such, which (in some cases)
 773 are language dependent.  If your language does not appear in the switch
 774 statement, an error is reported.
 775
 776 Also included in @file{language.c} is the code that updates the variable
 777 @code{current_language}, and the routines that translate the
 778 @code{language_@var{lang}} enumerated identifier into a printable
 779 string.
 780
 781 Update the function @code{_initialize_language} to include your
 782 language.  This function picks the default language upon startup, so is
 783 dependent upon which languages that GDB is built for.
 784
 785 Update @code{allocate_symtab} in @file{symfile.c} and/or symbol-reading
 786 code so that the language of each symtab (source file) is set properly.
 787 This is used to determine the language to use at each stack frame level.
 788 Currently, the language is set based upon the extension of the source
 789 file.  If the language can be better inferred from the symbol
 790 information, please set the language of the symtab in the symbol-reading
 791 code.
 792
 793 Add helper code to @code{expprint.c:print_subexp()} to handle any new
 794 expression opcodes you have added to @file{expression.h}.  Also, add the
 795 printed representations of your operators to @code{op_print_tab}.
 796
 797 @item Add a place of call
 798
 799 Add a call to @code{@var{lang}_parse()} and @code{@var{lang}_error} in
 800 @code{parse.c:parse_exp_1()}.
 801
 802 @item Use macros to trim code
 803
 804 The user has the option of building GDB for some or all of the
 805 languages.  If the user decides to build GDB for the language
 806 @var{lang}, then every file dependent on @file{language.h} will have the
 807 macro @code{_LANG_@var{lang}} defined in it.  Use @code{#ifdef}s to
 808 leave out large routines that the user won't need if he or she is not
 809 using your language.
 810
 811 Note that you do not need to do this in your YACC parser, since if GDB
 812 is not build for @var{lang}, then @file{@var{lang}-exp.tab.o} (the
 813 compiled form of your parser) is not linked into GDB at all.
 814
 815 See the file @file{configure.in} for how GDB is configured for different
 816 languages.
 817
 818 @item Edit @file{Makefile.in}
 819
 820 Add dependencies in @file{Makefile.in}.  Make sure you update the macro
 821 variables such as @code{HFILES} and @code{OBJS}, otherwise your code may
 822 not get linked in, or, worse yet, it may not get @code{tar}red into the
 823 distribution!
 824
 825 @end table
 826
 827
 828 @node Host Definition
 829
 830 @chapter Host Definition
 831
 832 With the advent of autoconf, it's rarely necessary to have host
 833 definition machinery anymore.
 834
 835 @section Adding a New Host
 836
 837 Most of GDB's host configuration support happens via autoconf.  It
 838 should be rare to need new host-specific definitions.  GDB still uses
 839 the host-specific definitions and files listed below, but these mostly
 840 exist for historical reasons, and should eventually disappear.
 841
 842 Several files control GDB's configuration for host systems:
 843
 844 @table @file
 845
 846 @item gdb/config/@var{arch}/@var{xyz}.mh
 847 Specifies Makefile fragments needed when hosting on machine @var{xyz}.
 848 In particular, this lists the required machine-dependent object files,
 849 by defining @samp{XDEPFILES=@dots{}}.  Also specifies the header file
 850 which describes host @var{xyz}, by defining @code{XM_FILE=
 851 xm-@var{xyz}.h}.  You can also define @code{CC}, @code{SYSV_DEFINE},
 852 @code{XM_CFLAGS}, @code{XM_ADD_FILES}, @code{XM_CLIBS}, @code{XM_CDEPS},
 853 etc.; see @file{Makefile.in}.
 854
 855 @item gdb/config/@var{arch}/xm-@var{xyz}.h
 856 (@file{xm.h} is a link to this file, created by configure).  Contains C
 857 macro definitions describing the host system environment, such as byte
 858 order, host C compiler and library.
 859
 860 @item gdb/@var{xyz}-xdep.c
 861 Contains any miscellaneous C code required for this machine as a host.
 862 On most machines it doesn't exist at all.  If it does exist, put
 863 @file{@var{xyz}-xdep.o} into the @code{XDEPFILES} line in
 864 @file{gdb/config/@var{arch}/@var{xyz}.mh}.
 865
 866 @end table
 867
 868 @subheading Generic Host Support Files
 869
 870 There are some ``generic'' versions of routines that can be used by
 871 various systems.  These can be customized in various ways by macros
 872 defined in your @file{xm-@var{xyz}.h} file.  If these routines work for
 873 the @var{xyz} host, you can just include the generic file's name (with
 874 @samp{.o}, not @samp{.c}) in @code{XDEPFILES}.
 875
 876 Otherwise, if your machine needs custom support routines, you will need
 877 to write routines that perform the same functions as the generic file.
 878 Put them into @code{@var{xyz}-xdep.c}, and put @code{@var{xyz}-xdep.o}
 879 into @code{XDEPFILES}.
 880
 881 @table @file
 882
 883 @item ser-unix.c
 884 This contains serial line support for Unix systems.  This is always
 885 included, via the makefile variable @code{SER_HARDWIRE}; override this
 886 variable in the @file{.mh} file to avoid it.
 887
 888 @item ser-go32.c
 889 This contains serial line support for 32-bit programs running under DOS,
 890 using the GO32 execution environment.
 891
 892 @item ser-tcp.c
 893 This contains generic TCP support using sockets.
 894
 895 @end table
 896
 897 @section Host Conditionals
 898
 899 When GDB is configured and compiled, various macros are defined or left
 900 undefined, to control compilation based on the attributes of the host
 901 system.  These macros and their meanings (or if the meaning is not
 902 documented here, then one of the source files where they are used is
 903 indicated) are:
 904
 905 @table @code
 906
 907 @item GDBINIT_FILENAME
 908 The default name of GDB's initialization file (normally @file{.gdbinit}).
 909
 910 @item MEM_FNS_DECLARED
 911 Your host config file defines this if it includes declarations of
 912 @code{memcpy} and @code{memset}.  Define this to avoid conflicts between
 913 the native include files and the declarations in @file{defs.h}.
 914
 915 @item NO_SYS_FILE
 916 Define this if your system does not have a @code{<sys/file.h>}.
 917
 918 @item SIGWINCH_HANDLER
 919 If your host defines @code{SIGWINCH}, you can define this to be the name
 920 of a function to be called if @code{SIGWINCH} is received.
 921
 922 @item SIGWINCH_HANDLER_BODY
 923 Define this to expand into code that will define the function named by
 924 the expansion of @code{SIGWINCH_HANDLER}.
 925
 926 @item ALIGN_STACK_ON_STARTUP
 927 Define this if your system is of a sort that will crash in
 928 @code{tgetent} if the stack happens not to be longword-aligned when
 929 @code{main} is called.  This is a rare situation, but is known to occur
 930 on several different types of systems.
 931
 932 @item CRLF_SOURCE_FILES
 933 Define this if host files use @code{\r\n} rather than @code{\n} as a
 934 line terminator.  This will cause source file listings to omit @code{\r}
 935 characters when printing and it will allow \r\n line endings of files
 936 which are "sourced" by gdb.  It must be possible to open files in binary
 937 mode using @code{O_BINARY} or, for fopen, @code{"rb"}.
 938
 939 @item DEFAULT_PROMPT
 940 The default value of the prompt string (normally @code{"(gdb) "}).
 941
 942 @item DEV_TTY
 943 The name of the generic TTY device, defaults to @code{"/dev/tty"}.
 944
 945 @item FCLOSE_PROVIDED
 946 Define this if the system declares @code{fclose} in the headers included
 947 in @code{defs.h}.  This isn't needed unless your compiler is unusually
 948 anal.
 949
 950 @item FOPEN_RB
 951 Define this if binary files are opened the same way as text files.
 952
 953 @item GETENV_PROVIDED
 954 Define this if the system declares @code{getenv} in its headers included
 955 in @code{defs.h}. This isn't needed unless your compiler is unusually
 956 anal.
 957
 958 @item HAVE_MMAP
 959 In some cases, use the system call @code{mmap} for reading symbol
 960 tables.  For some machines this allows for sharing and quick updates.
 961
 962 @item HAVE_SIGSETMASK
 963 Define this if the host system has job control, but does not define
 964 @code{sigsetmask()}.  Currently, this is only true of the RS/6000.
 965
 966 @item HAVE_TERMIO
 967 Define this if the host system has @code{termio.h}.
 968
 969 @item HOST_BYTE_ORDER
 970 The ordering of bytes in the host.  This must be defined to be either
 971 @code{BIG_ENDIAN} or @code{LITTLE_ENDIAN}.
 972
 973 @item INT_MAX
 974 @item INT_MIN
 975 @item LONG_MAX
 976 @item UINT_MAX
 977 @item ULONG_MAX
 978 Values for host-side constants.
 979
 980 @item ISATTY
 981 Substitute for isatty, if not available.
 982
 983 @item LONGEST
 984 This is the longest integer type available on the host.  If not defined,
 985 it will default to @code{long long} or @code{long}, depending on
 986 @code{CC_HAS_LONG_LONG}.
 987
 988 @item CC_HAS_LONG_LONG
 989 Define this if the host C compiler supports ``long long''.  This is set
 990 by the configure script.
 991
 992 @item PRINTF_HAS_LONG_LONG
 993 Define this if the host can handle printing of long long integers via
 994 the printf format directive ``ll''. This is set by the configure script.
 995
 996 @item HAVE_LONG_DOUBLE
 997 Define this if the host C compiler supports ``long double''.  This is
 998 set by the configure script.
 999
1000 @item PRINTF_HAS_LONG_DOUBLE
1001 Define this if the host can handle printing of long double float-point
1002 numbers via the printf format directive ``Lg''. This is set by the
1003 configure script.
1004
1005 @item SCANF_HAS_LONG_DOUBLE
1006 Define this if the host can handle the parsing of long double
1007 float-point numbers via the scanf format directive directive
1008 ``Lg''. This is set by the configure script.
1009
1010 @item LSEEK_NOT_LINEAR
1011 Define this if @code{lseek (n)} does not necessarily move to byte number
1012 @code{n} in the file.  This is only used when reading source files.  It
1013 is normally faster to define @code{CRLF_SOURCE_FILES} when possible.
1014
1015 @item L_SET
1016 This macro is used as the argument to lseek (or, most commonly,
1017 bfd_seek).  FIXME, should be replaced by SEEK_SET instead, which is the
1018 POSIX equivalent.
1019
1020 @item MAINTENANCE_CMDS
1021 If the value of this is 1, then a number of optional maintenance
1022 commands are compiled in.
1023
1024 @item MALLOC_INCOMPATIBLE
1025 Define this if the system's prototype for @code{malloc} differs from the
1026 @sc{ANSI} definition.
1027
1028 @item MMAP_BASE_ADDRESS
1029 When using HAVE_MMAP, the first mapping should go at this address.
1030
1031 @item MMAP_INCREMENT
1032 when using HAVE_MMAP, this is the increment between mappings.
1033
1034 @item NEED_POSIX_SETPGID
1035 Define this to use the POSIX version of @code{setpgid} to determine
1036 whether job control is available.
1037
1038 @item NORETURN
1039 If defined, this should be one or more tokens, such as @code{volatile},
1040 that can be used in both the declaration and definition of functions to
1041 indicate that they never return.  The default is already set correctly
1042 if compiling with GCC.  This will almost never need to be defined.
1043
1044 @item ATTR_NORETURN
1045 If defined, this should be one or more tokens, such as
1046 @code{__attribute__ ((noreturn))}, that can be used in the declarations
1047 of functions to indicate that they never return.  The default is already
1048 set correctly if compiling with GCC.  This will almost never need to be
1049 defined.
1050
1051 @item USE_MMALLOC
1052 GDB will use the @code{mmalloc} library for memory allocation for symbol
1053 reading if this symbol is defined.  Be careful defining it since there
1054 are systems on which @code{mmalloc} does not work for some reason.  One
1055 example is the DECstation, where its RPC library can't cope with our
1056 redefinition of @code{malloc} to call @code{mmalloc}.  When defining
1057 @code{USE_MMALLOC}, you will also have to set @code{MMALLOC} in the
1058 Makefile, to point to the mmalloc library.  This define is set when you
1059 configure with --with-mmalloc.
1060
1061 @item NO_MMCHECK
1062 Define this if you are using @code{mmalloc}, but don't want the overhead
1063 of checking the heap with @code{mmcheck}.  Note that on some systems,
1064 the C runtime makes calls to malloc prior to calling @code{main}, and if
1065 @code{free} is ever called with these pointers after calling
1066 @code{mmcheck} to enable checking, a memory corruption abort is certain
1067 to occur.  These systems can still use mmalloc, but must define
1068 NO_MMCHECK.
1069
1070 @item MMCHECK_FORCE
1071 Define this to 1 if the C runtime allocates memory prior to
1072 @code{mmcheck} being called, but that memory is never freed so we don't
1073 have to worry about it triggering a memory corruption abort.  The
1074 default is 0, which means that @code{mmcheck} will only install the heap
1075 checking functions if there has not yet been any memory allocation
1076 calls, and if it fails to install the functions, gdb will issue a
1077 warning.  This is currently defined if you configure using
1078 --with-mmalloc.
1079
1080 @item NO_SIGINTERRUPT
1081 Define this to indicate that siginterrupt() is not available.
1082
1083 @item R_OK
1084 Define if this is not in a system .h file.
1085
1086 @item SEEK_CUR
1087 @item SEEK_SET
1088 Define these to appropriate value for the system lseek(), if not already
1089 defined.
1090
1091 @item STOP_SIGNAL
1092 This is the signal for stopping GDB.  Defaults to SIGTSTP.  (Only
1093 redefined for the Convex.)
1094
1095 @item USE_O_NOCTTY
1096 Define this if the interior's tty should be opened with the O_NOCTTY
1097 flag.  (FIXME: This should be a native-only flag, but @file{inflow.c} is
1098 always linked in.)
1099
1100 @item USG
1101 Means that System V (prior to SVR4) include files are in use.  (FIXME:
1102 This symbol is abused in @file{infrun.c}, @file{regex.c},
1103 @file{remote-nindy.c}, and @file{utils.c} for other things, at the
1104 moment.)
1105
1106 @item lint
1107 Define this to help placate lint in some situations.
1108
1109 @item volatile
1110 Define this to override the defaults of @code{__volatile__} or
1111 @code{/**/}.
1112
1113 @end table
1114
1115
1116 @node Target Architecture Definition
1117
1118 @chapter Target Architecture Definition
1119
1120 GDB's target architecture defines what sort of machine-language programs
1121 GDB can work with, and how it works with them.
1122
1123 At present, the target architecture definition consists of a number of C
1124 macros.
1125
1126 @section Registers and Memory
1127
1128 GDB's model of the target machine is rather simple.  GDB assumes the
1129 machine includes a bank of registers and a block of memory.  Each
1130 register may have a different size.
1131
1132 GDB does not have a magical way to match up with the compiler's idea of
1133 which registers are which; however, it is critical that they do match up
1134 accurately.  The only way to make this work is to get accurate
1135 information about the order that the compiler uses, and to reflect that
1136 in the @code{REGISTER_NAME} and related macros.
1137
1138 GDB can handle big-endian, little-endian, and bi-endian architectures.
1139
1140 @section Frame Interpretation
1141
1142 @section Inferior Call Setup
1143
1144 @section Compiler Characteristics
1145
1146 @section Target Conditionals
1147
1148 This section describes the macros that you can use to define the target
1149 machine.
1150
1151 @table @code
1152
1153 @item ADDITIONAL_OPTIONS
1154 @item ADDITIONAL_OPTION_CASES
1155 @item ADDITIONAL_OPTION_HANDLER
1156 @item ADDITIONAL_OPTION_HELP
1157 These are a set of macros that allow the addition of additional command
1158 line options to GDB.  They are currently used only for the unsupported
1159 i960 Nindy target, and should not be used in any other configuration.
1160
1161 @item ADDR_BITS_REMOVE (addr)
1162 If a raw machine address includes any bits that are not really part of
1163 the address, then define this macro to expand into an expression that
1164 zeros those bits in @var{addr}.  For example, the two low-order bits of
1165 a Motorola 88K address may be used by some kernels for their own
1166 purposes, since addresses must always be 4-byte aligned, and so are of
1167 no use for addressing.  Those bits should be filtered out with an
1168 expression such as @code{((addr) & ~3)}.
1169
1170 @item BEFORE_MAIN_LOOP_HOOK
1171 Define this to expand into any code that you want to execute before the
1172 main loop starts.  Although this is not, strictly speaking, a target
1173 conditional, that is how it is currently being used.  Note that if a
1174 configuration were to define it one way for a host and a different way
1175 for the target, GDB will probably not compile, let alone run correctly.
1176 This is currently used only for the unsupported i960 Nindy target, and
1177 should not be used in any other configuration.
1178
1179 @item BELIEVE_PCC_PROMOTION
1180 Define if the compiler promotes a short or char parameter to an int, but
1181 still reports the parameter as its original type, rather than the
1182 promoted type.
1183
1184 @item BELIEVE_PCC_PROMOTION_TYPE
1185 Define this if GDB should believe the type of a short argument when
1186 compiled by pcc, but look within a full int space to get its value.
1187 Only defined for Sun-3 at present.
1188
1189 @item BITS_BIG_ENDIAN
1190 Define this if the numbering of bits in the targets does *not* match the
1191 endianness of the target byte order.  A value of 1 means that the bits
1192 are numbered in a big-endian order, 0 means little-endian.
1193
1194 @item BREAKPOINT
1195 This is the character array initializer for the bit pattern to put into
1196 memory where a breakpoint is set.  Although it's common to use a trap
1197 instruction for a breakpoint, it's not required; for instance, the bit
1198 pattern could be an invalid instruction.  The breakpoint must be no
1199 longer than the shortest instruction of the architecture.
1200
1201 @item BIG_BREAKPOINT
1202 @item LITTLE_BREAKPOINT
1203 Similar to BREAKPOINT, but used for bi-endian targets.
1204
1205 @item REMOTE_BREAKPOINT
1206 @item LITTLE_REMOTE_BREAKPOINT
1207 @item BIG_REMOTE_BREAKPOINT
1208 Similar to BREAKPOINT, but used for remote targets.
1209
1210 @item BREAKPOINT_FROM_PC (pcptr, lenptr)
1211
1212 Use the program counter to determine the contents and size of a
1213 breakpoint instruction.  It returns a pointer to a string of bytes that
1214 encode a breakpoint instruction, stores the length of the string to
1215 *lenptr, and adjusts pc (if necessary) to point to the actual memory
1216 location where the breakpoint should be inserted.
1217
1218 Although it is common to use a trap instruction for a breakpoint, it's
1219 not required; for instance, the bit pattern could be an invalid
1220 instruction.  The breakpoint must be no longer than the shortest
1221 instruction of the architecture.
1222
1223 Replaces all the other BREAKPOINTs.
1224
1225 @item CALL_DUMMY
1226 valops.c
1227 @item CALL_DUMMY_LOCATION
1228 inferior.h
1229 @item CALL_DUMMY_STACK_ADJUST
1230 valops.c
1231
1232 @item CANNOT_FETCH_REGISTER (regno)
1233 A C expression that should be nonzero if @var{regno} cannot be fetched
1234 from an inferior process.  This is only relevant if
1235 @code{FETCH_INFERIOR_REGISTERS} is not defined.
1236
1237 @item CANNOT_STORE_REGISTER (regno)
1238 A C expression that should be nonzero if @var{regno} should not be
1239 written to the target.  This is often the case for program counters,
1240 status words, and other special registers.  If this is not defined, GDB
1241 will assume that all registers may be written.
1242
1243 @item DO_DEFERRED_STORES
1244 @item CLEAR_DEFERRED_STORES
1245 Define this to execute any deferred stores of registers into the inferior,
1246 and to cancel any deferred stores.
1247
1248 Currently only implemented correctly for native Sparc configurations?
1249
1250 @item CPLUS_MARKER
1251 Define this to expand into the character that G++ uses to distinguish
1252 compiler-generated identifiers from programmer-specified identifiers.
1253 By default, this expands into @code{'$'}.  Most System V targets should
1254 define this to @code{'.'}.
1255
1256 @item DBX_PARM_SYMBOL_CLASS
1257 Hook for the @code{SYMBOL_CLASS} of a parameter when decoding DBX symbol
1258 information.  In the i960, parameters can be stored as locals or as
1259 args, depending on the type of the debug record.
1260
1261 @item DECR_PC_AFTER_BREAK
1262 Define this to be the amount by which to decrement the PC after the
1263 program encounters a breakpoint.  This is often the number of bytes in
1264 BREAKPOINT, though not always.  For most targets this value will be 0.
1265
1266 @item DECR_PC_AFTER_HW_BREAK
1267 Similarly, for hardware breakpoints.
1268
1269 @item DISABLE_UNSETTABLE_BREAK addr
1270 If defined, this should evaluate to 1 if @var{addr} is in a shared
1271 library in which breakpoints cannot be set and so should be disabled.
1272
1273 @item DO_REGISTERS_INFO
1274 If defined, use this to print the value of a register or all registers.
1275
1276 @item END_OF_TEXT_DEFAULT
1277 This is an expression that should designate the end of the text section
1278 (? FIXME ?)
1279
1280 @item EXTRACT_RETURN_VALUE(type,regbuf,valbuf)
1281 Define this to extract a function's return value of type @var{type} from
1282 the raw register state @var{regbuf} and copy that, in virtual format,
1283 into @var{valbuf}.
1284
1285 @item EXTRACT_STRUCT_VALUE_ADDRESS(regbuf)
1286 Define this to extract from an array @var{regbuf} containing the (raw)
1287 register state, the address in which a function should return its
1288 structure value, as a CORE_ADDR (or an expression that can be used as
1289 one).
1290
1291 @item FLOAT_INFO
1292 If defined, then the `info float' command will print information about
1293 the processor's floating point unit.
1294
1295 @item FP_REGNUM
1296 The number of the frame pointer register.
1297
1298 @item FRAMELESS_FUNCTION_INVOCATION(fi, frameless)
1299 Define this to set the variable @var{frameless} to 1 if the function
1300 invocation represented by @var{fi} does not have a stack frame
1301 associated with it.  Otherwise set it to 0.
1302
1303 @item FRAME_ARGS_ADDRESS_CORRECT
1304 stack.c
1305
1306 @item FRAME_CHAIN(frame)
1307 Given @var{frame}, return a pointer to the calling frame.
1308
1309 @item FRAME_CHAIN_COMBINE(chain,frame)
1310 Define this to take the frame chain pointer and the frame's nominal
1311 address and produce the nominal address of the caller's frame.
1312 Presently only defined for HP PA.
1313
1314 @item FRAME_CHAIN_VALID(chain,thisframe)
1315
1316 Define this to be an expression that returns zero if the given frame is
1317 an outermost frame, with no caller, and nonzero otherwise.  Three common
1318 definitions are available. @code{default_frame_chain_valid} (the
1319 default) is nonzero if the chain pointer is nonzero and given frame's PC
1320 is not inside the startup file (such as @file{crt0.o}).
1321 @code{alternate_frame_chain_valid} is nonzero if the chain pointer is
1322 nonzero and the given frame's PC is not in @code{main()} or a known
1323 entry point function (such as @code{_start()}).
1324
1325 @item FRAME_INIT_SAVED_REGS(frame)
1326 See @file{frame.h}.  Determines the address of all registers in the
1327 current stack frame storing each in @code{frame->saved_regs}.  Space for
1328 @code{frame->saved_regs} shall be allocated by
1329 @code{FRAME_INIT_SAVED_REGS} using either
1330 @code{frame_saved_regs_zalloc} or @code{frame_obstack_alloc}.
1331
1332 @var{FRAME_FIND_SAVED_REGS} and @var{EXTRA_FRAME_INFO} are deprecated.
1333
1334 @item FRAME_NUM_ARGS (val, fi)
1335 For the frame described by @var{fi}, set @var{val} to the number of arguments
1336 that are being passed.
1337
1338 @item FRAME_SAVED_PC(frame)
1339 Given @var{frame}, return the pc saved there.  That is, the return
1340 address.
1341
1342 @item FUNCTION_EPILOGUE_SIZE
1343 For some COFF targets, the @code{x_sym.x_misc.x_fsize} field of the
1344 function end symbol is 0.  For such targets, you must define
1345 @code{FUNCTION_EPILOGUE_SIZE} to expand into the standard size of a
1346 function's epilogue.
1347
1348 @item GCC_COMPILED_FLAG_SYMBOL
1349 @item GCC2_COMPILED_FLAG_SYMBOL
1350 If defined, these are the names of the symbols that GDB will look for to
1351 detect that GCC compiled the file.  The default symbols are
1352 @code{gcc_compiled.} and @code{gcc2_compiled.}, respectively. (Currently
1353 only defined for the Delta 68.)
1354
1355 @item GDB_TARGET_IS_HPPA
1356 This determines whether horrible kludge code in dbxread.c and
1357 partial-stab.h is used to mangle multiple-symbol-table files from
1358 HPPA's.  This should all be ripped out, and a scheme like elfread.c
1359 used.
1360
1361 @item GDB_TARGET_IS_MACH386
1362 @item GDB_TARGET_IS_SUN3
1363 @item GDB_TARGET_IS_SUN386
1364 Kludges that should go away.
1365
1366 @item GET_LONGJMP_TARGET
1367 For most machines, this is a target-dependent parameter.  On the
1368 DECstation and the Iris, this is a native-dependent parameter, since
1369 <setjmp.h> is needed to define it.
1370
1371 This macro determines the target PC address that longjmp() will jump to,
1372 assuming that we have just stopped at a longjmp breakpoint.  It takes a
1373 CORE_ADDR * as argument, and stores the target PC value through this
1374 pointer.  It examines the current state of the machine as needed.
1375
1376 @item GET_SAVED_REGISTER
1377 Define this if you need to supply your own definition for the function
1378 @code{get_saved_register}.  Currently this is only done for the a29k.
1379
1380 @item HAVE_REGISTER_WINDOWS
1381 Define this if the target has register windows.
1382 @item REGISTER_IN_WINDOW_P (regnum)
1383 Define this to be an expression that is 1 if the given register is in
1384 the window.
1385
1386 @item IBM6000_TARGET
1387 Shows that we are configured for an IBM RS/6000 target.  This
1388 conditional should be eliminated (FIXME) and replaced by
1389 feature-specific macros.  It was introduced in haste and we are
1390 repenting at leisure.
1391
1392 @item IEEE_FLOAT
1393 Define this if the target system uses IEEE-format floating point numbers.
1394
1395 @item INIT_EXTRA_FRAME_INFO (fromleaf, frame)
1396 If additional information about the frame is required this should be
1397 stored in @code{frame->extra_info}.  Space for @code{frame->extra_info}
1398 is allocated using @code{frame_obstack_alloc}.
1399
1400 @item INIT_FRAME_PC (fromleaf, prev)
1401 This is a C statement that sets the pc of the frame pointed to by
1402 @var{prev}.  [By default...]
1403
1404 @item INNER_THAN (lhs,rhs)
1405 Returns non-zero if stack address @var{lhs} is inner than (nearer to the
1406 stack top) stack address @var{rhs}. Define this as @code{lhs < rhs} if
1407 the target's stack grows downward in memory, or @code{lhs > rsh} if the
1408 stack grows upward.
1409
1410 @item IN_SIGTRAMP (pc, name)
1411 Define this to return true if the given @var{pc} and/or @var{name}
1412 indicates that the current function is a sigtramp.
1413
1414 @item SIGTRAMP_START (pc)
1415 @item SIGTRAMP_END (pc)
1416 Define these to be the start and end address of the sigtramp for the
1417 given @var{pc}.  On machines where the address is just a compile time
1418 constant, the macro expansion will typically just ignore the supplied
1419 @var{pc}.
1420
1421 @item IN_SOLIB_CALL_TRAMPOLINE pc name
1422 Define this to evaluate to nonzero if the program is stopped in the
1423 trampoline that connects to a shared library.
1424
1425 @item IN_SOLIB_RETURN_TRAMPOLINE pc name
1426 Define this to evaluate to nonzero if the program is stopped in the
1427 trampoline that returns from a shared library.
1428
1429 @item IS_TRAPPED_INTERNALVAR (name)
1430 This is an ugly hook to allow the specification of special actions that
1431 should occur as a side-effect of setting the value of a variable
1432 internal to GDB.  Currently only used by the h8500.  Note that this
1433 could be either a host or target conditional.
1434
1435 @item NEED_TEXT_START_END
1436 Define this if GDB should determine the start and end addresses of the
1437 text section.  (Seems dubious.)
1438
1439 @item NO_HIF_SUPPORT
1440 (Specific to the a29k.)
1441
1442 @item SOFTWARE_SINGLE_STEP_P
1443 Define this as 1 if the target does not have a hardware single-step
1444 mechanism. The macro @code{SOFTWARE_SINGLE_STEP} must also be defined.
1445
1446 @item SOFTWARE_SINGLE_STEP(signal,insert_breapoints_p)
1447 A function that inserts or removes (dependant on
1448 @var{insert_breapoints_p}) breakpoints at each possible destinations of
1449 the next instruction. See @code{sparc-tdep.c} and @code{rs6000-tdep.c}
1450 for examples.
1451
1452 @item PCC_SOL_BROKEN
1453 (Used only in the Convex target.)
1454
1455 @item PC_IN_CALL_DUMMY
1456 inferior.h
1457
1458 @item PC_LOAD_SEGMENT
1459 If defined, print information about the load segment for the program
1460 counter.  (Defined only for the RS/6000.)
1461
1462 @item PC_REGNUM
1463 If the program counter is kept in a register, then define this macro to
1464 be the number of that register.  This need be defined only if
1465 @code{TARGET_WRITE_PC} is not defined.
1466
1467 @item NPC_REGNUM
1468 The number of the ``next program counter'' register, if defined.
1469
1470 @item NNPC_REGNUM
1471 The number of the ``next next program counter'' register, if defined.
1472 Currently, this is only defined for the Motorola 88K.
1473
1474 @item PRINT_REGISTER_HOOK (regno)
1475 If defined, this must be a function that prints the contents of the
1476 given register to standard output.
1477
1478 @item PRINT_TYPELESS_INTEGER
1479 This is an obscure substitute for @code{print_longest} that seems to
1480 have been defined for the Convex target.
1481
1482 @item PROCESS_LINENUMBER_HOOK
1483 A hook defined for XCOFF reading.
1484
1485 @item PROLOGUE_FIRSTLINE_OVERLAP
1486 (Only used in unsupported Convex configuration.)
1487
1488 @item PS_REGNUM
1489 If defined, this is the number of the processor status register.  (This
1490 definition is only used in generic code when parsing "$ps".)
1491
1492 @item POP_FRAME
1493 Used in @samp{call_function_by_hand} to remove an artificial stack
1494 frame.
1495
1496 @item PUSH_ARGUMENTS (nargs, args, sp, struct_return, struct_addr)
1497 Define this to push arguments onto the stack for inferior function call.
1498
1499 @item PUSH_DUMMY_FRAME
1500 Used in @samp{call_function_by_hand} to create an artificial stack frame.
1501
1502 @item REGISTER_BYTES
1503 The total amount of space needed to store GDB's copy of the machine's
1504 register state.
1505
1506 @item REGISTER_NAME(i)
1507 Return the name of register @var{i} as a string.  May return @var{NULL}
1508 or @var{NUL} to indicate that register @var{i} is not valid.
1509
1510 @item REG_STRUCT_HAS_ADDR (gcc_p, type)
1511 Define this to return 1 if the given type will be passed by pointer
1512 rather than directly.
1513
1514 @item SDB_REG_TO_REGNUM
1515 Define this to convert sdb register numbers into GDB regnums.  If not
1516 defined, no conversion will be done.
1517
1518 @item SHIFT_INST_REGS
1519 (Only used for m88k targets.)
1520
1521 @item SKIP_PROLOGUE (pc)
1522 A C statement that advances the @var{pc} across any function entry
1523 prologue instructions so as to reach ``real'' code.
1524
1525 @item SKIP_PROLOGUE_FRAMELESS_P
1526 A C statement that should behave similarly, but that can stop as soon as
1527 the function is known to have a frame.  If not defined,
1528 @code{SKIP_PROLOGUE} will be used instead.
1529
1530 @item SKIP_TRAMPOLINE_CODE (pc)
1531 If the target machine has trampoline code that sits between callers and
1532 the functions being called, then define this macro to return a new PC
1533 that is at the start of the real function.
1534
1535 @item SP_REGNUM
1536 Define this to be the number of the register that serves as the stack
1537 pointer.
1538
1539 @item STAB_REG_TO_REGNUM
1540 Define this to convert stab register numbers (as gotten from `r'
1541 declarations) into GDB regnums.  If not defined, no conversion will be
1542 done.
1543
1544 @item STACK_ALIGN (addr)
1545 Define this to adjust the address to the alignment required for the
1546 processor's stack.
1547
1548 @item STEP_SKIPS_DELAY (addr)
1549 Define this to return true if the address is of an instruction with a
1550 delay slot.  If a breakpoint has been placed in the instruction's delay
1551 slot, GDB will single-step over that instruction before resuming
1552 normally.  Currently only defined for the Mips.
1553
1554 @item STORE_RETURN_VALUE (type, valbuf)
1555 A C expression that stores a function return value of type @var{type},
1556 where @var{valbuf} is the address of the value to be stored.
1557
1558 @item SUN_FIXED_LBRAC_BUG
1559 (Used only for Sun-3 and Sun-4 targets.)
1560
1561 @item SYMBOL_RELOADING_DEFAULT
1562 The default value of the `symbol-reloading' variable.  (Never defined in
1563 current sources.)
1564
1565 @item TARGET_BYTE_ORDER
1566 The ordering of bytes in the target.  This must be defined to be either
1567 @code{BIG_ENDIAN} or @code{LITTLE_ENDIAN}.
1568
1569 @item TARGET_CHAR_BIT
1570 Number of bits in a char; defaults to 8.
1571
1572 @item TARGET_COMPLEX_BIT
1573 Number of bits in a complex number; defaults to @code{2 * TARGET_FLOAT_BIT}.
1574
1575 @item TARGET_DOUBLE_BIT
1576 Number of bits in a double float; defaults to @code{8 * TARGET_CHAR_BIT}.
1577
1578 @item TARGET_DOUBLE_COMPLEX_BIT
1579 Number of bits in a double complex; defaults to @code{2 * TARGET_DOUBLE_BIT}.
1580
1581 @item TARGET_FLOAT_BIT
1582 Number of bits in a float; defaults to @code{4 * TARGET_CHAR_BIT}.
1583
1584 @item TARGET_INT_BIT
1585 Number of bits in an integer; defaults to @code{4 * TARGET_CHAR_BIT}.
1586
1587 @item TARGET_LONG_BIT
1588 Number of bits in a long integer; defaults to @code{4 * TARGET_CHAR_BIT}.
1589
1590 @item TARGET_LONG_DOUBLE_BIT
1591 Number of bits in a long double float;
1592 defaults to @code{2 * TARGET_DOUBLE_BIT}.
1593
1594 @item TARGET_LONG_LONG_BIT
1595 Number of bits in a long long integer; defaults to @code{2 * TARGET_LONG_BIT}.
1596
1597 @item TARGET_PTR_BIT
1598 Number of bits in a pointer; defaults to @code{TARGET_INT_BIT}.
1599
1600 @item TARGET_SHORT_BIT
1601 Number of bits in a short integer; defaults to @code{2 * TARGET_CHAR_BIT}.
1602
1603 @item TARGET_READ_PC
1604 @item TARGET_WRITE_PC (val, pid)
1605 @item TARGET_READ_SP
1606 @item TARGET_WRITE_SP
1607 @item TARGET_READ_FP
1608 @item TARGET_WRITE_FP
1609 These change the behavior of @code{read_pc}, @code{write_pc},
1610 @code{read_sp}, @code{write_sp}, @code{read_fp} and @code{write_fp}.
1611 For most targets, these may be left undefined.  GDB will call the read
1612 and write register functions with the relevant @code{_REGNUM} argument.
1613
1614 These macros are useful when a target keeps one of these registers in a
1615 hard to get at place; for example, part in a segment register and part
1616 in an ordinary register.
1617
1618 @item TARGET_VIRTUAL_FRAME_POINTER(pc,regp,offsetp)
1619 Returns a @code{(register, offset)} pair representing the virtual
1620 frame pointer in use at the code address @code{"pc"}.  If virtual
1621 frame pointers are not used, a default definition simply returns
1622 @code{FP_REGNUM}, with an offset of zero.
1623
1624 @item USE_STRUCT_CONVENTION (gcc_p, type)
1625 If defined, this must be an expression that is nonzero if a value of the
1626 given @var{type} being returned from a function must have space
1627 allocated for it on the stack.  @var{gcc_p} is true if the function
1628 being considered is known to have been compiled by GCC; this is helpful
1629 for systems where GCC is known to use different calling convention than
1630 other compilers.
1631
1632 @item VARIABLES_INSIDE_BLOCK (desc, gcc_p)
1633 For dbx-style debugging information, if the compiler puts variable
1634 declarations inside LBRAC/RBRAC blocks, this should be defined to be
1635 nonzero.  @var{desc} is the value of @code{n_desc} from the
1636 @code{N_RBRAC} symbol, and @var{gcc_p} is true if GDB has noticed the
1637 presence of either the @code{GCC_COMPILED_SYMBOL} or the
1638 @code{GCC2_COMPILED_SYMBOL}.  By default, this is 0.
1639
1640 @item OS9K_VARIABLES_INSIDE_BLOCK (desc, gcc_p)
1641 Similarly, for OS/9000.  Defaults to 1.
1642
1643 @end table
1644
1645 Motorola M68K target conditionals.
1646
1647 @table @code
1648
1649 @item BPT_VECTOR
1650 Define this to be the 4-bit location of the breakpoint trap vector.  If
1651 not defined, it will default to @code{0xf}.
1652
1653 @item REMOTE_BPT_VECTOR
1654 Defaults to @code{1}.
1655
1656 @end table
1657
1658 @section Adding a New Target
1659
1660 The following files define a target to GDB:
1661
1662 @table @file
1663
1664 @item gdb/config/@var{arch}/@var{ttt}.mt
1665 Contains a Makefile fragment specific to this target.  Specifies what
1666 object files are needed for target @var{ttt}, by defining
1667 @samp{TDEPFILES=@dots{}}.  Also specifies the header file which
1668 describes @var{ttt}, by defining @samp{TM_FILE= tm-@var{ttt}.h}.  You
1669 can also define @samp{TM_CFLAGS}, @samp{TM_CLIBS}, @samp{TM_CDEPS}, but
1670 these are now deprecated and may go away in future versions of GDB.
1671
1672 @item gdb/config/@var{arch}/tm-@var{ttt}.h
1673 (@file{tm.h} is a link to this file, created by configure).  Contains
1674 macro definitions about the target machine's registers, stack frame
1675 format and instructions.
1676
1677 @item gdb/@var{ttt}-tdep.c
1678 Contains any miscellaneous code required for this target machine.  On
1679 some machines it doesn't exist at all.  Sometimes the macros in
1680 @file{tm-@var{ttt}.h} become very complicated, so they are implemented
1681 as functions here instead, and the macro is simply defined to call the
1682 function.  This is vastly preferable, since it is easier to understand
1683 and debug.
1684
1685 @item gdb/config/@var{arch}/tm-@var{arch}.h
1686 This often exists to describe the basic layout of the target machine's
1687 processor chip (registers, stack, etc).  If used, it is included by
1688 @file{tm-@var{ttt}.h}.  It can be shared among many targets that use the
1689 same processor.
1690
1691 @item gdb/@var{arch}-tdep.c
1692 Similarly, there are often common subroutines that are shared by all
1693 target machines that use this particular architecture.
1694
1695 @end table
1696
1697 If you are adding a new operating system for an existing CPU chip, add a
1698 @file{config/tm-@var{os}.h} file that describes the operating system
1699 facilities that are unusual (extra symbol table info; the breakpoint
1700 instruction needed; etc).  Then write a @file{@var{arch}/tm-@var{os}.h}
1701 that just @code{#include}s @file{tm-@var{arch}.h} and
1702 @file{config/tm-@var{os}.h}.
1703
1704
1705 @node Target Vector Definition
1706
1707 @chapter Target Vector Definition
1708
1709 The target vector defines the interface between GDB's abstract handling
1710 of target systems, and the nitty-gritty code that actually exercises
1711 control over a process or a serial port.  GDB includes some 30-40
1712 different target vectors; however, each configuration of GDB includes
1713 only a few of them.
1714
1715 @section File Targets
1716
1717 Both executables and core files have target vectors.
1718
1719 @section Standard Protocol and Remote Stubs
1720
1721 GDB's file @file{remote.c} talks a serial protocol to code that runs in
1722 the target system.  GDB provides several sample ``stubs'' that can be
1723 integrated into target programs or operating systems for this purpose;
1724 they are named @file{*-stub.c}.
1725
1726 The GDB user's manual describes how to put such a stub into your target
1727 code.  What follows is a discussion of integrating the SPARC stub into a
1728 complicated operating system (rather than a simple program), by Stu
1729 Grossman, the author of this stub.
1730
1731 The trap handling code in the stub assumes the following upon entry to
1732 trap_low:
1733
1734 @enumerate
1735
1736 @item %l1 and %l2 contain pc and npc respectively at the time of the trap
1737
1738 @item traps are disabled
1739
1740 @item you are in the correct trap window
1741
1742 @end enumerate
1743
1744 As long as your trap handler can guarantee those conditions, then there
1745 is no reason why you shouldn't be able to `share' traps with the stub.
1746 The stub has no requirement that it be jumped to directly from the
1747 hardware trap vector.  That is why it calls @code{exceptionHandler()},
1748 which is provided by the external environment.  For instance, this could
1749 setup the hardware traps to actually execute code which calls the stub
1750 first, and then transfers to its own trap handler.
1751
1752 For the most point, there probably won't be much of an issue with
1753 `sharing' traps, as the traps we use are usually not used by the kernel,
1754 and often indicate unrecoverable error conditions.  Anyway, this is all
1755 controlled by a table, and is trivial to modify.  The most important
1756 trap for us is for @code{ta 1}.  Without that, we can't single step or
1757 do breakpoints.  Everything else is unnecessary for the proper operation
1758 of the debugger/stub.
1759
1760 From reading the stub, it's probably not obvious how breakpoints work.
1761 They are simply done by deposit/examine operations from GDB.
1762
1763 @section ROM Monitor Interface
1764
1765 @section Custom Protocols
1766
1767 @section Transport Layer
1768
1769 @section Builtin Simulator
1770
1771
1772 @node Native Debugging
1773
1774 @chapter Native Debugging
1775
1776 Several files control GDB's configuration for native support:
1777
1778 @table @file
1779
1780 @item gdb/config/@var{arch}/@var{xyz}.mh
1781 Specifies Makefile fragments needed when hosting @emph{or native} on
1782 machine @var{xyz}.  In particular, this lists the required
1783 native-dependent object files, by defining @samp{NATDEPFILES=@dots{}}.
1784 Also specifies the header file which describes native support on
1785 @var{xyz}, by defining @samp{NAT_FILE= nm-@var{xyz}.h}.  You can also
1786 define @samp{NAT_CFLAGS}, @samp{NAT_ADD_FILES}, @samp{NAT_CLIBS},
1787 @samp{NAT_CDEPS}, etc.; see @file{Makefile.in}.
1788
1789 @item gdb/config/@var{arch}/nm-@var{xyz}.h
1790 (@file{nm.h} is a link to this file, created by configure).  Contains C
1791 macro definitions describing the native system environment, such as
1792 child process control and core file support.
1793
1794 @item gdb/@var{xyz}-nat.c
1795 Contains any miscellaneous C code required for this native support of
1796 this machine.  On some machines it doesn't exist at all.
1797
1798 @end table
1799
1800 There are some ``generic'' versions of routines that can be used by
1801 various systems.  These can be customized in various ways by macros
1802 defined in your @file{nm-@var{xyz}.h} file.  If these routines work for
1803 the @var{xyz} host, you can just include the generic file's name (with
1804 @samp{.o}, not @samp{.c}) in @code{NATDEPFILES}.
1805
1806 Otherwise, if your machine needs custom support routines, you will need
1807 to write routines that perform the same functions as the generic file.
1808 Put them into @code{@var{xyz}-nat.c}, and put @code{@var{xyz}-nat.o}
1809 into @code{NATDEPFILES}.
1810
1811 @table @file
1812
1813 @item inftarg.c
1814 This contains the @emph{target_ops vector} that supports Unix child
1815 processes on systems which use ptrace and wait to control the child.
1816
1817 @item procfs.c
1818 This contains the @emph{target_ops vector} that supports Unix child
1819 processes on systems which use /proc to control the child.
1820
1821 @item fork-child.c
1822 This does the low-level grunge that uses Unix system calls to do a "fork
1823 and exec" to start up a child process.
1824
1825 @item infptrace.c
1826 This is the low level interface to inferior processes for systems using
1827 the Unix @code{ptrace} call in a vanilla way.
1828
1829 @end table
1830
1831 @section Native core file Support
1832
1833 @table @file
1834
1835 @item core-aout.c::fetch_core_registers()
1836 Support for reading registers out of a core file.  This routine calls
1837 @code{register_addr()}, see below.  Now that BFD is used to read core
1838 files, virtually all machines should use @code{core-aout.c}, and should
1839 just provide @code{fetch_core_registers} in @code{@var{xyz}-nat.c} (or
1840 @code{REGISTER_U_ADDR} in @code{nm-@var{xyz}.h}).
1841
1842 @item core-aout.c::register_addr()
1843 If your @code{nm-@var{xyz}.h} file defines the macro
1844 @code{REGISTER_U_ADDR(addr, blockend, regno)}, it should be defined to
1845 set @code{addr} to the offset within the @samp{user} struct of GDB
1846 register number @code{regno}.  @code{blockend} is the offset within the
1847 ``upage'' of @code{u.u_ar0}.  If @code{REGISTER_U_ADDR} is defined,
1848 @file{core-aout.c} will define the @code{register_addr()} function and
1849 use the macro in it.  If you do not define @code{REGISTER_U_ADDR}, but
1850 you are using the standard @code{fetch_core_registers()}, you will need
1851 to define your own version of @code{register_addr()}, put it into your
1852 @code{@var{xyz}-nat.c} file, and be sure @code{@var{xyz}-nat.o} is in
1853 the @code{NATDEPFILES} list.  If you have your own
1854 @code{fetch_core_registers()}, you may not need a separate
1855 @code{register_addr()}.  Many custom @code{fetch_core_registers()}
1856 implementations simply locate the registers themselves.@refill
1857
1858 @end table
1859
1860 When making GDB run native on a new operating system, to make it
1861 possible to debug core files, you will need to either write specific
1862 code for parsing your OS's core files, or customize
1863 @file{bfd/trad-core.c}.  First, use whatever @code{#include} files your
1864 machine uses to define the struct of registers that is accessible
1865 (possibly in the u-area) in a core file (rather than
1866 @file{machine/reg.h}), and an include file that defines whatever header
1867 exists on a core file (e.g. the u-area or a @samp{struct core}).  Then
1868 modify @code{trad_unix_core_file_p()} to use these values to set up the
1869 section information for the data segment, stack segment, any other
1870 segments in the core file (perhaps shared library contents or control
1871 information), ``registers'' segment, and if there are two discontiguous
1872 sets of registers (e.g.  integer and float), the ``reg2'' segment.  This
1873 section information basically delimits areas in the core file in a
1874 standard way, which the section-reading routines in BFD know how to seek
1875 around in.
1876
1877 Then back in GDB, you need a matching routine called
1878 @code{fetch_core_registers()}.  If you can use the generic one, it's in
1879 @file{core-aout.c}; if not, it's in your @file{@var{xyz}-nat.c} file.
1880 It will be passed a char pointer to the entire ``registers'' segment,
1881 its length, and a zero; or a char pointer to the entire ``regs2''
1882 segment, its length, and a 2.  The routine should suck out the supplied
1883 register values and install them into GDB's ``registers'' array.
1884
1885 If your system uses @file{/proc} to control processes, and uses ELF
1886 format core files, then you may be able to use the same routines for
1887 reading the registers out of processes and out of core files.
1888
1889 @section ptrace
1890
1891 @section /proc
1892
1893 @section win32
1894
1895 @section shared libraries
1896
1897 @section Native Conditionals
1898
1899 When GDB is configured and compiled, various macros are defined or left
1900 undefined, to control compilation when the host and target systems are
1901 the same.  These macros should be defined (or left undefined) in
1902 @file{nm-@var{system}.h}.
1903
1904 @table @code
1905
1906 @item ATTACH_DETACH
1907 If defined, then GDB will include support for the @code{attach} and
1908 @code{detach} commands.
1909
1910 @item CHILD_PREPARE_TO_STORE
1911 If the machine stores all registers at once in the child process, then
1912 define this to ensure that all values are correct.  This usually entails
1913 a read from the child.
1914
1915 [Note that this is incorrectly defined in @file{xm-@var{system}.h} files
1916 currently.]
1917
1918 @item FETCH_INFERIOR_REGISTERS
1919 Define this if the native-dependent code will provide its own routines
1920 @code{fetch_inferior_registers} and @code{store_inferior_registers} in
1921 @file{@var{HOST}-nat.c}.  If this symbol is @emph{not} defined, and
1922 @file{infptrace.c} is included in this configuration, the default
1923 routines in @file{infptrace.c} are used for these functions.
1924
1925 @item FILES_INFO_HOOK
1926 (Only defined for Convex.)
1927
1928 @item FP0_REGNUM
1929 This macro is normally defined to be the number of the first floating
1930 point register, if the machine has such registers.  As such, it would
1931 appear only in target-specific code.  However, /proc support uses this
1932 to decide whether floats are in use on this target.
1933
1934 @item GET_LONGJMP_TARGET
1935 For most machines, this is a target-dependent parameter.  On the
1936 DECstation and the Iris, this is a native-dependent parameter, since
1937 <setjmp.h> is needed to define it.
1938
1939 This macro determines the target PC address that longjmp() will jump to,
1940 assuming that we have just stopped at a longjmp breakpoint.  It takes a
1941 CORE_ADDR * as argument, and stores the target PC value through this
1942 pointer.  It examines the current state of the machine as needed.
1943
1944 @item KERNEL_U_ADDR
1945 Define this to the address of the @code{u} structure (the ``user
1946 struct'', also known as the ``u-page'') in kernel virtual memory.  GDB
1947 needs to know this so that it can subtract this address from absolute
1948 addresses in the upage, that are obtained via ptrace or from core files.
1949 On systems that don't need this value, set it to zero.
1950
1951 @item KERNEL_U_ADDR_BSD
1952 Define this to cause GDB to determine the address of @code{u} at
1953 runtime, by using Berkeley-style @code{nlist} on the kernel's image in
1954 the root directory.
1955
1956 @item KERNEL_U_ADDR_HPUX
1957 Define this to cause GDB to determine the address of @code{u} at
1958 runtime, by using HP-style @code{nlist} on the kernel's image in the
1959 root directory.
1960
1961 @item ONE_PROCESS_WRITETEXT
1962 Define this to be able to, when a breakpoint insertion fails, warn the
1963 user that another process may be running with the same executable.
1964
1965 @item PROC_NAME_FMT
1966 Defines the format for the name of a @file{/proc} device.  Should be
1967 defined in @file{nm.h} @emph{only} in order to override the default
1968 definition in @file{procfs.c}.
1969
1970 @item PTRACE_FP_BUG
1971 mach386-xdep.c
1972
1973 @item PTRACE_ARG3_TYPE
1974 The type of the third argument to the @code{ptrace} system call, if it
1975 exists and is different from @code{int}.
1976
1977 @item REGISTER_U_ADDR
1978 Defines the offset of the registers in the ``u area''.
1979
1980 @item SHELL_COMMAND_CONCAT
1981 If defined, is a string to prefix on the shell command used to start the
1982 inferior.
1983
1984 @item SHELL_FILE
1985 If defined, this is the name of the shell to use to run the inferior.
1986 Defaults to @code{"/bin/sh"}.
1987
1988 @item SOLIB_ADD (filename, from_tty, targ)
1989 Define this to expand into an expression that will cause the symbols in
1990 @var{filename} to be added to GDB's symbol table.
1991
1992 @item SOLIB_CREATE_INFERIOR_HOOK
1993 Define this to expand into any shared-library-relocation code that you
1994 want to be run just after the child process has been forked.
1995
1996 @item START_INFERIOR_TRAPS_EXPECTED
1997 When starting an inferior, GDB normally expects to trap twice; once when
1998 the shell execs, and once when the program itself execs.  If the actual
1999 number of traps is something other than 2, then define this macro to
2000 expand into the number expected.
2001
2002 @item SVR4_SHARED_LIBS
2003 Define this to indicate that SVR4-style shared libraries are in use.
2004
2005 @item USE_PROC_FS
2006 This determines whether small routines in @file{*-tdep.c}, which
2007 translate register values between GDB's internal representation and the
2008 /proc representation, are compiled.
2009
2010 @item U_REGS_OFFSET
2011 This is the offset of the registers in the upage.  It need only be
2012 defined if the generic ptrace register access routines in
2013 @file{infptrace.c} are being used (that is, @file{infptrace.c} is
2014 configured in, and @code{FETCH_INFERIOR_REGISTERS} is not defined).  If
2015 the default value from @file{infptrace.c} is good enough, leave it
2016 undefined.
2017
2018 The default value means that u.u_ar0 @emph{points to} the location of
2019 the registers.  I'm guessing that @code{#define U_REGS_OFFSET 0} means
2020 that u.u_ar0 @emph{is} the location of the registers.
2021
2022 @item CLEAR_SOLIB
2023 objfiles.c
2024
2025 @item DEBUG_PTRACE
2026 Define this to debug ptrace calls.
2027
2028 @end table
2029
2030
2031 @node Support Libraries
2032
2033 @chapter Support Libraries
2034
2035 @section BFD
2036
2037 BFD provides support for GDB in several ways:
2038
2039 @table @emph
2040
2041 @item identifying executable and core files
2042 BFD will identify a variety of file types, including a.out, coff, and
2043 several variants thereof, as well as several kinds of core files.
2044
2045 @item access to sections of files
2046 BFD parses the file headers to determine the names, virtual addresses,
2047 sizes, and file locations of all the various named sections in files
2048 (such as the text section or the data section).  GDB simply calls BFD to
2049 read or write section X at byte offset Y for length Z.
2050
2051 @item specialized core file support
2052 BFD provides routines to determine the failing command name stored in a
2053 core file, the signal with which the program failed, and whether a core
2054 file matches (i.e. could be a core dump of) a particular executable
2055 file.
2056
2057 @item locating the symbol information
2058 GDB uses an internal interface of BFD to determine where to find the
2059 symbol information in an executable file or symbol-file.  GDB itself
2060 handles the reading of symbols, since BFD does not ``understand'' debug
2061 symbols, but GDB uses BFD's cached information to find the symbols,
2062 string table, etc.
2063
2064 @end table
2065
2066 @section opcodes
2067
2068 The opcodes library provides GDB's disassembler.  (It's a separate
2069 library because it's also used in binutils, for @file{objdump}).
2070
2071 @section readline
2072
2073 @section mmalloc
2074
2075 @section libiberty
2076
2077 @section gnu-regex
2078
2079 Regex conditionals.
2080
2081 @table @code
2082
2083 @item C_ALLOCA
2084
2085 @item NFAILURES
2086
2087 @item RE_NREGS
2088
2089 @item SIGN_EXTEND_CHAR
2090
2091 @item SWITCH_ENUM_BUG
2092
2093 @item SYNTAX_TABLE
2094
2095 @item Sword
2096
2097 @item sparc
2098
2099 @end table
2100
2101 @section include
2102
2103 @node Coding
2104
2105 @chapter Coding
2106
2107 This chapter covers topics that are lower-level than the major
2108 algorithms of GDB.
2109
2110 @section Cleanups
2111
2112 Cleanups are a structured way to deal with things that need to be done
2113 later.  When your code does something (like @code{malloc} some memory,
2114 or open a file) that needs to be undone later (e.g. free the memory or
2115 close the file), it can make a cleanup.  The cleanup will be done at
2116 some future point: when the command is finished, when an error occurs,
2117 or when your code decides it's time to do cleanups.
2118
2119 You can also discard cleanups, that is, throw them away without doing
2120 what they say.  This is only done if you ask that it be done.
2121
2122 Syntax:
2123
2124 @table @code
2125
2126 @item struct cleanup *@var{old_chain};
2127 Declare a variable which will hold a cleanup chain handle.
2128
2129 @item @var{old_chain} = make_cleanup (@var{function}, @var{arg});
2130 Make a cleanup which will cause @var{function} to be called with
2131 @var{arg} (a @code{char *}) later.  The result, @var{old_chain}, is a
2132 handle that can be passed to @code{do_cleanups} or
2133 @code{discard_cleanups} later.  Unless you are going to call
2134 @code{do_cleanups} or @code{discard_cleanups} yourself, you can ignore
2135 the result from @code{make_cleanup}.
2136
2137 @item do_cleanups (@var{old_chain});
2138 Perform all cleanups done since @code{make_cleanup} returned
2139 @var{old_chain}.  E.g.:
2140 @example
2141 make_cleanup (a, 0);
2142 old = make_cleanup (b, 0);
2143 do_cleanups (old);
2144 @end example
2145 @noindent
2146 will call @code{b()} but will not call @code{a()}.  The cleanup that
2147 calls @code{a()} will remain in the cleanup chain, and will be done
2148 later unless otherwise discarded.@refill
2149
2150 @item discard_cleanups (@var{old_chain});
2151 Same as @code{do_cleanups} except that it just removes the cleanups from
2152 the chain and does not call the specified functions.
2153
2154 @end table
2155
2156 Some functions, e.g. @code{fputs_filtered()} or @code{error()}, specify
2157 that they ``should not be called when cleanups are not in place''.  This
2158 means that any actions you need to reverse in the case of an error or
2159 interruption must be on the cleanup chain before you call these
2160 functions, since they might never return to your code (they
2161 @samp{longjmp} instead).
2162
2163 @section Wrapping Output Lines
2164
2165 Output that goes through @code{printf_filtered} or @code{fputs_filtered}
2166 or @code{fputs_demangled} needs only to have calls to @code{wrap_here}
2167 added in places that would be good breaking points.  The utility
2168 routines will take care of actually wrapping if the line width is
2169 exceeded.
2170
2171 The argument to @code{wrap_here} is an indentation string which is
2172 printed @emph{only} if the line breaks there.  This argument is saved
2173 away and used later.  It must remain valid until the next call to
2174 @code{wrap_here} or until a newline has been printed through the
2175 @code{*_filtered} functions.  Don't pass in a local variable and then
2176 return!
2177
2178 It is usually best to call @code{wrap_here()} after printing a comma or
2179 space.  If you call it before printing a space, make sure that your
2180 indentation properly accounts for the leading space that will print if
2181 the line wraps there.
2182
2183 Any function or set of functions that produce filtered output must
2184 finish by printing a newline, to flush the wrap buffer, before switching
2185 to unfiltered (``@code{printf}'') output.  Symbol reading routines that
2186 print warnings are a good example.
2187
2188 @section Coding Style
2189
2190 GDB follows the GNU coding standards, as described in
2191 @file{etc/standards.texi}.  This file is also available for anonymous
2192 FTP from GNU archive sites.  There are some additional considerations
2193 for GDB maintainers that reflect the unique environment and style of GDB
2194 maintenance.  If you follow these guidelines, GDB will be more
2195 consistent and easier to maintain.
2196
2197 GDB's policy on the use of prototypes is that prototypes are used to
2198 @emph{declare} functions but never to @emph{define} them.  Simple macros
2199 are used in the declarations, so that a non-ANSI compiler can compile
2200 GDB without trouble.  The simple macro calls are used like this:
2201
2202 @example @code
2203 extern int memory_remove_breakpoint PARAMS ((CORE_ADDR, char *));
2204 @end example
2205
2206 Note the double parentheses around the parameter types.  This allows an
2207 arbitrary number of parameters to be described, without freaking out the
2208 C preprocessor.  When the function has no parameters, it should be
2209 described like:
2210
2211 @example @code
2212 void noprocess PARAMS ((void));
2213 @end example
2214
2215 The @code{PARAMS} macro expands to its argument in ANSI C, or to a
2216 simple @code{()} in traditional C.
2217
2218 All external functions should have a @code{PARAMS} declaration in a
2219 header file that callers include.  All static functions should have such
2220 a declaration near the top of their source file.
2221
2222 We don't have a gcc option that will properly check that these rules
2223 have been followed, but it's GDB policy, and we periodically check it
2224 using the tools available (plus manual labor), and clean up any
2225 remnants.
2226
2227 @section Clean Design
2228
2229 In addition to getting the syntax right, there's the little question of
2230 semantics.  Some things are done in certain ways in GDB because long
2231 experience has shown that the more obvious ways caused various kinds of
2232 trouble.
2233
2234 You can't assume the byte order of anything that comes from a target
2235 (including @var{value}s, object files, and instructions).  Such things
2236 must be byte-swapped using @code{SWAP_TARGET_AND_HOST} in GDB, or one of
2237 the swap routines defined in @file{bfd.h}, such as @code{bfd_get_32}.
2238
2239 You can't assume that you know what interface is being used to talk to
2240 the target system.  All references to the target must go through the
2241 current @code{target_ops} vector.
2242
2243 You can't assume that the host and target machines are the same machine
2244 (except in the ``native'' support modules).  In particular, you can't
2245 assume that the target machine's header files will be available on the
2246 host machine.  Target code must bring along its own header files --
2247 written from scratch or explicitly donated by their owner, to avoid
2248 copyright problems.
2249
2250 Insertion of new @code{#ifdef}'s will be frowned upon.  It's much better
2251 to write the code portably than to conditionalize it for various
2252 systems.
2253
2254 New @code{#ifdef}'s which test for specific compilers or manufacturers
2255 or operating systems are unacceptable.  All @code{#ifdef}'s should test
2256 for features.  The information about which configurations contain which
2257 features should be segregated into the configuration files.  Experience
2258 has proven far too often that a feature unique to one particular system
2259 often creeps into other systems; and that a conditional based on some
2260 predefined macro for your current system will become worthless over
2261 time, as new versions of your system come out that behave differently
2262 with regard to this feature.
2263
2264 Adding code that handles specific architectures, operating systems,
2265 target interfaces, or hosts, is not acceptable in generic code.  If a
2266 hook is needed at that point, invent a generic hook and define it for
2267 your configuration, with something like:
2268
2269 @example
2270 #ifdef  WRANGLE_SIGNALS
2271    WRANGLE_SIGNALS (signo);
2272 #endif
2273 @end example
2274
2275 In your host, target, or native configuration file, as appropriate,
2276 define @code{WRANGLE_SIGNALS} to do the machine-dependent thing.  Take a
2277 bit of care in defining the hook, so that it can be used by other ports
2278 in the future, if they need a hook in the same place.
2279
2280 If the hook is not defined, the code should do whatever "most" machines
2281 want.  Using @code{#ifdef}, as above, is the preferred way to do this,
2282 but sometimes that gets convoluted, in which case use
2283
2284 @example
2285 #ifndef SPECIAL_FOO_HANDLING
2286 #define SPECIAL_FOO_HANDLING(pc, sp) (0)
2287 #endif
2288 @end example
2289
2290 where the macro is used or in an appropriate header file.
2291
2292 Whether to include a @dfn{small} hook, a hook around the exact pieces of
2293 code which are system-dependent, or whether to replace a whole function
2294 with a hook depends on the case.  A good example of this dilemma can be
2295 found in @code{get_saved_register}.  All machines that GDB 2.8 ran on
2296 just needed the @code{FRAME_FIND_SAVED_REGS} hook to find the saved
2297 registers.  Then the SPARC and Pyramid came along, and
2298 @code{HAVE_REGISTER_WINDOWS} and @code{REGISTER_IN_WINDOW_P} were
2299 introduced.  Then the 29k and 88k required the @code{GET_SAVED_REGISTER}
2300 hook.  The first three are examples of small hooks; the latter replaces
2301 a whole function.  In this specific case, it is useful to have both
2302 kinds; it would be a bad idea to replace all the uses of the small hooks
2303 with @code{GET_SAVED_REGISTER}, since that would result in much
2304 duplicated code.  Other times, duplicating a few lines of code here or
2305 there is much cleaner than introducing a large number of small hooks.
2306
2307 Another way to generalize GDB along a particular interface is with an
2308 attribute struct.  For example, GDB has been generalized to handle
2309 multiple kinds of remote interfaces -- not by #ifdef's everywhere, but
2310 by defining the "target_ops" structure and having a current target (as
2311 well as a stack of targets below it, for memory references).  Whenever
2312 something needs to be done that depends on which remote interface we are
2313 using, a flag in the current target_ops structure is tested (e.g.
2314 `target_has_stack'), or a function is called through a pointer in the
2315 current target_ops structure.  In this way, when a new remote interface
2316 is added, only one module needs to be touched -- the one that actually
2317 implements the new remote interface.  Other examples of
2318 attribute-structs are BFD access to multiple kinds of object file
2319 formats, or GDB's access to multiple source languages.
2320
2321 Please avoid duplicating code.  For example, in GDB 3.x all the code
2322 interfacing between @code{ptrace} and the rest of GDB was duplicated in
2323 @file{*-dep.c}, and so changing something was very painful.  In GDB 4.x,
2324 these have all been consolidated into @file{infptrace.c}.
2325 @file{infptrace.c} can deal with variations between systems the same way
2326 any system-independent file would (hooks, #if defined, etc.), and
2327 machines which are radically different don't need to use infptrace.c at
2328 all.
2329
2330 @emph{Do} write code that doesn't depend on the sizes of C data types,
2331 the format of the host's floating point numbers, the alignment of anything,
2332 or the order of evaluation of expressions.  In short, follow good
2333 programming practices for writing portable C code.
2334
2335
2336 @node Porting GDB
2337
2338 @chapter Porting GDB
2339
2340 Most of the work in making GDB compile on a new machine is in specifying
2341 the configuration of the machine.  This is done in a dizzying variety of
2342 header files and configuration scripts, which we hope to make more
2343 sensible soon.  Let's say your new host is called an @var{xyz} (e.g.
2344 @samp{sun4}), and its full three-part configuration name is
2345 @code{@var{arch}-@var{xvend}-@var{xos}} (e.g.  @samp{sparc-sun-sunos4}).
2346 In particular:
2347
2348 In the top level directory, edit @file{config.sub} and add @var{arch},
2349 @var{xvend}, and @var{xos} to the lists of supported architectures,
2350 vendors, and operating systems near the bottom of the file.  Also, add
2351 @var{xyz} as an alias that maps to
2352 @code{@var{arch}-@var{xvend}-@var{xos}}.  You can test your changes by
2353 running
2354
2355 @example
2356 ./config.sub @var{xyz}
2357 @end example
2358 @noindent
2359 and
2360 @example
2361 ./config.sub @code{@var{arch}-@var{xvend}-@var{xos}}
2362 @end example
2363 @noindent
2364 which should both respond with @code{@var{arch}-@var{xvend}-@var{xos}}
2365 and no error messages.
2366
2367 You need to port BFD, if that hasn't been done already.  Porting BFD is
2368 beyond the scope of this manual.
2369
2370 To configure GDB itself, edit @file{gdb/configure.host} to recognize
2371 your system and set @code{gdb_host} to @var{xyz}, and (unless your
2372 desired target is already available) also edit @file{gdb/configure.tgt},
2373 setting @code{gdb_target} to something appropriate (for instance,
2374 @var{xyz}).
2375
2376 Finally, you'll need to specify and define GDB's host-, native-, and
2377 target-dependent @file{.h} and @file{.c} files used for your
2378 configuration.
2379
2380 @section Configuring GDB for Release
2381
2382 From the top level directory (containing @file{gdb}, @file{bfd},
2383 @file{libiberty}, and so on):
2384 @example
2385 make -f Makefile.in gdb.tar.gz
2386 @end example
2387
2388 This will properly configure, clean, rebuild any files that are
2389 distributed pre-built (e.g. @file{c-exp.tab.c} or @file{refcard.ps}),
2390 and will then make a tarfile.  (If the top level directory has already
2391 been configured, you can just do @code{make gdb.tar.gz} instead.)
2392
2393 This procedure requires:
2394 @itemize @bullet
2395 @item symbolic links
2396 @item @code{makeinfo} (texinfo2 level)
2397 @item @TeX{}
2398 @item @code{dvips}
2399 @item @code{yacc} or @code{bison}
2400 @end itemize
2401 @noindent
2402 @dots{} and the usual slew of utilities (@code{sed}, @code{tar}, etc.).
2403
2404 @subheading TEMPORARY RELEASE PROCEDURE FOR DOCUMENTATION
2405
2406 @file{gdb.texinfo} is currently marked up using the texinfo-2 macros,
2407 which are not yet a default for anything (but we have to start using
2408 them sometime).
2409
2410 For making paper, the only thing this implies is the right generation of
2411 @file{texinfo.tex} needs to be included in the distribution.
2412
2413 For making info files, however, rather than duplicating the texinfo2
2414 distribution, generate @file{gdb-all.texinfo} locally, and include the
2415 files @file{gdb.info*} in the distribution.  Note the plural;
2416 @code{makeinfo} will split the document into one overall file and five
2417 or so included files.
2418
2419 @node Hints
2420
2421 @chapter Hints
2422
2423 Check the @file{README} file, it often has useful information that does not
2424 appear anywhere else in the directory.
2425
2426 @menu
2427 * Getting Started::             Getting started working on GDB
2428 * Debugging GDB::               Debugging GDB with itself
2429 @end menu
2430
2431 @node Getting Started,,, Hints
2432
2433 @section Getting Started
2434
2435 GDB is a large and complicated program, and if you first starting to
2436 work on it, it can be hard to know where to start.  Fortunately, if you
2437 know how to go about it, there are ways to figure out what is going on.
2438
2439 This manual, the GDB Internals manual, has information which applies
2440 generally to many parts of GDB.
2441
2442 Information about particular functions or data structures are located in
2443 comments with those functions or data structures.  If you run across a
2444 function or a global variable which does not have a comment correctly
2445 explaining what is does, this can be thought of as a bug in GDB; feel
2446 free to submit a bug report, with a suggested comment if you can figure
2447 out what the comment should say.  If you find a comment which is
2448 actually wrong, be especially sure to report that.
2449
2450 Comments explaining the function of macros defined in host, target, or
2451 native dependent files can be in several places.  Sometimes they are
2452 repeated every place the macro is defined.  Sometimes they are where the
2453 macro is used.  Sometimes there is a header file which supplies a
2454 default definition of the macro, and the comment is there.  This manual
2455 also documents all the available macros.
2456 @c (@pxref{Host Conditionals}, @pxref{Target
2457 @c Conditionals}, @pxref{Native Conditionals}, and @pxref{Obsolete
2458 @c Conditionals})
2459
2460 Start with the header files.  Once you some idea of how GDB's internal
2461 symbol tables are stored (see @file{symtab.h}, @file{gdbtypes.h}), you
2462 will find it much easier to understand the code which uses and creates
2463 those symbol tables.
2464
2465 You may wish to process the information you are getting somehow, to
2466 enhance your understanding of it.  Summarize it, translate it to another
2467 language, add some (perhaps trivial or non-useful) feature to GDB, use
2468 the code to predict what a test case would do and write the test case
2469 and verify your prediction, etc.  If you are reading code and your eyes
2470 are starting to glaze over, this is a sign you need to use a more active
2471 approach.
2472
2473 Once you have a part of GDB to start with, you can find more
2474 specifically the part you are looking for by stepping through each
2475 function with the @code{next} command.  Do not use @code{step} or you
2476 will quickly get distracted; when the function you are stepping through
2477 calls another function try only to get a big-picture understanding
2478 (perhaps using the comment at the beginning of the function being
2479 called) of what it does.  This way you can identify which of the
2480 functions being called by the function you are stepping through is the
2481 one which you are interested in.  You may need to examine the data
2482 structures generated at each stage, with reference to the comments in
2483 the header files explaining what the data structures are supposed to
2484 look like.
2485
2486 Of course, this same technique can be used if you are just reading the
2487 code, rather than actually stepping through it.  The same general
2488 principle applies---when the code you are looking at calls something
2489 else, just try to understand generally what the code being called does,
2490 rather than worrying about all its details.
2491
2492 A good place to start when tracking down some particular area is with a
2493 command which invokes that feature.  Suppose you want to know how
2494 single-stepping works.  As a GDB user, you know that the @code{step}
2495 command invokes single-stepping.  The command is invoked via command
2496 tables (see @file{command.h}); by convention the function which actually
2497 performs the command is formed by taking the name of the command and
2498 adding @samp{_command}, or in the case of an @code{info} subcommand,
2499 @samp{_info}.  For example, the @code{step} command invokes the
2500 @code{step_command} function and the @code{info display} command invokes
2501 @code{display_info}.  When this convention is not followed, you might
2502 have to use @code{grep} or @kbd{M-x tags-search} in emacs, or run GDB on
2503 itself and set a breakpoint in @code{execute_command}.
2504
2505 If all of the above fail, it may be appropriate to ask for information
2506 on @code{bug-gdb}.  But @emph{never} post a generic question like ``I was
2507 wondering if anyone could give me some tips about understanding
2508 GDB''---if we had some magic secret we would put it in this manual.
2509 Suggestions for improving the manual are always welcome, of course.
2510
2511 @node Debugging GDB,,,Hints
2512
2513 @section Debugging GDB with itself
2514
2515 If GDB is limping on your machine, this is the preferred way to get it
2516 fully functional.  Be warned that in some ancient Unix systems, like
2517 Ultrix 4.2, a program can't be running in one process while it is being
2518 debugged in another.  Rather than typing the command @code{@w{./gdb
2519 ./gdb}}, which works on Suns and such, you can copy @file{gdb} to
2520 @file{gdb2} and then type @code{@w{./gdb ./gdb2}}.
2521
2522 When you run GDB in the GDB source directory, it will read a
2523 @file{.gdbinit} file that sets up some simple things to make debugging
2524 gdb easier.  The @code{info} command, when executed without a subcommand
2525 in a GDB being debugged by gdb, will pop you back up to the top level
2526 gdb.  See @file{.gdbinit} for details.
2527
2528 If you use emacs, you will probably want to do a @code{make TAGS} after
2529 you configure your distribution; this will put the machine dependent
2530 routines for your local machine where they will be accessed first by
2531 @kbd{M-.}
2532
2533 Also, make sure that you've either compiled GDB with your local cc, or
2534 have run @code{fixincludes} if you are compiling with gcc.
2535
2536 @section Submitting Patches
2537
2538 Thanks for thinking of offering your changes back to the community of
2539 GDB users.  In general we like to get well designed enhancements.
2540 Thanks also for checking in advance about the best way to transfer the
2541 changes.
2542
2543 The GDB maintainers will only install ``cleanly designed'' patches.  You
2544 may not always agree on what is clean design.
2545 @c @pxref{Coding Style}, @pxref{Clean Design}.
2546
2547 If the maintainers don't have time to put the patch in when it arrives,
2548 or if there is any question about a patch, it goes into a large queue
2549 with everyone else's patches and bug reports.
2550
2551 The legal issue is that to incorporate substantial changes requires a
2552 copyright assignment from you and/or your employer, granting ownership
2553 of the changes to the Free Software Foundation.  You can get the
2554 standard document for doing this by sending mail to
2555 @code{gnu@@prep.ai.mit.edu} and asking for it.  I recommend that people
2556 write in "All programs owned by the Free Software Foundation" as "NAME
2557 OF PROGRAM", so that changes in many programs (not just GDB, but GAS,
2558 Emacs, GCC, etc) can be contributed with only one piece of legalese
2559 pushed through the bureacracy and filed with the FSF.  I can't start
2560 merging changes until this paperwork is received by the FSF (their
2561 rules, which I follow since I maintain it for them).
2562
2563 Technically, the easiest way to receive changes is to receive each
2564 feature as a small context diff or unidiff, suitable for "patch".
2565 Each message sent to me should include the changes to C code and
2566 header files for a single feature, plus ChangeLog entries for each
2567 directory where files were modified, and diffs for any changes needed
2568 to the manuals (gdb/doc/gdb.texi or gdb/doc/gdbint.texi).  If there
2569 are a lot of changes for a single feature, they can be split down
2570 into multiple messages.
2571
2572 In this way, if I read and like the feature, I can add it to the
2573 sources with a single patch command, do some testing, and check it in.
2574 If you leave out the ChangeLog, I have to write one.  If you leave
2575 out the doc, I have to puzzle out what needs documenting.  Etc.
2576
2577 The reason to send each change in a separate message is that I will
2578 not install some of the changes.  They'll be returned to you with
2579 questions or comments.  If I'm doing my job, my message back to you
2580 will say what you have to fix in order to make the change acceptable.
2581 The reason to have separate messages for separate features is so
2582 that other changes (which I @emph{am} willing to accept) can be installed
2583 while one or more changes are being reworked.  If multiple features
2584 are sent in a single message, I tend to not put in the effort to sort
2585 out the acceptable changes from the unacceptable, so none of the
2586 features get installed until all are acceptable.
2587
2588 If this sounds painful or authoritarian, well, it is.  But I get a lot
2589 of bug reports and a lot of patches, and most of them don't get
2590 installed because I don't have the time to finish the job that the bug
2591 reporter or the contributor could have done.  Patches that arrive
2592 complete, working, and well designed, tend to get installed on the day
2593 they arrive.  The others go into a queue and get installed if and when
2594 I scan back over the queue -- which can literally take months
2595 sometimes.  It's in both our interests to make patch installation easy
2596 -- you get your changes installed, and I make some forward progress on
2597 GDB in a normal 12-hour day (instead of them having to wait until I
2598 have a 14-hour or 16-hour day to spend cleaning up patches before I
2599 can install them).
2600
2601 Please send patches directly to the GDB maintainers at
2602 @code{gdb-patches@@cygnus.com}.
2603
2604 @section Obsolete Conditionals
2605
2606 Fragments of old code in GDB sometimes reference or set the following
2607 configuration macros.  They should not be used by new code, and old uses
2608 should be removed as those parts of the debugger are otherwise touched.
2609
2610 @table @code
2611
2612 @item STACK_END_ADDR
2613 This macro used to define where the end of the stack appeared, for use
2614 in interpreting core file formats that don't record this address in the
2615 core file itself.  This information is now configured in BFD, and GDB
2616 gets the info portably from there.  The values in GDB's configuration
2617 files should be moved into BFD configuration files (if needed there),
2618 and deleted from all of GDB's config files.
2619
2620 Any @file{@var{foo}-xdep.c} file that references STACK_END_ADDR
2621 is so old that it has never been converted to use BFD.  Now that's old!
2622
2623 @item PYRAMID_CONTROL_FRAME_DEBUGGING
2624 pyr-xdep.c
2625 @item PYRAMID_CORE
2626 pyr-xdep.c
2627 @item PYRAMID_PTRACE
2628 pyr-xdep.c
2629
2630 @item REG_STACK_SEGMENT
2631 exec.c
2632
2633 @end table
2634
2635
2636 @contents
2637 @bye