gcc/doc/cpp.texi

   1 \input texinfo
   2 @setfilename cpp.info
   3 @settitle The C Preprocessor
   4 @setchapternewpage off
   5 @c @smallbook
   6 @c @cropmarks
   7 @c @finalout
   8
   9 @include gcc-common.texi
  10
  11 @copying
  12 @c man begin COPYRIGHT
  13 Copyright @copyright{} 1987, 1989, 1991, 1992, 1993, 1994, 1995, 1996,
  14 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007,
  15 2008, 2009, 2010
  16 Free Software Foundation, Inc.
  17
  18 Permission is granted to copy, distribute and/or modify this document
  19 under the terms of the GNU Free Documentation License, Version 1.2 or
  20 any later version published by the Free Software Foundation.  A copy of
  21 the license is included in the
  22 @c man end
  23 section entitled ``GNU Free Documentation License''.
  24 @ignore
  25 @c man begin COPYRIGHT
  26 man page gfdl(7).
  27 @c man end
  28 @end ignore
  29
  30 @c man begin COPYRIGHT
  31 This manual contains no Invariant Sections.  The Front-Cover Texts are
  32 (a) (see below), and the Back-Cover Texts are (b) (see below).
  33
  34 (a) The FSF's Front-Cover Text is:
  35
  36      A GNU Manual
  37
  38 (b) The FSF's Back-Cover Text is:
  39
  40      You have freedom to copy and modify this GNU Manual, like GNU
  41      software.  Copies published by the Free Software Foundation raise
  42      funds for GNU development.
  43 @c man end
  44 @end copying
  45
  46 @c Create a separate index for command line options.
  47 @defcodeindex op
  48 @syncodeindex vr op
  49
  50 @c Used in cppopts.texi and cppenv.texi.
  51 @set cppmanual
  52
  53 @ifinfo
  54 @dircategory Software development
  55 @direntry
  56 * Cpp: (cpp).                  The GNU C preprocessor.
  57 @end direntry
  58 @end ifinfo
  59
  60 @titlepage
  61 @title The C Preprocessor
  62 @versionsubtitle
  63 @author Richard M. Stallman, Zachary Weinberg
  64 @page
  65 @c There is a fill at the bottom of the page, so we need a filll to
  66 @c override it.
  67 @vskip 0pt plus 1filll
  68 @insertcopying
  69 @end titlepage
  70 @contents
  71 @page
  72
  73 @ifnottex
  74 @node Top
  75 @top
  76 The C preprocessor implements the macro language used to transform C,
  77 C++, and Objective-C programs before they are compiled.  It can also be
  78 useful on its own.
  79
  80 @menu
  81 * Overview::
  82 * Header Files::
  83 * Macros::
  84 * Conditionals::
  85 * Diagnostics::
  86 * Line Control::
  87 * Pragmas::
  88 * Other Directives::
  89 * Preprocessor Output::
  90 * Traditional Mode::
  91 * Implementation Details::
  92 * Invocation::
  93 * Environment Variables::
  94 * GNU Free Documentation License::
  95 * Index of Directives::
  96 * Option Index::
  97 * Concept Index::
  98
  99 @detailmenu
 100  --- The Detailed Node Listing ---
 101
 102 Overview
 103
 104 * Character sets::
 105 * Initial processing::
 106 * Tokenization::
 107 * The preprocessing language::
 108
 109 Header Files
 110
 111 * Include Syntax::
 112 * Include Operation::
 113 * Search Path::
 114 * Once-Only Headers::
 115 * Alternatives to Wrapper #ifndef::
 116 * Computed Includes::
 117 * Wrapper Headers::
 118 * System Headers::
 119
 120 Macros
 121
 122 * Object-like Macros::
 123 * Function-like Macros::
 124 * Macro Arguments::
 125 * Stringification::
 126 * Concatenation::
 127 * Variadic Macros::
 128 * Predefined Macros::
 129 * Undefining and Redefining Macros::
 130 * Directives Within Macro Arguments::
 131 * Macro Pitfalls::
 132
 133 Predefined Macros
 134
 135 * Standard Predefined Macros::
 136 * Common Predefined Macros::
 137 * System-specific Predefined Macros::
 138 * C++ Named Operators::
 139
 140 Macro Pitfalls
 141
 142 * Misnesting::
 143 * Operator Precedence Problems::
 144 * Swallowing the Semicolon::
 145 * Duplication of Side Effects::
 146 * Self-Referential Macros::
 147 * Argument Prescan::
 148 * Newlines in Arguments::
 149
 150 Conditionals
 151
 152 * Conditional Uses::
 153 * Conditional Syntax::
 154 * Deleted Code::
 155
 156 Conditional Syntax
 157
 158 * Ifdef::
 159 * If::
 160 * Defined::
 161 * Else::
 162 * Elif::
 163
 164 Implementation Details
 165
 166 * Implementation-defined behavior::
 167 * Implementation limits::
 168 * Obsolete Features::
 169 * Differences from previous versions::
 170
 171 Obsolete Features
 172
 173 * Obsolete Features::
 174
 175 @end detailmenu
 176 @end menu
 177
 178 @insertcopying
 179 @end ifnottex
 180
 181 @node Overview
 182 @chapter Overview
 183 @c man begin DESCRIPTION
 184 The C preprocessor, often known as @dfn{cpp}, is a @dfn{macro processor}
 185 that is used automatically by the C compiler to transform your program
 186 before compilation.  It is called a macro processor because it allows
 187 you to define @dfn{macros}, which are brief abbreviations for longer
 188 constructs.
 189
 190 The C preprocessor is intended to be used only with C, C++, and
 191 Objective-C source code.  In the past, it has been abused as a general
 192 text processor.  It will choke on input which does not obey C's lexical
 193 rules.  For example, apostrophes will be interpreted as the beginning of
 194 character constants, and cause errors.  Also, you cannot rely on it
 195 preserving characteristics of the input which are not significant to
 196 C-family languages.  If a Makefile is preprocessed, all the hard tabs
 197 will be removed, and the Makefile will not work.
 198
 199 Having said that, you can often get away with using cpp on things which
 200 are not C@.  Other Algol-ish programming languages are often safe
 201 (Pascal, Ada, etc.) So is assembly, with caution.  @option{-traditional-cpp}
 202 mode preserves more white space, and is otherwise more permissive.  Many
 203 of the problems can be avoided by writing C or C++ style comments
 204 instead of native language comments, and keeping macros simple.
 205
 206 Wherever possible, you should use a preprocessor geared to the language
 207 you are writing in.  Modern versions of the GNU assembler have macro
 208 facilities.  Most high level programming languages have their own
 209 conditional compilation and inclusion mechanism.  If all else fails,
 210 try a true general text processor, such as GNU M4.
 211
 212 C preprocessors vary in some details.  This manual discusses the GNU C
 213 preprocessor, which provides a small superset of the features of ISO
 214 Standard C@.  In its default mode, the GNU C preprocessor does not do a
 215 few things required by the standard.  These are features which are
 216 rarely, if ever, used, and may cause surprising changes to the meaning
 217 of a program which does not expect them.  To get strict ISO Standard C,
 218 you should use the @option{-std=c90}, @option{-std=c99} or
 219 @option{-std=c1x} options, depending
 220 on which version of the standard you want.  To get all the mandatory
 221 diagnostics, you must also use @option{-pedantic}.  @xref{Invocation}.
 222
 223 This manual describes the behavior of the ISO preprocessor.  To
 224 minimize gratuitous differences, where the ISO preprocessor's
 225 behavior does not conflict with traditional semantics, the
 226 traditional preprocessor should behave the same way.  The various
 227 differences that do exist are detailed in the section @ref{Traditional
 228 Mode}.
 229
 230 For clarity, unless noted otherwise, references to @samp{CPP} in this
 231 manual refer to GNU CPP@.
 232 @c man end
 233
 234 @menu
 235 * Character sets::
 236 * Initial processing::
 237 * Tokenization::
 238 * The preprocessing language::
 239 @end menu
 240
 241 @node Character sets
 242 @section Character sets
 243
 244 Source code character set processing in C and related languages is
 245 rather complicated.  The C standard discusses two character sets, but
 246 there are really at least four.
 247
 248 The files input to CPP might be in any character set at all.  CPP's
 249 very first action, before it even looks for line boundaries, is to
 250 convert the file into the character set it uses for internal
 251 processing.  That set is what the C standard calls the @dfn{source}
 252 character set.  It must be isomorphic with ISO 10646, also known as
 253 Unicode.  CPP uses the UTF-8 encoding of Unicode.
 254
 255 The character sets of the input files are specified using the
 256 @option{-finput-charset=} option.
 257
 258 All preprocessing work (the subject of the rest of this manual) is
 259 carried out in the source character set.  If you request textual
 260 output from the preprocessor with the @option{-E} option, it will be
 261 in UTF-8.
 262
 263 After preprocessing is complete, string and character constants are
 264 converted again, into the @dfn{execution} character set.  This
 265 character set is under control of the user; the default is UTF-8,
 266 matching the source character set.  Wide string and character
 267 constants have their own character set, which is not called out
 268 specifically in the standard.  Again, it is under control of the user.
 269 The default is UTF-16 or UTF-32, whichever fits in the target's
 270 @code{wchar_t} type, in the target machine's byte
 271 order.@footnote{UTF-16 does not meet the requirements of the C
 272 standard for a wide character set, but the choice of 16-bit
 273 @code{wchar_t} is enshrined in some system ABIs so we cannot fix
 274 this.}  Octal and hexadecimal escape sequences do not undergo
 275 conversion; @t{'\x12'} has the value 0x12 regardless of the currently
 276 selected execution character set.  All other escapes are replaced by
 277 the character in the source character set that they represent, then
 278 converted to the execution character set, just like unescaped
 279 characters.
 280
 281 Unless the experimental @option{-fextended-identifiers} option is used,
 282 GCC does not permit the use of characters outside the ASCII range, nor
 283 @samp{\u} and @samp{\U} escapes, in identifiers.  Even with that
 284 option, characters outside the ASCII range can only be specified with
 285 the @samp{\u} and @samp{\U} escapes, not used directly in identifiers.
 286
 287 @node Initial processing
 288 @section Initial processing
 289
 290 The preprocessor performs a series of textual transformations on its
 291 input.  These happen before all other processing.  Conceptually, they
 292 happen in a rigid order, and the entire file is run through each
 293 transformation before the next one begins.  CPP actually does them
 294 all at once, for performance reasons.  These transformations correspond
 295 roughly to the first three ``phases of translation'' described in the C
 296 standard.
 297
 298 @enumerate
 299 @item
 300 @cindex line endings
 301 The input file is read into memory and broken into lines.
 302
 303 Different systems use different conventions to indicate the end of a
 304 line.  GCC accepts the ASCII control sequences @kbd{LF}, @kbd{@w{CR
 305 LF}} and @kbd{CR} as end-of-line markers.  These are the canonical
 306 sequences used by Unix, DOS and VMS, and the classic Mac OS (before
 307 OSX) respectively.  You may therefore safely copy source code written
 308 on any of those systems to a different one and use it without
 309 conversion.  (GCC may lose track of the current line number if a file
 310 doesn't consistently use one convention, as sometimes happens when it
 311 is edited on computers with different conventions that share a network
 312 file system.)
 313
 314 If the last line of any input file lacks an end-of-line marker, the end
 315 of the file is considered to implicitly supply one.  The C standard says
 316 that this condition provokes undefined behavior, so GCC will emit a
 317 warning message.
 318
 319 @item
 320 @cindex trigraphs
 321 @anchor{trigraphs}If trigraphs are enabled, they are replaced by their
 322 corresponding single characters.  By default GCC ignores trigraphs,
 323 but if you request a strictly conforming mode with the @option{-std}
 324 option, or you specify the @option{-trigraphs} option, then it
 325 converts them.
 326
 327 These are nine three-character sequences, all starting with @samp{??},
 328 that are defined by ISO C to stand for single characters.  They permit
 329 obsolete systems that lack some of C's punctuation to use C@.  For
 330 example, @samp{??/} stands for @samp{\}, so @t{'??/n'} is a character
 331 constant for a newline.
 332
 333 Trigraphs are not popular and many compilers implement them
 334 incorrectly.  Portable code should not rely on trigraphs being either
 335 converted or ignored.  With @option{-Wtrigraphs} GCC will warn you
 336 when a trigraph may change the meaning of your program if it were
 337 converted.  @xref{Wtrigraphs}.
 338
 339 In a string constant, you can prevent a sequence of question marks
 340 from being confused with a trigraph by inserting a backslash between
 341 the question marks, or by separating the string literal at the
 342 trigraph and making use of string literal concatenation.  @t{"(??\?)"}
 343 is the string @samp{(???)}, not @samp{(?]}.  Traditional C compilers
 344 do not recognize these idioms.
 345
 346 The nine trigraphs and their replacements are
 347
 348 @smallexample
 349 Trigraph:       ??(  ??)  ??<  ??>  ??=  ??/  ??'  ??!  ??-
 350 Replacement:      [    ]    @{    @}    #    \    ^    |    ~
 351 @end smallexample
 352
 353 @item
 354 @cindex continued lines
 355 @cindex backslash-newline
 356 Continued lines are merged into one long line.
 357
 358 A continued line is a line which ends with a backslash, @samp{\}.  The
 359 backslash is removed and the following line is joined with the current
 360 one.  No space is inserted, so you may split a line anywhere, even in
 361 the middle of a word.  (It is generally more readable to split lines
 362 only at white space.)
 363
 364 The trailing backslash on a continued line is commonly referred to as a
 365 @dfn{backslash-newline}.
 366
 367 If there is white space between a backslash and the end of a line, that
 368 is still a continued line.  However, as this is usually the result of an
 369 editing mistake, and many compilers will not accept it as a continued
 370 line, GCC will warn you about it.
 371
 372 @item
 373 @cindex comments
 374 @cindex line comments
 375 @cindex block comments
 376 All comments are replaced with single spaces.
 377
 378 There are two kinds of comments.  @dfn{Block comments} begin with
 379 @samp{/*} and continue until the next @samp{*/}.  Block comments do not
 380 nest:
 381
 382 @smallexample
 383 /* @r{this is} /* @r{one comment} */ @r{text outside comment}
 384 @end smallexample
 385
 386 @dfn{Line comments} begin with @samp{//} and continue to the end of the
 387 current line.  Line comments do not nest either, but it does not matter,
 388 because they would end in the same place anyway.
 389
 390 @smallexample
 391 // @r{this is} // @r{one comment}
 392 @r{text outside comment}
 393 @end smallexample
 394 @end enumerate
 395
 396 It is safe to put line comments inside block comments, or vice versa.
 397
 398 @smallexample
 399 @group
 400 /* @r{block comment}
 401    // @r{contains line comment}
 402    @r{yet more comment}
 403  */ @r{outside comment}
 404
 405 // @r{line comment} /* @r{contains block comment} */
 406 @end group
 407 @end smallexample
 408
 409 But beware of commenting out one end of a block comment with a line
 410 comment.
 411
 412 @smallexample
 413 @group
 414  // @r{l.c.}  /* @r{block comment begins}
 415     @r{oops! this isn't a comment anymore} */
 416 @end group
 417 @end smallexample
 418
 419 Comments are not recognized within string literals.
 420 @t{@w{"/* blah */"}} is the string constant @samp{@w{/* blah */}}, not
 421 an empty string.
 422
 423 Line comments are not in the 1989 edition of the C standard, but they
 424 are recognized by GCC as an extension.  In C++ and in the 1999 edition
 425 of the C standard, they are an official part of the language.
 426
 427 Since these transformations happen before all other processing, you can
 428 split a line mechanically with backslash-newline anywhere.  You can
 429 comment out the end of a line.  You can continue a line comment onto the
 430 next line with backslash-newline.  You can even split @samp{/*},
 431 @samp{*/}, and @samp{//} onto multiple lines with backslash-newline.
 432 For example:
 433
 434 @smallexample
 435 @group
 436 /\
 437 *
 438 */ # /*
 439 */ defi\
 440 ne FO\
 441 O 10\
 442 20
 443 @end group
 444 @end smallexample
 445
 446 @noindent
 447 is equivalent to @code{@w{#define FOO 1020}}.  All these tricks are
 448 extremely confusing and should not be used in code intended to be
 449 readable.
 450
 451 There is no way to prevent a backslash at the end of a line from being
 452 interpreted as a backslash-newline.  This cannot affect any correct
 453 program, however.
 454
 455 @node Tokenization
 456 @section Tokenization
 457
 458 @cindex tokens
 459 @cindex preprocessing tokens
 460 After the textual transformations are finished, the input file is
 461 converted into a sequence of @dfn{preprocessing tokens}.  These mostly
 462 correspond to the syntactic tokens used by the C compiler, but there are
 463 a few differences.  White space separates tokens; it is not itself a
 464 token of any kind.  Tokens do not have to be separated by white space,
 465 but it is often necessary to avoid ambiguities.
 466
 467 When faced with a sequence of characters that has more than one possible
 468 tokenization, the preprocessor is greedy.  It always makes each token,
 469 starting from the left, as big as possible before moving on to the next
 470 token.  For instance, @code{a+++++b} is interpreted as
 471 @code{@w{a ++ ++ + b}}, not as @code{@w{a ++ + ++ b}}, even though the
 472 latter tokenization could be part of a valid C program and the former
 473 could not.
 474
 475 Once the input file is broken into tokens, the token boundaries never
 476 change, except when the @samp{##} preprocessing operator is used to paste
 477 tokens together.  @xref{Concatenation}.  For example,
 478
 479 @smallexample
 480 @group
 481 #define foo() bar
 482 foo()baz
 483      @expansion{} bar baz
 484 @emph{not}
 485      @expansion{} barbaz
 486 @end group
 487 @end smallexample
 488
 489 The compiler does not re-tokenize the preprocessor's output.  Each
 490 preprocessing token becomes one compiler token.
 491
 492 @cindex identifiers
 493 Preprocessing tokens fall into five broad classes: identifiers,
 494 preprocessing numbers, string literals, punctuators, and other.  An
 495 @dfn{identifier} is the same as an identifier in C: any sequence of
 496 letters, digits, or underscores, which begins with a letter or
 497 underscore.  Keywords of C have no significance to the preprocessor;
 498 they are ordinary identifiers.  You can define a macro whose name is a
 499 keyword, for instance.  The only identifier which can be considered a
 500 preprocessing keyword is @code{defined}.  @xref{Defined}.
 501
 502 This is mostly true of other languages which use the C preprocessor.
 503 However, a few of the keywords of C++ are significant even in the
 504 preprocessor.  @xref{C++ Named Operators}.
 505
 506 In the 1999 C standard, identifiers may contain letters which are not
 507 part of the ``basic source character set'', at the implementation's
 508 discretion (such as accented Latin letters, Greek letters, or Chinese
 509 ideograms).  This may be done with an extended character set, or the
 510 @samp{\u} and @samp{\U} escape sequences.  The implementation of this
 511 feature in GCC is experimental; such characters are only accepted in
 512 the @samp{\u} and @samp{\U} forms and only if
 513 @option{-fextended-identifiers} is used.
 514
 515 As an extension, GCC treats @samp{$} as a letter.  This is for
 516 compatibility with some systems, such as VMS, where @samp{$} is commonly
 517 used in system-defined function and object names.  @samp{$} is not a
 518 letter in strictly conforming mode, or if you specify the @option{-$}
 519 option.  @xref{Invocation}.
 520
 521 @cindex numbers
 522 @cindex preprocessing numbers
 523 A @dfn{preprocessing number} has a rather bizarre definition.  The
 524 category includes all the normal integer and floating point constants
 525 one expects of C, but also a number of other things one might not
 526 initially recognize as a number.  Formally, preprocessing numbers begin
 527 with an optional period, a required decimal digit, and then continue
 528 with any sequence of letters, digits, underscores, periods, and
 529 exponents.  Exponents are the two-character sequences @samp{e+},
 530 @samp{e-}, @samp{E+}, @samp{E-}, @samp{p+}, @samp{p-}, @samp{P+}, and
 531 @samp{P-}.  (The exponents that begin with @samp{p} or @samp{P} are new
 532 to C99.  They are used for hexadecimal floating-point constants.)
 533
 534 The purpose of this unusual definition is to isolate the preprocessor
 535 from the full complexity of numeric constants.  It does not have to
 536 distinguish between lexically valid and invalid floating-point numbers,
 537 which is complicated.  The definition also permits you to split an
 538 identifier at any position and get exactly two tokens, which can then be
 539 pasted back together with the @samp{##} operator.
 540
 541 It's possible for preprocessing numbers to cause programs to be
 542 misinterpreted.  For example, @code{0xE+12} is a preprocessing number
 543 which does not translate to any valid numeric constant, therefore a
 544 syntax error.  It does not mean @code{@w{0xE + 12}}, which is what you
 545 might have intended.
 546
 547 @cindex string literals
 548 @cindex string constants
 549 @cindex character constants
 550 @cindex header file names
 551 @c the @: prevents makeinfo from turning '' into ".
 552 @dfn{String literals} are string constants, character constants, and
 553 header file names (the argument of @samp{#include}).@footnote{The C
 554 standard uses the term @dfn{string literal} to refer only to what we are
 555 calling @dfn{string constants}.}  String constants and character
 556 constants are straightforward: @t{"@dots{}"} or @t{'@dots{}'}.  In
 557 either case embedded quotes should be escaped with a backslash:
 558 @t{'\'@:'} is the character constant for @samp{'}.  There is no limit on
 559 the length of a character constant, but the value of a character
 560 constant that contains more than one character is
 561 implementation-defined.  @xref{Implementation Details}.
 562
 563 Header file names either look like string constants, @t{"@dots{}"}, or are
 564 written with angle brackets instead, @t{<@dots{}>}.  In either case,
 565 backslash is an ordinary character.  There is no way to escape the
 566 closing quote or angle bracket.  The preprocessor looks for the header
 567 file in different places depending on which form you use.  @xref{Include
 568 Operation}.
 569
 570 No string literal may extend past the end of a line.  Older versions
 571 of GCC accepted multi-line string constants.  You may use continued
 572 lines instead, or string constant concatenation.  @xref{Differences
 573 from previous versions}.
 574
 575 @cindex punctuators
 576 @cindex digraphs
 577 @cindex alternative tokens
 578 @dfn{Punctuators} are all the usual bits of punctuation which are
 579 meaningful to C and C++.  All but three of the punctuation characters in
 580 ASCII are C punctuators.  The exceptions are @samp{@@}, @samp{$}, and
 581 @samp{`}.  In addition, all the two- and three-character operators are
 582 punctuators.  There are also six @dfn{digraphs}, which the C++ standard
 583 calls @dfn{alternative tokens}, which are merely alternate ways to spell
 584 other punctuators.  This is a second attempt to work around missing
 585 punctuation in obsolete systems.  It has no negative side effects,
 586 unlike trigraphs, but does not cover as much ground.  The digraphs and
 587 their corresponding normal punctuators are:
 588
 589 @smallexample
 590 Digraph:        <%  %>  <:  :>  %:  %:%:
 591 Punctuator:      @{   @}   [   ]   #    ##
 592 @end smallexample
 593
 594 @cindex other tokens
 595 Any other single character is considered ``other''.  It is passed on to
 596 the preprocessor's output unmolested.  The C compiler will almost
 597 certainly reject source code containing ``other'' tokens.  In ASCII, the
 598 only other characters are @samp{@@}, @samp{$}, @samp{`}, and control
 599 characters other than NUL (all bits zero).  (Note that @samp{$} is
 600 normally considered a letter.)  All characters with the high bit set
 601 (numeric range 0x7F--0xFF) are also ``other'' in the present
 602 implementation.  This will change when proper support for international
 603 character sets is added to GCC@.
 604
 605 NUL is a special case because of the high probability that its
 606 appearance is accidental, and because it may be invisible to the user
 607 (many terminals do not display NUL at all).  Within comments, NULs are
 608 silently ignored, just as any other character would be.  In running
 609 text, NUL is considered white space.  For example, these two directives
 610 have the same meaning.
 611
 612 @smallexample
 613 #define X^@@1
 614 #define X 1
 615 @end smallexample
 616
 617 @noindent
 618 (where @samp{^@@} is ASCII NUL)@.  Within string or character constants,
 619 NULs are preserved.  In the latter two cases the preprocessor emits a
 620 warning message.
 621
 622 @node The preprocessing language
 623 @section The preprocessing language
 624 @cindex directives
 625 @cindex preprocessing directives
 626 @cindex directive line
 627 @cindex directive name
 628
 629 After tokenization, the stream of tokens may simply be passed straight
 630 to the compiler's parser.  However, if it contains any operations in the
 631 @dfn{preprocessing language}, it will be transformed first.  This stage
 632 corresponds roughly to the standard's ``translation phase 4'' and is
 633 what most people think of as the preprocessor's job.
 634
 635 The preprocessing language consists of @dfn{directives} to be executed
 636 and @dfn{macros} to be expanded.  Its primary capabilities are:
 637
 638 @itemize @bullet
 639 @item
 640 Inclusion of header files.  These are files of declarations that can be
 641 substituted into your program.
 642
 643 @item
 644 Macro expansion.  You can define @dfn{macros}, which are abbreviations
 645 for arbitrary fragments of C code.  The preprocessor will replace the
 646 macros with their definitions throughout the program.  Some macros are
 647 automatically defined for you.
 648
 649 @item
 650 Conditional compilation.  You can include or exclude parts of the
 651 program according to various conditions.
 652
 653 @item
 654 Line control.  If you use a program to combine or rearrange source files
 655 into an intermediate file which is then compiled, you can use line
 656 control to inform the compiler where each source line originally came
 657 from.
 658
 659 @item
 660 Diagnostics.  You can detect problems at compile time and issue errors
 661 or warnings.
 662 @end itemize
 663
 664 There are a few more, less useful, features.
 665
 666 Except for expansion of predefined macros, all these operations are
 667 triggered with @dfn{preprocessing directives}.  Preprocessing directives
 668 are lines in your program that start with @samp{#}.  Whitespace is
 669 allowed before and after the @samp{#}.  The @samp{#} is followed by an
 670 identifier, the @dfn{directive name}.  It specifies the operation to
 671 perform.  Directives are commonly referred to as @samp{#@var{name}}
 672 where @var{name} is the directive name.  For example, @samp{#define} is
 673 the directive that defines a macro.
 674
 675 The @samp{#} which begins a directive cannot come from a macro
 676 expansion.  Also, the directive name is not macro expanded.  Thus, if
 677 @code{foo} is defined as a macro expanding to @code{define}, that does
 678 not make @samp{#foo} a valid preprocessing directive.
 679
 680 The set of valid directive names is fixed.  Programs cannot define new
 681 preprocessing directives.
 682
 683 Some directives require arguments; these make up the rest of the
 684 directive line and must be separated from the directive name by
 685 whitespace.  For example, @samp{#define} must be followed by a macro
 686 name and the intended expansion of the macro.
 687
 688 A preprocessing directive cannot cover more than one line.  The line
 689 may, however, be continued with backslash-newline, or by a block comment
 690 which extends past the end of the line.  In either case, when the
 691 directive is processed, the continuations have already been merged with
 692 the first line to make one long line.
 693
 694 @node Header Files
 695 @chapter Header Files
 696
 697 @cindex header file
 698 A header file is a file containing C declarations and macro definitions
 699 (@pxref{Macros}) to be shared between several source files.  You request
 700 the use of a header file in your program by @dfn{including} it, with the
 701 C preprocessing directive @samp{#include}.
 702
 703 Header files serve two purposes.
 704
 705 @itemize @bullet
 706 @item
 707 @cindex system header files
 708 System header files declare the interfaces to parts of the operating
 709 system.  You include them in your program to supply the definitions and
 710 declarations you need to invoke system calls and libraries.
 711
 712 @item
 713 Your own header files contain declarations for interfaces between the
 714 source files of your program.  Each time you have a group of related
 715 declarations and macro definitions all or most of which are needed in
 716 several different source files, it is a good idea to create a header
 717 file for them.
 718 @end itemize
 719
 720 Including a header file produces the same results as copying the header
 721 file into each source file that needs it.  Such copying would be
 722 time-consuming and error-prone.  With a header file, the related
 723 declarations appear in only one place.  If they need to be changed, they
 724 can be changed in one place, and programs that include the header file
 725 will automatically use the new version when next recompiled.  The header
 726 file eliminates the labor of finding and changing all the copies as well
 727 as the risk that a failure to find one copy will result in
 728 inconsistencies within a program.
 729
 730 In C, the usual convention is to give header files names that end with
 731 @file{.h}.  It is most portable to use only letters, digits, dashes, and
 732 underscores in header file names, and at most one dot.
 733
 734 @menu
 735 * Include Syntax::
 736 * Include Operation::
 737 * Search Path::
 738 * Once-Only Headers::
 739 * Alternatives to Wrapper #ifndef::
 740 * Computed Includes::
 741 * Wrapper Headers::
 742 * System Headers::
 743 @end menu
 744
 745 @node Include Syntax
 746 @section Include Syntax
 747
 748 @findex #include
 749 Both user and system header files are included using the preprocessing
 750 directive @samp{#include}.  It has two variants:
 751
 752 @table @code
 753 @item #include <@var{file}>
 754 This variant is used for system header files.  It searches for a file
 755 named @var{file} in a standard list of system directories.  You can prepend
 756 directories to this list with the @option{-I} option (@pxref{Invocation}).
 757
 758 @item #include "@var{file}"
 759 This variant is used for header files of your own program.  It
 760 searches for a file named @var{file} first in the directory containing
 761 the current file, then in the quote directories and then the same
 762 directories used for @code{<@var{file}>}.  You can prepend directories
 763 to the list of quote directories with the @option{-iquote} option.
 764 @end table
 765
 766 The argument of @samp{#include}, whether delimited with quote marks or
 767 angle brackets, behaves like a string constant in that comments are not
 768 recognized, and macro names are not expanded.  Thus, @code{@w{#include
 769 <x/*y>}} specifies inclusion of a system header file named @file{x/*y}.
 770
 771 However, if backslashes occur within @var{file}, they are considered
 772 ordinary text characters, not escape characters.  None of the character
 773 escape sequences appropriate to string constants in C are processed.
 774 Thus, @code{@w{#include "x\n\\y"}} specifies a filename containing three
 775 backslashes.  (Some systems interpret @samp{\} as a pathname separator.
 776 All of these also interpret @samp{/} the same way.  It is most portable
 777 to use only @samp{/}.)
 778
 779 It is an error if there is anything (other than comments) on the line
 780 after the file name.
 781
 782 @node Include Operation
 783 @section Include Operation
 784
 785 The @samp{#include} directive works by directing the C preprocessor to
 786 scan the specified file as input before continuing with the rest of the
 787 current file.  The output from the preprocessor contains the output
 788 already generated, followed by the output resulting from the included
 789 file, followed by the output that comes from the text after the
 790 @samp{#include} directive.  For example, if you have a header file
 791 @file{header.h} as follows,
 792
 793 @smallexample
 794 char *test (void);
 795 @end smallexample
 796
 797 @noindent
 798 and a main program called @file{program.c} that uses the header file,
 799 like this,
 800
 801 @smallexample
 802 int x;
 803 #include "header.h"
 804
 805 int
 806 main (void)
 807 @{
 808   puts (test ());
 809 @}
 810 @end smallexample
 811
 812 @noindent
 813 the compiler will see the same token stream as it would if
 814 @file{program.c} read
 815
 816 @smallexample
 817 int x;
 818 char *test (void);
 819
 820 int
 821 main (void)
 822 @{
 823   puts (test ());
 824 @}
 825 @end smallexample
 826
 827 Included files are not limited to declarations and macro definitions;
 828 those are merely the typical uses.  Any fragment of a C program can be
 829 included from another file.  The include file could even contain the
 830 beginning of a statement that is concluded in the containing file, or
 831 the end of a statement that was started in the including file.  However,
 832 an included file must consist of complete tokens.  Comments and string
 833 literals which have not been closed by the end of an included file are
 834 invalid.  For error recovery, they are considered to end at the end of
 835 the file.
 836
 837 To avoid confusion, it is best if header files contain only complete
 838 syntactic units---function declarations or definitions, type
 839 declarations, etc.
 840
 841 The line following the @samp{#include} directive is always treated as a
 842 separate line by the C preprocessor, even if the included file lacks a
 843 final newline.
 844
 845 @node Search Path
 846 @section Search Path
 847
 848 GCC looks in several different places for headers.  On a normal Unix
 849 system, if you do not instruct it otherwise, it will look for headers
 850 requested with @code{@w{#include <@var{file}>}} in:
 851
 852 @smallexample
 853 /usr/local/include
 854 @var{libdir}/gcc/@var{target}/@var{version}/include
 855 /usr/@var{target}/include
 856 /usr/include
 857 @end smallexample
 858
 859 For C++ programs, it will also look in @file{/usr/include/g++-v3},
 860 first.  In the above, @var{target} is the canonical name of the system
 861 GCC was configured to compile code for; often but not always the same as
 862 the canonical name of the system it runs on.  @var{version} is the
 863 version of GCC in use.
 864
 865 You can add to this list with the @option{-I@var{dir}} command line
 866 option.  All the directories named by @option{-I} are searched, in
 867 left-to-right order, @emph{before} the default directories.  The only
 868 exception is when @file{dir} is already searched by default.  In
 869 this case, the option is ignored and the search order for system
 870 directories remains unchanged.
 871
 872 Duplicate directories are removed from the quote and bracket search
 873 chains before the two chains are merged to make the final search chain.
 874 Thus, it is possible for a directory to occur twice in the final search
 875 chain if it was specified in both the quote and bracket chains.
 876
 877 You can prevent GCC from searching any of the default directories with
 878 the @option{-nostdinc} option.  This is useful when you are compiling an
 879 operating system kernel or some other program that does not use the
 880 standard C library facilities, or the standard C library itself.
 881 @option{-I} options are not ignored as described above when
 882 @option{-nostdinc} is in effect.
 883
 884 GCC looks for headers requested with @code{@w{#include "@var{file}"}}
 885 first in the directory containing the current file, then in the
 886 directories as specified by @option{-iquote} options, then in the same
 887 places it would have looked for a header requested with angle
 888 brackets.  For example, if @file{/usr/include/sys/stat.h} contains
 889 @code{@w{#include "types.h"}}, GCC looks for @file{types.h} first in
 890 @file{/usr/include/sys}, then in its usual search path.
 891
 892 @samp{#line} (@pxref{Line Control}) does not change GCC's idea of the
 893 directory containing the current file.
 894
 895 You may put @option{-I-} at any point in your list of @option{-I} options.
 896 This has two effects.  First, directories appearing before the
 897 @option{-I-} in the list are searched only for headers requested with
 898 quote marks.  Directories after @option{-I-} are searched for all
 899 headers.  Second, the directory containing the current file is not
 900 searched for anything, unless it happens to be one of the directories
 901 named by an @option{-I} switch.  @option{-I-} is deprecated, @option{-iquote}
 902 should be used instead.
 903
 904 @option{-I. -I-} is not the same as no @option{-I} options at all, and does
 905 not cause the same behavior for @samp{<>} includes that @samp{""}
 906 includes get with no special options.  @option{-I.} searches the
 907 compiler's current working directory for header files.  That may or may
 908 not be the same as the directory containing the current file.
 909
 910 If you need to look for headers in a directory named @file{-}, write
 911 @option{-I./-}.
 912
 913 There are several more ways to adjust the header search path.  They are
 914 generally less useful.  @xref{Invocation}.
 915
 916 @node Once-Only Headers
 917 @section Once-Only Headers
 918 @cindex repeated inclusion
 919 @cindex including just once
 920 @cindex wrapper @code{#ifndef}
 921
 922 If a header file happens to be included twice, the compiler will process
 923 its contents twice.  This is very likely to cause an error, e.g.@: when the
 924 compiler sees the same structure definition twice.  Even if it does not,
 925 it will certainly waste time.
 926
 927 The standard way to prevent this is to enclose the entire real contents
 928 of the file in a conditional, like this:
 929
 930 @smallexample
 931 @group
 932 /* File foo.  */
 933 #ifndef FILE_FOO_SEEN
 934 #define FILE_FOO_SEEN
 935
 936 @var{the entire file}
 937
 938 #endif /* !FILE_FOO_SEEN */
 939 @end group
 940 @end smallexample
 941
 942 This construct is commonly known as a @dfn{wrapper #ifndef}.
 943 When the header is included again, the conditional will be false,
 944 because @code{FILE_FOO_SEEN} is defined.  The preprocessor will skip
 945 over the entire contents of the file, and the compiler will not see it
 946 twice.
 947
 948 CPP optimizes even further.  It remembers when a header file has a
 949 wrapper @samp{#ifndef}.  If a subsequent @samp{#include} specifies that
 950 header, and the macro in the @samp{#ifndef} is still defined, it does
 951 not bother to rescan the file at all.
 952
 953 You can put comments outside the wrapper.  They will not interfere with
 954 this optimization.
 955
 956 @cindex controlling macro
 957 @cindex guard macro
 958 The macro @code{FILE_FOO_SEEN} is called the @dfn{controlling macro} or
 959 @dfn{guard macro}.  In a user header file, the macro name should not
 960 begin with @samp{_}.  In a system header file, it should begin with
 961 @samp{__} to avoid conflicts with user programs.  In any kind of header
 962 file, the macro name should contain the name of the file and some
 963 additional text, to avoid conflicts with other header files.
 964
 965 @node Alternatives to Wrapper #ifndef
 966 @section Alternatives to Wrapper #ifndef
 967
 968 CPP supports two more ways of indicating that a header file should be
 969 read only once.  Neither one is as portable as a wrapper @samp{#ifndef}
 970 and we recommend you do not use them in new programs, with the caveat
 971 that @samp{#import} is standard practice in Objective-C.
 972
 973 @findex #import
 974 CPP supports a variant of @samp{#include} called @samp{#import} which
 975 includes a file, but does so at most once.  If you use @samp{#import}
 976 instead of @samp{#include}, then you don't need the conditionals
 977 inside the header file to prevent multiple inclusion of the contents.
 978 @samp{#import} is standard in Objective-C, but is considered a
 979 deprecated extension in C and C++.
 980
 981 @samp{#import} is not a well designed feature.  It requires the users of
 982 a header file to know that it should only be included once.  It is much
 983 better for the header file's implementor to write the file so that users
 984 don't need to know this.  Using a wrapper @samp{#ifndef} accomplishes
 985 this goal.
 986
 987 In the present implementation, a single use of @samp{#import} will
 988 prevent the file from ever being read again, by either @samp{#import} or
 989 @samp{#include}.  You should not rely on this; do not use both
 990 @samp{#import} and @samp{#include} to refer to the same header file.
 991
 992 Another way to prevent a header file from being included more than once
 993 is with the @samp{#pragma once} directive.  If @samp{#pragma once} is
 994 seen when scanning a header file, that file will never be read again, no
 995 matter what.
 996
 997 @samp{#pragma once} does not have the problems that @samp{#import} does,
 998 but it is not recognized by all preprocessors, so you cannot rely on it
 999 in a portable program.
1000
1001 @node Computed Includes
1002 @section Computed Includes
1003 @cindex computed includes
1004 @cindex macros in include
1005
1006 Sometimes it is necessary to select one of several different header
1007 files to be included into your program.  They might specify
1008 configuration parameters to be used on different sorts of operating
1009 systems, for instance.  You could do this with a series of conditionals,
1010
1011 @smallexample
1012 #if SYSTEM_1
1013 # include "system_1.h"
1014 #elif SYSTEM_2
1015 # include "system_2.h"
1016 #elif SYSTEM_3
1017 @dots{}
1018 #endif
1019 @end smallexample
1020
1021 That rapidly becomes tedious.  Instead, the preprocessor offers the
1022 ability to use a macro for the header name.  This is called a
1023 @dfn{computed include}.  Instead of writing a header name as the direct
1024 argument of @samp{#include}, you simply put a macro name there instead:
1025
1026 @smallexample
1027 #define SYSTEM_H "system_1.h"
1028 @dots{}
1029 #include SYSTEM_H
1030 @end smallexample
1031
1032 @noindent
1033 @code{SYSTEM_H} will be expanded, and the preprocessor will look for
1034 @file{system_1.h} as if the @samp{#include} had been written that way
1035 originally.  @code{SYSTEM_H} could be defined by your Makefile with a
1036 @option{-D} option.
1037
1038 You must be careful when you define the macro.  @samp{#define} saves
1039 tokens, not text.  The preprocessor has no way of knowing that the macro
1040 will be used as the argument of @samp{#include}, so it generates
1041 ordinary tokens, not a header name.  This is unlikely to cause problems
1042 if you use double-quote includes, which are close enough to string
1043 constants.  If you use angle brackets, however, you may have trouble.
1044
1045 The syntax of a computed include is actually a bit more general than the
1046 above.  If the first non-whitespace character after @samp{#include} is
1047 not @samp{"} or @samp{<}, then the entire line is macro-expanded
1048 like running text would be.
1049
1050 If the line expands to a single string constant, the contents of that
1051 string constant are the file to be included.  CPP does not re-examine the
1052 string for embedded quotes, but neither does it process backslash
1053 escapes in the string.  Therefore
1054
1055 @smallexample
1056 #define HEADER "a\"b"
1057 #include HEADER
1058 @end smallexample
1059
1060 @noindent
1061 looks for a file named @file{a\"b}.  CPP searches for the file according
1062 to the rules for double-quoted includes.
1063
1064 If the line expands to a token stream beginning with a @samp{<} token
1065 and including a @samp{>} token, then the tokens between the @samp{<} and
1066 the first @samp{>} are combined to form the filename to be included.
1067 Any whitespace between tokens is reduced to a single space; then any
1068 space after the initial @samp{<} is retained, but a trailing space
1069 before the closing @samp{>} is ignored.  CPP searches for the file
1070 according to the rules for angle-bracket includes.
1071
1072 In either case, if there are any tokens on the line after the file name,
1073 an error occurs and the directive is not processed.  It is also an error
1074 if the result of expansion does not match either of the two expected
1075 forms.
1076
1077 These rules are implementation-defined behavior according to the C
1078 standard.  To minimize the risk of different compilers interpreting your
1079 computed includes differently, we recommend you use only a single
1080 object-like macro which expands to a string constant.  This will also
1081 minimize confusion for people reading your program.
1082
1083 @node Wrapper Headers
1084 @section Wrapper Headers
1085 @cindex wrapper headers
1086 @cindex overriding a header file
1087 @findex #include_next
1088
1089 Sometimes it is necessary to adjust the contents of a system-provided
1090 header file without editing it directly.  GCC's @command{fixincludes}
1091 operation does this, for example.  One way to do that would be to create
1092 a new header file with the same name and insert it in the search path
1093 before the original header.  That works fine as long as you're willing
1094 to replace the old header entirely.  But what if you want to refer to
1095 the old header from the new one?
1096
1097 You cannot simply include the old header with @samp{#include}.  That
1098 will start from the beginning, and find your new header again.  If your
1099 header is not protected from multiple inclusion (@pxref{Once-Only
1100 Headers}), it will recurse infinitely and cause a fatal error.
1101
1102 You could include the old header with an absolute pathname:
1103 @smallexample
1104 #include "/usr/include/old-header.h"
1105 @end smallexample
1106 @noindent
1107 This works, but is not clean; should the system headers ever move, you
1108 would have to edit the new headers to match.
1109
1110 There is no way to solve this problem within the C standard, but you can
1111 use the GNU extension @samp{#include_next}.  It means, ``Include the
1112 @emph{next} file with this name''.  This directive works like
1113 @samp{#include} except in searching for the specified file: it starts
1114 searching the list of header file directories @emph{after} the directory
1115 in which the current file was found.
1116
1117 Suppose you specify @option{-I /usr/local/include}, and the list of
1118 directories to search also includes @file{/usr/include}; and suppose
1119 both directories contain @file{signal.h}.  Ordinary @code{@w{#include
1120 <signal.h>}} finds the file under @file{/usr/local/include}.  If that
1121 file contains @code{@w{#include_next <signal.h>}}, it starts searching
1122 after that directory, and finds the file in @file{/usr/include}.
1123
1124 @samp{#include_next} does not distinguish between @code{<@var{file}>}
1125 and @code{"@var{file}"} inclusion, nor does it check that the file you
1126 specify has the same name as the current file.  It simply looks for the
1127 file named, starting with the directory in the search path after the one
1128 where the current file was found.
1129
1130 The use of @samp{#include_next} can lead to great confusion.  We
1131 recommend it be used only when there is no other alternative.  In
1132 particular, it should not be used in the headers belonging to a specific
1133 program; it should be used only to make global corrections along the
1134 lines of @command{fixincludes}.
1135
1136 @node System Headers
1137 @section System Headers
1138 @cindex system header files
1139
1140 The header files declaring interfaces to the operating system and
1141 runtime libraries often cannot be written in strictly conforming C@.
1142 Therefore, GCC gives code found in @dfn{system headers} special
1143 treatment.  All warnings, other than those generated by @samp{#warning}
1144 (@pxref{Diagnostics}), are suppressed while GCC is processing a system
1145 header.  Macros defined in a system header are immune to a few warnings
1146 wherever they are expanded.  This immunity is granted on an ad-hoc
1147 basis, when we find that a warning generates lots of false positives
1148 because of code in macros defined in system headers.
1149
1150 Normally, only the headers found in specific directories are considered
1151 system headers.  These directories are determined when GCC is compiled.
1152 There are, however, two ways to make normal headers into system headers.
1153
1154 The @option{-isystem} command line option adds its argument to the list of
1155 directories to search for headers, just like @option{-I}.  Any headers
1156 found in that directory will be considered system headers.
1157
1158 All directories named by @option{-isystem} are searched @emph{after} all
1159 directories named by @option{-I}, no matter what their order was on the
1160 command line.  If the same directory is named by both @option{-I} and
1161 @option{-isystem}, the @option{-I} option is ignored.  GCC provides an
1162 informative message when this occurs if @option{-v} is used.
1163
1164 @findex #pragma GCC system_header
1165 There is also a directive, @code{@w{#pragma GCC system_header}}, which
1166 tells GCC to consider the rest of the current include file a system
1167 header, no matter where it was found.  Code that comes before the
1168 @samp{#pragma} in the file will not be affected.  @code{@w{#pragma GCC
1169 system_header}} has no effect in the primary source file.
1170
1171 On very old systems, some of the pre-defined system header directories
1172 get even more special treatment.  GNU C++ considers code in headers
1173 found in those directories to be surrounded by an @code{@w{extern "C"}}
1174 block.  There is no way to request this behavior with a @samp{#pragma},
1175 or from the command line.
1176
1177 @node Macros
1178 @chapter Macros
1179
1180 A @dfn{macro} is a fragment of code which has been given a name.
1181 Whenever the name is used, it is replaced by the contents of the macro.
1182 There are two kinds of macros.  They differ mostly in what they look
1183 like when they are used.  @dfn{Object-like} macros resemble data objects
1184 when used, @dfn{function-like} macros resemble function calls.
1185
1186 You may define any valid identifier as a macro, even if it is a C
1187 keyword.  The preprocessor does not know anything about keywords.  This
1188 can be useful if you wish to hide a keyword such as @code{const} from an
1189 older compiler that does not understand it.  However, the preprocessor
1190 operator @code{defined} (@pxref{Defined}) can never be defined as a
1191 macro, and C++'s named operators (@pxref{C++ Named Operators}) cannot be
1192 macros when you are compiling C++.
1193
1194 @menu
1195 * Object-like Macros::
1196 * Function-like Macros::
1197 * Macro Arguments::
1198 * Stringification::
1199 * Concatenation::
1200 * Variadic Macros::
1201 * Predefined Macros::
1202 * Undefining and Redefining Macros::
1203 * Directives Within Macro Arguments::
1204 * Macro Pitfalls::
1205 @end menu
1206
1207 @node Object-like Macros
1208 @section Object-like Macros
1209 @cindex object-like macro
1210 @cindex symbolic constants
1211 @cindex manifest constants
1212
1213 An @dfn{object-like macro} is a simple identifier which will be replaced
1214 by a code fragment.  It is called object-like because it looks like a
1215 data object in code that uses it.  They are most commonly used to give
1216 symbolic names to numeric constants.
1217
1218 @findex #define
1219 You create macros with the @samp{#define} directive.  @samp{#define} is
1220 followed by the name of the macro and then the token sequence it should
1221 be an abbreviation for, which is variously referred to as the macro's
1222 @dfn{body}, @dfn{expansion} or @dfn{replacement list}.  For example,
1223
1224 @smallexample
1225 #define BUFFER_SIZE 1024
1226 @end smallexample
1227
1228 @noindent
1229 defines a macro named @code{BUFFER_SIZE} as an abbreviation for the
1230 token @code{1024}.  If somewhere after this @samp{#define} directive
1231 there comes a C statement of the form
1232
1233 @smallexample
1234 foo = (char *) malloc (BUFFER_SIZE);
1235 @end smallexample
1236
1237 @noindent
1238 then the C preprocessor will recognize and @dfn{expand} the macro
1239 @code{BUFFER_SIZE}.  The C compiler will see the same tokens as it would
1240 if you had written
1241
1242 @smallexample
1243 foo = (char *) malloc (1024);
1244 @end smallexample
1245
1246 By convention, macro names are written in uppercase.  Programs are
1247 easier to read when it is possible to tell at a glance which names are
1248 macros.
1249
1250 The macro's body ends at the end of the @samp{#define} line.  You may
1251 continue the definition onto multiple lines, if necessary, using
1252 backslash-newline.  When the macro is expanded, however, it will all
1253 come out on one line.  For example,
1254
1255 @smallexample
1256 #define NUMBERS 1, \
1257                 2, \
1258                 3
1259 int x[] = @{ NUMBERS @};
1260      @expansion{} int x[] = @{ 1, 2, 3 @};
1261 @end smallexample
1262
1263 @noindent
1264 The most common visible consequence of this is surprising line numbers
1265 in error messages.
1266
1267 There is no restriction on what can go in a macro body provided it
1268 decomposes into valid preprocessing tokens.  Parentheses need not
1269 balance, and the body need not resemble valid C code.  (If it does not,
1270 you may get error messages from the C compiler when you use the macro.)
1271
1272 The C preprocessor scans your program sequentially.  Macro definitions
1273 take effect at the place you write them.  Therefore, the following input
1274 to the C preprocessor
1275
1276 @smallexample
1277 foo = X;
1278 #define X 4
1279 bar = X;
1280 @end smallexample
1281
1282 @noindent
1283 produces
1284
1285 @smallexample
1286 foo = X;
1287 bar = 4;
1288 @end smallexample
1289
1290 When the preprocessor expands a macro name, the macro's expansion
1291 replaces the macro invocation, then the expansion is examined for more
1292 macros to expand.  For example,
1293
1294 @smallexample
1295 @group
1296 #define TABLESIZE BUFSIZE
1297 #define BUFSIZE 1024
1298 TABLESIZE
1299      @expansion{} BUFSIZE
1300      @expansion{} 1024
1301 @end group
1302 @end smallexample
1303
1304 @noindent
1305 @code{TABLESIZE} is expanded first to produce @code{BUFSIZE}, then that
1306 macro is expanded to produce the final result, @code{1024}.
1307
1308 Notice that @code{BUFSIZE} was not defined when @code{TABLESIZE} was
1309 defined.  The @samp{#define} for @code{TABLESIZE} uses exactly the
1310 expansion you specify---in this case, @code{BUFSIZE}---and does not
1311 check to see whether it too contains macro names.  Only when you
1312 @emph{use} @code{TABLESIZE} is the result of its expansion scanned for
1313 more macro names.
1314
1315 This makes a difference if you change the definition of @code{BUFSIZE}
1316 at some point in the source file.  @code{TABLESIZE}, defined as shown,
1317 will always expand using the definition of @code{BUFSIZE} that is
1318 currently in effect:
1319
1320 @smallexample
1321 #define BUFSIZE 1020
1322 #define TABLESIZE BUFSIZE
1323 #undef BUFSIZE
1324 #define BUFSIZE 37
1325 @end smallexample
1326
1327 @noindent
1328 Now @code{TABLESIZE} expands (in two stages) to @code{37}.
1329
1330 If the expansion of a macro contains its own name, either directly or
1331 via intermediate macros, it is not expanded again when the expansion is
1332 examined for more macros.  This prevents infinite recursion.
1333 @xref{Self-Referential Macros}, for the precise details.
1334
1335 @node Function-like Macros
1336 @section Function-like Macros
1337 @cindex function-like macros
1338
1339 You can also define macros whose use looks like a function call.  These
1340 are called @dfn{function-like macros}.  To define a function-like macro,
1341 you use the same @samp{#define} directive, but you put a pair of
1342 parentheses immediately after the macro name.  For example,
1343
1344 @smallexample
1345 #define lang_init()  c_init()
1346 lang_init()
1347      @expansion{} c_init()
1348 @end smallexample
1349
1350 A function-like macro is only expanded if its name appears with a pair
1351 of parentheses after it.  If you write just the name, it is left alone.
1352 This can be useful when you have a function and a macro of the same
1353 name, and you wish to use the function sometimes.
1354
1355 @smallexample
1356 extern void foo(void);
1357 #define foo() /* @r{optimized inline version} */
1358 @dots{}
1359   foo();
1360   funcptr = foo;
1361 @end smallexample
1362
1363 Here the call to @code{foo()} will use the macro, but the function
1364 pointer will get the address of the real function.  If the macro were to
1365 be expanded, it would cause a syntax error.
1366
1367 If you put spaces between the macro name and the parentheses in the
1368 macro definition, that does not define a function-like macro, it defines
1369 an object-like macro whose expansion happens to begin with a pair of
1370 parentheses.
1371
1372 @smallexample
1373 #define lang_init ()    c_init()
1374 lang_init()
1375      @expansion{} () c_init()()
1376 @end smallexample
1377
1378 The first two pairs of parentheses in this expansion come from the
1379 macro.  The third is the pair that was originally after the macro
1380 invocation.  Since @code{lang_init} is an object-like macro, it does not
1381 consume those parentheses.
1382
1383 @node Macro Arguments
1384 @section Macro Arguments
1385 @cindex arguments
1386 @cindex macros with arguments
1387 @cindex arguments in macro definitions
1388
1389 Function-like macros can take @dfn{arguments}, just like true functions.
1390 To define a macro that uses arguments, you insert @dfn{parameters}
1391 between the pair of parentheses in the macro definition that make the
1392 macro function-like.  The parameters must be valid C identifiers,
1393 separated by commas and optionally whitespace.
1394
1395 To invoke a macro that takes arguments, you write the name of the macro
1396 followed by a list of @dfn{actual arguments} in parentheses, separated
1397 by commas.  The invocation of the macro need not be restricted to a
1398 single logical line---it can cross as many lines in the source file as
1399 you wish.  The number of arguments you give must match the number of
1400 parameters in the macro definition.  When the macro is expanded, each
1401 use of a parameter in its body is replaced by the tokens of the
1402 corresponding argument.  (You need not use all of the parameters in the
1403 macro body.)
1404
1405 As an example, here is a macro that computes the minimum of two numeric
1406 values, as it is defined in many C programs, and some uses.
1407
1408 @smallexample
1409 #define min(X, Y)  ((X) < (Y) ? (X) : (Y))
1410   x = min(a, b);          @expansion{}  x = ((a) < (b) ? (a) : (b));
1411   y = min(1, 2);          @expansion{}  y = ((1) < (2) ? (1) : (2));
1412   z = min(a + 28, *p);    @expansion{}  z = ((a + 28) < (*p) ? (a + 28) : (*p));
1413 @end smallexample
1414
1415 @noindent
1416 (In this small example you can already see several of the dangers of
1417 macro arguments.  @xref{Macro Pitfalls}, for detailed explanations.)
1418
1419 Leading and trailing whitespace in each argument is dropped, and all
1420 whitespace between the tokens of an argument is reduced to a single
1421 space.  Parentheses within each argument must balance; a comma within
1422 such parentheses does not end the argument.  However, there is no
1423 requirement for square brackets or braces to balance, and they do not
1424 prevent a comma from separating arguments.  Thus,
1425
1426 @smallexample
1427 macro (array[x = y, x + 1])
1428 @end smallexample
1429
1430 @noindent
1431 passes two arguments to @code{macro}: @code{array[x = y} and @code{x +
1432 1]}.  If you want to supply @code{array[x = y, x + 1]} as an argument,
1433 you can write it as @code{array[(x = y, x + 1)]}, which is equivalent C
1434 code.
1435
1436 All arguments to a macro are completely macro-expanded before they are
1437 substituted into the macro body.  After substitution, the complete text
1438 is scanned again for macros to expand, including the arguments.  This rule
1439 may seem strange, but it is carefully designed so you need not worry
1440 about whether any function call is actually a macro invocation.  You can
1441 run into trouble if you try to be too clever, though.  @xref{Argument
1442 Prescan}, for detailed discussion.
1443
1444 For example, @code{min (min (a, b), c)} is first expanded to
1445
1446 @smallexample
1447   min (((a) < (b) ? (a) : (b)), (c))
1448 @end smallexample
1449
1450 @noindent
1451 and then to
1452
1453 @smallexample
1454 @group
1455 ((((a) < (b) ? (a) : (b))) < (c)
1456  ? (((a) < (b) ? (a) : (b)))
1457  : (c))
1458 @end group
1459 @end smallexample
1460
1461 @noindent
1462 (Line breaks shown here for clarity would not actually be generated.)
1463
1464 @cindex empty macro arguments
1465 You can leave macro arguments empty; this is not an error to the
1466 preprocessor (but many macros will then expand to invalid code).
1467 You cannot leave out arguments entirely; if a macro takes two arguments,
1468 there must be exactly one comma at the top level of its argument list.
1469 Here are some silly examples using @code{min}:
1470
1471 @smallexample
1472 min(, b)        @expansion{} ((   ) < (b) ? (   ) : (b))
1473 min(a, )        @expansion{} ((a  ) < ( ) ? (a  ) : ( ))
1474 min(,)          @expansion{} ((   ) < ( ) ? (   ) : ( ))
1475 min((,),)       @expansion{} (((,)) < ( ) ? ((,)) : ( ))
1476
1477 min()      @error{} macro "min" requires 2 arguments, but only 1 given
1478 min(,,)    @error{} macro "min" passed 3 arguments, but takes just 2
1479 @end smallexample
1480
1481 Whitespace is not a preprocessing token, so if a macro @code{foo} takes
1482 one argument, @code{@w{foo ()}} and @code{@w{foo ( )}} both supply it an
1483 empty argument.  Previous GNU preprocessor implementations and
1484 documentation were incorrect on this point, insisting that a
1485 function-like macro that takes a single argument be passed a space if an
1486 empty argument was required.
1487
1488 Macro parameters appearing inside string literals are not replaced by
1489 their corresponding actual arguments.
1490
1491 @smallexample
1492 #define foo(x) x, "x"
1493 foo(bar)        @expansion{} bar, "x"
1494 @end smallexample
1495
1496 @node Stringification
1497 @section Stringification
1498 @cindex stringification
1499 @cindex @samp{#} operator
1500
1501 Sometimes you may want to convert a macro argument into a string
1502 constant.  Parameters are not replaced inside string constants, but you
1503 can use the @samp{#} preprocessing operator instead.  When a macro
1504 parameter is used with a leading @samp{#}, the preprocessor replaces it
1505 with the literal text of the actual argument, converted to a string
1506 constant.  Unlike normal parameter replacement, the argument is not
1507 macro-expanded first.  This is called @dfn{stringification}.
1508
1509 There is no way to combine an argument with surrounding text and
1510 stringify it all together.  Instead, you can write a series of adjacent
1511 string constants and stringified arguments.  The preprocessor will
1512 replace the stringified arguments with string constants.  The C
1513 compiler will then combine all the adjacent string constants into one
1514 long string.
1515
1516 Here is an example of a macro definition that uses stringification:
1517
1518 @smallexample
1519 @group
1520 #define WARN_IF(EXP) \
1521 do @{ if (EXP) \
1522         fprintf (stderr, "Warning: " #EXP "\n"); @} \
1523 while (0)
1524 WARN_IF (x == 0);
1525      @expansion{} do @{ if (x == 0)
1526            fprintf (stderr, "Warning: " "x == 0" "\n"); @} while (0);
1527 @end group
1528 @end smallexample
1529
1530 @noindent
1531 The argument for @code{EXP} is substituted once, as-is, into the
1532 @code{if} statement, and once, stringified, into the argument to
1533 @code{fprintf}.  If @code{x} were a macro, it would be expanded in the
1534 @code{if} statement, but not in the string.
1535
1536 The @code{do} and @code{while (0)} are a kludge to make it possible to
1537 write @code{WARN_IF (@var{arg});}, which the resemblance of
1538 @code{WARN_IF} to a function would make C programmers want to do; see
1539 @ref{Swallowing the Semicolon}.
1540
1541 Stringification in C involves more than putting double-quote characters
1542 around the fragment.  The preprocessor backslash-escapes the quotes
1543 surrounding embedded string constants, and all backslashes within string and
1544 character constants, in order to get a valid C string constant with the
1545 proper contents.  Thus, stringifying @code{@w{p = "foo\n";}} results in
1546 @t{@w{"p = \"foo\\n\";"}}.  However, backslashes that are not inside string
1547 or character constants are not duplicated: @samp{\n} by itself
1548 stringifies to @t{"\n"}.
1549
1550 All leading and trailing whitespace in text being stringified is
1551 ignored.  Any sequence of whitespace in the middle of the text is
1552 converted to a single space in the stringified result.  Comments are
1553 replaced by whitespace long before stringification happens, so they
1554 never appear in stringified text.
1555
1556 There is no way to convert a macro argument into a character constant.
1557
1558 If you want to stringify the result of expansion of a macro argument,
1559 you have to use two levels of macros.
1560
1561 @smallexample
1562 #define xstr(s) str(s)
1563 #define str(s) #s
1564 #define foo 4
1565 str (foo)
1566      @expansion{} "foo"
1567 xstr (foo)
1568      @expansion{} xstr (4)
1569      @expansion{} str (4)
1570      @expansion{} "4"
1571 @end smallexample
1572
1573 @code{s} is stringified when it is used in @code{str}, so it is not
1574 macro-expanded first.  But @code{s} is an ordinary argument to
1575 @code{xstr}, so it is completely macro-expanded before @code{xstr}
1576 itself is expanded (@pxref{Argument Prescan}).  Therefore, by the time
1577 @code{str} gets to its argument, it has already been macro-expanded.
1578
1579 @node Concatenation
1580 @section Concatenation
1581 @cindex concatenation
1582 @cindex token pasting
1583 @cindex token concatenation
1584 @cindex @samp{##} operator
1585
1586 It is often useful to merge two tokens into one while expanding macros.
1587 This is called @dfn{token pasting} or @dfn{token concatenation}.  The
1588 @samp{##} preprocessing operator performs token pasting.  When a macro
1589 is expanded, the two tokens on either side of each @samp{##} operator
1590 are combined into a single token, which then replaces the @samp{##} and
1591 the two original tokens in the macro expansion.  Usually both will be
1592 identifiers, or one will be an identifier and the other a preprocessing
1593 number.  When pasted, they make a longer identifier.  This isn't the
1594 only valid case.  It is also possible to concatenate two numbers (or a
1595 number and a name, such as @code{1.5} and @code{e3}) into a number.
1596 Also, multi-character operators such as @code{+=} can be formed by
1597 token pasting.
1598
1599 However, two tokens that don't together form a valid token cannot be
1600 pasted together.  For example, you cannot concatenate @code{x} with
1601 @code{+} in either order.  If you try, the preprocessor issues a warning
1602 and emits the two tokens.  Whether it puts white space between the
1603 tokens is undefined.  It is common to find unnecessary uses of @samp{##}
1604 in complex macros.  If you get this warning, it is likely that you can
1605 simply remove the @samp{##}.
1606
1607 Both the tokens combined by @samp{##} could come from the macro body,
1608 but you could just as well write them as one token in the first place.
1609 Token pasting is most useful when one or both of the tokens comes from a
1610 macro argument.  If either of the tokens next to an @samp{##} is a
1611 parameter name, it is replaced by its actual argument before @samp{##}
1612 executes.  As with stringification, the actual argument is not
1613 macro-expanded first.  If the argument is empty, that @samp{##} has no
1614 effect.
1615
1616 Keep in mind that the C preprocessor converts comments to whitespace
1617 before macros are even considered.  Therefore, you cannot create a
1618 comment by concatenating @samp{/} and @samp{*}.  You can put as much
1619 whitespace between @samp{##} and its operands as you like, including
1620 comments, and you can put comments in arguments that will be
1621 concatenated.  However, it is an error if @samp{##} appears at either
1622 end of a macro body.
1623
1624 Consider a C program that interprets named commands.  There probably
1625 needs to be a table of commands, perhaps an array of structures declared
1626 as follows:
1627
1628 @smallexample
1629 @group
1630 struct command
1631 @{
1632   char *name;
1633   void (*function) (void);
1634 @};
1635 @end group
1636
1637 @group
1638 struct command commands[] =
1639 @{
1640   @{ "quit", quit_command @},
1641   @{ "help", help_command @},
1642   @dots{}
1643 @};
1644 @end group
1645 @end smallexample
1646
1647 It would be cleaner not to have to give each command name twice, once in
1648 the string constant and once in the function name.  A macro which takes the
1649 name of a command as an argument can make this unnecessary.  The string
1650 constant can be created with stringification, and the function name by
1651 concatenating the argument with @samp{_command}.  Here is how it is done:
1652
1653 @smallexample
1654 #define COMMAND(NAME)  @{ #NAME, NAME ## _command @}
1655
1656 struct command commands[] =
1657 @{
1658   COMMAND (quit),
1659   COMMAND (help),
1660   @dots{}
1661 @};
1662 @end smallexample
1663
1664 @node Variadic Macros
1665 @section Variadic Macros
1666 @cindex variable number of arguments
1667 @cindex macros with variable arguments
1668 @cindex variadic macros
1669
1670 A macro can be declared to accept a variable number of arguments much as
1671 a function can.  The syntax for defining the macro is similar to that of
1672 a function.  Here is an example:
1673
1674 @smallexample
1675 #define eprintf(@dots{}) fprintf (stderr, __VA_ARGS__)
1676 @end smallexample
1677
1678 This kind of macro is called @dfn{variadic}.  When the macro is invoked,
1679 all the tokens in its argument list after the last named argument (this
1680 macro has none), including any commas, become the @dfn{variable
1681 argument}.  This sequence of tokens replaces the identifier
1682 @code{@w{__VA_ARGS__}} in the macro body wherever it appears.  Thus, we
1683 have this expansion:
1684
1685 @smallexample
1686 eprintf ("%s:%d: ", input_file, lineno)
1687      @expansion{}  fprintf (stderr, "%s:%d: ", input_file, lineno)
1688 @end smallexample
1689
1690 The variable argument is completely macro-expanded before it is inserted
1691 into the macro expansion, just like an ordinary argument.  You may use
1692 the @samp{#} and @samp{##} operators to stringify the variable argument
1693 or to paste its leading or trailing token with another token.  (But see
1694 below for an important special case for @samp{##}.)
1695
1696 If your macro is complicated, you may want a more descriptive name for
1697 the variable argument than @code{@w{__VA_ARGS__}}.  CPP permits
1698 this, as an extension.  You may write an argument name immediately
1699 before the @samp{@dots{}}; that name is used for the variable argument.
1700 The @code{eprintf} macro above could be written
1701
1702 @smallexample
1703 #define eprintf(args@dots{}) fprintf (stderr, args)
1704 @end smallexample
1705
1706 @noindent
1707 using this extension.  You cannot use @code{@w{__VA_ARGS__}} and this
1708 extension in the same macro.
1709
1710 You can have named arguments as well as variable arguments in a variadic
1711 macro.  We could define @code{eprintf} like this, instead:
1712
1713 @smallexample
1714 #define eprintf(format, @dots{}) fprintf (stderr, format, __VA_ARGS__)
1715 @end smallexample
1716
1717 @noindent
1718 This formulation looks more descriptive, but unfortunately it is less
1719 flexible: you must now supply at least one argument after the format
1720 string.  In standard C, you cannot omit the comma separating the named
1721 argument from the variable arguments.  Furthermore, if you leave the
1722 variable argument empty, you will get a syntax error, because
1723 there will be an extra comma after the format string.
1724
1725 @smallexample
1726 eprintf("success!\n", );
1727      @expansion{} fprintf(stderr, "success!\n", );
1728 @end smallexample
1729
1730 GNU CPP has a pair of extensions which deal with this problem.  First,
1731 you are allowed to leave the variable argument out entirely:
1732
1733 @smallexample
1734 eprintf ("success!\n")
1735      @expansion{} fprintf(stderr, "success!\n", );
1736 @end smallexample
1737
1738 @noindent
1739 Second, the @samp{##} token paste operator has a special meaning when
1740 placed between a comma and a variable argument.  If you write
1741
1742 @smallexample
1743 #define eprintf(format, @dots{}) fprintf (stderr, format, ##__VA_ARGS__)
1744 @end smallexample
1745
1746 @noindent
1747 and the variable argument is left out when the @code{eprintf} macro is
1748 used, then the comma before the @samp{##} will be deleted.  This does
1749 @emph{not} happen if you pass an empty argument, nor does it happen if
1750 the token preceding @samp{##} is anything other than a comma.
1751
1752 @smallexample
1753 eprintf ("success!\n")
1754      @expansion{} fprintf(stderr, "success!\n");
1755 @end smallexample
1756
1757 @noindent
1758 The above explanation is ambiguous about the case where the only macro
1759 parameter is a variable arguments parameter, as it is meaningless to
1760 try to distinguish whether no argument at all is an empty argument or
1761 a missing argument.  In this case the C99 standard is clear that the
1762 comma must remain, however the existing GCC extension used to swallow
1763 the comma.  So CPP retains the comma when conforming to a specific C
1764 standard, and drops it otherwise.
1765
1766 C99 mandates that the only place the identifier @code{@w{__VA_ARGS__}}
1767 can appear is in the replacement list of a variadic macro.  It may not
1768 be used as a macro name, macro argument name, or within a different type
1769 of macro.  It may also be forbidden in open text; the standard is
1770 ambiguous.  We recommend you avoid using it except for its defined
1771 purpose.
1772
1773 Variadic macros are a new feature in C99.  GNU CPP has supported them
1774 for a long time, but only with a named variable argument
1775 (@samp{args@dots{}}, not @samp{@dots{}} and @code{@w{__VA_ARGS__}}).  If you are
1776 concerned with portability to previous versions of GCC, you should use
1777 only named variable arguments.  On the other hand, if you are concerned
1778 with portability to other conforming implementations of C99, you should
1779 use only @code{@w{__VA_ARGS__}}.
1780
1781 Previous versions of CPP implemented the comma-deletion extension
1782 much more generally.  We have restricted it in this release to minimize
1783 the differences from C99.  To get the same effect with both this and
1784 previous versions of GCC, the token preceding the special @samp{##} must
1785 be a comma, and there must be white space between that comma and
1786 whatever comes immediately before it:
1787
1788 @smallexample
1789 #define eprintf(format, args@dots{}) fprintf (stderr, format , ##args)
1790 @end smallexample
1791
1792 @noindent
1793 @xref{Differences from previous versions}, for the gory details.
1794
1795 @node Predefined Macros
1796 @section Predefined Macros
1797
1798 @cindex predefined macros
1799 Several object-like macros are predefined; you use them without
1800 supplying their definitions.  They fall into three classes: standard,
1801 common, and system-specific.
1802
1803 In C++, there is a fourth category, the named operators.  They act like
1804 predefined macros, but you cannot undefine them.
1805
1806 @menu
1807 * Standard Predefined Macros::
1808 * Common Predefined Macros::
1809 * System-specific Predefined Macros::
1810 * C++ Named Operators::
1811 @end menu
1812
1813 @node Standard Predefined Macros
1814 @subsection Standard Predefined Macros
1815 @cindex standard predefined macros.
1816
1817 The standard predefined macros are specified by the relevant
1818 language standards, so they are available with all compilers that
1819 implement those standards.  Older compilers may not provide all of
1820 them.  Their names all start with double underscores.
1821
1822 @table @code
1823 @item __FILE__
1824 This macro expands to the name of the current input file, in the form of
1825 a C string constant.  This is the path by which the preprocessor opened
1826 the file, not the short name specified in @samp{#include} or as the
1827 input file name argument.  For example,
1828 @code{"/usr/local/include/myheader.h"} is a possible expansion of this
1829 macro.
1830
1831 @item __LINE__
1832 This macro expands to the current input line number, in the form of a
1833 decimal integer constant.  While we call it a predefined macro, it's
1834 a pretty strange macro, since its ``definition'' changes with each
1835 new line of source code.
1836 @end table
1837
1838 @code{__FILE__} and @code{__LINE__} are useful in generating an error
1839 message to report an inconsistency detected by the program; the message
1840 can state the source line at which the inconsistency was detected.  For
1841 example,
1842
1843 @smallexample
1844 fprintf (stderr, "Internal error: "
1845                  "negative string length "
1846                  "%d at %s, line %d.",
1847          length, __FILE__, __LINE__);
1848 @end smallexample
1849
1850 An @samp{#include} directive changes the expansions of @code{__FILE__}
1851 and @code{__LINE__} to correspond to the included file.  At the end of
1852 that file, when processing resumes on the input file that contained
1853 the @samp{#include} directive, the expansions of @code{__FILE__} and
1854 @code{__LINE__} revert to the values they had before the
1855 @samp{#include} (but @code{__LINE__} is then incremented by one as
1856 processing moves to the line after the @samp{#include}).
1857
1858 A @samp{#line} directive changes @code{__LINE__}, and may change
1859 @code{__FILE__} as well.  @xref{Line Control}.
1860
1861 C99 introduces @code{__func__}, and GCC has provided @code{__FUNCTION__}
1862 for a long time.  Both of these are strings containing the name of the
1863 current function (there are slight semantic differences; see the GCC
1864 manual).  Neither of them is a macro; the preprocessor does not know the
1865 name of the current function.  They tend to be useful in conjunction
1866 with @code{__FILE__} and @code{__LINE__}, though.
1867
1868 @table @code
1869
1870 @item __DATE__
1871 This macro expands to a string constant that describes the date on which
1872 the preprocessor is being run.  The string constant contains eleven
1873 characters and looks like @code{@w{"Feb 12 1996"}}.  If the day of the
1874 month is less than 10, it is padded with a space on the left.
1875
1876 If GCC cannot determine the current date, it will emit a warning message
1877 (once per compilation) and @code{__DATE__} will expand to
1878 @code{@w{"??? ?? ????"}}.
1879
1880 @item __TIME__
1881 This macro expands to a string constant that describes the time at
1882 which the preprocessor is being run.  The string constant contains
1883 eight characters and looks like @code{"23:59:01"}.
1884
1885 If GCC cannot determine the current time, it will emit a warning message
1886 (once per compilation) and @code{__TIME__} will expand to
1887 @code{"??:??:??"}.
1888
1889 @item __STDC__
1890 In normal operation, this macro expands to the constant 1, to signify
1891 that this compiler conforms to ISO Standard C@.  If GNU CPP is used with
1892 a compiler other than GCC, this is not necessarily true; however, the
1893 preprocessor always conforms to the standard unless the
1894 @option{-traditional-cpp} option is used.
1895
1896 This macro is not defined if the @option{-traditional-cpp} option is used.
1897
1898 On some hosts, the system compiler uses a different convention, where
1899 @code{__STDC__} is normally 0, but is 1 if the user specifies strict
1900 conformance to the C Standard.  CPP follows the host convention when
1901 processing system header files, but when processing user files
1902 @code{__STDC__} is always 1.  This has been reported to cause problems;
1903 for instance, some versions of Solaris provide X Windows headers that
1904 expect @code{__STDC__} to be either undefined or 1.  @xref{Invocation}.
1905
1906 @item __STDC_VERSION__
1907 This macro expands to the C Standard's version number, a long integer
1908 constant of the form @code{@var{yyyy}@var{mm}L} where @var{yyyy} and
1909 @var{mm} are the year and month of the Standard version.  This signifies
1910 which version of the C Standard the compiler conforms to.  Like
1911 @code{__STDC__}, this is not necessarily accurate for the entire
1912 implementation, unless GNU CPP is being used with GCC@.
1913
1914 The value @code{199409L} signifies the 1989 C standard as amended in
1915 1994, which is the current default; the value @code{199901L} signifies
1916 the 1999 revision of the C standard.  Support for the 1999 revision is
1917 not yet complete.
1918
1919 This macro is not defined if the @option{-traditional-cpp} option is
1920 used, nor when compiling C++ or Objective-C@.
1921
1922 @item __STDC_HOSTED__
1923 This macro is defined, with value 1, if the compiler's target is a
1924 @dfn{hosted environment}.  A hosted environment has the complete
1925 facilities of the standard C library available.
1926
1927 @item __cplusplus
1928 This macro is defined when the C++ compiler is in use.  You can use
1929 @code{__cplusplus} to test whether a header is compiled by a C compiler
1930 or a C++ compiler.  This macro is similar to @code{__STDC_VERSION__}, in
1931 that it expands to a version number.  A fully conforming implementation
1932 of the 1998 C++ standard will define this macro to @code{199711L}.  The
1933 GNU C++ compiler is not yet fully conforming, so it uses @code{1}
1934 instead.  It is hoped to complete the implementation of standard C++
1935 in the near future.
1936
1937 @item __OBJC__
1938 This macro is defined, with value 1, when the Objective-C compiler is in
1939 use.  You can use @code{__OBJC__} to test whether a header is compiled
1940 by a C compiler or an Objective-C compiler.
1941
1942 @item __ASSEMBLER__
1943 This macro is defined with value 1 when preprocessing assembly
1944 language.
1945
1946 @end table
1947
1948 @node Common Predefined Macros
1949 @subsection Common Predefined Macros
1950 @cindex common predefined macros
1951
1952 The common predefined macros are GNU C extensions.  They are available
1953 with the same meanings regardless of the machine or operating system on
1954 which you are using GNU C or GNU Fortran.  Their names all start with
1955 double underscores.
1956
1957 @table @code
1958
1959 @item __COUNTER__
1960 This macro expands to sequential integral values starting from 0.  In
1961 conjunction with the @code{##} operator, this provides a convenient means to
1962 generate unique identifiers.  Care must be taken to ensure that
1963 @code{__COUNTER__} is not expanded prior to inclusion of precompiled headers
1964 which use it.  Otherwise, the precompiled headers will not be used.
1965
1966 @item __GFORTRAN__
1967 The GNU Fortran compiler defines this.
1968
1969 @item __GNUC__
1970 @itemx __GNUC_MINOR__
1971 @itemx __GNUC_PATCHLEVEL__
1972 These macros are defined by all GNU compilers that use the C
1973 preprocessor: C, C++, Objective-C and Fortran.  Their values are the major
1974 version, minor version, and patch level of the compiler, as integer
1975 constants.  For example, GCC 3.2.1 will define @code{__GNUC__} to 3,
1976 @code{__GNUC_MINOR__} to 2, and @code{__GNUC_PATCHLEVEL__} to 1.  These
1977 macros are also defined if you invoke the preprocessor directly.
1978
1979 @code{__GNUC_PATCHLEVEL__} is new to GCC 3.0; it is also present in the
1980 widely-used development snapshots leading up to 3.0 (which identify
1981 themselves as GCC 2.96 or 2.97, depending on which snapshot you have).
1982
1983 If all you need to know is whether or not your program is being compiled
1984 by GCC, or a non-GCC compiler that claims to accept the GNU C dialects,
1985 you can simply test @code{__GNUC__}.  If you need to write code
1986 which depends on a specific version, you must be more careful.  Each
1987 time the minor version is increased, the patch level is reset to zero;
1988 each time the major version is increased (which happens rarely), the
1989 minor version and patch level are reset.  If you wish to use the
1990 predefined macros directly in the conditional, you will need to write it
1991 like this:
1992
1993 @smallexample
1994 /* @r{Test for GCC > 3.2.0} */
1995 #if __GNUC__ > 3 || \
1996     (__GNUC__ == 3 && (__GNUC_MINOR__ > 2 || \
1997                        (__GNUC_MINOR__ == 2 && \
1998                         __GNUC_PATCHLEVEL__ > 0))
1999 @end smallexample
2000
2001 @noindent
2002 Another approach is to use the predefined macros to
2003 calculate a single number, then compare that against a threshold:
2004
2005 @smallexample
2006 #define GCC_VERSION (__GNUC__ * 10000 \
2007                      + __GNUC_MINOR__ * 100 \
2008                      + __GNUC_PATCHLEVEL__)
2009 @dots{}
2010 /* @r{Test for GCC > 3.2.0} */
2011 #if GCC_VERSION > 30200
2012 @end smallexample
2013
2014 @noindent
2015 Many people find this form easier to understand.
2016
2017 @item __GNUG__
2018 The GNU C++ compiler defines this.  Testing it is equivalent to
2019 testing @code{@w{(__GNUC__ && __cplusplus)}}.
2020
2021 @item __STRICT_ANSI__
2022 GCC defines this macro if and only if the @option{-ansi} switch, or a
2023 @option{-std} switch specifying strict conformance to some version of ISO C,
2024 was specified when GCC was invoked.  It is defined to @samp{1}.
2025 This macro exists primarily to direct GNU libc's header files to
2026 restrict their definitions to the minimal set found in the 1989 C
2027 standard.
2028
2029 @item __BASE_FILE__
2030 This macro expands to the name of the main input file, in the form
2031 of a C string constant.  This is the source file that was specified
2032 on the command line of the preprocessor or C compiler.
2033
2034 @item __INCLUDE_LEVEL__
2035 This macro expands to a decimal integer constant that represents the
2036 depth of nesting in include files.  The value of this macro is
2037 incremented on every @samp{#include} directive and decremented at the
2038 end of every included file.  It starts out at 0, its value within the
2039 base file specified on the command line.
2040
2041 @item __ELF__
2042 This macro is defined if the target uses the ELF object format.
2043
2044 @item __VERSION__
2045 This macro expands to a string constant which describes the version of
2046 the compiler in use.  You should not rely on its contents having any
2047 particular form, but it can be counted on to contain at least the
2048 release number.
2049
2050 @item __OPTIMIZE__
2051 @itemx __OPTIMIZE_SIZE__
2052 @itemx __NO_INLINE__
2053 These macros describe the compilation mode.  @code{__OPTIMIZE__} is
2054 defined in all optimizing compilations.  @code{__OPTIMIZE_SIZE__} is
2055 defined if the compiler is optimizing for size, not speed.
2056 @code{__NO_INLINE__} is defined if no functions will be inlined into
2057 their callers (when not optimizing, or when inlining has been
2058 specifically disabled by @option{-fno-inline}).
2059
2060 These macros cause certain GNU header files to provide optimized
2061 definitions, using macros or inline functions, of system library
2062 functions.  You should not use these macros in any way unless you make
2063 sure that programs will execute with the same effect whether or not they
2064 are defined.  If they are defined, their value is 1.
2065
2066 @item __GNUC_GNU_INLINE__
2067 GCC defines this macro if functions declared @code{inline} will be
2068 handled in GCC's traditional gnu90 mode.  Object files will contain
2069 externally visible definitions of all functions declared @code{inline}
2070 without @code{extern} or @code{static}.  They will not contain any
2071 definitions of any functions declared @code{extern inline}.
2072
2073 @item __GNUC_STDC_INLINE__
2074 GCC defines this macro if functions declared @code{inline} will be
2075 handled according to the ISO C99 standard.  Object files will contain
2076 externally visible definitions of all functions declared @code{extern
2077 inline}.  They will not contain definitions of any functions declared
2078 @code{inline} without @code{extern}.
2079
2080 If this macro is defined, GCC supports the @code{gnu_inline} function
2081 attribute as a way to always get the gnu90 behavior.  Support for
2082 this and @code{__GNUC_GNU_INLINE__} was added in GCC 4.1.3.  If
2083 neither macro is defined, an older version of GCC is being used:
2084 @code{inline} functions will be compiled in gnu90 mode, and the
2085 @code{gnu_inline} function attribute will not be recognized.
2086
2087 @item __CHAR_UNSIGNED__
2088 GCC defines this macro if and only if the data type @code{char} is
2089 unsigned on the target machine.  It exists to cause the standard header
2090 file @file{limits.h} to work correctly.  You should not use this macro
2091 yourself; instead, refer to the standard macros defined in @file{limits.h}.
2092
2093 @item __WCHAR_UNSIGNED__
2094 Like @code{__CHAR_UNSIGNED__}, this macro is defined if and only if the
2095 data type @code{wchar_t} is unsigned and the front-end is in C++ mode.
2096
2097 @item __REGISTER_PREFIX__
2098 This macro expands to a single token (not a string constant) which is
2099 the prefix applied to CPU register names in assembly language for this
2100 target.  You can use it to write assembly that is usable in multiple
2101 environments.  For example, in the @code{m68k-aout} environment it
2102 expands to nothing, but in the @code{m68k-coff} environment it expands
2103 to a single @samp{%}.
2104
2105 @item __USER_LABEL_PREFIX__
2106 This macro expands to a single token which is the prefix applied to
2107 user labels (symbols visible to C code) in assembly.  For example, in
2108 the @code{m68k-aout} environment it expands to an @samp{_}, but in the
2109 @code{m68k-coff} environment it expands to nothing.
2110
2111 This macro will have the correct definition even if
2112 @option{-f(no-)underscores} is in use, but it will not be correct if
2113 target-specific options that adjust this prefix are used (e.g.@: the
2114 OSF/rose @option{-mno-underscores} option).
2115
2116 @item __SIZE_TYPE__
2117 @itemx __PTRDIFF_TYPE__
2118 @itemx __WCHAR_TYPE__
2119 @itemx __WINT_TYPE__
2120 @itemx __INTMAX_TYPE__
2121 @itemx __UINTMAX_TYPE__
2122 @itemx __SIG_ATOMIC_TYPE__
2123 @itemx __INT8_TYPE__
2124 @itemx __INT16_TYPE__
2125 @itemx __INT32_TYPE__
2126 @itemx __INT64_TYPE__
2127 @itemx __UINT8_TYPE__
2128 @itemx __UINT16_TYPE__
2129 @itemx __UINT32_TYPE__
2130 @itemx __UINT64_TYPE__
2131 @itemx __INT_LEAST8_TYPE__
2132 @itemx __INT_LEAST16_TYPE__
2133 @itemx __INT_LEAST32_TYPE__
2134 @itemx __INT_LEAST64_TYPE__
2135 @itemx __UINT_LEAST8_TYPE__
2136 @itemx __UINT_LEAST16_TYPE__
2137 @itemx __UINT_LEAST32_TYPE__
2138 @itemx __UINT_LEAST64_TYPE__
2139 @itemx __INT_FAST8_TYPE__
2140 @itemx __INT_FAST16_TYPE__
2141 @itemx __INT_FAST32_TYPE__
2142 @itemx __INT_FAST64_TYPE__
2143 @itemx __UINT_FAST8_TYPE__
2144 @itemx __UINT_FAST16_TYPE__
2145 @itemx __UINT_FAST32_TYPE__
2146 @itemx __UINT_FAST64_TYPE__
2147 @itemx __INTPTR_TYPE__
2148 @itemx __UINTPTR_TYPE__
2149 These macros are defined to the correct underlying types for the
2150 @code{size_t}, @code{ptrdiff_t}, @code{wchar_t}, @code{wint_t},
2151 @code{intmax_t}, @code{uintmax_t}, @code{sig_atomic_t}, @code{int8_t},
2152 @code{int16_t}, @code{int32_t}, @code{int64_t}, @code{uint8_t},
2153 @code{uint16_t}, @code{uint32_t}, @code{uint64_t},
2154 @code{int_least8_t}, @code{int_least16_t}, @code{int_least32_t},
2155 @code{int_least64_t}, @code{uint_least8_t}, @code{uint_least16_t},
2156 @code{uint_least32_t}, @code{uint_least64_t}, @code{int_fast8_t},
2157 @code{int_fast16_t}, @code{int_fast32_t}, @code{int_fast64_t},
2158 @code{uint_fast8_t}, @code{uint_fast16_t}, @code{uint_fast32_t},
2159 @code{uint_fast64_t}, @code{intptr_t}, and @code{uintptr_t} typedefs,
2160 respectively.  They exist to make the standard header files
2161 @file{stddef.h}, @file{stdint.h}, and @file{wchar.h} work correctly.
2162 You should not use these macros directly; instead, include the
2163 appropriate headers and use the typedefs.  Some of these macros may
2164 not be defined on particular systems if GCC does not provide a
2165 @file{stdint.h} header on those systems.
2166
2167 @item __CHAR_BIT__
2168 Defined to the number of bits used in the representation of the
2169 @code{char} data type.  It exists to make the standard header given
2170 numerical limits work correctly.  You should not use
2171 this macro directly; instead, include the appropriate headers.
2172
2173 @item __SCHAR_MAX__
2174 @itemx __WCHAR_MAX__
2175 @itemx __SHRT_MAX__
2176 @itemx __INT_MAX__
2177 @itemx __LONG_MAX__
2178 @itemx __LONG_LONG_MAX__
2179 @itemx __WINT_MAX__
2180 @itemx __SIZE_MAX__
2181 @itemx __PTRDIFF_MAX__
2182 @itemx __INTMAX_MAX__
2183 @itemx __UINTMAX_MAX__
2184 @itemx __SIG_ATOMIC_MAX__
2185 @itemx __INT8_MAX__
2186 @itemx __INT16_MAX__
2187 @itemx __INT32_MAX__
2188 @itemx __INT64_MAX__
2189 @itemx __UINT8_MAX__
2190 @itemx __UINT16_MAX__
2191 @itemx __UINT32_MAX__
2192 @itemx __UINT64_MAX__
2193 @itemx __INT_LEAST8_MAX__
2194 @itemx __INT_LEAST16_MAX__
2195 @itemx __INT_LEAST32_MAX__
2196 @itemx __INT_LEAST64_MAX__
2197 @itemx __UINT_LEAST8_MAX__
2198 @itemx __UINT_LEAST16_MAX__
2199 @itemx __UINT_LEAST32_MAX__
2200 @itemx __UINT_LEAST64_MAX__
2201 @itemx __INT_FAST8_MAX__
2202 @itemx __INT_FAST16_MAX__
2203 @itemx __INT_FAST32_MAX__
2204 @itemx __INT_FAST64_MAX__
2205 @itemx __UINT_FAST8_MAX__
2206 @itemx __UINT_FAST16_MAX__
2207 @itemx __UINT_FAST32_MAX__
2208 @itemx __UINT_FAST64_MAX__
2209 @itemx __INTPTR_MAX__
2210 @itemx __UINTPTR_MAX__
2211 @itemx __WCHAR_MIN__
2212 @itemx __WINT_MIN__
2213 @itemx __SIG_ATOMIC_MIN__
2214 Defined to the maximum value of the @code{signed char}, @code{wchar_t},
2215 @code{signed short},
2216 @code{signed int}, @code{signed long}, @code{signed long long},
2217 @code{wint_t}, @code{size_t}, @code{ptrdiff_t},
2218 @code{intmax_t}, @code{uintmax_t}, @code{sig_atomic_t}, @code{int8_t},
2219 @code{int16_t}, @code{int32_t}, @code{int64_t}, @code{uint8_t},
2220 @code{uint16_t}, @code{uint32_t}, @code{uint64_t},
2221 @code{int_least8_t}, @code{int_least16_t}, @code{int_least32_t},
2222 @code{int_least64_t}, @code{uint_least8_t}, @code{uint_least16_t},
2223 @code{uint_least32_t}, @code{uint_least64_t}, @code{int_fast8_t},
2224 @code{int_fast16_t}, @code{int_fast32_t}, @code{int_fast64_t},
2225 @code{uint_fast8_t}, @code{uint_fast16_t}, @code{uint_fast32_t},
2226 @code{uint_fast64_t}, @code{intptr_t}, and @code{uintptr_t} types and
2227 to the minimum value of the @code{wchar_t}, @code{wint_t}, and
2228 @code{sig_atomic_t} types respectively.  They exist to make the
2229 standard header given numerical limits work correctly.  You should not
2230 use these macros directly; instead, include the appropriate headers.
2231 Some of these macros may not be defined on particular systems if GCC
2232 does not provide a @file{stdint.h} header on those systems.
2233
2234 @item __INT8_C
2235 @itemx __INT16_C
2236 @itemx __INT32_C
2237 @itemx __INT64_C
2238 @itemx __UINT8_C
2239 @itemx __UINT16_C
2240 @itemx __UINT32_C
2241 @itemx __UINT64_C
2242 @itemx __INTMAX_C
2243 @itemx __UINTMAX_C
2244 Defined to implementations of the standard @file{stdint.h} macros with
2245 the same names without the leading @code{__}.  They exist the make the
2246 implementation of that header work correctly.  You should not use
2247 these macros directly; instead, include the appropriate headers.  Some
2248 of these macros may not be defined on particular systems if GCC does
2249 not provide a @file{stdint.h} header on those systems.
2250
2251 @item __SIZEOF_INT__
2252 @itemx __SIZEOF_LONG__
2253 @itemx __SIZEOF_LONG_LONG__
2254 @itemx __SIZEOF_SHORT__
2255 @itemx __SIZEOF_POINTER__
2256 @itemx __SIZEOF_FLOAT__
2257 @itemx __SIZEOF_DOUBLE__
2258 @itemx __SIZEOF_LONG_DOUBLE__
2259 @itemx __SIZEOF_SIZE_T__
2260 @itemx __SIZEOF_WCHAR_T__
2261 @itemx __SIZEOF_WINT_T__
2262 @itemx __SIZEOF_PTRDIFF_T__
2263 Defined to the number of bytes of the C standard data types: @code{int},
2264 @code{long}, @code{long long}, @code{short}, @code{void *}, @code{float},
2265 @code{double}, @code{long double}, @code{size_t}, @code{wchar_t}, @code{wint_t}
2266 and @code{ptrdiff_t}.
2267
2268 @item __DEPRECATED
2269 This macro is defined, with value 1, when compiling a C++ source file
2270 with warnings about deprecated constructs enabled.  These warnings are
2271 enabled by default, but can be disabled with @option{-Wno-deprecated}.
2272
2273 @item __EXCEPTIONS
2274 This macro is defined, with value 1, when compiling a C++ source file
2275 with exceptions enabled.  If @option{-fno-exceptions} is used when
2276 compiling the file, then this macro is not defined.
2277
2278 @item __GXX_RTTI
2279 This macro is defined, with value 1, when compiling a C++ source file
2280 with runtime type identification enabled.  If @option{-fno-rtti} is
2281 used when compiling the file, then this macro is not defined.
2282
2283 @item __USING_SJLJ_EXCEPTIONS__
2284 This macro is defined, with value 1, if the compiler uses the old
2285 mechanism based on @code{setjmp} and @code{longjmp} for exception
2286 handling.
2287
2288 @item __GXX_EXPERIMENTAL_CXX0X__
2289 This macro is defined when compiling a C++ source file with the option
2290 @option{-std=c++0x} or @option{-std=gnu++0x}. It indicates that some
2291 features likely to be included in C++0x are available. Note that these
2292 features are experimental, and may change or be removed in future
2293 versions of GCC.
2294
2295 @item __GXX_WEAK__
2296 This macro is defined when compiling a C++ source file.  It has the
2297 value 1 if the compiler will use weak symbols, COMDAT sections, or
2298 other similar techniques to collapse symbols with ``vague linkage''
2299 that are defined in multiple translation units.  If the compiler will
2300 not collapse such symbols, this macro is defined with value 0.  In
2301 general, user code should not need to make use of this macro; the
2302 purpose of this macro is to ease implementation of the C++ runtime
2303 library provided with G++.
2304
2305 @item __NEXT_RUNTIME__
2306 This macro is defined, with value 1, if (and only if) the NeXT runtime
2307 (as in @option{-fnext-runtime}) is in use for Objective-C@.  If the GNU
2308 runtime is used, this macro is not defined, so that you can use this
2309 macro to determine which runtime (NeXT or GNU) is being used.
2310
2311 @item __LP64__
2312 @itemx _LP64
2313 These macros are defined, with value 1, if (and only if) the compilation
2314 is for a target where @code{long int} and pointer both use 64-bits and
2315 @code{int} uses 32-bit.
2316
2317 @item __SSP__
2318 This macro is defined, with value 1, when @option{-fstack-protector} is in
2319 use.
2320
2321 @item __SSP_ALL__
2322 This macro is defined, with value 2, when @option{-fstack-protector-all} is
2323 in use.
2324
2325 @item __TIMESTAMP__
2326 This macro expands to a string constant that describes the date and time
2327 of the last modification of the current source file. The string constant
2328 contains abbreviated day of the week, month, day of the month, time in
2329 hh:mm:ss form, year and looks like @code{@w{"Sun Sep 16 01:03:52 1973"}}.
2330 If the day of the month is less than 10, it is padded with a space on the left.
2331
2332 If GCC cannot determine the current date, it will emit a warning message
2333 (once per compilation) and @code{__TIMESTAMP__} will expand to
2334 @code{@w{"??? ??? ?? ??:??:?? ????"}}.
2335
2336 @item __GCC_HAVE_SYNC_COMPARE_AND_SWAP_1
2337 @itemx __GCC_HAVE_SYNC_COMPARE_AND_SWAP_2
2338 @itemx __GCC_HAVE_SYNC_COMPARE_AND_SWAP_4
2339 @itemx __GCC_HAVE_SYNC_COMPARE_AND_SWAP_8
2340 @itemx __GCC_HAVE_SYNC_COMPARE_AND_SWAP_16
2341 These macros are defined when the target processor supports atomic compare
2342 and swap operations on operands 1, 2, 4, 8 or 16 bytes in length, respectively.
2343
2344 @item __GCC_HAVE_DWARF2_CFI_ASM
2345 This macro is defined when the compiler is emitting Dwarf2 CFI directives
2346 to the assembler.  When this is defined, it is possible to emit those same
2347 directives in inline assembly.
2348 @end table
2349
2350 @node System-specific Predefined Macros
2351 @subsection System-specific Predefined Macros
2352
2353 @cindex system-specific predefined macros
2354 @cindex predefined macros, system-specific
2355 @cindex reserved namespace
2356
2357 The C preprocessor normally predefines several macros that indicate what
2358 type of system and machine is in use.  They are obviously different on
2359 each target supported by GCC@.  This manual, being for all systems and
2360 machines, cannot tell you what their names are, but you can use
2361 @command{cpp -dM} to see them all.  @xref{Invocation}.  All system-specific
2362 predefined macros expand to the constant 1, so you can test them with
2363 either @samp{#ifdef} or @samp{#if}.
2364
2365 The C standard requires that all system-specific macros be part of the
2366 @dfn{reserved namespace}.  All names which begin with two underscores,
2367 or an underscore and a capital letter, are reserved for the compiler and
2368 library to use as they wish.  However, historically system-specific
2369 macros have had names with no special prefix; for instance, it is common
2370 to find @code{unix} defined on Unix systems.  For all such macros, GCC
2371 provides a parallel macro with two underscores added at the beginning
2372 and the end.  If @code{unix} is defined, @code{__unix__} will be defined
2373 too.  There will never be more than two underscores; the parallel of
2374 @code{_mips} is @code{__mips__}.
2375
2376 When the @option{-ansi} option, or any @option{-std} option that
2377 requests strict conformance, is given to the compiler, all the
2378 system-specific predefined macros outside the reserved namespace are
2379 suppressed.  The parallel macros, inside the reserved namespace, remain
2380 defined.
2381
2382 We are slowly phasing out all predefined macros which are outside the
2383 reserved namespace.  You should never use them in new programs, and we
2384 encourage you to correct older code to use the parallel macros whenever
2385 you find it.  We don't recommend you use the system-specific macros that
2386 are in the reserved namespace, either.  It is better in the long run to
2387 check specifically for features you need, using a tool such as
2388 @command{autoconf}.
2389
2390 @node C++ Named Operators
2391 @subsection C++ Named Operators
2392 @cindex named operators
2393 @cindex C++ named operators
2394 @cindex iso646.h
2395
2396 In C++, there are eleven keywords which are simply alternate spellings
2397 of operators normally written with punctuation.  These keywords are
2398 treated as such even in the preprocessor.  They function as operators in
2399 @samp{#if}, and they cannot be defined as macros or poisoned.  In C, you
2400 can request that those keywords take their C++ meaning by including
2401 @file{iso646.h}.  That header defines each one as a normal object-like
2402 macro expanding to the appropriate punctuator.
2403
2404 These are the named operators and their corresponding punctuators:
2405
2406 @multitable {Named Operator} {Punctuator}
2407 @item Named Operator @tab Punctuator
2408 @item @code{and}    @tab @code{&&}
2409 @item @code{and_eq} @tab @code{&=}
2410 @item @code{bitand} @tab @code{&}
2411 @item @code{bitor}  @tab @code{|}
2412 @item @code{compl}  @tab @code{~}
2413 @item @code{not}    @tab @code{!}
2414 @item @code{not_eq} @tab @code{!=}
2415 @item @code{or}     @tab @code{||}
2416 @item @code{or_eq}  @tab @code{|=}
2417 @item @code{xor}    @tab @code{^}
2418 @item @code{xor_eq} @tab @code{^=}
2419 @end multitable
2420
2421 @node Undefining and Redefining Macros
2422 @section Undefining and Redefining Macros
2423 @cindex undefining macros
2424 @cindex redefining macros
2425 @findex #undef
2426
2427 If a macro ceases to be useful, it may be @dfn{undefined} with the
2428 @samp{#undef} directive.  @samp{#undef} takes a single argument, the
2429 name of the macro to undefine.  You use the bare macro name, even if the
2430 macro is function-like.  It is an error if anything appears on the line
2431 after the macro name.  @samp{#undef} has no effect if the name is not a
2432 macro.
2433
2434 @smallexample
2435 #define FOO 4
2436 x = FOO;        @expansion{} x = 4;
2437 #undef FOO
2438 x = FOO;        @expansion{} x = FOO;
2439 @end smallexample
2440
2441 Once a macro has been undefined, that identifier may be @dfn{redefined}
2442 as a macro by a subsequent @samp{#define} directive.  The new definition
2443 need not have any resemblance to the old definition.
2444
2445 However, if an identifier which is currently a macro is redefined, then
2446 the new definition must be @dfn{effectively the same} as the old one.
2447 Two macro definitions are effectively the same if:
2448 @itemize @bullet
2449 @item Both are the same type of macro (object- or function-like).
2450 @item All the tokens of the replacement list are the same.
2451 @item If there are any parameters, they are the same.
2452 @item Whitespace appears in the same places in both.  It need not be
2453 exactly the same amount of whitespace, though.  Remember that comments
2454 count as whitespace.
2455 @end itemize
2456
2457 @noindent
2458 These definitions are effectively the same:
2459 @smallexample
2460 #define FOUR (2 + 2)
2461 #define FOUR         (2    +    2)
2462 #define FOUR (2 /* @r{two} */ + 2)
2463 @end smallexample
2464 @noindent
2465 but these are not:
2466 @smallexample
2467 #define FOUR (2 + 2)
2468 #define FOUR ( 2+2 )
2469 #define FOUR (2 * 2)
2470 #define FOUR(score,and,seven,years,ago) (2 + 2)
2471 @end smallexample
2472
2473 If a macro is redefined with a definition that is not effectively the
2474 same as the old one, the preprocessor issues a warning and changes the
2475 macro to use the new definition.  If the new definition is effectively
2476 the same, the redefinition is silently ignored.  This allows, for
2477 instance, two different headers to define a common macro.  The
2478 preprocessor will only complain if the definitions do not match.
2479
2480 @node Directives Within Macro Arguments
2481 @section Directives Within Macro Arguments
2482 @cindex macro arguments and directives
2483
2484 Occasionally it is convenient to use preprocessor directives within
2485 the arguments of a macro.  The C and C++ standards declare that
2486 behavior in these cases is undefined.
2487
2488 Versions of CPP prior to 3.2 would reject such constructs with an
2489 error message.  This was the only syntactic difference between normal
2490 functions and function-like macros, so it seemed attractive to remove
2491 this limitation, and people would often be surprised that they could
2492 not use macros in this way.  Moreover, sometimes people would use
2493 conditional compilation in the argument list to a normal library
2494 function like @samp{printf}, only to find that after a library upgrade
2495 @samp{printf} had changed to be a function-like macro, and their code
2496 would no longer compile.  So from version 3.2 we changed CPP to
2497 successfully process arbitrary directives within macro arguments in
2498 exactly the same way as it would have processed the directive were the
2499 function-like macro invocation not present.
2500
2501 If, within a macro invocation, that macro is redefined, then the new
2502 definition takes effect in time for argument pre-expansion, but the
2503 original definition is still used for argument replacement.  Here is a
2504 pathological example:
2505
2506 @smallexample
2507 #define f(x) x x
2508 f (1
2509 #undef f
2510 #define f 2
2511 f)
2512 @end smallexample
2513
2514 @noindent
2515 which expands to
2516
2517 @smallexample
2518 1 2 1 2
2519 @end smallexample
2520
2521 @noindent
2522 with the semantics described above.
2523
2524 @node Macro Pitfalls
2525 @section Macro Pitfalls
2526 @cindex problems with macros
2527 @cindex pitfalls of macros
2528
2529 In this section we describe some special rules that apply to macros and
2530 macro expansion, and point out certain cases in which the rules have
2531 counter-intuitive consequences that you must watch out for.
2532
2533 @menu
2534 * Misnesting::
2535 * Operator Precedence Problems::
2536 * Swallowing the Semicolon::
2537 * Duplication of Side Effects::
2538 * Self-Referential Macros::
2539 * Argument Prescan::
2540 * Newlines in Arguments::
2541 @end menu
2542
2543 @node Misnesting
2544 @subsection Misnesting
2545
2546 When a macro is called with arguments, the arguments are substituted
2547 into the macro body and the result is checked, together with the rest of
2548 the input file, for more macro calls.  It is possible to piece together
2549 a macro call coming partially from the macro body and partially from the
2550 arguments.  For example,
2551
2552 @smallexample
2553 #define twice(x) (2*(x))
2554 #define call_with_1(x) x(1)
2555 call_with_1 (twice)
2556      @expansion{} twice(1)
2557      @expansion{} (2*(1))
2558 @end smallexample
2559
2560 Macro definitions do not have to have balanced parentheses.  By writing
2561 an unbalanced open parenthesis in a macro body, it is possible to create
2562 a macro call that begins inside the macro body but ends outside of it.
2563 For example,
2564
2565 @smallexample
2566 #define strange(file) fprintf (file, "%s %d",
2567 @dots{}
2568 strange(stderr) p, 35)
2569      @expansion{} fprintf (stderr, "%s %d", p, 35)
2570 @end smallexample
2571
2572 The ability to piece together a macro call can be useful, but the use of
2573 unbalanced open parentheses in a macro body is just confusing, and
2574 should be avoided.
2575
2576 @node Operator Precedence Problems
2577 @subsection Operator Precedence Problems
2578 @cindex parentheses in macro bodies
2579
2580 You may have noticed that in most of the macro definition examples shown
2581 above, each occurrence of a macro argument name had parentheses around
2582 it.  In addition, another pair of parentheses usually surround the
2583 entire macro definition.  Here is why it is best to write macros that
2584 way.
2585
2586 Suppose you define a macro as follows,
2587
2588 @smallexample
2589 #define ceil_div(x, y) (x + y - 1) / y
2590 @end smallexample
2591
2592 @noindent
2593 whose purpose is to divide, rounding up.  (One use for this operation is
2594 to compute how many @code{int} objects are needed to hold a certain
2595 number of @code{char} objects.)  Then suppose it is used as follows:
2596
2597 @smallexample
2598 a = ceil_div (b & c, sizeof (int));
2599      @expansion{} a = (b & c + sizeof (int) - 1) / sizeof (int);
2600 @end smallexample
2601
2602 @noindent
2603 This does not do what is intended.  The operator-precedence rules of
2604 C make it equivalent to this:
2605
2606 @smallexample
2607 a = (b & (c + sizeof (int) - 1)) / sizeof (int);
2608 @end smallexample
2609
2610 @noindent
2611 What we want is this:
2612
2613 @smallexample
2614 a = ((b & c) + sizeof (int) - 1)) / sizeof (int);
2615 @end smallexample
2616
2617 @noindent
2618 Defining the macro as
2619
2620 @smallexample
2621 #define ceil_div(x, y) ((x) + (y) - 1) / (y)
2622 @end smallexample
2623
2624 @noindent
2625 provides the desired result.
2626
2627 Unintended grouping can result in another way.  Consider @code{sizeof
2628 ceil_div(1, 2)}.  That has the appearance of a C expression that would
2629 compute the size of the type of @code{ceil_div (1, 2)}, but in fact it
2630 means something very different.  Here is what it expands to:
2631
2632 @smallexample
2633 sizeof ((1) + (2) - 1) / (2)
2634 @end smallexample
2635
2636 @noindent
2637 This would take the size of an integer and divide it by two.  The
2638 precedence rules have put the division outside the @code{sizeof} when it
2639 was intended to be inside.
2640
2641 Parentheses around the entire macro definition prevent such problems.
2642 Here, then, is the recommended way to define @code{ceil_div}:
2643
2644 @smallexample
2645 #define ceil_div(x, y) (((x) + (y) - 1) / (y))
2646 @end smallexample
2647
2648 @node Swallowing the Semicolon
2649 @subsection Swallowing the Semicolon
2650 @cindex semicolons (after macro calls)
2651
2652 Often it is desirable to define a macro that expands into a compound
2653 statement.  Consider, for example, the following macro, that advances a
2654 pointer (the argument @code{p} says where to find it) across whitespace
2655 characters:
2656
2657 @smallexample
2658 #define SKIP_SPACES(p, limit)  \
2659 @{ char *lim = (limit);         \
2660   while (p < lim) @{            \
2661     if (*p++ != ' ') @{         \
2662       p--; break; @}@}@}
2663 @end smallexample
2664
2665 @noindent
2666 Here backslash-newline is used to split the macro definition, which must
2667 be a single logical line, so that it resembles the way such code would
2668 be laid out if not part of a macro definition.
2669
2670 A call to this macro might be @code{SKIP_SPACES (p, lim)}.  Strictly
2671 speaking, the call expands to a compound statement, which is a complete
2672 statement with no need for a semicolon to end it.  However, since it
2673 looks like a function call, it minimizes confusion if you can use it
2674 like a function call, writing a semicolon afterward, as in
2675 @code{SKIP_SPACES (p, lim);}
2676
2677 This can cause trouble before @code{else} statements, because the
2678 semicolon is actually a null statement.  Suppose you write
2679
2680 @smallexample
2681 if (*p != 0)
2682   SKIP_SPACES (p, lim);
2683 else @dots{}
2684 @end smallexample
2685
2686 @noindent
2687 The presence of two statements---the compound statement and a null
2688 statement---in between the @code{if} condition and the @code{else}
2689 makes invalid C code.
2690
2691 The definition of the macro @code{SKIP_SPACES} can be altered to solve
2692 this problem, using a @code{do @dots{} while} statement.  Here is how:
2693
2694 @smallexample
2695 #define SKIP_SPACES(p, limit)     \
2696 do @{ char *lim = (limit);         \
2697      while (p < lim) @{            \
2698        if (*p++ != ' ') @{         \
2699          p--; break; @}@}@}          \
2700 while (0)
2701 @end smallexample
2702
2703 Now @code{SKIP_SPACES (p, lim);} expands into
2704
2705 @smallexample
2706 do @{@dots{}@} while (0);
2707 @end smallexample
2708
2709 @noindent
2710 which is one statement.  The loop executes exactly once; most compilers
2711 generate no extra code for it.
2712
2713 @node Duplication of Side Effects
2714 @subsection Duplication of Side Effects
2715
2716 @cindex side effects (in macro arguments)
2717 @cindex unsafe macros
2718 Many C programs define a macro @code{min}, for ``minimum'', like this:
2719
2720 @smallexample
2721 #define min(X, Y)  ((X) < (Y) ? (X) : (Y))
2722 @end smallexample
2723
2724 When you use this macro with an argument containing a side effect,
2725 as shown here,
2726
2727 @smallexample
2728 next = min (x + y, foo (z));
2729 @end smallexample
2730
2731 @noindent
2732 it expands as follows:
2733
2734 @smallexample
2735 next = ((x + y) < (foo (z)) ? (x + y) : (foo (z)));
2736 @end smallexample
2737
2738 @noindent
2739 where @code{x + y} has been substituted for @code{X} and @code{foo (z)}
2740 for @code{Y}.
2741
2742 The function @code{foo} is used only once in the statement as it appears
2743 in the program, but the expression @code{foo (z)} has been substituted
2744 twice into the macro expansion.  As a result, @code{foo} might be called
2745 two times when the statement is executed.  If it has side effects or if
2746 it takes a long time to compute, the results might not be what you
2747 intended.  We say that @code{min} is an @dfn{unsafe} macro.
2748
2749 The best solution to this problem is to define @code{min} in a way that
2750 computes the value of @code{foo (z)} only once.  The C language offers
2751 no standard way to do this, but it can be done with GNU extensions as
2752 follows:
2753
2754 @smallexample
2755 #define min(X, Y)                \
2756 (@{ typeof (X) x_ = (X);          \
2757    typeof (Y) y_ = (Y);          \
2758    (x_ < y_) ? x_ : y_; @})
2759 @end smallexample
2760
2761 The @samp{(@{ @dots{} @})} notation produces a compound statement that
2762 acts as an expression.  Its value is the value of its last statement.
2763 This permits us to define local variables and assign each argument to
2764 one.  The local variables have underscores after their names to reduce
2765 the risk of conflict with an identifier of wider scope (it is impossible
2766 to avoid this entirely).  Now each argument is evaluated exactly once.
2767
2768 If you do not wish to use GNU C extensions, the only solution is to be
2769 careful when @emph{using} the macro @code{min}.  For example, you can
2770 calculate the value of @code{foo (z)}, save it in a variable, and use
2771 that variable in @code{min}:
2772
2773 @smallexample
2774 @group
2775 #define min(X, Y)  ((X) < (Y) ? (X) : (Y))
2776 @dots{}
2777 @{
2778   int tem = foo (z);
2779   next = min (x + y, tem);
2780 @}
2781 @end group
2782 @end smallexample
2783
2784 @noindent
2785 (where we assume that @code{foo} returns type @code{int}).
2786
2787 @node Self-Referential Macros
2788 @subsection Self-Referential Macros
2789 @cindex self-reference
2790
2791 A @dfn{self-referential} macro is one whose name appears in its
2792 definition.  Recall that all macro definitions are rescanned for more
2793 macros to replace.  If the self-reference were considered a use of the
2794 macro, it would produce an infinitely large expansion.  To prevent this,
2795 the self-reference is not considered a macro call.  It is passed into
2796 the preprocessor output unchanged.  Consider an example:
2797
2798 @smallexample
2799 #define foo (4 + foo)
2800 @end smallexample
2801
2802 @noindent
2803 where @code{foo} is also a variable in your program.
2804
2805 Following the ordinary rules, each reference to @code{foo} will expand
2806 into @code{(4 + foo)}; then this will be rescanned and will expand into
2807 @code{(4 + (4 + foo))}; and so on until the computer runs out of memory.
2808
2809 The self-reference rule cuts this process short after one step, at
2810 @code{(4 + foo)}.  Therefore, this macro definition has the possibly
2811 useful effect of causing the program to add 4 to the value of @code{foo}
2812 wherever @code{foo} is referred to.
2813
2814 In most cases, it is a bad idea to take advantage of this feature.  A
2815 person reading the program who sees that @code{foo} is a variable will
2816 not expect that it is a macro as well.  The reader will come across the
2817 identifier @code{foo} in the program and think its value should be that
2818 of the variable @code{foo}, whereas in fact the value is four greater.
2819
2820 One common, useful use of self-reference is to create a macro which
2821 expands to itself.  If you write
2822
2823 @smallexample
2824 #define EPERM EPERM
2825 @end smallexample
2826
2827 @noindent
2828 then the macro @code{EPERM} expands to @code{EPERM}.  Effectively, it is
2829 left alone by the preprocessor whenever it's used in running text.  You
2830 can tell that it's a macro with @samp{#ifdef}.  You might do this if you
2831 want to define numeric constants with an @code{enum}, but have
2832 @samp{#ifdef} be true for each constant.
2833
2834 If a macro @code{x} expands to use a macro @code{y}, and the expansion of
2835 @code{y} refers to the macro @code{x}, that is an @dfn{indirect
2836 self-reference} of @code{x}.  @code{x} is not expanded in this case
2837 either.  Thus, if we have
2838
2839 @smallexample
2840 #define x (4 + y)
2841 #define y (2 * x)
2842 @end smallexample
2843
2844 @noindent
2845 then @code{x} and @code{y} expand as follows:
2846
2847 @smallexample
2848 @group
2849 x    @expansion{} (4 + y)
2850      @expansion{} (4 + (2 * x))
2851
2852 y    @expansion{} (2 * x)
2853      @expansion{} (2 * (4 + y))
2854 @end group
2855 @end smallexample
2856
2857 @noindent
2858 Each macro is expanded when it appears in the definition of the other
2859 macro, but not when it indirectly appears in its own definition.
2860
2861 @node Argument Prescan
2862 @subsection Argument Prescan
2863 @cindex expansion of arguments
2864 @cindex macro argument expansion
2865 @cindex prescan of macro arguments
2866
2867 Macro arguments are completely macro-expanded before they are
2868 substituted into a macro body, unless they are stringified or pasted
2869 with other tokens.  After substitution, the entire macro body, including
2870 the substituted arguments, is scanned again for macros to be expanded.
2871 The result is that the arguments are scanned @emph{twice} to expand
2872 macro calls in them.
2873
2874 Most of the time, this has no effect.  If the argument contained any
2875 macro calls, they are expanded during the first scan.  The result
2876 therefore contains no macro calls, so the second scan does not change
2877 it.  If the argument were substituted as given, with no prescan, the
2878 single remaining scan would find the same macro calls and produce the
2879 same results.
2880
2881 You might expect the double scan to change the results when a
2882 self-referential macro is used in an argument of another macro
2883 (@pxref{Self-Referential Macros}): the self-referential macro would be
2884 expanded once in the first scan, and a second time in the second scan.
2885 However, this is not what happens.  The self-references that do not
2886 expand in the first scan are marked so that they will not expand in the
2887 second scan either.
2888
2889 You might wonder, ``Why mention the prescan, if it makes no difference?
2890 And why not skip it and make the preprocessor faster?''  The answer is
2891 that the prescan does make a difference in three special cases:
2892
2893 @itemize @bullet
2894 @item
2895 Nested calls to a macro.
2896
2897 We say that @dfn{nested} calls to a macro occur when a macro's argument
2898 contains a call to that very macro.  For example, if @code{f} is a macro
2899 that expects one argument, @code{f (f (1))} is a nested pair of calls to
2900 @code{f}.  The desired expansion is made by expanding @code{f (1)} and
2901 substituting that into the definition of @code{f}.  The prescan causes
2902 the expected result to happen.  Without the prescan, @code{f (1)} itself
2903 would be substituted as an argument, and the inner use of @code{f} would
2904 appear during the main scan as an indirect self-reference and would not
2905 be expanded.
2906
2907 @item
2908 Macros that call other macros that stringify or concatenate.
2909
2910 If an argument is stringified or concatenated, the prescan does not
2911 occur.  If you @emph{want} to expand a macro, then stringify or
2912 concatenate its expansion, you can do that by causing one macro to call
2913 another macro that does the stringification or concatenation.  For
2914 instance, if you have
2915
2916 @smallexample
2917 #define AFTERX(x) X_ ## x
2918 #define XAFTERX(x) AFTERX(x)
2919 #define TABLESIZE 1024
2920 #define BUFSIZE TABLESIZE
2921 @end smallexample
2922
2923 then @code{AFTERX(BUFSIZE)} expands to @code{X_BUFSIZE}, and
2924 @code{XAFTERX(BUFSIZE)} expands to @code{X_1024}.  (Not to
2925 @code{X_TABLESIZE}.  Prescan always does a complete expansion.)
2926
2927 @item
2928 Macros used in arguments, whose expansions contain unshielded commas.
2929
2930 This can cause a macro expanded on the second scan to be called with the
2931 wrong number of arguments.  Here is an example:
2932
2933 @smallexample
2934 #define foo  a,b
2935 #define bar(x) lose(x)
2936 #define lose(x) (1 + (x))
2937 @end smallexample
2938
2939 We would like @code{bar(foo)} to turn into @code{(1 + (foo))}, which
2940 would then turn into @code{(1 + (a,b))}.  Instead, @code{bar(foo)}
2941 expands into @code{lose(a,b)}, and you get an error because @code{lose}
2942 requires a single argument.  In this case, the problem is easily solved
2943 by the same parentheses that ought to be used to prevent misnesting of
2944 arithmetic operations:
2945
2946 @smallexample
2947 #define foo (a,b)
2948 @exdent or
2949 #define bar(x) lose((x))
2950 @end smallexample
2951
2952 The extra pair of parentheses prevents the comma in @code{foo}'s
2953 definition from being interpreted as an argument separator.
2954
2955 @end itemize
2956
2957 @node Newlines in Arguments
2958 @subsection Newlines in Arguments
2959 @cindex newlines in macro arguments
2960
2961 The invocation of a function-like macro can extend over many logical
2962 lines.  However, in the present implementation, the entire expansion
2963 comes out on one line.  Thus line numbers emitted by the compiler or
2964 debugger refer to the line the invocation started on, which might be
2965 different to the line containing the argument causing the problem.
2966
2967 Here is an example illustrating this:
2968
2969 @smallexample
2970 #define ignore_second_arg(a,b,c) a; c
2971
2972 ignore_second_arg (foo (),
2973                    ignored (),
2974                    syntax error);
2975 @end smallexample
2976
2977 @noindent
2978 The syntax error triggered by the tokens @code{syntax error} results in
2979 an error message citing line three---the line of ignore_second_arg---
2980 even though the problematic code comes from line five.
2981
2982 We consider this a bug, and intend to fix it in the near future.
2983
2984 @node Conditionals
2985 @chapter Conditionals
2986 @cindex conditionals
2987
2988 A @dfn{conditional} is a directive that instructs the preprocessor to
2989 select whether or not to include a chunk of code in the final token
2990 stream passed to the compiler.  Preprocessor conditionals can test
2991 arithmetic expressions, or whether a name is defined as a macro, or both
2992 simultaneously using the special @code{defined} operator.
2993
2994 A conditional in the C preprocessor resembles in some ways an @code{if}
2995 statement in C, but it is important to understand the difference between
2996 them.  The condition in an @code{if} statement is tested during the
2997 execution of your program.  Its purpose is to allow your program to
2998 behave differently from run to run, depending on the data it is
2999 operating on.  The condition in a preprocessing conditional directive is
3000 tested when your program is compiled.  Its purpose is to allow different
3001 code to be included in the program depending on the situation at the
3002 time of compilation.
3003
3004 However, the distinction is becoming less clear.  Modern compilers often
3005 do test @code{if} statements when a program is compiled, if their
3006 conditions are known not to vary at run time, and eliminate code which
3007 can never be executed.  If you can count on your compiler to do this,
3008 you may find that your program is more readable if you use @code{if}
3009 statements with constant conditions (perhaps determined by macros).  Of
3010 course, you can only use this to exclude code, not type definitions or
3011 other preprocessing directives, and you can only do it if the code
3012 remains syntactically valid when it is not to be used.
3013
3014 GCC version 3 eliminates this kind of never-executed code even when
3015 not optimizing.  Older versions did it only when optimizing.
3016
3017 @menu
3018 * Conditional Uses::
3019 * Conditional Syntax::
3020 * Deleted Code::
3021 @end menu
3022
3023 @node Conditional Uses
3024 @section Conditional Uses
3025
3026 There are three general reasons to use a conditional.
3027
3028 @itemize @bullet
3029 @item
3030 A program may need to use different code depending on the machine or
3031 operating system it is to run on.  In some cases the code for one
3032 operating system may be erroneous on another operating system; for
3033 example, it might refer to data types or constants that do not exist on
3034 the other system.  When this happens, it is not enough to avoid
3035 executing the invalid code.  Its mere presence will cause the compiler
3036 to reject the program.  With a preprocessing conditional, the offending
3037 code can be effectively excised from the program when it is not valid.
3038
3039 @item
3040 You may want to be able to compile the same source file into two
3041 different programs.  One version might make frequent time-consuming
3042 consistency checks on its intermediate data, or print the values of
3043 those data for debugging, and the other not.
3044
3045 @item
3046 A conditional whose condition is always false is one way to exclude code
3047 from the program but keep it as a sort of comment for future reference.
3048 @end itemize
3049
3050 Simple programs that do not need system-specific logic or complex
3051 debugging hooks generally will not need to use preprocessing
3052 conditionals.
3053
3054 @node Conditional Syntax
3055 @section Conditional Syntax
3056
3057 @findex #if
3058 A conditional in the C preprocessor begins with a @dfn{conditional
3059 directive}: @samp{#if}, @samp{#ifdef} or @samp{#ifndef}.
3060
3061 @menu
3062 * Ifdef::
3063 * If::
3064 * Defined::
3065 * Else::
3066 * Elif::
3067 @end menu
3068
3069 @node Ifdef
3070 @subsection Ifdef
3071 @findex #ifdef
3072 @findex #endif
3073
3074 The simplest sort of conditional is
3075
3076 @smallexample
3077 @group
3078 #ifdef @var{MACRO}
3079
3080 @var{controlled text}
3081
3082 #endif /* @var{MACRO} */
3083 @end group
3084 @end smallexample
3085
3086 @cindex conditional group
3087 This block is called a @dfn{conditional group}.  @var{controlled text}
3088 will be included in the output of the preprocessor if and only if
3089 @var{MACRO} is defined.  We say that the conditional @dfn{succeeds} if
3090 @var{MACRO} is defined, @dfn{fails} if it is not.
3091
3092 The @var{controlled text} inside of a conditional can include
3093 preprocessing directives.  They are executed only if the conditional
3094 succeeds.  You can nest conditional groups inside other conditional
3095 groups, but they must be completely nested.  In other words,
3096 @samp{#endif} always matches the nearest @samp{#ifdef} (or
3097 @samp{#ifndef}, or @samp{#if}).  Also, you cannot start a conditional
3098 group in one file and end it in another.
3099
3100 Even if a conditional fails, the @var{controlled text} inside it is
3101 still run through initial transformations and tokenization.  Therefore,
3102 it must all be lexically valid C@.  Normally the only way this matters is
3103 that all comments and string literals inside a failing conditional group
3104 must still be properly ended.
3105
3106 The comment following the @samp{#endif} is not required, but it is a
3107 good practice if there is a lot of @var{controlled text}, because it
3108 helps people match the @samp{#endif} to the corresponding @samp{#ifdef}.
3109 Older programs sometimes put @var{MACRO} directly after the
3110 @samp{#endif} without enclosing it in a comment.  This is invalid code
3111 according to the C standard.  CPP accepts it with a warning.  It
3112 never affects which @samp{#ifndef} the @samp{#endif} matches.
3113
3114 @findex #ifndef
3115 Sometimes you wish to use some code if a macro is @emph{not} defined.
3116 You can do this by writing @samp{#ifndef} instead of @samp{#ifdef}.
3117 One common use of @samp{#ifndef} is to include code only the first
3118 time a header file is included.  @xref{Once-Only Headers}.
3119
3120 Macro definitions can vary between compilations for several reasons.
3121 Here are some samples.
3122
3123 @itemize @bullet
3124 @item
3125 Some macros are predefined on each kind of machine
3126 (@pxref{System-specific Predefined Macros}).  This allows you to provide
3127 code specially tuned for a particular machine.
3128
3129 @item
3130 System header files define more macros, associated with the features
3131 they implement.  You can test these macros with conditionals to avoid
3132 using a system feature on a machine where it is not implemented.
3133
3134 @item
3135 Macros can be defined or undefined with the @option{-D} and @option{-U}
3136 command line options when you compile the program.  You can arrange to
3137 compile the same source file into two different programs by choosing a
3138 macro name to specify which program you want, writing conditionals to
3139 test whether or how this macro is defined, and then controlling the
3140 state of the macro with command line options, perhaps set in the
3141 Makefile.  @xref{Invocation}.
3142
3143 @item
3144 Your program might have a special header file (often called
3145 @file{config.h}) that is adjusted when the program is compiled.  It can
3146 define or not define macros depending on the features of the system and
3147 the desired capabilities of the program.  The adjustment can be
3148 automated by a tool such as @command{autoconf}, or done by hand.
3149 @end itemize
3150
3151 @node If
3152 @subsection If
3153
3154 The @samp{#if} directive allows you to test the value of an arithmetic
3155 expression, rather than the mere existence of one macro.  Its syntax is
3156
3157 @smallexample
3158 @group
3159 #if @var{expression}
3160
3161 @var{controlled text}
3162
3163 #endif /* @var{expression} */
3164 @end group
3165 @end smallexample
3166
3167 @var{expression} is a C expression of integer type, subject to stringent
3168 restrictions.  It may contain
3169
3170 @itemize @bullet
3171 @item
3172 Integer constants.
3173
3174 @item
3175 Character constants, which are interpreted as they would be in normal
3176 code.
3177
3178 @item
3179 Arithmetic operators for addition, subtraction, multiplication,
3180 division, bitwise operations, shifts, comparisons, and logical
3181 operations (@code{&&} and @code{||}).  The latter two obey the usual
3182 short-circuiting rules of standard C@.
3183
3184 @item
3185 Macros.  All macros in the expression are expanded before actual
3186 computation of the expression's value begins.
3187
3188 @item
3189 Uses of the @code{defined} operator, which lets you check whether macros
3190 are defined in the middle of an @samp{#if}.
3191
3192 @item
3193 Identifiers that are not macros, which are all considered to be the
3194 number zero.  This allows you to write @code{@w{#if MACRO}} instead of
3195 @code{@w{#ifdef MACRO}}, if you know that MACRO, when defined, will
3196 always have a nonzero value.  Function-like macros used without their
3197 function call parentheses are also treated as zero.
3198
3199 In some contexts this shortcut is undesirable.  The @option{-Wundef}
3200 option causes GCC to warn whenever it encounters an identifier which is
3201 not a macro in an @samp{#if}.
3202 @end itemize
3203
3204 The preprocessor does not know anything about types in the language.
3205 Therefore, @code{sizeof} operators are not recognized in @samp{#if}, and
3206 neither are @code{enum} constants.  They will be taken as identifiers
3207 which are not macros, and replaced by zero.  In the case of
3208 @code{sizeof}, this is likely to cause the expression to be invalid.
3209
3210 The preprocessor calculates the value of @var{expression}.  It carries
3211 out all calculations in the widest integer type known to the compiler;
3212 on most machines supported by GCC this is 64 bits.  This is not the same
3213 rule as the compiler uses to calculate the value of a constant
3214 expression, and may give different results in some cases.  If the value
3215 comes out to be nonzero, the @samp{#if} succeeds and the @var{controlled
3216 text} is included; otherwise it is skipped.
3217
3218 @node Defined
3219 @subsection Defined
3220
3221 @cindex @code{defined}
3222 The special operator @code{defined} is used in @samp{#if} and
3223 @samp{#elif} expressions to test whether a certain name is defined as a
3224 macro.  @code{defined @var{name}} and @code{defined (@var{name})} are
3225 both expressions whose value is 1 if @var{name} is defined as a macro at
3226 the current point in the program, and 0 otherwise.  Thus,  @code{@w{#if
3227 defined MACRO}} is precisely equivalent to @code{@w{#ifdef MACRO}}.
3228
3229 @code{defined} is useful when you wish to test more than one macro for
3230 existence at once.  For example,
3231
3232 @smallexample
3233 #if defined (__vax__) || defined (__ns16000__)
3234 @end smallexample
3235
3236 @noindent
3237 would succeed if either of the names @code{__vax__} or
3238 @code{__ns16000__} is defined as a macro.
3239
3240 Conditionals written like this:
3241
3242 @smallexample
3243 #if defined BUFSIZE && BUFSIZE >= 1024
3244 @end smallexample
3245
3246 @noindent
3247 can generally be simplified to just @code{@w{#if BUFSIZE >= 1024}},
3248 since if @code{BUFSIZE} is not defined, it will be interpreted as having
3249 the value zero.
3250
3251 If the @code{defined} operator appears as a result of a macro expansion,
3252 the C standard says the behavior is undefined.  GNU cpp treats it as a
3253 genuine @code{defined} operator and evaluates it normally.  It will warn
3254 wherever your code uses this feature if you use the command-line option
3255 @option{-pedantic}, since other compilers may handle it differently.
3256
3257 @node Else
3258 @subsection Else
3259
3260 @findex #else
3261 The @samp{#else} directive can be added to a conditional to provide
3262 alternative text to be used if the condition fails.  This is what it
3263 looks like:
3264
3265 @smallexample
3266 @group
3267 #if @var{expression}
3268 @var{text-if-true}
3269 #else /* Not @var{expression} */
3270 @var{text-if-false}
3271 #endif /* Not @var{expression} */
3272 @end group
3273 @end smallexample
3274
3275 @noindent
3276 If @var{expression} is nonzero, the @var{text-if-true} is included and
3277 the @var{text-if-false} is skipped.  If @var{expression} is zero, the
3278 opposite happens.
3279
3280 You can use @samp{#else} with @samp{#ifdef} and @samp{#ifndef}, too.
3281
3282 @node Elif
3283 @subsection Elif
3284
3285 @findex #elif
3286 One common case of nested conditionals is used to check for more than two
3287 possible alternatives.  For example, you might have
3288
3289 @smallexample
3290 #if X == 1
3291 @dots{}
3292 #else /* X != 1 */
3293 #if X == 2
3294 @dots{}
3295 #else /* X != 2 */
3296 @dots{}
3297 #endif /* X != 2 */
3298 #endif /* X != 1 */
3299 @end smallexample
3300
3301 Another conditional directive, @samp{#elif}, allows this to be
3302 abbreviated as follows:
3303
3304 @smallexample
3305 #if X == 1
3306 @dots{}
3307 #elif X == 2
3308 @dots{}
3309 #else /* X != 2 and X != 1*/
3310 @dots{}
3311 #endif /* X != 2 and X != 1*/
3312 @end smallexample
3313
3314 @samp{#elif} stands for ``else if''.  Like @samp{#else}, it goes in the
3315 middle of a conditional group and subdivides it; it does not require a
3316 matching @samp{#endif} of its own.  Like @samp{#if}, the @samp{#elif}
3317 directive includes an expression to be tested.  The text following the
3318 @samp{#elif} is processed only if the original @samp{#if}-condition
3319 failed and the @samp{#elif} condition succeeds.
3320
3321 More than one @samp{#elif} can go in the same conditional group.  Then
3322 the text after each @samp{#elif} is processed only if the @samp{#elif}
3323 condition succeeds after the original @samp{#if} and all previous
3324 @samp{#elif} directives within it have failed.
3325
3326 @samp{#else} is allowed after any number of @samp{#elif} directives, but
3327 @samp{#elif} may not follow @samp{#else}.
3328
3329 @node Deleted Code
3330 @section Deleted Code
3331 @cindex commenting out code
3332
3333 If you replace or delete a part of the program but want to keep the old
3334 code around for future reference, you often cannot simply comment it
3335 out.  Block comments do not nest, so the first comment inside the old
3336 code will end the commenting-out.  The probable result is a flood of
3337 syntax errors.
3338
3339 One way to avoid this problem is to use an always-false conditional
3340 instead.  For instance, put @code{#if 0} before the deleted code and
3341 @code{#endif} after it.  This works even if the code being turned
3342 off contains conditionals, but they must be entire conditionals
3343 (balanced @samp{#if} and @samp{#endif}).
3344
3345 Some people use @code{#ifdef notdef} instead.  This is risky, because
3346 @code{notdef} might be accidentally defined as a macro, and then the
3347 conditional would succeed.  @code{#if 0} can be counted on to fail.
3348
3349 Do not use @code{#if 0} for comments which are not C code.  Use a real
3350 comment, instead.  The interior of @code{#if 0} must consist of complete
3351 tokens; in particular, single-quote characters must balance.  Comments
3352 often contain unbalanced single-quote characters (known in English as
3353 apostrophes).  These confuse @code{#if 0}.  They don't confuse
3354 @samp{/*}.
3355
3356 @node Diagnostics
3357 @chapter Diagnostics
3358 @cindex diagnostic
3359 @cindex reporting errors
3360 @cindex reporting warnings
3361
3362 @findex #error
3363 The directive @samp{#error} causes the preprocessor to report a fatal
3364 error.  The tokens forming the rest of the line following @samp{#error}
3365 are used as the error message.
3366
3367 You would use @samp{#error} inside of a conditional that detects a
3368 combination of parameters which you know the program does not properly
3369 support.  For example, if you know that the program will not run
3370 properly on a VAX, you might write
3371
3372 @smallexample
3373 @group
3374 #ifdef __vax__
3375 #error "Won't work on VAXen.  See comments at get_last_object."
3376 #endif
3377 @end group
3378 @end smallexample
3379
3380 If you have several configuration parameters that must be set up by
3381 the installation in a consistent way, you can use conditionals to detect
3382 an inconsistency and report it with @samp{#error}.  For example,
3383
3384 @smallexample
3385 #if !defined(UNALIGNED_INT_ASM_OP) && defined(DWARF2_DEBUGGING_INFO)
3386 #error "DWARF2_DEBUGGING_INFO requires UNALIGNED_INT_ASM_OP."
3387 #endif
3388 @end smallexample
3389
3390 @findex #warning
3391 The directive @samp{#warning} is like @samp{#error}, but causes the
3392 preprocessor to issue a warning and continue preprocessing.  The tokens
3393 following @samp{#warning} are used as the warning message.
3394
3395 You might use @samp{#warning} in obsolete header files, with a message
3396 directing the user to the header file which should be used instead.
3397
3398 Neither @samp{#error} nor @samp{#warning} macro-expands its argument.
3399 Internal whitespace sequences are each replaced with a single space.
3400 The line must consist of complete tokens.  It is wisest to make the
3401 argument of these directives be a single string constant; this avoids
3402 problems with apostrophes and the like.
3403
3404 @node Line Control
3405 @chapter Line Control
3406 @cindex line control
3407
3408 The C preprocessor informs the C compiler of the location in your source
3409 code where each token came from.  Presently, this is just the file name
3410 and line number.  All the tokens resulting from macro expansion are
3411 reported as having appeared on the line of the source file where the
3412 outermost macro was used.  We intend to be more accurate in the future.
3413
3414 If you write a program which generates source code, such as the
3415 @command{bison} parser generator, you may want to adjust the preprocessor's
3416 notion of the current file name and line number by hand.  Parts of the
3417 output from @command{bison} are generated from scratch, other parts come
3418 from a standard parser file.  The rest are copied verbatim from
3419 @command{bison}'s input.  You would like compiler error messages and
3420 symbolic debuggers to be able to refer to @code{bison}'s input file.
3421
3422 @findex #line
3423 @command{bison} or any such program can arrange this by writing
3424 @samp{#line} directives into the output file.  @samp{#line} is a
3425 directive that specifies the original line number and source file name
3426 for subsequent input in the current preprocessor input file.
3427 @samp{#line} has three variants:
3428
3429 @table @code
3430 @item #line @var{linenum}
3431 @var{linenum} is a non-negative decimal integer constant.  It specifies
3432 the line number which should be reported for the following line of
3433 input.  Subsequent lines are counted from @var{linenum}.
3434
3435 @item #line @var{linenum} @var{filename}
3436 @var{linenum} is the same as for the first form, and has the same
3437 effect.  In addition, @var{filename} is a string constant.  The
3438 following line and all subsequent lines are reported to come from the
3439 file it specifies, until something else happens to change that.
3440 @var{filename} is interpreted according to the normal rules for a string
3441 constant: backslash escapes are interpreted.  This is different from
3442 @samp{#include}.
3443
3444 Previous versions of CPP did not interpret escapes in @samp{#line};
3445 we have changed it because the standard requires they be interpreted,
3446 and most other compilers do.
3447
3448 @item #line @var{anything else}
3449 @var{anything else} is checked for macro calls, which are expanded.
3450 The result should match one of the above two forms.
3451 @end table
3452
3453 @samp{#line} directives alter the results of the @code{__FILE__} and
3454 @code{__LINE__} predefined macros from that point on.  @xref{Standard
3455 Predefined Macros}.  They do not have any effect on @samp{#include}'s
3456 idea of the directory containing the current file.  This is a change
3457 from GCC 2.95.  Previously, a file reading
3458
3459 @smallexample
3460 #line 1 "../src/gram.y"
3461 #include "gram.h"
3462 @end smallexample
3463
3464 would search for @file{gram.h} in @file{../src}, then the @option{-I}
3465 chain; the directory containing the physical source file would not be
3466 searched.  In GCC 3.0 and later, the @samp{#include} is not affected by
3467 the presence of a @samp{#line} referring to a different directory.
3468
3469 We made this change because the old behavior caused problems when
3470 generated source files were transported between machines.  For instance,
3471 it is common practice to ship generated parsers with a source release,
3472 so that people building the distribution do not need to have yacc or
3473 Bison installed.  These files frequently have @samp{#line} directives
3474 referring to the directory tree of the system where the distribution was
3475 created.  If GCC tries to search for headers in those directories, the
3476 build is likely to fail.
3477
3478 The new behavior can cause failures too, if the generated file is not
3479 in the same directory as its source and it attempts to include a header
3480 which would be visible searching from the directory containing the
3481 source file.  However, this problem is easily solved with an additional
3482 @option{-I} switch on the command line.  The failures caused by the old
3483 semantics could sometimes be corrected only by editing the generated
3484 files, which is difficult and error-prone.
3485
3486 @node Pragmas
3487 @chapter Pragmas
3488
3489 The @samp{#pragma} directive is the method specified by the C standard
3490 for providing additional information to the compiler, beyond what is
3491 conveyed in the language itself.  Three forms of this directive
3492 (commonly known as @dfn{pragmas}) are specified by the 1999 C standard.
3493 A C compiler is free to attach any meaning it likes to other pragmas.
3494
3495 GCC has historically preferred to use extensions to the syntax of the
3496 language, such as @code{__attribute__}, for this purpose.  However, GCC
3497 does define a few pragmas of its own.  These mostly have effects on the
3498 entire translation unit or source file.
3499
3500 In GCC version 3, all GNU-defined, supported pragmas have been given a
3501 @code{GCC} prefix.  This is in line with the @code{STDC} prefix on all
3502 pragmas defined by C99.  For backward compatibility, pragmas which were
3503 recognized by previous versions are still recognized without the
3504 @code{GCC} prefix, but that usage is deprecated.  Some older pragmas are
3505 deprecated in their entirety.  They are not recognized with the
3506 @code{GCC} prefix.  @xref{Obsolete Features}.
3507
3508 @cindex @code{_Pragma}
3509 C99 introduces the @code{@w{_Pragma}} operator.  This feature addresses a
3510 major problem with @samp{#pragma}: being a directive, it cannot be
3511 produced as the result of macro expansion.  @code{@w{_Pragma}} is an
3512 operator, much like @code{sizeof} or @code{defined}, and can be embedded
3513 in a macro.
3514
3515 Its syntax is @code{@w{_Pragma (@var{string-literal})}}, where
3516 @var{string-literal} can be either a normal or wide-character string
3517 literal.  It is destringized, by replacing all @samp{\\} with a single
3518 @samp{\} and all @samp{\"} with a @samp{"}.  The result is then
3519 processed as if it had appeared as the right hand side of a
3520 @samp{#pragma} directive.  For example,
3521
3522 @smallexample
3523 _Pragma ("GCC dependency \"parse.y\"")
3524 @end smallexample
3525
3526 @noindent
3527 has the same effect as @code{#pragma GCC dependency "parse.y"}.  The
3528 same effect could be achieved using macros, for example
3529
3530 @smallexample
3531 #define DO_PRAGMA(x) _Pragma (#x)
3532 DO_PRAGMA (GCC dependency "parse.y")
3533 @end smallexample
3534
3535 The standard is unclear on where a @code{_Pragma} operator can appear.
3536 The preprocessor does not accept it within a preprocessing conditional
3537 directive like @samp{#if}.  To be safe, you are probably best keeping it
3538 out of directives other than @samp{#define}, and putting it on a line of
3539 its own.
3540
3541 This manual documents the pragmas which are meaningful to the
3542 preprocessor itself.  Other pragmas are meaningful to the C or C++
3543 compilers.  They are documented in the GCC manual.
3544
3545 GCC plugins may provide their own pragmas.
3546
3547 @ftable @code
3548 @item #pragma GCC dependency
3549 @code{#pragma GCC dependency} allows you to check the relative dates of
3550 the current file and another file.  If the other file is more recent than
3551 the current file, a warning is issued.  This is useful if the current
3552 file is derived from the other file, and should be regenerated.  The
3553 other file is searched for using the normal include search path.
3554 Optional trailing text can be used to give more information in the
3555 warning message.
3556
3557 @smallexample
3558 #pragma GCC dependency "parse.y"
3559 #pragma GCC dependency "/usr/include/time.h" rerun fixincludes
3560 @end smallexample
3561
3562 @item #pragma GCC poison
3563 Sometimes, there is an identifier that you want to remove completely
3564 from your program, and make sure that it never creeps back in.  To
3565 enforce this, you can @dfn{poison} the identifier with this pragma.
3566 @code{#pragma GCC poison} is followed by a list of identifiers to
3567 poison.  If any of those identifiers appears anywhere in the source
3568 after the directive, it is a hard error.  For example,
3569
3570 @smallexample
3571 #pragma GCC poison printf sprintf fprintf
3572 sprintf(some_string, "hello");
3573 @end smallexample
3574
3575 @noindent
3576 will produce an error.
3577
3578 If a poisoned identifier appears as part of the expansion of a macro
3579 which was defined before the identifier was poisoned, it will @emph{not}
3580 cause an error.  This lets you poison an identifier without worrying
3581 about system headers defining macros that use it.
3582
3583 For example,
3584
3585 @smallexample
3586 #define strrchr rindex
3587 #pragma GCC poison rindex
3588 strrchr(some_string, 'h');
3589 @end smallexample
3590
3591 @noindent
3592 will not produce an error.
3593
3594 @item #pragma GCC system_header
3595 This pragma takes no arguments.  It causes the rest of the code in the
3596 current file to be treated as if it came from a system header.
3597 @xref{System Headers}.
3598
3599 @end ftable
3600
3601 @node Other Directives
3602 @chapter Other Directives
3603
3604 @findex #ident
3605 @findex #sccs
3606 The @samp{#ident} directive takes one argument, a string constant.  On
3607 some systems, that string constant is copied into a special segment of
3608 the object file.  On other systems, the directive is ignored.  The
3609 @samp{#sccs} directive is a synonym for @samp{#ident}.
3610
3611 These directives are not part of the C standard, but they are not
3612 official GNU extensions either.  What historical information we have
3613 been able to find, suggests they originated with System V@.
3614
3615 @cindex null directive
3616 The @dfn{null directive} consists of a @samp{#} followed by a newline,
3617 with only whitespace (including comments) in between.  A null directive
3618 is understood as a preprocessing directive but has no effect on the
3619 preprocessor output.  The primary significance of the existence of the
3620 null directive is that an input line consisting of just a @samp{#} will
3621 produce no output, rather than a line of output containing just a
3622 @samp{#}.  Supposedly some old C programs contain such lines.
3623
3624 @node Preprocessor Output
3625 @chapter Preprocessor Output
3626
3627 When the C preprocessor is used with the C, C++, or Objective-C
3628 compilers, it is integrated into the compiler and communicates a stream
3629 of binary tokens directly to the compiler's parser.  However, it can
3630 also be used in the more conventional standalone mode, where it produces
3631 textual output.
3632 @c FIXME: Document the library interface.
3633
3634 @cindex output format
3635 The output from the C preprocessor looks much like the input, except
3636 that all preprocessing directive lines have been replaced with blank
3637 lines and all comments with spaces.  Long runs of blank lines are
3638 discarded.
3639
3640 The ISO standard specifies that it is implementation defined whether a
3641 preprocessor preserves whitespace between tokens, or replaces it with
3642 e.g.@: a single space.  In GNU CPP, whitespace between tokens is collapsed
3643 to become a single space, with the exception that the first token on a
3644 non-directive line is preceded with sufficient spaces that it appears in
3645 the same column in the preprocessed output that it appeared in the
3646 original source file.  This is so the output is easy to read.
3647 @xref{Differences from previous versions}.  CPP does not insert any
3648 whitespace where there was none in the original source, except where
3649 necessary to prevent an accidental token paste.
3650
3651 @cindex linemarkers
3652 Source file name and line number information is conveyed by lines
3653 of the form
3654
3655 @smallexample
3656 # @var{linenum} @var{filename} @var{flags}
3657 @end smallexample
3658
3659 @noindent
3660 These are called @dfn{linemarkers}.  They are inserted as needed into
3661 the output (but never within a string or character constant).  They mean
3662 that the following line originated in file @var{filename} at line
3663 @var{linenum}.  @var{filename} will never contain any non-printing
3664 characters; they are replaced with octal escape sequences.
3665
3666 After the file name comes zero or more flags, which are @samp{1},
3667 @samp{2}, @samp{3}, or @samp{4}.  If there are multiple flags, spaces
3668 separate them.  Here is what the flags mean:
3669
3670 @table @samp
3671 @item 1
3672 This indicates the start of a new file.
3673 @item 2
3674 This indicates returning to a file (after having included another file).
3675 @item 3
3676 This indicates that the following text comes from a system header file,
3677 so certain warnings should be suppressed.
3678 @item 4
3679 This indicates that the following text should be treated as being
3680 wrapped in an implicit @code{extern "C"} block.
3681 @c maybe cross reference NO_IMPLICIT_EXTERN_C
3682 @end table
3683
3684 As an extension, the preprocessor accepts linemarkers in non-assembler
3685 input files.  They are treated like the corresponding @samp{#line}
3686 directive, (@pxref{Line Control}), except that trailing flags are
3687 permitted, and are interpreted with the meanings described above.  If
3688 multiple flags are given, they must be in ascending order.
3689
3690 Some directives may be duplicated in the output of the preprocessor.
3691 These are @samp{#ident} (always), @samp{#pragma} (only if the
3692 preprocessor does not handle the pragma itself), and @samp{#define} and
3693 @samp{#undef} (with certain debugging options).  If this happens, the
3694 @samp{#} of the directive will always be in the first column, and there
3695 will be no space between the @samp{#} and the directive name.  If macro
3696 expansion happens to generate tokens which might be mistaken for a
3697 duplicated directive, a space will be inserted between the @samp{#} and
3698 the directive name.
3699
3700 @node Traditional Mode
3701 @chapter Traditional Mode
3702
3703 Traditional (pre-standard) C preprocessing is rather different from
3704 the preprocessing specified by the standard.  When GCC is given the
3705 @option{-traditional-cpp} option, it attempts to emulate a traditional
3706 preprocessor.
3707
3708 GCC versions 3.2 and later only support traditional mode semantics in
3709 the preprocessor, and not in the compiler front ends.  This chapter
3710 outlines the traditional preprocessor semantics we implemented.
3711
3712 The implementation does not correspond precisely to the behavior of
3713 earlier versions of GCC, nor to any true traditional preprocessor.
3714 After all, inconsistencies among traditional implementations were a
3715 major motivation for C standardization.  However, we intend that it
3716 should be compatible with true traditional preprocessors in all ways
3717 that actually matter.
3718
3719 @menu
3720 * Traditional lexical analysis::
3721 * Traditional macros::
3722 * Traditional miscellany::
3723 * Traditional warnings::
3724 @end menu
3725
3726 @node Traditional lexical analysis
3727 @section Traditional lexical analysis
3728
3729 The traditional preprocessor does not decompose its input into tokens
3730 the same way a standards-conforming preprocessor does.  The input is
3731 simply treated as a stream of text with minimal internal form.
3732
3733 This implementation does not treat trigraphs (@pxref{trigraphs})
3734 specially since they were an invention of the standards committee.  It
3735 handles arbitrarily-positioned escaped newlines properly and splices
3736 the lines as you would expect; many traditional preprocessors did not
3737 do this.
3738
3739 The form of horizontal whitespace in the input file is preserved in
3740 the output.  In particular, hard tabs remain hard tabs.  This can be
3741 useful if, for example, you are preprocessing a Makefile.
3742
3743 Traditional CPP only recognizes C-style block comments, and treats the
3744 @samp{/*} sequence as introducing a comment only if it lies outside
3745 quoted text.  Quoted text is introduced by the usual single and double
3746 quotes, and also by an initial @samp{<} in a @code{#include}
3747 directive.
3748
3749 Traditionally, comments are completely removed and are not replaced
3750 with a space.  Since a traditional compiler does its own tokenization
3751 of the output of the preprocessor, this means that comments can
3752 effectively be used as token paste operators.  However, comments
3753 behave like separators for text handled by the preprocessor itself,
3754 since it doesn't re-lex its input.  For example, in
3755
3756 @smallexample
3757 #if foo/**/bar
3758 @end smallexample
3759
3760 @noindent
3761 @samp{foo} and @samp{bar} are distinct identifiers and expanded
3762 separately if they happen to be macros.  In other words, this
3763 directive is equivalent to
3764
3765 @smallexample
3766 #if foo bar
3767 @end smallexample
3768
3769 @noindent
3770 rather than
3771
3772 @smallexample
3773 #if foobar
3774 @end smallexample
3775
3776 Generally speaking, in traditional mode an opening quote need not have
3777 a matching closing quote.  In particular, a macro may be defined with
3778 replacement text that contains an unmatched quote.  Of course, if you
3779 attempt to compile preprocessed output containing an unmatched quote
3780 you will get a syntax error.
3781
3782 However, all preprocessing directives other than @code{#define}
3783 require matching quotes.  For example:
3784
3785 @smallexample
3786 #define m This macro's fine and has an unmatched quote
3787 "/* This is not a comment.  */
3788 /* @r{This is a comment.  The following #include directive
3789    is ill-formed.}  */
3790 #include <stdio.h
3791 @end smallexample
3792
3793 Just as for the ISO preprocessor, what would be a closing quote can be
3794 escaped with a backslash to prevent the quoted text from closing.
3795
3796 @node Traditional macros
3797 @section Traditional macros
3798
3799 The major difference between traditional and ISO macros is that the
3800 former expand to text rather than to a token sequence.  CPP removes
3801 all leading and trailing horizontal whitespace from a macro's
3802 replacement text before storing it, but preserves the form of internal
3803 whitespace.
3804
3805 One consequence is that it is legitimate for the replacement text to
3806 contain an unmatched quote (@pxref{Traditional lexical analysis}).  An
3807 unclosed string or character constant continues into the text
3808 following the macro call.  Similarly, the text at the end of a macro's
3809 expansion can run together with the text after the macro invocation to
3810 produce a single token.
3811
3812 Normally comments are removed from the replacement text after the
3813 macro is expanded, but if the @option{-CC} option is passed on the
3814 command line comments are preserved.  (In fact, the current
3815 implementation removes comments even before saving the macro
3816 replacement text, but it careful to do it in such a way that the
3817 observed effect is identical even in the function-like macro case.)
3818
3819 The ISO stringification operator @samp{#} and token paste operator
3820 @samp{##} have no special meaning.  As explained later, an effect
3821 similar to these operators can be obtained in a different way.  Macro
3822 names that are embedded in quotes, either from the main file or after
3823 macro replacement, do not expand.
3824
3825 CPP replaces an unquoted object-like macro name with its replacement
3826 text, and then rescans it for further macros to replace.  Unlike
3827 standard macro expansion, traditional macro expansion has no provision
3828 to prevent recursion.  If an object-like macro appears unquoted in its
3829 replacement text, it will be replaced again during the rescan pass,
3830 and so on @emph{ad infinitum}.  GCC detects when it is expanding
3831 recursive macros, emits an error message, and continues after the
3832 offending macro invocation.
3833
3834 @smallexample
3835 #define PLUS +
3836 #define INC(x) PLUS+x
3837 INC(foo);
3838      @expansion{} ++foo;
3839 @end smallexample
3840
3841 Function-like macros are similar in form but quite different in
3842 behavior to their ISO counterparts.  Their arguments are contained
3843 within parentheses, are comma-separated, and can cross physical lines.
3844 Commas within nested parentheses are not treated as argument
3845 separators.  Similarly, a quote in an argument cannot be left
3846 unclosed; a following comma or parenthesis that comes before the
3847 closing quote is treated like any other character.  There is no
3848 facility for handling variadic macros.
3849
3850 This implementation removes all comments from macro arguments, unless
3851 the @option{-C} option is given.  The form of all other horizontal
3852 whitespace in arguments is preserved, including leading and trailing
3853 whitespace.  In particular
3854
3855 @smallexample
3856 f( )
3857 @end smallexample
3858
3859 @noindent
3860 is treated as an invocation of the macro @samp{f} with a single
3861 argument consisting of a single space.  If you want to invoke a
3862 function-like macro that takes no arguments, you must not leave any
3863 whitespace between the parentheses.
3864
3865 If a macro argument crosses a new line, the new line is replaced with
3866 a space when forming the argument.  If the previous line contained an
3867 unterminated quote, the following line inherits the quoted state.
3868
3869 Traditional preprocessors replace parameters in the replacement text
3870 with their arguments regardless of whether the parameters are within
3871 quotes or not.  This provides a way to stringize arguments.  For
3872 example
3873
3874 @smallexample
3875 #define str(x) "x"
3876 str(/* @r{A comment} */some text )
3877      @expansion{} "some text "
3878 @end smallexample
3879
3880 @noindent
3881 Note that the comment is removed, but that the trailing space is
3882 preserved.  Here is an example of using a comment to effect token
3883 pasting.
3884
3885 @smallexample
3886 #define suffix(x) foo_/**/x
3887 suffix(bar)
3888      @expansion{} foo_bar
3889 @end smallexample
3890
3891 @node Traditional miscellany
3892 @section Traditional miscellany
3893
3894 Here are some things to be aware of when using the traditional
3895 preprocessor.
3896
3897 @itemize @bullet
3898 @item
3899 Preprocessing directives are recognized only when their leading
3900 @samp{#} appears in the first column.  There can be no whitespace
3901 between the beginning of the line and the @samp{#}, but whitespace can
3902 follow the @samp{#}.
3903
3904 @item
3905 A true traditional C preprocessor does not recognize @samp{#error} or
3906 @samp{#pragma}, and may not recognize @samp{#elif}.  CPP supports all
3907 the directives in traditional mode that it supports in ISO mode,
3908 including extensions, with the exception that the effects of
3909 @samp{#pragma GCC poison} are undefined.
3910
3911 @item
3912 __STDC__ is not defined.
3913
3914 @item
3915 If you use digraphs the behavior is undefined.
3916
3917 @item
3918 If a line that looks like a directive appears within macro arguments,
3919 the behavior is undefined.
3920
3921 @end itemize
3922
3923 @node Traditional warnings
3924 @section Traditional warnings
3925 You can request warnings about features that did not exist, or worked
3926 differently, in traditional C with the @option{-Wtraditional} option.
3927 GCC does not warn about features of ISO C which you must use when you
3928 are using a conforming compiler, such as the @samp{#} and @samp{##}
3929 operators.
3930
3931 Presently @option{-Wtraditional} warns about:
3932
3933 @itemize @bullet
3934 @item
3935 Macro parameters that appear within string literals in the macro body.
3936 In traditional C macro replacement takes place within string literals,
3937 but does not in ISO C@.
3938
3939 @item
3940 In traditional C, some preprocessor directives did not exist.
3941 Traditional preprocessors would only consider a line to be a directive
3942 if the @samp{#} appeared in column 1 on the line.  Therefore
3943 @option{-Wtraditional} warns about directives that traditional C
3944 understands but would ignore because the @samp{#} does not appear as the
3945 first character on the line.  It also suggests you hide directives like
3946 @samp{#pragma} not understood by traditional C by indenting them.  Some
3947 traditional implementations would not recognize @samp{#elif}, so it
3948 suggests avoiding it altogether.
3949
3950 @item
3951 A function-like macro that appears without an argument list.  In some
3952 traditional preprocessors this was an error.  In ISO C it merely means
3953 that the macro is not expanded.
3954
3955 @item
3956 The unary plus operator.  This did not exist in traditional C@.
3957
3958 @item
3959 The @samp{U} and @samp{LL} integer constant suffixes, which were not
3960 available in traditional C@.  (Traditional C does support the @samp{L}
3961 suffix for simple long integer constants.)  You are not warned about
3962 uses of these suffixes in macros defined in system headers.  For
3963 instance, @code{UINT_MAX} may well be defined as @code{4294967295U}, but
3964 you will not be warned if you use @code{UINT_MAX}.
3965
3966 You can usually avoid the warning, and the related warning about
3967 constants which are so large that they are unsigned, by writing the
3968 integer constant in question in hexadecimal, with no U suffix.  Take
3969 care, though, because this gives the wrong result in exotic cases.
3970 @end itemize
3971
3972 @node Implementation Details
3973 @chapter Implementation Details
3974
3975 Here we document details of how the preprocessor's implementation
3976 affects its user-visible behavior.  You should try to avoid undue
3977 reliance on behavior described here, as it is possible that it will
3978 change subtly in future implementations.
3979
3980 Also documented here are obsolete features and changes from previous
3981 versions of CPP@.
3982
3983 @menu
3984 * Implementation-defined behavior::
3985 * Implementation limits::
3986 * Obsolete Features::
3987 * Differences from previous versions::
3988 @end menu
3989
3990 @node Implementation-defined behavior
3991 @section Implementation-defined behavior
3992 @cindex implementation-defined behavior
3993
3994 This is how CPP behaves in all the cases which the C standard
3995 describes as @dfn{implementation-defined}.  This term means that the
3996 implementation is free to do what it likes, but must document its choice
3997 and stick to it.
3998 @c FIXME: Check the C++ standard for more implementation-defined stuff.
3999
4000 @itemize @bullet
4001 @need 1000
4002 @item The mapping of physical source file multi-byte characters to the
4003 execution character set.
4004
4005 The input character set can be specified using the
4006 @option{-finput-charset} option, while the execution character set may
4007 be controlled using the @option{-fexec-charset} and
4008 @option{-fwide-exec-charset} options.
4009
4010 @item Identifier characters.
4011 @anchor{Identifier characters}
4012
4013 The C and C++ standards allow identifiers to be composed of @samp{_}
4014 and the alphanumeric characters.  C++ and C99 also allow universal
4015 character names, and C99 further permits implementation-defined
4016 characters.  GCC currently only permits universal character names if
4017 @option{-fextended-identifiers} is used, because the implementation of
4018 universal character names in identifiers is experimental.
4019
4020 GCC allows the @samp{$} character in identifiers as an extension for
4021 most targets.  This is true regardless of the @option{std=} switch,
4022 since this extension cannot conflict with standards-conforming
4023 programs.  When preprocessing assembler, however, dollars are not
4024 identifier characters by default.
4025
4026 Currently the targets that by default do not permit @samp{$} are AVR,
4027 IP2K, MMIX, MIPS Irix 3, ARM aout, and PowerPC targets for the AIX
4028 operating system.
4029
4030 You can override the default with @option{-fdollars-in-identifiers} or
4031 @option{fno-dollars-in-identifiers}.  @xref{fdollars-in-identifiers}.
4032
4033 @item Non-empty sequences of whitespace characters.
4034
4035 In textual output, each whitespace sequence is collapsed to a single
4036 space.  For aesthetic reasons, the first token on each non-directive
4037 line of output is preceded with sufficient spaces that it appears in the
4038 same column as it did in the original source file.
4039
4040 @item The numeric value of character constants in preprocessor expressions.
4041
4042 The preprocessor and compiler interpret character constants in the
4043 same way; i.e.@: escape sequences such as @samp{\a} are given the
4044 values they would have on the target machine.
4045
4046 The compiler evaluates a multi-character character constant a character
4047 at a time, shifting the previous value left by the number of bits per
4048 target character, and then or-ing in the bit-pattern of the new
4049 character truncated to the width of a target character.  The final
4050 bit-pattern is given type @code{int}, and is therefore signed,
4051 regardless of whether single characters are signed or not (a slight
4052 change from versions 3.1 and earlier of GCC)@.  If there are more
4053 characters in the constant than would fit in the target @code{int} the
4054 compiler issues a warning, and the excess leading characters are
4055 ignored.
4056
4057 For example, @code{'ab'} for a target with an 8-bit @code{char} would be
4058 interpreted as @w{@samp{(int) ((unsigned char) 'a' * 256 + (unsigned char)
4059 'b')}}, and @code{'\234a'} as @w{@samp{(int) ((unsigned char) '\234' *
4060 256 + (unsigned char) 'a')}}.
4061
4062 @item Source file inclusion.
4063
4064 For a discussion on how the preprocessor locates header files,
4065 @ref{Include Operation}.
4066
4067 @item Interpretation of the filename resulting from a macro-expanded
4068 @samp{#include} directive.
4069
4070 @xref{Computed Includes}.
4071
4072 @item Treatment of a @samp{#pragma} directive that after macro-expansion
4073 results in a standard pragma.
4074
4075 No macro expansion occurs on any @samp{#pragma} directive line, so the
4076 question does not arise.
4077
4078 Note that GCC does not yet implement any of the standard
4079 pragmas.
4080
4081 @end itemize
4082
4083 @node Implementation limits
4084 @section Implementation limits
4085 @cindex implementation limits
4086
4087 CPP has a small number of internal limits.  This section lists the
4088 limits which the C standard requires to be no lower than some minimum,
4089 and all the others known.  It is intended that there should be as few limits
4090 as possible.  If you encounter an undocumented or inconvenient limit,
4091 please report that as a bug.  @xref{Bugs, , Reporting Bugs, gcc, Using
4092 the GNU Compiler Collection (GCC)}.
4093
4094 Where we say something is limited @dfn{only by available memory}, that
4095 means that internal data structures impose no intrinsic limit, and space
4096 is allocated with @code{malloc} or equivalent.  The actual limit will
4097 therefore depend on many things, such as the size of other things
4098 allocated by the compiler at the same time, the amount of memory
4099 consumed by other processes on the same computer, etc.
4100
4101 @itemize @bullet
4102
4103 @item Nesting levels of @samp{#include} files.
4104
4105 We impose an arbitrary limit of 200 levels, to avoid runaway recursion.
4106 The standard requires at least 15 levels.
4107
4108 @item Nesting levels of conditional inclusion.
4109
4110 The C standard mandates this be at least 63.  CPP is limited only by
4111 available memory.
4112
4113 @item Levels of parenthesized expressions within a full expression.
4114
4115 The C standard requires this to be at least 63.  In preprocessor
4116 conditional expressions, it is limited only by available memory.
4117
4118 @item Significant initial characters in an identifier or macro name.
4119
4120 The preprocessor treats all characters as significant.  The C standard
4121 requires only that the first 63 be significant.
4122
4123 @item Number of macros simultaneously defined in a single translation unit.
4124
4125 The standard requires at least 4095 be possible.  CPP is limited only
4126 by available memory.
4127
4128 @item Number of parameters in a macro definition and arguments in a macro call.
4129
4130 We allow @code{USHRT_MAX}, which is no smaller than 65,535.  The minimum
4131 required by the standard is 127.
4132
4133 @item Number of characters on a logical source line.
4134
4135 The C standard requires a minimum of 4096 be permitted.  CPP places
4136 no limits on this, but you may get incorrect column numbers reported in
4137 diagnostics for lines longer than 65,535 characters.
4138
4139 @item Maximum size of a source file.
4140
4141 The standard does not specify any lower limit on the maximum size of a
4142 source file.  GNU cpp maps files into memory, so it is limited by the
4143 available address space.  This is generally at least two gigabytes.
4144 Depending on the operating system, the size of physical memory may or
4145 may not be a limitation.
4146
4147 @end itemize
4148
4149 @node Obsolete Features
4150 @section Obsolete Features
4151
4152 CPP has some features which are present mainly for compatibility with
4153 older programs.  We discourage their use in new code.  In some cases,
4154 we plan to remove the feature in a future version of GCC@.
4155
4156 @subsection Assertions
4157 @cindex assertions
4158
4159 @dfn{Assertions} are a deprecated alternative to macros in writing
4160 conditionals to test what sort of computer or system the compiled
4161 program will run on.  Assertions are usually predefined, but you can
4162 define them with preprocessing directives or command-line options.
4163
4164 Assertions were intended to provide a more systematic way to describe
4165 the compiler's target system.  However, in practice they are just as
4166 unpredictable as the system-specific predefined macros.  In addition, they
4167 are not part of any standard, and only a few compilers support them.
4168 Therefore, the use of assertions is @strong{less} portable than the use
4169 of system-specific predefined macros.  We recommend you do not use them at
4170 all.
4171
4172 @cindex predicates
4173 An assertion looks like this:
4174
4175 @smallexample
4176 #@var{predicate} (@var{answer})
4177 @end smallexample
4178
4179 @noindent
4180 @var{predicate} must be a single identifier.  @var{answer} can be any
4181 sequence of tokens; all characters are significant except for leading
4182 and trailing whitespace, and differences in internal whitespace
4183 sequences are ignored.  (This is similar to the rules governing macro
4184 redefinition.)  Thus, @code{(x + y)} is different from @code{(x+y)} but
4185 equivalent to @code{@w{( x + y )}}.  Parentheses do not nest inside an
4186 answer.
4187
4188 @cindex testing predicates
4189 To test an assertion, you write it in an @samp{#if}.  For example, this
4190 conditional succeeds if either @code{vax} or @code{ns16000} has been
4191 asserted as an answer for @code{machine}.
4192
4193 @smallexample
4194 #if #machine (vax) || #machine (ns16000)
4195 @end smallexample
4196
4197 @noindent
4198 You can test whether @emph{any} answer is asserted for a predicate by
4199 omitting the answer in the conditional:
4200
4201 @smallexample
4202 #if #machine
4203 @end smallexample
4204
4205 @findex #assert
4206 Assertions are made with the @samp{#assert} directive.  Its sole
4207 argument is the assertion to make, without the leading @samp{#} that
4208 identifies assertions in conditionals.
4209
4210 @smallexample
4211 #assert @var{predicate} (@var{answer})
4212 @end smallexample
4213
4214 @noindent
4215 You may make several assertions with the same predicate and different
4216 answers.  Subsequent assertions do not override previous ones for the
4217 same predicate.  All the answers for any given predicate are
4218 simultaneously true.
4219
4220 @cindex assertions, canceling
4221 @findex #unassert
4222 Assertions can be canceled with the @samp{#unassert} directive.  It
4223 has the same syntax as @samp{#assert}.  In that form it cancels only the
4224 answer which was specified on the @samp{#unassert} line; other answers
4225 for that predicate remain true.  You can cancel an entire predicate by
4226 leaving out the answer:
4227
4228 @smallexample
4229 #unassert @var{predicate}
4230 @end smallexample
4231
4232 @noindent
4233 In either form, if no such assertion has been made, @samp{#unassert} has
4234 no effect.
4235
4236 You can also make or cancel assertions using command line options.
4237 @xref{Invocation}.
4238
4239 @node Differences from previous versions
4240 @section Differences from previous versions
4241 @cindex differences from previous versions
4242
4243 This section details behavior which has changed from previous versions
4244 of CPP@.  We do not plan to change it again in the near future, but
4245 we do not promise not to, either.
4246
4247 The ``previous versions'' discussed here are 2.95 and before.  The
4248 behavior of GCC 3.0 is mostly the same as the behavior of the widely
4249 used 2.96 and 2.97 development snapshots.  Where there are differences,
4250 they generally represent bugs in the snapshots.
4251
4252 @itemize @bullet
4253
4254 @item -I- deprecated
4255
4256 This option has been deprecated in 4.0.  @option{-iquote} is meant to
4257 replace the need for this option.
4258
4259 @item Order of evaluation of @samp{#} and @samp{##} operators
4260
4261 The standard does not specify the order of evaluation of a chain of
4262 @samp{##} operators, nor whether @samp{#} is evaluated before, after, or
4263 at the same time as @samp{##}.  You should therefore not write any code
4264 which depends on any specific ordering.  It is possible to guarantee an
4265 ordering, if you need one, by suitable use of nested macros.
4266
4267 An example of where this might matter is pasting the arguments @samp{1},
4268 @samp{e} and @samp{-2}.  This would be fine for left-to-right pasting,
4269 but right-to-left pasting would produce an invalid token @samp{e-2}.
4270
4271 GCC 3.0 evaluates @samp{#} and @samp{##} at the same time and strictly
4272 left to right.  Older versions evaluated all @samp{#} operators first,
4273 then all @samp{##} operators, in an unreliable order.
4274
4275 @item The form of whitespace between tokens in preprocessor output
4276
4277 @xref{Preprocessor Output}, for the current textual format.  This is
4278 also the format used by stringification.  Normally, the preprocessor
4279 communicates tokens directly to the compiler's parser, and whitespace
4280 does not come up at all.
4281
4282 Older versions of GCC preserved all whitespace provided by the user and
4283 inserted lots more whitespace of their own, because they could not
4284 accurately predict when extra spaces were needed to prevent accidental
4285 token pasting.
4286
4287 @item Optional argument when invoking rest argument macros
4288
4289 As an extension, GCC permits you to omit the variable arguments entirely
4290 when you use a variable argument macro.  This is forbidden by the 1999 C
4291 standard, and will provoke a pedantic warning with GCC 3.0.  Previous
4292 versions accepted it silently.
4293
4294 @item @samp{##} swallowing preceding text in rest argument macros
4295
4296 Formerly, in a macro expansion, if @samp{##} appeared before a variable
4297 arguments parameter, and the set of tokens specified for that argument
4298 in the macro invocation was empty, previous versions of CPP would
4299 back up and remove the preceding sequence of non-whitespace characters
4300 (@strong{not} the preceding token).  This extension is in direct
4301 conflict with the 1999 C standard and has been drastically pared back.
4302
4303 In the current version of the preprocessor, if @samp{##} appears between
4304 a comma and a variable arguments parameter, and the variable argument is
4305 omitted entirely, the comma will be removed from the expansion.  If the
4306 variable argument is empty, or the token before @samp{##} is not a
4307 comma, then @samp{##} behaves as a normal token paste.
4308
4309 @item @samp{#line} and @samp{#include}
4310
4311 The @samp{#line} directive used to change GCC's notion of the
4312 ``directory containing the current file'', used by @samp{#include} with
4313 a double-quoted header file name.  In 3.0 and later, it does not.
4314 @xref{Line Control}, for further explanation.
4315
4316 @item Syntax of @samp{#line}
4317
4318 In GCC 2.95 and previous, the string constant argument to @samp{#line}
4319 was treated the same way as the argument to @samp{#include}: backslash
4320 escapes were not honored, and the string ended at the second @samp{"}.
4321 This is not compliant with the C standard.  In GCC 3.0, an attempt was
4322 made to correct the behavior, so that the string was treated as a real
4323 string constant, but it turned out to be buggy.  In 3.1, the bugs have
4324 been fixed.  (We are not fixing the bugs in 3.0 because they affect
4325 relatively few people and the fix is quite invasive.)
4326
4327 @end itemize
4328
4329 @node Invocation
4330 @chapter Invocation
4331 @cindex invocation
4332 @cindex command line
4333
4334 Most often when you use the C preprocessor you will not have to invoke it
4335 explicitly: the C compiler will do so automatically.  However, the
4336 preprocessor is sometimes useful on its own.  All the options listed
4337 here are also acceptable to the C compiler and have the same meaning,
4338 except that the C compiler has different rules for specifying the output
4339 file.
4340
4341 @emph{Note:} Whether you use the preprocessor by way of @command{gcc}
4342 or @command{cpp}, the @dfn{compiler driver} is run first.  This
4343 program's purpose is to translate your command into invocations of the
4344 programs that do the actual work.  Their command line interfaces are
4345 similar but not identical to the documented interface, and may change
4346 without notice.
4347
4348 @ignore
4349 @c man begin SYNOPSIS
4350 cpp [@option{-D}@var{macro}[=@var{defn}]@dots{}] [@option{-U}@var{macro}]
4351     [@option{-I}@var{dir}@dots{}] [@option{-iquote}@var{dir}@dots{}]
4352     [@option{-W}@var{warn}@dots{}]
4353     [@option{-M}|@option{-MM}] [@option{-MG}] [@option{-MF} @var{filename}]
4354     [@option{-MP}] [@option{-MQ} @var{target}@dots{}]
4355     [@option{-MT} @var{target}@dots{}]
4356     [@option{-P}] [@option{-fno-working-directory}]
4357     [@option{-x} @var{language}] [@option{-std=}@var{standard}]
4358     @var{infile} @var{outfile}
4359
4360 Only the most useful options are listed here; see below for the remainder.
4361 @c man end
4362 @c man begin SEEALSO
4363 gpl(7), gfdl(7), fsf-funding(7),
4364 gcc(1), as(1), ld(1), and the Info entries for @file{cpp}, @file{gcc}, and
4365 @file{binutils}.
4366 @c man end
4367 @end ignore
4368
4369 @c man begin OPTIONS
4370 The C preprocessor expects two file names as arguments, @var{infile} and
4371 @var{outfile}.  The preprocessor reads @var{infile} together with any
4372 other files it specifies with @samp{#include}.  All the output generated
4373 by the combined input files is written in @var{outfile}.
4374
4375 Either @var{infile} or @var{outfile} may be @option{-}, which as
4376 @var{infile} means to read from standard input and as @var{outfile}
4377 means to write to standard output.  Also, if either file is omitted, it
4378 means the same as if @option{-} had been specified for that file.
4379
4380 Unless otherwise noted, or the option ends in @samp{=}, all options
4381 which take an argument may have that argument appear either immediately
4382 after the option, or with a space between option and argument:
4383 @option{-Ifoo} and @option{-I foo} have the same effect.
4384
4385 @cindex grouping options
4386 @cindex options, grouping
4387 Many options have multi-letter names; therefore multiple single-letter
4388 options may @emph{not} be grouped: @option{-dM} is very different from
4389 @w{@samp{-d -M}}.
4390
4391 @cindex options
4392 @include cppopts.texi
4393 @c man end
4394
4395 @node Environment Variables
4396 @chapter Environment Variables
4397 @cindex environment variables
4398 @c man begin ENVIRONMENT
4399
4400 This section describes the environment variables that affect how CPP
4401 operates.  You can use them to specify directories or prefixes to use
4402 when searching for include files, or to control dependency output.
4403
4404 Note that you can also specify places to search using options such as
4405 @option{-I}, and control dependency output with options like
4406 @option{-M} (@pxref{Invocation}).  These take precedence over
4407 environment variables, which in turn take precedence over the
4408 configuration of GCC@.
4409
4410 @include cppenv.texi
4411 @c man end
4412
4413 @page
4414 @include fdl.texi
4415
4416 @page
4417 @node Index of Directives
4418 @unnumbered Index of Directives
4419 @printindex fn
4420
4421 @node Option Index
4422 @unnumbered Option Index
4423 @noindent
4424 CPP's command line options and environment variables are indexed here
4425 without any initial @samp{-} or @samp{--}.
4426 @printindex op
4427
4428 @page
4429 @node Concept Index
4430 @unnumbered Concept Index
4431 @printindex cp
4432
4433 @bye