gcc/doc/md.texi

   1 @c Copyright (C) 1988, 1989, 1992, 1993, 1994, 1996, 1998, 1999, 2000, 2001,
   2 @c 2002, 2003, 2004, 2005, 2006 Free Software Foundation, Inc.
   3 @c This is part of the GCC manual.
   4 @c For copying conditions, see the file gcc.texi.
   5
   6 @ifset INTERNALS
   7 @node Machine Desc
   8 @chapter Machine Descriptions
   9 @cindex machine descriptions
  10
  11 A machine description has two parts: a file of instruction patterns
  12 (@file{.md} file) and a C header file of macro definitions.
  13
  14 The @file{.md} file for a target machine contains a pattern for each
  15 instruction that the target machine supports (or at least each instruction
  16 that is worth telling the compiler about).  It may also contain comments.
  17 A semicolon causes the rest of the line to be a comment, unless the semicolon
  18 is inside a quoted string.
  19
  20 See the next chapter for information on the C header file.
  21
  22 @menu
  23 * Overview::            How the machine description is used.
  24 * Patterns::            How to write instruction patterns.
  25 * Example::             An explained example of a @code{define_insn} pattern.
  26 * RTL Template::        The RTL template defines what insns match a pattern.
  27 * Output Template::     The output template says how to make assembler code
  28                           from such an insn.
  29 * Output Statement::    For more generality, write C code to output
  30                           the assembler code.
  31 * Predicates::          Controlling what kinds of operands can be used
  32                           for an insn.
  33 * Constraints::         Fine-tuning operand selection.
  34 * Standard Names::      Names mark patterns to use for code generation.
  35 * Pattern Ordering::    When the order of patterns makes a difference.
  36 * Dependent Patterns::  Having one pattern may make you need another.
  37 * Jump Patterns::       Special considerations for patterns for jump insns.
  38 * Looping Patterns::    How to define patterns for special looping insns.
  39 * Insn Canonicalizations::Canonicalization of Instructions
  40 * Expander Definitions::Generating a sequence of several RTL insns
  41                           for a standard operation.
  42 * Insn Splitting::      Splitting Instructions into Multiple Instructions.
  43 * Including Patterns::      Including Patterns in Machine Descriptions.
  44 * Peephole Definitions::Defining machine-specific peephole optimizations.
  45 * Insn Attributes::     Specifying the value of attributes for generated insns.
  46 * Conditional Execution::Generating @code{define_insn} patterns for
  47                            predication.
  48 * Constant Definitions::Defining symbolic constants that can be used in the
  49                         md file.
  50 * Macros::              Using macros to generate patterns from a template.
  51 @end menu
  52
  53 @node Overview
  54 @section Overview of How the Machine Description is Used
  55
  56 There are three main conversions that happen in the compiler:
  57
  58 @enumerate
  59
  60 @item
  61 The front end reads the source code and builds a parse tree.
  62
  63 @item
  64 The parse tree is used to generate an RTL insn list based on named
  65 instruction patterns.
  66
  67 @item
  68 The insn list is matched against the RTL templates to produce assembler
  69 code.
  70
  71 @end enumerate
  72
  73 For the generate pass, only the names of the insns matter, from either a
  74 named @code{define_insn} or a @code{define_expand}.  The compiler will
  75 choose the pattern with the right name and apply the operands according
  76 to the documentation later in this chapter, without regard for the RTL
  77 template or operand constraints.  Note that the names the compiler looks
  78 for are hard-coded in the compiler---it will ignore unnamed patterns and
  79 patterns with names it doesn't know about, but if you don't provide a
  80 named pattern it needs, it will abort.
  81
  82 If a @code{define_insn} is used, the template given is inserted into the
  83 insn list.  If a @code{define_expand} is used, one of three things
  84 happens, based on the condition logic.  The condition logic may manually
  85 create new insns for the insn list, say via @code{emit_insn()}, and
  86 invoke @code{DONE}.  For certain named patterns, it may invoke @code{FAIL} to tell the
  87 compiler to use an alternate way of performing that task.  If it invokes
  88 neither @code{DONE} nor @code{FAIL}, the template given in the pattern
  89 is inserted, as if the @code{define_expand} were a @code{define_insn}.
  90
  91 Once the insn list is generated, various optimization passes convert,
  92 replace, and rearrange the insns in the insn list.  This is where the
  93 @code{define_split} and @code{define_peephole} patterns get used, for
  94 example.
  95
  96 Finally, the insn list's RTL is matched up with the RTL templates in the
  97 @code{define_insn} patterns, and those patterns are used to emit the
  98 final assembly code.  For this purpose, each named @code{define_insn}
  99 acts like it's unnamed, since the names are ignored.
 100
 101 @node Patterns
 102 @section Everything about Instruction Patterns
 103 @cindex patterns
 104 @cindex instruction patterns
 105
 106 @findex define_insn
 107 Each instruction pattern contains an incomplete RTL expression, with pieces
 108 to be filled in later, operand constraints that restrict how the pieces can
 109 be filled in, and an output pattern or C code to generate the assembler
 110 output, all wrapped up in a @code{define_insn} expression.
 111
 112 A @code{define_insn} is an RTL expression containing four or five operands:
 113
 114 @enumerate
 115 @item
 116 An optional name.  The presence of a name indicate that this instruction
 117 pattern can perform a certain standard job for the RTL-generation
 118 pass of the compiler.  This pass knows certain names and will use
 119 the instruction patterns with those names, if the names are defined
 120 in the machine description.
 121
 122 The absence of a name is indicated by writing an empty string
 123 where the name should go.  Nameless instruction patterns are never
 124 used for generating RTL code, but they may permit several simpler insns
 125 to be combined later on.
 126
 127 Names that are not thus known and used in RTL-generation have no
 128 effect; they are equivalent to no name at all.
 129
 130 For the purpose of debugging the compiler, you may also specify a
 131 name beginning with the @samp{*} character.  Such a name is used only
 132 for identifying the instruction in RTL dumps; it is entirely equivalent
 133 to having a nameless pattern for all other purposes.
 134
 135 @item
 136 The @dfn{RTL template} (@pxref{RTL Template}) is a vector of incomplete
 137 RTL expressions which show what the instruction should look like.  It is
 138 incomplete because it may contain @code{match_operand},
 139 @code{match_operator}, and @code{match_dup} expressions that stand for
 140 operands of the instruction.
 141
 142 If the vector has only one element, that element is the template for the
 143 instruction pattern.  If the vector has multiple elements, then the
 144 instruction pattern is a @code{parallel} expression containing the
 145 elements described.
 146
 147 @item
 148 @cindex pattern conditions
 149 @cindex conditions, in patterns
 150 A condition.  This is a string which contains a C expression that is
 151 the final test to decide whether an insn body matches this pattern.
 152
 153 @cindex named patterns and conditions
 154 For a named pattern, the condition (if present) may not depend on
 155 the data in the insn being matched, but only the target-machine-type
 156 flags.  The compiler needs to test these conditions during
 157 initialization in order to learn exactly which named instructions are
 158 available in a particular run.
 159
 160 @findex operands
 161 For nameless patterns, the condition is applied only when matching an
 162 individual insn, and only after the insn has matched the pattern's
 163 recognition template.  The insn's operands may be found in the vector
 164 @code{operands}.  For an insn where the condition has once matched, it
 165 can't be used to control register allocation, for example by excluding
 166 certain hard registers or hard register combinations.
 167
 168 @item
 169 The @dfn{output template}: a string that says how to output matching
 170 insns as assembler code.  @samp{%} in this string specifies where
 171 to substitute the value of an operand.  @xref{Output Template}.
 172
 173 When simple substitution isn't general enough, you can specify a piece
 174 of C code to compute the output.  @xref{Output Statement}.
 175
 176 @item
 177 Optionally, a vector containing the values of attributes for insns matching
 178 this pattern.  @xref{Insn Attributes}.
 179 @end enumerate
 180
 181 @node Example
 182 @section Example of @code{define_insn}
 183 @cindex @code{define_insn} example
 184
 185 Here is an actual example of an instruction pattern, for the 68000/68020.
 186
 187 @smallexample
 188 (define_insn "tstsi"
 189   [(set (cc0)
 190         (match_operand:SI 0 "general_operand" "rm"))]
 191   ""
 192   "*
 193 @{
 194   if (TARGET_68020 || ! ADDRESS_REG_P (operands[0]))
 195     return \"tstl %0\";
 196   return \"cmpl #0,%0\";
 197 @}")
 198 @end smallexample
 199
 200 @noindent
 201 This can also be written using braced strings:
 202
 203 @smallexample
 204 (define_insn "tstsi"
 205   [(set (cc0)
 206         (match_operand:SI 0 "general_operand" "rm"))]
 207   ""
 208 @{
 209   if (TARGET_68020 || ! ADDRESS_REG_P (operands[0]))
 210     return "tstl %0";
 211   return "cmpl #0,%0";
 212 @})
 213 @end smallexample
 214
 215 This is an instruction that sets the condition codes based on the value of
 216 a general operand.  It has no condition, so any insn whose RTL description
 217 has the form shown may be handled according to this pattern.  The name
 218 @samp{tstsi} means ``test a @code{SImode} value'' and tells the RTL generation
 219 pass that, when it is necessary to test such a value, an insn to do so
 220 can be constructed using this pattern.
 221
 222 The output control string is a piece of C code which chooses which
 223 output template to return based on the kind of operand and the specific
 224 type of CPU for which code is being generated.
 225
 226 @samp{"rm"} is an operand constraint.  Its meaning is explained below.
 227
 228 @node RTL Template
 229 @section RTL Template
 230 @cindex RTL insn template
 231 @cindex generating insns
 232 @cindex insns, generating
 233 @cindex recognizing insns
 234 @cindex insns, recognizing
 235
 236 The RTL template is used to define which insns match the particular pattern
 237 and how to find their operands.  For named patterns, the RTL template also
 238 says how to construct an insn from specified operands.
 239
 240 Construction involves substituting specified operands into a copy of the
 241 template.  Matching involves determining the values that serve as the
 242 operands in the insn being matched.  Both of these activities are
 243 controlled by special expression types that direct matching and
 244 substitution of the operands.
 245
 246 @table @code
 247 @findex match_operand
 248 @item (match_operand:@var{m} @var{n} @var{predicate} @var{constraint})
 249 This expression is a placeholder for operand number @var{n} of
 250 the insn.  When constructing an insn, operand number @var{n}
 251 will be substituted at this point.  When matching an insn, whatever
 252 appears at this position in the insn will be taken as operand
 253 number @var{n}; but it must satisfy @var{predicate} or this instruction
 254 pattern will not match at all.
 255
 256 Operand numbers must be chosen consecutively counting from zero in
 257 each instruction pattern.  There may be only one @code{match_operand}
 258 expression in the pattern for each operand number.  Usually operands
 259 are numbered in the order of appearance in @code{match_operand}
 260 expressions.  In the case of a @code{define_expand}, any operand numbers
 261 used only in @code{match_dup} expressions have higher values than all
 262 other operand numbers.
 263
 264 @var{predicate} is a string that is the name of a function that
 265 accepts two arguments, an expression and a machine mode.
 266 @xref{Predicates}.  During matching, the function will be called with
 267 the putative operand as the expression and @var{m} as the mode
 268 argument (if @var{m} is not specified, @code{VOIDmode} will be used,
 269 which normally causes @var{predicate} to accept any mode).  If it
 270 returns zero, this instruction pattern fails to match.
 271 @var{predicate} may be an empty string; then it means no test is to be
 272 done on the operand, so anything which occurs in this position is
 273 valid.
 274
 275 Most of the time, @var{predicate} will reject modes other than @var{m}---but
 276 not always.  For example, the predicate @code{address_operand} uses
 277 @var{m} as the mode of memory ref that the address should be valid for.
 278 Many predicates accept @code{const_int} nodes even though their mode is
 279 @code{VOIDmode}.
 280
 281 @var{constraint} controls reloading and the choice of the best register
 282 class to use for a value, as explained later (@pxref{Constraints}).
 283 If the constraint would be an empty string, it can be omitted.
 284
 285 People are often unclear on the difference between the constraint and the
 286 predicate.  The predicate helps decide whether a given insn matches the
 287 pattern.  The constraint plays no role in this decision; instead, it
 288 controls various decisions in the case of an insn which does match.
 289
 290 @findex match_scratch
 291 @item (match_scratch:@var{m} @var{n} @var{constraint})
 292 This expression is also a placeholder for operand number @var{n}
 293 and indicates that operand must be a @code{scratch} or @code{reg}
 294 expression.
 295
 296 When matching patterns, this is equivalent to
 297
 298 @smallexample
 299 (match_operand:@var{m} @var{n} "scratch_operand" @var{pred})
 300 @end smallexample
 301
 302 but, when generating RTL, it produces a (@code{scratch}:@var{m})
 303 expression.
 304
 305 If the last few expressions in a @code{parallel} are @code{clobber}
 306 expressions whose operands are either a hard register or
 307 @code{match_scratch}, the combiner can add or delete them when
 308 necessary.  @xref{Side Effects}.
 309
 310 @findex match_dup
 311 @item (match_dup @var{n})
 312 This expression is also a placeholder for operand number @var{n}.
 313 It is used when the operand needs to appear more than once in the
 314 insn.
 315
 316 In construction, @code{match_dup} acts just like @code{match_operand}:
 317 the operand is substituted into the insn being constructed.  But in
 318 matching, @code{match_dup} behaves differently.  It assumes that operand
 319 number @var{n} has already been determined by a @code{match_operand}
 320 appearing earlier in the recognition template, and it matches only an
 321 identical-looking expression.
 322
 323 Note that @code{match_dup} should not be used to tell the compiler that
 324 a particular register is being used for two operands (example:
 325 @code{add} that adds one register to another; the second register is
 326 both an input operand and the output operand).  Use a matching
 327 constraint (@pxref{Simple Constraints}) for those.  @code{match_dup} is for the cases where one
 328 operand is used in two places in the template, such as an instruction
 329 that computes both a quotient and a remainder, where the opcode takes
 330 two input operands but the RTL template has to refer to each of those
 331 twice; once for the quotient pattern and once for the remainder pattern.
 332
 333 @findex match_operator
 334 @item (match_operator:@var{m} @var{n} @var{predicate} [@var{operands}@dots{}])
 335 This pattern is a kind of placeholder for a variable RTL expression
 336 code.
 337
 338 When constructing an insn, it stands for an RTL expression whose
 339 expression code is taken from that of operand @var{n}, and whose
 340 operands are constructed from the patterns @var{operands}.
 341
 342 When matching an expression, it matches an expression if the function
 343 @var{predicate} returns nonzero on that expression @emph{and} the
 344 patterns @var{operands} match the operands of the expression.
 345
 346 Suppose that the function @code{commutative_operator} is defined as
 347 follows, to match any expression whose operator is one of the
 348 commutative arithmetic operators of RTL and whose mode is @var{mode}:
 349
 350 @smallexample
 351 int
 352 commutative_integer_operator (x, mode)
 353      rtx x;
 354      enum machine_mode mode;
 355 @{
 356   enum rtx_code code = GET_CODE (x);
 357   if (GET_MODE (x) != mode)
 358     return 0;
 359   return (GET_RTX_CLASS (code) == RTX_COMM_ARITH
 360           || code == EQ || code == NE);
 361 @}
 362 @end smallexample
 363
 364 Then the following pattern will match any RTL expression consisting
 365 of a commutative operator applied to two general operands:
 366
 367 @smallexample
 368 (match_operator:SI 3 "commutative_operator"
 369   [(match_operand:SI 1 "general_operand" "g")
 370    (match_operand:SI 2 "general_operand" "g")])
 371 @end smallexample
 372
 373 Here the vector @code{[@var{operands}@dots{}]} contains two patterns
 374 because the expressions to be matched all contain two operands.
 375
 376 When this pattern does match, the two operands of the commutative
 377 operator are recorded as operands 1 and 2 of the insn.  (This is done
 378 by the two instances of @code{match_operand}.)  Operand 3 of the insn
 379 will be the entire commutative expression: use @code{GET_CODE
 380 (operands[3])} to see which commutative operator was used.
 381
 382 The machine mode @var{m} of @code{match_operator} works like that of
 383 @code{match_operand}: it is passed as the second argument to the
 384 predicate function, and that function is solely responsible for
 385 deciding whether the expression to be matched ``has'' that mode.
 386
 387 When constructing an insn, argument 3 of the gen-function will specify
 388 the operation (i.e.@: the expression code) for the expression to be
 389 made.  It should be an RTL expression, whose expression code is copied
 390 into a new expression whose operands are arguments 1 and 2 of the
 391 gen-function.  The subexpressions of argument 3 are not used;
 392 only its expression code matters.
 393
 394 When @code{match_operator} is used in a pattern for matching an insn,
 395 it usually best if the operand number of the @code{match_operator}
 396 is higher than that of the actual operands of the insn.  This improves
 397 register allocation because the register allocator often looks at
 398 operands 1 and 2 of insns to see if it can do register tying.
 399
 400 There is no way to specify constraints in @code{match_operator}.  The
 401 operand of the insn which corresponds to the @code{match_operator}
 402 never has any constraints because it is never reloaded as a whole.
 403 However, if parts of its @var{operands} are matched by
 404 @code{match_operand} patterns, those parts may have constraints of
 405 their own.
 406
 407 @findex match_op_dup
 408 @item (match_op_dup:@var{m} @var{n}[@var{operands}@dots{}])
 409 Like @code{match_dup}, except that it applies to operators instead of
 410 operands.  When constructing an insn, operand number @var{n} will be
 411 substituted at this point.  But in matching, @code{match_op_dup} behaves
 412 differently.  It assumes that operand number @var{n} has already been
 413 determined by a @code{match_operator} appearing earlier in the
 414 recognition template, and it matches only an identical-looking
 415 expression.
 416
 417 @findex match_parallel
 418 @item (match_parallel @var{n} @var{predicate} [@var{subpat}@dots{}])
 419 This pattern is a placeholder for an insn that consists of a
 420 @code{parallel} expression with a variable number of elements.  This
 421 expression should only appear at the top level of an insn pattern.
 422
 423 When constructing an insn, operand number @var{n} will be substituted at
 424 this point.  When matching an insn, it matches if the body of the insn
 425 is a @code{parallel} expression with at least as many elements as the
 426 vector of @var{subpat} expressions in the @code{match_parallel}, if each
 427 @var{subpat} matches the corresponding element of the @code{parallel},
 428 @emph{and} the function @var{predicate} returns nonzero on the
 429 @code{parallel} that is the body of the insn.  It is the responsibility
 430 of the predicate to validate elements of the @code{parallel} beyond
 431 those listed in the @code{match_parallel}.
 432
 433 A typical use of @code{match_parallel} is to match load and store
 434 multiple expressions, which can contain a variable number of elements
 435 in a @code{parallel}.  For example,
 436
 437 @smallexample
 438 (define_insn ""
 439   [(match_parallel 0 "load_multiple_operation"
 440      [(set (match_operand:SI 1 "gpc_reg_operand" "=r")
 441            (match_operand:SI 2 "memory_operand" "m"))
 442       (use (reg:SI 179))
 443       (clobber (reg:SI 179))])]
 444   ""
 445   "loadm 0,0,%1,%2")
 446 @end smallexample
 447
 448 This example comes from @file{a29k.md}.  The function
 449 @code{load_multiple_operation} is defined in @file{a29k.c} and checks
 450 that subsequent elements in the @code{parallel} are the same as the
 451 @code{set} in the pattern, except that they are referencing subsequent
 452 registers and memory locations.
 453
 454 An insn that matches this pattern might look like:
 455
 456 @smallexample
 457 (parallel
 458  [(set (reg:SI 20) (mem:SI (reg:SI 100)))
 459   (use (reg:SI 179))
 460   (clobber (reg:SI 179))
 461   (set (reg:SI 21)
 462        (mem:SI (plus:SI (reg:SI 100)
 463                         (const_int 4))))
 464   (set (reg:SI 22)
 465        (mem:SI (plus:SI (reg:SI 100)
 466                         (const_int 8))))])
 467 @end smallexample
 468
 469 @findex match_par_dup
 470 @item (match_par_dup @var{n} [@var{subpat}@dots{}])
 471 Like @code{match_op_dup}, but for @code{match_parallel} instead of
 472 @code{match_operator}.
 473
 474 @end table
 475
 476 @node Output Template
 477 @section Output Templates and Operand Substitution
 478 @cindex output templates
 479 @cindex operand substitution
 480
 481 @cindex @samp{%} in template
 482 @cindex percent sign
 483 The @dfn{output template} is a string which specifies how to output the
 484 assembler code for an instruction pattern.  Most of the template is a
 485 fixed string which is output literally.  The character @samp{%} is used
 486 to specify where to substitute an operand; it can also be used to
 487 identify places where different variants of the assembler require
 488 different syntax.
 489
 490 In the simplest case, a @samp{%} followed by a digit @var{n} says to output
 491 operand @var{n} at that point in the string.
 492
 493 @samp{%} followed by a letter and a digit says to output an operand in an
 494 alternate fashion.  Four letters have standard, built-in meanings described
 495 below.  The machine description macro @code{PRINT_OPERAND} can define
 496 additional letters with nonstandard meanings.
 497
 498 @samp{%c@var{digit}} can be used to substitute an operand that is a
 499 constant value without the syntax that normally indicates an immediate
 500 operand.
 501
 502 @samp{%n@var{digit}} is like @samp{%c@var{digit}} except that the value of
 503 the constant is negated before printing.
 504
 505 @samp{%a@var{digit}} can be used to substitute an operand as if it were a
 506 memory reference, with the actual operand treated as the address.  This may
 507 be useful when outputting a ``load address'' instruction, because often the
 508 assembler syntax for such an instruction requires you to write the operand
 509 as if it were a memory reference.
 510
 511 @samp{%l@var{digit}} is used to substitute a @code{label_ref} into a jump
 512 instruction.
 513
 514 @samp{%=} outputs a number which is unique to each instruction in the
 515 entire compilation.  This is useful for making local labels to be
 516 referred to more than once in a single template that generates multiple
 517 assembler instructions.
 518
 519 @samp{%} followed by a punctuation character specifies a substitution that
 520 does not use an operand.  Only one case is standard: @samp{%%} outputs a
 521 @samp{%} into the assembler code.  Other nonstandard cases can be
 522 defined in the @code{PRINT_OPERAND} macro.  You must also define
 523 which punctuation characters are valid with the
 524 @code{PRINT_OPERAND_PUNCT_VALID_P} macro.
 525
 526 @cindex \
 527 @cindex backslash
 528 The template may generate multiple assembler instructions.  Write the text
 529 for the instructions, with @samp{\;} between them.
 530
 531 @cindex matching operands
 532 When the RTL contains two operands which are required by constraint to match
 533 each other, the output template must refer only to the lower-numbered operand.
 534 Matching operands are not always identical, and the rest of the compiler
 535 arranges to put the proper RTL expression for printing into the lower-numbered
 536 operand.
 537
 538 One use of nonstandard letters or punctuation following @samp{%} is to
 539 distinguish between different assembler languages for the same machine; for
 540 example, Motorola syntax versus MIT syntax for the 68000.  Motorola syntax
 541 requires periods in most opcode names, while MIT syntax does not.  For
 542 example, the opcode @samp{movel} in MIT syntax is @samp{move.l} in Motorola
 543 syntax.  The same file of patterns is used for both kinds of output syntax,
 544 but the character sequence @samp{%.} is used in each place where Motorola
 545 syntax wants a period.  The @code{PRINT_OPERAND} macro for Motorola syntax
 546 defines the sequence to output a period; the macro for MIT syntax defines
 547 it to do nothing.
 548
 549 @cindex @code{#} in template
 550 As a special case, a template consisting of the single character @code{#}
 551 instructs the compiler to first split the insn, and then output the
 552 resulting instructions separately.  This helps eliminate redundancy in the
 553 output templates.   If you have a @code{define_insn} that needs to emit
 554 multiple assembler instructions, and there is an matching @code{define_split}
 555 already defined, then you can simply use @code{#} as the output template
 556 instead of writing an output template that emits the multiple assembler
 557 instructions.
 558
 559 If the macro @code{ASSEMBLER_DIALECT} is defined, you can use construct
 560 of the form @samp{@{option0|option1|option2@}} in the templates.  These
 561 describe multiple variants of assembler language syntax.
 562 @xref{Instruction Output}.
 563
 564 @node Output Statement
 565 @section C Statements for Assembler Output
 566 @cindex output statements
 567 @cindex C statements for assembler output
 568 @cindex generating assembler output
 569
 570 Often a single fixed template string cannot produce correct and efficient
 571 assembler code for all the cases that are recognized by a single
 572 instruction pattern.  For example, the opcodes may depend on the kinds of
 573 operands; or some unfortunate combinations of operands may require extra
 574 machine instructions.
 575
 576 If the output control string starts with a @samp{@@}, then it is actually
 577 a series of templates, each on a separate line.  (Blank lines and
 578 leading spaces and tabs are ignored.)  The templates correspond to the
 579 pattern's constraint alternatives (@pxref{Multi-Alternative}).  For example,
 580 if a target machine has a two-address add instruction @samp{addr} to add
 581 into a register and another @samp{addm} to add a register to memory, you
 582 might write this pattern:
 583
 584 @smallexample
 585 (define_insn "addsi3"
 586   [(set (match_operand:SI 0 "general_operand" "=r,m")
 587         (plus:SI (match_operand:SI 1 "general_operand" "0,0")
 588                  (match_operand:SI 2 "general_operand" "g,r")))]
 589   ""
 590   "@@
 591    addr %2,%0
 592    addm %2,%0")
 593 @end smallexample
 594
 595 @cindex @code{*} in template
 596 @cindex asterisk in template
 597 If the output control string starts with a @samp{*}, then it is not an
 598 output template but rather a piece of C program that should compute a
 599 template.  It should execute a @code{return} statement to return the
 600 template-string you want.  Most such templates use C string literals, which
 601 require doublequote characters to delimit them.  To include these
 602 doublequote characters in the string, prefix each one with @samp{\}.
 603
 604 If the output control string is written as a brace block instead of a
 605 double-quoted string, it is automatically assumed to be C code.  In that
 606 case, it is not necessary to put in a leading asterisk, or to escape the
 607 doublequotes surrounding C string literals.
 608
 609 The operands may be found in the array @code{operands}, whose C data type
 610 is @code{rtx []}.
 611
 612 It is very common to select different ways of generating assembler code
 613 based on whether an immediate operand is within a certain range.  Be
 614 careful when doing this, because the result of @code{INTVAL} is an
 615 integer on the host machine.  If the host machine has more bits in an
 616 @code{int} than the target machine has in the mode in which the constant
 617 will be used, then some of the bits you get from @code{INTVAL} will be
 618 superfluous.  For proper results, you must carefully disregard the
 619 values of those bits.
 620
 621 @findex output_asm_insn
 622 It is possible to output an assembler instruction and then go on to output
 623 or compute more of them, using the subroutine @code{output_asm_insn}.  This
 624 receives two arguments: a template-string and a vector of operands.  The
 625 vector may be @code{operands}, or it may be another array of @code{rtx}
 626 that you declare locally and initialize yourself.
 627
 628 @findex which_alternative
 629 When an insn pattern has multiple alternatives in its constraints, often
 630 the appearance of the assembler code is determined mostly by which alternative
 631 was matched.  When this is so, the C code can test the variable
 632 @code{which_alternative}, which is the ordinal number of the alternative
 633 that was actually satisfied (0 for the first, 1 for the second alternative,
 634 etc.).
 635
 636 For example, suppose there are two opcodes for storing zero, @samp{clrreg}
 637 for registers and @samp{clrmem} for memory locations.  Here is how
 638 a pattern could use @code{which_alternative} to choose between them:
 639
 640 @smallexample
 641 (define_insn ""
 642   [(set (match_operand:SI 0 "general_operand" "=r,m")
 643         (const_int 0))]
 644   ""
 645   @{
 646   return (which_alternative == 0
 647           ? "clrreg %0" : "clrmem %0");
 648   @})
 649 @end smallexample
 650
 651 The example above, where the assembler code to generate was
 652 @emph{solely} determined by the alternative, could also have been specified
 653 as follows, having the output control string start with a @samp{@@}:
 654
 655 @smallexample
 656 @group
 657 (define_insn ""
 658   [(set (match_operand:SI 0 "general_operand" "=r,m")
 659         (const_int 0))]
 660   ""
 661   "@@
 662    clrreg %0
 663    clrmem %0")
 664 @end group
 665 @end smallexample
 666
 667 @node Predicates
 668 @section Predicates
 669 @cindex predicates
 670 @cindex operand predicates
 671 @cindex operator predicates
 672
 673 A predicate determines whether a @code{match_operand} or
 674 @code{match_operator} expression matches, and therefore whether the
 675 surrounding instruction pattern will be used for that combination of
 676 operands.  GCC has a number of machine-independent predicates, and you
 677 can define machine-specific predicates as needed.  By convention,
 678 predicates used with @code{match_operand} have names that end in
 679 @samp{_operand}, and those used with @code{match_operator} have names
 680 that end in @samp{_operator}.
 681
 682 All predicates are Boolean functions (in the mathematical sense) of
 683 two arguments: the RTL expression that is being considered at that
 684 position in the instruction pattern, and the machine mode that the
 685 @code{match_operand} or @code{match_operator} specifies.  In this
 686 section, the first argument is called @var{op} and the second argument
 687 @var{mode}.  Predicates can be called from C as ordinary two-argument
 688 functions; this can be useful in output templates or other
 689 machine-specific code.
 690
 691 Operand predicates can allow operands that are not actually acceptable
 692 to the hardware, as long as the constraints give reload the ability to
 693 fix them up (@pxref{Constraints}).  However, GCC will usually generate
 694 better code if the predicates specify the requirements of the machine
 695 instructions as closely as possible.  Reload cannot fix up operands
 696 that must be constants (``immediate operands''); you must use a
 697 predicate that allows only constants, or else enforce the requirement
 698 in the extra condition.
 699
 700 @cindex predicates and machine modes
 701 @cindex normal predicates
 702 @cindex special predicates
 703 Most predicates handle their @var{mode} argument in a uniform manner.
 704 If @var{mode} is @code{VOIDmode} (unspecified), then @var{op} can have
 705 any mode.  If @var{mode} is anything else, then @var{op} must have the
 706 same mode, unless @var{op} is a @code{CONST_INT} or integer
 707 @code{CONST_DOUBLE}.  These RTL expressions always have
 708 @code{VOIDmode}, so it would be counterproductive to check that their
 709 mode matches.  Instead, predicates that accept @code{CONST_INT} and/or
 710 integer @code{CONST_DOUBLE} check that the value stored in the
 711 constant will fit in the requested mode.
 712
 713 Predicates with this behavior are called @dfn{normal}.
 714 @command{genrecog} can optimize the instruction recognizer based on
 715 knowledge of how normal predicates treat modes.  It can also diagnose
 716 certain kinds of common errors in the use of normal predicates; for
 717 instance, it is almost always an error to use a normal predicate
 718 without specifying a mode.
 719
 720 Predicates that do something different with their @var{mode} argument
 721 are called @dfn{special}.  The generic predicates
 722 @code{address_operand} and @code{pmode_register_operand} are special
 723 predicates.  @command{genrecog} does not do any optimizations or
 724 diagnosis when special predicates are used.
 725
 726 @menu
 727 * Machine-Independent Predicates::  Predicates available to all back ends.
 728 * Defining Predicates::             How to write machine-specific predicate
 729                                     functions.
 730 @end menu
 731
 732 @node Machine-Independent Predicates
 733 @subsection Machine-Independent Predicates
 734 @cindex machine-independent predicates
 735 @cindex generic predicates
 736
 737 These are the generic predicates available to all back ends.  They are
 738 defined in @file{recog.c}.  The first category of predicates allow
 739 only constant, or @dfn{immediate}, operands.
 740
 741 @defun immediate_operand
 742 This predicate allows any sort of constant that fits in @var{mode}.
 743 It is an appropriate choice for instructions that take operands that
 744 must be constant.
 745 @end defun
 746
 747 @defun const_int_operand
 748 This predicate allows any @code{CONST_INT} expression that fits in
 749 @var{mode}.  It is an appropriate choice for an immediate operand that
 750 does not allow a symbol or label.
 751 @end defun
 752
 753 @defun const_double_operand
 754 This predicate accepts any @code{CONST_DOUBLE} expression that has
 755 exactly @var{mode}.  If @var{mode} is @code{VOIDmode}, it will also
 756 accept @code{CONST_INT}.  It is intended for immediate floating point
 757 constants.
 758 @end defun
 759
 760 @noindent
 761 The second category of predicates allow only some kind of machine
 762 register.
 763
 764 @defun register_operand
 765 This predicate allows any @code{REG} or @code{SUBREG} expression that
 766 is valid for @var{mode}.  It is often suitable for arithmetic
 767 instruction operands on a RISC machine.
 768 @end defun
 769
 770 @defun pmode_register_operand
 771 This is a slight variant on @code{register_operand} which works around
 772 a limitation in the machine-description reader.
 773
 774 @smallexample
 775 (match_operand @var{n} "pmode_register_operand" @var{constraint})
 776 @end smallexample
 777
 778 @noindent
 779 means exactly what
 780
 781 @smallexample
 782 (match_operand:P @var{n} "register_operand" @var{constraint})
 783 @end smallexample
 784
 785 @noindent
 786 would mean, if the machine-description reader accepted @samp{:P}
 787 mode suffixes.  Unfortunately, it cannot, because @code{Pmode} is an
 788 alias for some other mode, and might vary with machine-specific
 789 options.  @xref{Misc}.
 790 @end defun
 791
 792 @defun scratch_operand
 793 This predicate allows hard registers and @code{SCRATCH} expressions,
 794 but not pseudo-registers.  It is used internally by @code{match_scratch};
 795 it should not be used directly.
 796 @end defun
 797
 798 @noindent
 799 The third category of predicates allow only some kind of memory reference.
 800
 801 @defun memory_operand
 802 This predicate allows any valid reference to a quantity of mode
 803 @var{mode} in memory, as determined by the weak form of
 804 @code{GO_IF_LEGITIMATE_ADDRESS} (@pxref{Addressing Modes}).
 805 @end defun
 806
 807 @defun address_operand
 808 This predicate is a little unusual; it allows any operand that is a
 809 valid expression for the @emph{address} of a quantity of mode
 810 @var{mode}, again determined by the weak form of
 811 @code{GO_IF_LEGITIMATE_ADDRESS}.  To first order, if
 812 @samp{@w{(mem:@var{mode} (@var{exp}))}} is acceptable to
 813 @code{memory_operand}, then @var{exp} is acceptable to
 814 @code{address_operand}.  Note that @var{exp} does not necessarily have
 815 the mode @var{mode}.
 816 @end defun
 817
 818 @defun indirect_operand
 819 This is a stricter form of @code{memory_operand} which allows only
 820 memory references with a @code{general_operand} as the address
 821 expression.  New uses of this predicate are discouraged, because
 822 @code{general_operand} is very permissive, so it's hard to tell what
 823 an @code{indirect_operand} does or does not allow.  If a target has
 824 different requirements for memory operands for different instructions,
 825 it is better to define target-specific predicates which enforce the
 826 hardware's requirements explicitly.
 827 @end defun
 828
 829 @defun push_operand
 830 This predicate allows a memory reference suitable for pushing a value
 831 onto the stack.  This will be a @code{MEM} which refers to
 832 @code{stack_pointer_rtx}, with a side-effect in its address expression
 833 (@pxref{Incdec}); which one is determined by the
 834 @code{STACK_PUSH_CODE} macro (@pxref{Frame Layout}).
 835 @end defun
 836
 837 @defun pop_operand
 838 This predicate allows a memory reference suitable for popping a value
 839 off the stack.  Again, this will be a @code{MEM} referring to
 840 @code{stack_pointer_rtx}, with a side-effect in its address
 841 expression.  However, this time @code{STACK_POP_CODE} is expected.
 842 @end defun
 843
 844 @noindent
 845 The fourth category of predicates allow some combination of the above
 846 operands.
 847
 848 @defun nonmemory_operand
 849 This predicate allows any immediate or register operand valid for @var{mode}.
 850 @end defun
 851
 852 @defun nonimmediate_operand
 853 This predicate allows any register or memory operand valid for @var{mode}.
 854 @end defun
 855
 856 @defun general_operand
 857 This predicate allows any immediate, register, or memory operand
 858 valid for @var{mode}.
 859 @end defun
 860
 861 @noindent
 862 Finally, there is one generic operator predicate.
 863
 864 @defun comparison_operator
 865 This predicate matches any expression which performs an arithmetic
 866 comparison in @var{mode}; that is, @code{COMPARISON_P} is true for the
 867 expression code.
 868 @end defun
 869
 870 @node Defining Predicates
 871 @subsection Defining Machine-Specific Predicates
 872 @cindex defining predicates
 873 @findex define_predicate
 874 @findex define_special_predicate
 875
 876 Many machines have requirements for their operands that cannot be
 877 expressed precisely using the generic predicates.  You can define
 878 additional predicates using @code{define_predicate} and
 879 @code{define_special_predicate} expressions.  These expressions have
 880 three operands:
 881
 882 @itemize @bullet
 883 @item
 884 The name of the predicate, as it will be referred to in
 885 @code{match_operand} or @code{match_operator} expressions.
 886
 887 @item
 888 An RTL expression which evaluates to true if the predicate allows the
 889 operand @var{op}, false if it does not.  This expression can only use
 890 the following RTL codes:
 891
 892 @table @code
 893 @item MATCH_OPERAND
 894 When written inside a predicate expression, a @code{MATCH_OPERAND}
 895 expression evaluates to true if the predicate it names would allow
 896 @var{op}.  The operand number and constraint are ignored.  Due to
 897 limitations in @command{genrecog}, you can only refer to generic
 898 predicates and predicates that have already been defined.
 899
 900 @item MATCH_CODE
 901 This expression evaluates to true if @var{op} or a specified
 902 subexpression of @var{op} has one of a given list of RTX codes.
 903
 904 The first operand of this expression is a string constant containing a
 905 comma-separated list of RTX code names (in lower case).  These are the
 906 codes for which the @code{MATCH_CODE} will be true.
 907
 908 The second operand is a string constant which indicates what
 909 subexpression of @var{op} to examine.  If it is absent or the empty
 910 string, @var{op} itself is examined.  Otherwise, the string constant
 911 must be a sequence of digits and/or lowercase letters.  Each character
 912 indicates a subexpression to extract from the current expression; for
 913 the first character this is @var{op}, for the second and subsequent
 914 characters it is the result of the previous character.  A digit
 915 @var{n} extracts @samp{@w{XEXP (@var{e}, @var{n})}}; a letter @var{l}
 916 extracts @samp{@w{XVECEXP (@var{e}, 0, @var{n})}} where @var{n} is the
 917 alphabetic ordinal of @var{l} (0 for `a', 1 for 'b', and so on).  The
 918 @code{MATCH_CODE} then examines the RTX code of the subexpression
 919 extracted by the complete string.  It is not possible to extract
 920 components of an @code{rtvec} that is not at position 0 within its RTX
 921 object.
 922
 923 @item MATCH_TEST
 924 This expression has one operand, a string constant containing a C
 925 expression.  The predicate's arguments, @var{op} and @var{mode}, are
 926 available with those names in the C expression.  The @code{MATCH_TEST}
 927 evaluates to true if the C expression evaluates to a nonzero value.
 928 @code{MATCH_TEST} expressions must not have side effects.
 929
 930 @item  AND
 931 @itemx IOR
 932 @itemx NOT
 933 @itemx IF_THEN_ELSE
 934 The basic @samp{MATCH_} expressions can be combined using these
 935 logical operators, which have the semantics of the C operators
 936 @samp{&&}, @samp{||}, @samp{!}, and @samp{@w{? :}} respectively.  As
 937 in Common Lisp, you may give an @code{AND} or @code{IOR} expression an
 938 arbitrary number of arguments; this has exactly the same effect as
 939 writing a chain of two-argument @code{AND} or @code{IOR} expressions.
 940 @end table
 941
 942 @item
 943 An optional block of C code, which should execute
 944 @samp{@w{return true}} if the predicate is found to match and
 945 @samp{@w{return false}} if it does not.  It must not have any side
 946 effects.  The predicate arguments, @var{op} and @var{mode}, are
 947 available with those names.
 948
 949 If a code block is present in a predicate definition, then the RTL
 950 expression must evaluate to true @emph{and} the code block must
 951 execute @samp{@w{return true}} for the predicate to allow the operand.
 952 The RTL expression is evaluated first; do not re-check anything in the
 953 code block that was checked in the RTL expression.
 954 @end itemize
 955
 956 The program @command{genrecog} scans @code{define_predicate} and
 957 @code{define_special_predicate} expressions to determine which RTX
 958 codes are possibly allowed.  You should always make this explicit in
 959 the RTL predicate expression, using @code{MATCH_OPERAND} and
 960 @code{MATCH_CODE}.
 961
 962 Here is an example of a simple predicate definition, from the IA64
 963 machine description:
 964
 965 @smallexample
 966 @group
 967 ;; @r{True if @var{op} is a @code{SYMBOL_REF} which refers to the sdata section.}
 968 (define_predicate "small_addr_symbolic_operand"
 969   (and (match_code "symbol_ref")
 970        (match_test "SYMBOL_REF_SMALL_ADDR_P (op)")))
 971 @end group
 972 @end smallexample
 973
 974 @noindent
 975 And here is another, showing the use of the C block.
 976
 977 @smallexample
 978 @group
 979 ;; @r{True if @var{op} is a register operand that is (or could be) a GR reg.}
 980 (define_predicate "gr_register_operand"
 981   (match_operand 0 "register_operand")
 982 @{
 983   unsigned int regno;
 984   if (GET_CODE (op) == SUBREG)
 985     op = SUBREG_REG (op);
 986
 987   regno = REGNO (op);
 988   return (regno >= FIRST_PSEUDO_REGISTER || GENERAL_REGNO_P (regno));
 989 @})
 990 @end group
 991 @end smallexample
 992
 993 Predicates written with @code{define_predicate} automatically include
 994 a test that @var{mode} is @code{VOIDmode}, or @var{op} has the same
 995 mode as @var{mode}, or @var{op} is a @code{CONST_INT} or
 996 @code{CONST_DOUBLE}.  They do @emph{not} check specifically for
 997 integer @code{CONST_DOUBLE}, nor do they test that the value of either
 998 kind of constant fits in the requested mode.  This is because
 999 target-specific predicates that take constants usually have to do more
1000 stringent value checks anyway.  If you need the exact same treatment
1001 of @code{CONST_INT} or @code{CONST_DOUBLE} that the generic predicates
1002 provide, use a @code{MATCH_OPERAND} subexpression to call
1003 @code{const_int_operand}, @code{const_double_operand}, or
1004 @code{immediate_operand}.
1005
1006 Predicates written with @code{define_special_predicate} do not get any
1007 automatic mode checks, and are treated as having special mode handling
1008 by @command{genrecog}.
1009
1010 The program @command{genpreds} is responsible for generating code to
1011 test predicates.  It also writes a header file containing function
1012 declarations for all machine-specific predicates.  It is not necessary
1013 to declare these predicates in @file{@var{cpu}-protos.h}.
1014 @end ifset
1015
1016 @c Most of this node appears by itself (in a different place) even
1017 @c when the INTERNALS flag is clear.  Passages that require the internals
1018 @c manual's context are conditionalized to appear only in the internals manual.
1019 @ifset INTERNALS
1020 @node Constraints
1021 @section Operand Constraints
1022 @cindex operand constraints
1023 @cindex constraints
1024
1025 Each @code{match_operand} in an instruction pattern can specify
1026 constraints for the operands allowed.  The constraints allow you to
1027 fine-tune matching within the set of operands allowed by the
1028 predicate.
1029
1030 @end ifset
1031 @ifclear INTERNALS
1032 @node Constraints
1033 @section Constraints for @code{asm} Operands
1034 @cindex operand constraints, @code{asm}
1035 @cindex constraints, @code{asm}
1036 @cindex @code{asm} constraints
1037
1038 Here are specific details on what constraint letters you can use with
1039 @code{asm} operands.
1040 @end ifclear
1041 Constraints can say whether
1042 an operand may be in a register, and which kinds of register; whether the
1043 operand can be a memory reference, and which kinds of address; whether the
1044 operand may be an immediate constant, and which possible values it may
1045 have.  Constraints can also require two operands to match.
1046
1047 @ifset INTERNALS
1048 @menu
1049 * Simple Constraints::  Basic use of constraints.
1050 * Multi-Alternative::   When an insn has two alternative constraint-patterns.
1051 * Class Preferences::   Constraints guide which hard register to put things in.
1052 * Modifiers::           More precise control over effects of constraints.
1053 * Machine Constraints:: Existing constraints for some particular machines.
1054 * Define Constraints::  How to define machine-specific constraints.
1055 * C Constraint Interface:: How to test constraints from C code.
1056 @end menu
1057 @end ifset
1058
1059 @ifclear INTERNALS
1060 @menu
1061 * Simple Constraints::  Basic use of constraints.
1062 * Multi-Alternative::   When an insn has two alternative constraint-patterns.
1063 * Modifiers::           More precise control over effects of constraints.
1064 * Machine Constraints:: Special constraints for some particular machines.
1065 @end menu
1066 @end ifclear
1067
1068 @node Simple Constraints
1069 @subsection Simple Constraints
1070 @cindex simple constraints
1071
1072 The simplest kind of constraint is a string full of letters, each of
1073 which describes one kind of operand that is permitted.  Here are
1074 the letters that are allowed:
1075
1076 @table @asis
1077 @item whitespace
1078 Whitespace characters are ignored and can be inserted at any position
1079 except the first.  This enables each alternative for different operands to
1080 be visually aligned in the machine description even if they have different
1081 number of constraints and modifiers.
1082
1083 @cindex @samp{m} in constraint
1084 @cindex memory references in constraints
1085 @item @samp{m}
1086 A memory operand is allowed, with any kind of address that the machine
1087 supports in general.
1088
1089 @cindex offsettable address
1090 @cindex @samp{o} in constraint
1091 @item @samp{o}
1092 A memory operand is allowed, but only if the address is
1093 @dfn{offsettable}.  This means that adding a small integer (actually,
1094 the width in bytes of the operand, as determined by its machine mode)
1095 may be added to the address and the result is also a valid memory
1096 address.
1097
1098 @cindex autoincrement/decrement addressing
1099 For example, an address which is constant is offsettable; so is an
1100 address that is the sum of a register and a constant (as long as a
1101 slightly larger constant is also within the range of address-offsets
1102 supported by the machine); but an autoincrement or autodecrement
1103 address is not offsettable.  More complicated indirect/indexed
1104 addresses may or may not be offsettable depending on the other
1105 addressing modes that the machine supports.
1106
1107 Note that in an output operand which can be matched by another
1108 operand, the constraint letter @samp{o} is valid only when accompanied
1109 by both @samp{<} (if the target machine has predecrement addressing)
1110 and @samp{>} (if the target machine has preincrement addressing).
1111
1112 @cindex @samp{V} in constraint
1113 @item @samp{V}
1114 A memory operand that is not offsettable.  In other words, anything that
1115 would fit the @samp{m} constraint but not the @samp{o} constraint.
1116
1117 @cindex @samp{<} in constraint
1118 @item @samp{<}
1119 A memory operand with autodecrement addressing (either predecrement or
1120 postdecrement) is allowed.
1121
1122 @cindex @samp{>} in constraint
1123 @item @samp{>}
1124 A memory operand with autoincrement addressing (either preincrement or
1125 postincrement) is allowed.
1126
1127 @cindex @samp{r} in constraint
1128 @cindex registers in constraints
1129 @item @samp{r}
1130 A register operand is allowed provided that it is in a general
1131 register.
1132
1133 @cindex constants in constraints
1134 @cindex @samp{i} in constraint
1135 @item @samp{i}
1136 An immediate integer operand (one with constant value) is allowed.
1137 This includes symbolic constants whose values will be known only at
1138 assembly time or later.
1139
1140 @cindex @samp{n} in constraint
1141 @item @samp{n}
1142 An immediate integer operand with a known numeric value is allowed.
1143 Many systems cannot support assembly-time constants for operands less
1144 than a word wide.  Constraints for these operands should use @samp{n}
1145 rather than @samp{i}.
1146
1147 @cindex @samp{I} in constraint
1148 @item @samp{I}, @samp{J}, @samp{K}, @dots{} @samp{P}
1149 Other letters in the range @samp{I} through @samp{P} may be defined in
1150 a machine-dependent fashion to permit immediate integer operands with
1151 explicit integer values in specified ranges.  For example, on the
1152 68000, @samp{I} is defined to stand for the range of values 1 to 8.
1153 This is the range permitted as a shift count in the shift
1154 instructions.
1155
1156 @cindex @samp{E} in constraint
1157 @item @samp{E}
1158 An immediate floating operand (expression code @code{const_double}) is
1159 allowed, but only if the target floating point format is the same as
1160 that of the host machine (on which the compiler is running).
1161
1162 @cindex @samp{F} in constraint
1163 @item @samp{F}
1164 An immediate floating operand (expression code @code{const_double} or
1165 @code{const_vector}) is allowed.
1166
1167 @cindex @samp{G} in constraint
1168 @cindex @samp{H} in constraint
1169 @item @samp{G}, @samp{H}
1170 @samp{G} and @samp{H} may be defined in a machine-dependent fashion to
1171 permit immediate floating operands in particular ranges of values.
1172
1173 @cindex @samp{s} in constraint
1174 @item @samp{s}
1175 An immediate integer operand whose value is not an explicit integer is
1176 allowed.
1177
1178 This might appear strange; if an insn allows a constant operand with a
1179 value not known at compile time, it certainly must allow any known
1180 value.  So why use @samp{s} instead of @samp{i}?  Sometimes it allows
1181 better code to be generated.
1182
1183 For example, on the 68000 in a fullword instruction it is possible to
1184 use an immediate operand; but if the immediate value is between @minus{}128
1185 and 127, better code results from loading the value into a register and
1186 using the register.  This is because the load into the register can be
1187 done with a @samp{moveq} instruction.  We arrange for this to happen
1188 by defining the letter @samp{K} to mean ``any integer outside the
1189 range @minus{}128 to 127'', and then specifying @samp{Ks} in the operand
1190 constraints.
1191
1192 @cindex @samp{g} in constraint
1193 @item @samp{g}
1194 Any register, memory or immediate integer operand is allowed, except for
1195 registers that are not general registers.
1196
1197 @cindex @samp{X} in constraint
1198 @item @samp{X}
1199 @ifset INTERNALS
1200 Any operand whatsoever is allowed, even if it does not satisfy
1201 @code{general_operand}.  This is normally used in the constraint of
1202 a @code{match_scratch} when certain alternatives will not actually
1203 require a scratch register.
1204 @end ifset
1205 @ifclear INTERNALS
1206 Any operand whatsoever is allowed.
1207 @end ifclear
1208
1209 @cindex @samp{0} in constraint
1210 @cindex digits in constraint
1211 @item @samp{0}, @samp{1}, @samp{2}, @dots{} @samp{9}
1212 An operand that matches the specified operand number is allowed.  If a
1213 digit is used together with letters within the same alternative, the
1214 digit should come last.
1215
1216 This number is allowed to be more than a single digit.  If multiple
1217 digits are encountered consecutively, they are interpreted as a single
1218 decimal integer.  There is scant chance for ambiguity, since to-date
1219 it has never been desirable that @samp{10} be interpreted as matching
1220 either operand 1 @emph{or} operand 0.  Should this be desired, one
1221 can use multiple alternatives instead.
1222
1223 @cindex matching constraint
1224 @cindex constraint, matching
1225 This is called a @dfn{matching constraint} and what it really means is
1226 that the assembler has only a single operand that fills two roles
1227 @ifset INTERNALS
1228 considered separate in the RTL insn.  For example, an add insn has two
1229 input operands and one output operand in the RTL, but on most CISC
1230 @end ifset
1231 @ifclear INTERNALS
1232 which @code{asm} distinguishes.  For example, an add instruction uses
1233 two input operands and an output operand, but on most CISC
1234 @end ifclear
1235 machines an add instruction really has only two operands, one of them an
1236 input-output operand:
1237
1238 @smallexample
1239 addl #35,r12
1240 @end smallexample
1241
1242 Matching constraints are used in these circumstances.
1243 More precisely, the two operands that match must include one input-only
1244 operand and one output-only operand.  Moreover, the digit must be a
1245 smaller number than the number of the operand that uses it in the
1246 constraint.
1247
1248 @ifset INTERNALS
1249 For operands to match in a particular case usually means that they
1250 are identical-looking RTL expressions.  But in a few special cases
1251 specific kinds of dissimilarity are allowed.  For example, @code{*x}
1252 as an input operand will match @code{*x++} as an output operand.
1253 For proper results in such cases, the output template should always
1254 use the output-operand's number when printing the operand.
1255 @end ifset
1256
1257 @cindex load address instruction
1258 @cindex push address instruction
1259 @cindex address constraints
1260 @cindex @samp{p} in constraint
1261 @item @samp{p}
1262 An operand that is a valid memory address is allowed.  This is
1263 for ``load address'' and ``push address'' instructions.
1264
1265 @findex address_operand
1266 @samp{p} in the constraint must be accompanied by @code{address_operand}
1267 as the predicate in the @code{match_operand}.  This predicate interprets
1268 the mode specified in the @code{match_operand} as the mode of the memory
1269 reference for which the address would be valid.
1270
1271 @cindex other register constraints
1272 @cindex extensible constraints
1273 @item @var{other-letters}
1274 Other letters can be defined in machine-dependent fashion to stand for
1275 particular classes of registers or other arbitrary operand types.
1276 @samp{d}, @samp{a} and @samp{f} are defined on the 68000/68020 to stand
1277 for data, address and floating point registers.
1278 @end table
1279
1280 @ifset INTERNALS
1281 In order to have valid assembler code, each operand must satisfy
1282 its constraint.  But a failure to do so does not prevent the pattern
1283 from applying to an insn.  Instead, it directs the compiler to modify
1284 the code so that the constraint will be satisfied.  Usually this is
1285 done by copying an operand into a register.
1286
1287 Contrast, therefore, the two instruction patterns that follow:
1288
1289 @smallexample
1290 (define_insn ""
1291   [(set (match_operand:SI 0 "general_operand" "=r")
1292         (plus:SI (match_dup 0)
1293                  (match_operand:SI 1 "general_operand" "r")))]
1294   ""
1295   "@dots{}")
1296 @end smallexample
1297
1298 @noindent
1299 which has two operands, one of which must appear in two places, and
1300
1301 @smallexample
1302 (define_insn ""
1303   [(set (match_operand:SI 0 "general_operand" "=r")
1304         (plus:SI (match_operand:SI 1 "general_operand" "0")
1305                  (match_operand:SI 2 "general_operand" "r")))]
1306   ""
1307   "@dots{}")
1308 @end smallexample
1309
1310 @noindent
1311 which has three operands, two of which are required by a constraint to be
1312 identical.  If we are considering an insn of the form
1313
1314 @smallexample
1315 (insn @var{n} @var{prev} @var{next}
1316   (set (reg:SI 3)
1317        (plus:SI (reg:SI 6) (reg:SI 109)))
1318   @dots{})
1319 @end smallexample
1320
1321 @noindent
1322 the first pattern would not apply at all, because this insn does not
1323 contain two identical subexpressions in the right place.  The pattern would
1324 say, ``That does not look like an add instruction; try other patterns''.
1325 The second pattern would say, ``Yes, that's an add instruction, but there
1326 is something wrong with it''.  It would direct the reload pass of the
1327 compiler to generate additional insns to make the constraint true.  The
1328 results might look like this:
1329
1330 @smallexample
1331 (insn @var{n2} @var{prev} @var{n}
1332   (set (reg:SI 3) (reg:SI 6))
1333   @dots{})
1334
1335 (insn @var{n} @var{n2} @var{next}
1336   (set (reg:SI 3)
1337        (plus:SI (reg:SI 3) (reg:SI 109)))
1338   @dots{})
1339 @end smallexample
1340
1341 It is up to you to make sure that each operand, in each pattern, has
1342 constraints that can handle any RTL expression that could be present for
1343 that operand.  (When multiple alternatives are in use, each pattern must,
1344 for each possible combination of operand expressions, have at least one
1345 alternative which can handle that combination of operands.)  The
1346 constraints don't need to @emph{allow} any possible operand---when this is
1347 the case, they do not constrain---but they must at least point the way to
1348 reloading any possible operand so that it will fit.
1349
1350 @itemize @bullet
1351 @item
1352 If the constraint accepts whatever operands the predicate permits,
1353 there is no problem: reloading is never necessary for this operand.
1354
1355 For example, an operand whose constraints permit everything except
1356 registers is safe provided its predicate rejects registers.
1357
1358 An operand whose predicate accepts only constant values is safe
1359 provided its constraints include the letter @samp{i}.  If any possible
1360 constant value is accepted, then nothing less than @samp{i} will do;
1361 if the predicate is more selective, then the constraints may also be
1362 more selective.
1363
1364 @item
1365 Any operand expression can be reloaded by copying it into a register.
1366 So if an operand's constraints allow some kind of register, it is
1367 certain to be safe.  It need not permit all classes of registers; the
1368 compiler knows how to copy a register into another register of the
1369 proper class in order to make an instruction valid.
1370
1371 @cindex nonoffsettable memory reference
1372 @cindex memory reference, nonoffsettable
1373 @item
1374 A nonoffsettable memory reference can be reloaded by copying the
1375 address into a register.  So if the constraint uses the letter
1376 @samp{o}, all memory references are taken care of.
1377
1378 @item
1379 A constant operand can be reloaded by allocating space in memory to
1380 hold it as preinitialized data.  Then the memory reference can be used
1381 in place of the constant.  So if the constraint uses the letters
1382 @samp{o} or @samp{m}, constant operands are not a problem.
1383
1384 @item
1385 If the constraint permits a constant and a pseudo register used in an insn
1386 was not allocated to a hard register and is equivalent to a constant,
1387 the register will be replaced with the constant.  If the predicate does
1388 not permit a constant and the insn is re-recognized for some reason, the
1389 compiler will crash.  Thus the predicate must always recognize any
1390 objects allowed by the constraint.
1391 @end itemize
1392
1393 If the operand's predicate can recognize registers, but the constraint does
1394 not permit them, it can make the compiler crash.  When this operand happens
1395 to be a register, the reload pass will be stymied, because it does not know
1396 how to copy a register temporarily into memory.
1397
1398 If the predicate accepts a unary operator, the constraint applies to the
1399 operand.  For example, the MIPS processor at ISA level 3 supports an
1400 instruction which adds two registers in @code{SImode} to produce a
1401 @code{DImode} result, but only if the registers are correctly sign
1402 extended.  This predicate for the input operands accepts a
1403 @code{sign_extend} of an @code{SImode} register.  Write the constraint
1404 to indicate the type of register that is required for the operand of the
1405 @code{sign_extend}.
1406 @end ifset
1407
1408 @node Multi-Alternative
1409 @subsection Multiple Alternative Constraints
1410 @cindex multiple alternative constraints
1411
1412 Sometimes a single instruction has multiple alternative sets of possible
1413 operands.  For example, on the 68000, a logical-or instruction can combine
1414 register or an immediate value into memory, or it can combine any kind of
1415 operand into a register; but it cannot combine one memory location into
1416 another.
1417
1418 These constraints are represented as multiple alternatives.  An alternative
1419 can be described by a series of letters for each operand.  The overall
1420 constraint for an operand is made from the letters for this operand
1421 from the first alternative, a comma, the letters for this operand from
1422 the second alternative, a comma, and so on until the last alternative.
1423 @ifset INTERNALS
1424 Here is how it is done for fullword logical-or on the 68000:
1425
1426 @smallexample
1427 (define_insn "iorsi3"
1428   [(set (match_operand:SI 0 "general_operand" "=m,d")
1429         (ior:SI (match_operand:SI 1 "general_operand" "%0,0")
1430                 (match_operand:SI 2 "general_operand" "dKs,dmKs")))]
1431   @dots{})
1432 @end smallexample
1433
1434 The first alternative has @samp{m} (memory) for operand 0, @samp{0} for
1435 operand 1 (meaning it must match operand 0), and @samp{dKs} for operand
1436 2.  The second alternative has @samp{d} (data register) for operand 0,
1437 @samp{0} for operand 1, and @samp{dmKs} for operand 2.  The @samp{=} and
1438 @samp{%} in the constraints apply to all the alternatives; their
1439 meaning is explained in the next section (@pxref{Class Preferences}).
1440 @end ifset
1441
1442 @c FIXME Is this ? and ! stuff of use in asm()?  If not, hide unless INTERNAL
1443 If all the operands fit any one alternative, the instruction is valid.
1444 Otherwise, for each alternative, the compiler counts how many instructions
1445 must be added to copy the operands so that that alternative applies.
1446 The alternative requiring the least copying is chosen.  If two alternatives
1447 need the same amount of copying, the one that comes first is chosen.
1448 These choices can be altered with the @samp{?} and @samp{!} characters:
1449
1450 @table @code
1451 @cindex @samp{?} in constraint
1452 @cindex question mark
1453 @item ?
1454 Disparage slightly the alternative that the @samp{?} appears in,
1455 as a choice when no alternative applies exactly.  The compiler regards
1456 this alternative as one unit more costly for each @samp{?} that appears
1457 in it.
1458
1459 @cindex @samp{!} in constraint
1460 @cindex exclamation point
1461 @item !
1462 Disparage severely the alternative that the @samp{!} appears in.
1463 This alternative can still be used if it fits without reloading,
1464 but if reloading is needed, some other alternative will be used.
1465 @end table
1466
1467 @ifset INTERNALS
1468 When an insn pattern has multiple alternatives in its constraints, often
1469 the appearance of the assembler code is determined mostly by which
1470 alternative was matched.  When this is so, the C code for writing the
1471 assembler code can use the variable @code{which_alternative}, which is
1472 the ordinal number of the alternative that was actually satisfied (0 for
1473 the first, 1 for the second alternative, etc.).  @xref{Output Statement}.
1474 @end ifset
1475
1476 @ifset INTERNALS
1477 @node Class Preferences
1478 @subsection Register Class Preferences
1479 @cindex class preference constraints
1480 @cindex register class preference constraints
1481
1482 @cindex voting between constraint alternatives
1483 The operand constraints have another function: they enable the compiler
1484 to decide which kind of hardware register a pseudo register is best
1485 allocated to.  The compiler examines the constraints that apply to the
1486 insns that use the pseudo register, looking for the machine-dependent
1487 letters such as @samp{d} and @samp{a} that specify classes of registers.
1488 The pseudo register is put in whichever class gets the most ``votes''.
1489 The constraint letters @samp{g} and @samp{r} also vote: they vote in
1490 favor of a general register.  The machine description says which registers
1491 are considered general.
1492
1493 Of course, on some machines all registers are equivalent, and no register
1494 classes are defined.  Then none of this complexity is relevant.
1495 @end ifset
1496
1497 @node Modifiers
1498 @subsection Constraint Modifier Characters
1499 @cindex modifiers in constraints
1500 @cindex constraint modifier characters
1501
1502 @c prevent bad page break with this line
1503 Here are constraint modifier characters.
1504
1505 @table @samp
1506 @cindex @samp{=} in constraint
1507 @item =
1508 Means that this operand is write-only for this instruction: the previous
1509 value is discarded and replaced by output data.
1510
1511 @cindex @samp{+} in constraint
1512 @item +
1513 Means that this operand is both read and written by the instruction.
1514
1515 When the compiler fixes up the operands to satisfy the constraints,
1516 it needs to know which operands are inputs to the instruction and
1517 which are outputs from it.  @samp{=} identifies an output; @samp{+}
1518 identifies an operand that is both input and output; all other operands
1519 are assumed to be input only.
1520
1521 If you specify @samp{=} or @samp{+} in a constraint, you put it in the
1522 first character of the constraint string.
1523
1524 @cindex @samp{&} in constraint
1525 @cindex earlyclobber operand
1526 @item &
1527 Means (in a particular alternative) that this operand is an
1528 @dfn{earlyclobber} operand, which is modified before the instruction is
1529 finished using the input operands.  Therefore, this operand may not lie
1530 in a register that is used as an input operand or as part of any memory
1531 address.
1532
1533 @samp{&} applies only to the alternative in which it is written.  In
1534 constraints with multiple alternatives, sometimes one alternative
1535 requires @samp{&} while others do not.  See, for example, the
1536 @samp{movdf} insn of the 68000.
1537
1538 An input operand can be tied to an earlyclobber operand if its only
1539 use as an input occurs before the early result is written.  Adding
1540 alternatives of this form often allows GCC to produce better code
1541 when only some of the inputs can be affected by the earlyclobber.
1542 See, for example, the @samp{mulsi3} insn of the ARM@.
1543
1544 @samp{&} does not obviate the need to write @samp{=}.
1545
1546 @cindex @samp{%} in constraint
1547 @item %
1548 Declares the instruction to be commutative for this operand and the
1549 following operand.  This means that the compiler may interchange the
1550 two operands if that is the cheapest way to make all operands fit the
1551 constraints.
1552 @ifset INTERNALS
1553 This is often used in patterns for addition instructions
1554 that really have only two operands: the result must go in one of the
1555 arguments.  Here for example, is how the 68000 halfword-add
1556 instruction is defined:
1557
1558 @smallexample
1559 (define_insn "addhi3"
1560   [(set (match_operand:HI 0 "general_operand" "=m,r")
1561      (plus:HI (match_operand:HI 1 "general_operand" "%0,0")
1562               (match_operand:HI 2 "general_operand" "di,g")))]
1563   @dots{})
1564 @end smallexample
1565 @end ifset
1566 GCC can only handle one commutative pair in an asm; if you use more,
1567 the compiler may fail.  Note that you need not use the modifier if
1568 the two alternatives are strictly identical; this would only waste
1569 time in the reload pass.  The modifier is not operational after
1570 register allocation, so the result of @code{define_peephole2}
1571 and @code{define_split}s performed after reload cannot rely on
1572 @samp{%} to make the intended insn match.
1573
1574 @cindex @samp{#} in constraint
1575 @item #
1576 Says that all following characters, up to the next comma, are to be
1577 ignored as a constraint.  They are significant only for choosing
1578 register preferences.
1579
1580 @cindex @samp{*} in constraint
1581 @item *
1582 Says that the following character should be ignored when choosing
1583 register preferences.  @samp{*} has no effect on the meaning of the
1584 constraint as a constraint, and no effect on reloading.
1585
1586 @ifset INTERNALS
1587 Here is an example: the 68000 has an instruction to sign-extend a
1588 halfword in a data register, and can also sign-extend a value by
1589 copying it into an address register.  While either kind of register is
1590 acceptable, the constraints on an address-register destination are
1591 less strict, so it is best if register allocation makes an address
1592 register its goal.  Therefore, @samp{*} is used so that the @samp{d}
1593 constraint letter (for data register) is ignored when computing
1594 register preferences.
1595
1596 @smallexample
1597 (define_insn "extendhisi2"
1598   [(set (match_operand:SI 0 "general_operand" "=*d,a")
1599         (sign_extend:SI
1600          (match_operand:HI 1 "general_operand" "0,g")))]
1601   @dots{})
1602 @end smallexample
1603 @end ifset
1604 @end table
1605
1606 @node Machine Constraints
1607 @subsection Constraints for Particular Machines
1608 @cindex machine specific constraints
1609 @cindex constraints, machine specific
1610
1611 Whenever possible, you should use the general-purpose constraint letters
1612 in @code{asm} arguments, since they will convey meaning more readily to
1613 people reading your code.  Failing that, use the constraint letters
1614 that usually have very similar meanings across architectures.  The most
1615 commonly used constraints are @samp{m} and @samp{r} (for memory and
1616 general-purpose registers respectively; @pxref{Simple Constraints}), and
1617 @samp{I}, usually the letter indicating the most common
1618 immediate-constant format.
1619
1620 Each architecture defines additional constraints.  These constraints
1621 are used by the compiler itself for instruction generation, as well as
1622 for @code{asm} statements; therefore, some of the constraints are not
1623 particularly useful for @code{asm}.  Here is a summary of some of the
1624 machine-dependent constraints available on some particular machines;
1625 it includes both constraints that are useful for @code{asm} and
1626 constraints that aren't.  The compiler source file mentioned in the
1627 table heading for each architecture is the definitive reference for
1628 the meanings of that architecture's constraints.
1629
1630 @table @emph
1631 @item ARM family---@file{config/arm/arm.h}
1632 @table @code
1633 @item f
1634 Floating-point register
1635
1636 @item w
1637 VFP floating-point register
1638
1639 @item F
1640 One of the floating-point constants 0.0, 0.5, 1.0, 2.0, 3.0, 4.0, 5.0
1641 or 10.0
1642
1643 @item G
1644 Floating-point constant that would satisfy the constraint @samp{F} if it
1645 were negated
1646
1647 @item I
1648 Integer that is valid as an immediate operand in a data processing
1649 instruction.  That is, an integer in the range 0 to 255 rotated by a
1650 multiple of 2
1651
1652 @item J
1653 Integer in the range @minus{}4095 to 4095
1654
1655 @item K
1656 Integer that satisfies constraint @samp{I} when inverted (ones complement)
1657
1658 @item L
1659 Integer that satisfies constraint @samp{I} when negated (twos complement)
1660
1661 @item M
1662 Integer in the range 0 to 32
1663
1664 @item Q
1665 A memory reference where the exact address is in a single register
1666 (`@samp{m}' is preferable for @code{asm} statements)
1667
1668 @item R
1669 An item in the constant pool
1670
1671 @item S
1672 A symbol in the text segment of the current file
1673
1674 @item Uv
1675 A memory reference suitable for VFP load/store insns (reg+constant offset)
1676
1677 @item Uy
1678 A memory reference suitable for iWMMXt load/store instructions.
1679
1680 @item Uq
1681 A memory reference suitable for the ARMv4 ldrsb instruction.
1682 @end table
1683
1684 @item AVR family---@file{config/avr/constraints.md}
1685 @table @code
1686 @item l
1687 Registers from r0 to r15
1688
1689 @item a
1690 Registers from r16 to r23
1691
1692 @item d
1693 Registers from r16 to r31
1694
1695 @item w
1696 Registers from r24 to r31.  These registers can be used in @samp{adiw} command
1697
1698 @item e
1699 Pointer register (r26--r31)
1700
1701 @item b
1702 Base pointer register (r28--r31)
1703
1704 @item q
1705 Stack pointer register (SPH:SPL)
1706
1707 @item t
1708 Temporary register r0
1709
1710 @item x
1711 Register pair X (r27:r26)
1712
1713 @item y
1714 Register pair Y (r29:r28)
1715
1716 @item z
1717 Register pair Z (r31:r30)
1718
1719 @item I
1720 Constant greater than @minus{}1, less than 64
1721
1722 @item J
1723 Constant greater than @minus{}64, less than 1
1724
1725 @item K
1726 Constant integer 2
1727
1728 @item L
1729 Constant integer 0
1730
1731 @item M
1732 Constant that fits in 8 bits
1733
1734 @item N
1735 Constant integer @minus{}1
1736
1737 @item O
1738 Constant integer 8, 16, or 24
1739
1740 @item P
1741 Constant integer 1
1742
1743 @item G
1744 A floating point constant 0.0
1745
1746 @item R
1747 Integer constant in the range -6 @dots{} 5.
1748
1749 @item Q
1750 A memory address based on Y or Z pointer with displacement.
1751 @end table
1752
1753 @item CRX Architecture---@file{config/crx/crx.h}
1754 @table @code
1755
1756 @item b
1757 Registers from r0 to r14 (registers without stack pointer)
1758
1759 @item l
1760 Register r16 (64-bit accumulator lo register)
1761
1762 @item h
1763 Register r17 (64-bit accumulator hi register)
1764
1765 @item k
1766 Register pair r16-r17. (64-bit accumulator lo-hi pair)
1767
1768 @item I
1769 Constant that fits in 3 bits
1770
1771 @item J
1772 Constant that fits in 4 bits
1773
1774 @item K
1775 Constant that fits in 5 bits
1776
1777 @item L
1778 Constant that is one of -1, 4, -4, 7, 8, 12, 16, 20, 32, 48
1779
1780 @item G
1781 Floating point constant that is legal for store immediate
1782 @end table
1783
1784 @item Hewlett-Packard PA-RISC---@file{config/pa/pa.h}
1785 @table @code
1786 @item a
1787 General register 1
1788
1789 @item f
1790 Floating point register
1791
1792 @item q
1793 Shift amount register
1794
1795 @item x
1796 Floating point register (deprecated)
1797
1798 @item y
1799 Upper floating point register (32-bit), floating point register (64-bit)
1800
1801 @item Z
1802 Any register
1803
1804 @item I
1805 Signed 11-bit integer constant
1806
1807 @item J
1808 Signed 14-bit integer constant
1809
1810 @item K
1811 Integer constant that can be deposited with a @code{zdepi} instruction
1812
1813 @item L
1814 Signed 5-bit integer constant
1815
1816 @item M
1817 Integer constant 0
1818
1819 @item N
1820 Integer constant that can be loaded with a @code{ldil} instruction
1821
1822 @item O
1823 Integer constant whose value plus one is a power of 2
1824
1825 @item P
1826 Integer constant that can be used for @code{and} operations in @code{depi}
1827 and @code{extru} instructions
1828
1829 @item S
1830 Integer constant 31
1831
1832 @item U
1833 Integer constant 63
1834
1835 @item G
1836 Floating-point constant 0.0
1837
1838 @item A
1839 A @code{lo_sum} data-linkage-table memory operand
1840
1841 @item Q
1842 A memory operand that can be used as the destination operand of an
1843 integer store instruction
1844
1845 @item R
1846 A scaled or unscaled indexed memory operand
1847
1848 @item T
1849 A memory operand for floating-point loads and stores
1850
1851 @item W
1852 A register indirect memory operand
1853 @end table
1854
1855 @item PowerPC and IBM RS6000---@file{config/rs6000/rs6000.h}
1856 @table @code
1857 @item b
1858 Address base register
1859
1860 @item f
1861 Floating point register
1862
1863 @item v
1864 Vector register
1865
1866 @item h
1867 @samp{MQ}, @samp{CTR}, or @samp{LINK} register
1868
1869 @item q
1870 @samp{MQ} register
1871
1872 @item c
1873 @samp{CTR} register
1874
1875 @item l
1876 @samp{LINK} register
1877
1878 @item x
1879 @samp{CR} register (condition register) number 0
1880
1881 @item y
1882 @samp{CR} register (condition register)
1883
1884 @item z
1885 @samp{FPMEM} stack memory for FPR-GPR transfers
1886
1887 @item I
1888 Signed 16-bit constant
1889
1890 @item J
1891 Unsigned 16-bit constant shifted left 16 bits (use @samp{L} instead for
1892 @code{SImode} constants)
1893
1894 @item K
1895 Unsigned 16-bit constant
1896
1897 @item L
1898 Signed 16-bit constant shifted left 16 bits
1899
1900 @item M
1901 Constant larger than 31
1902
1903 @item N
1904 Exact power of 2
1905
1906 @item O
1907 Zero
1908
1909 @item P
1910 Constant whose negation is a signed 16-bit constant
1911
1912 @item G
1913 Floating point constant that can be loaded into a register with one
1914 instruction per word
1915
1916 @item H
1917 Integer/Floating point constant that can be loaded into a register using
1918 three instructions
1919
1920 @item Q
1921 Memory operand that is an offset from a register (@samp{m} is preferable
1922 for @code{asm} statements)
1923
1924 @item Z
1925 Memory operand that is an indexed or indirect from a register (@samp{m} is
1926 preferable for @code{asm} statements)
1927
1928 @item R
1929 AIX TOC entry
1930
1931 @item a
1932 Address operand that is an indexed or indirect from a register (@samp{p} is
1933 preferable for @code{asm} statements)
1934
1935 @item S
1936 Constant suitable as a 64-bit mask operand
1937
1938 @item T
1939 Constant suitable as a 32-bit mask operand
1940
1941 @item U
1942 System V Release 4 small data area reference
1943
1944 @item t
1945 AND masks that can be performed by two rldic@{l, r@} instructions
1946
1947 @item W
1948 Vector constant that does not require memory
1949
1950 @end table
1951
1952 @item MorphoTech family---@file{config/mt/mt.h}
1953 @table @code
1954 @item I
1955 Constant for an arithmetic insn (16-bit signed integer).
1956
1957 @item J
1958 The constant 0.
1959
1960 @item K
1961 Constant for a logical insn (16-bit zero-extended integer).
1962
1963 @item L
1964 A constant that can be loaded with @code{lui} (i.e.@: the bottom 16
1965 bits are zero).
1966
1967 @item M
1968 A constant that takes two words to load (i.e.@: not matched by
1969 @code{I}, @code{K}, or @code{L}).
1970
1971 @item N
1972 Negative 16-bit constants other than -65536.
1973
1974 @item O
1975 A 15-bit signed integer constant.
1976
1977 @item P
1978 A positive 16-bit constant.
1979 @end table
1980
1981 @item Intel 386---@file{config/i386/constraints.md}
1982 @table @code
1983 @item R
1984 Legacy register---the eight integer registers available on all
1985 i386 processors (@code{a}, @code{b}, @code{c}, @code{d},
1986 @code{si}, @code{di}, @code{bp}, @code{sp}).
1987
1988 @item q
1989 Any register accessible as @code{@var{r}l}.  In 32-bit mode, @code{a},
1990 @code{b}, @code{c}, and @code{d}; in 64-bit mode, any integer register.
1991
1992 @item Q
1993 Any register accessible as @code{@var{r}h}: @code{a}, @code{b},
1994 @code{c}, and @code{d}.
1995
1996 @ifset INTERNALS
1997 @item l
1998 Any register that can be used as the index in a base+index memory
1999 access: that is, any general register except the stack pointer.
2000 @end ifset
2001
2002 @item a
2003 The @code{a} register.
2004
2005 @item b
2006 The @code{b} register.
2007
2008 @item c
2009 The @code{c} register.
2010
2011 @item d
2012 The @code{d} register.
2013
2014 @item S
2015 The @code{si} register.
2016
2017 @item D
2018 The @code{di} register.
2019
2020 @item A
2021 The @code{a} and @code{d} registers, as a pair (for instructions that
2022 return half the result in one and half in the other).
2023
2024 @item f
2025 Any 80387 floating-point (stack) register.
2026
2027 @item t
2028 Top of 80387 floating-point stack (@code{%st(0)}).
2029
2030 @item u
2031 Second from top of 80387 floating-point stack (@code{%st(1)}).
2032
2033 @item y
2034 Any MMX register.
2035
2036 @item x
2037 Any SSE register.
2038
2039 @ifset INTERNALS
2040 @item Y
2041 Any SSE2 register.
2042 @end ifset
2043
2044 @item I
2045 Integer constant in the range 0 @dots{} 31, for 32-bit shifts.
2046
2047 @item J
2048 Integer constant in the range 0 @dots{} 63, for 64-bit shifts.
2049
2050 @item K
2051 Signed 8-bit integer constant.
2052
2053 @item L
2054 @code{0xFF} or @code{0xFFFF}, for andsi as a zero-extending move.
2055
2056 @item M
2057 0, 1, 2, or 3 (shifts for the @code{lea} instruction).
2058
2059 @item N
2060 Unsigned 8-bit integer constant (for @code{in} and @code{out}
2061 instructions).
2062
2063 @ifset INTERNALS
2064 @item O
2065 Integer constant in the range 0 @dots{} 127, for 128-bit shifts.
2066 @end ifset
2067
2068 @item G
2069 Standard 80387 floating point constant.
2070
2071 @item C
2072 Standard SSE floating point constant.
2073
2074 @item e
2075 32-bit signed integer constant, or a symbolic reference known
2076 to fit that range (for immediate operands in sign-extending x86-64
2077 instructions).
2078
2079 @item Z
2080 32-bit unsigned integer constant, or a symbolic reference known
2081 to fit that range (for immediate operands in zero-extending x86-64
2082 instructions).
2083
2084 @end table
2085
2086 @item Intel IA-64---@file{config/ia64/ia64.h}
2087 @table @code
2088 @item a
2089 General register @code{r0} to @code{r3} for @code{addl} instruction
2090
2091 @item b
2092 Branch register
2093
2094 @item c
2095 Predicate register (@samp{c} as in ``conditional'')
2096
2097 @item d
2098 Application register residing in M-unit
2099
2100 @item e
2101 Application register residing in I-unit
2102
2103 @item f
2104 Floating-point register
2105
2106 @item m
2107 Memory operand.
2108 Remember that @samp{m} allows postincrement and postdecrement which
2109 require printing with @samp{%Pn} on IA-64.
2110 Use @samp{S} to disallow postincrement and postdecrement.
2111
2112 @item G
2113 Floating-point constant 0.0 or 1.0
2114
2115 @item I
2116 14-bit signed integer constant
2117
2118 @item J
2119 22-bit signed integer constant
2120
2121 @item K
2122 8-bit signed integer constant for logical instructions
2123
2124 @item L
2125 8-bit adjusted signed integer constant for compare pseudo-ops
2126
2127 @item M
2128 6-bit unsigned integer constant for shift counts
2129
2130 @item N
2131 9-bit signed integer constant for load and store postincrements
2132
2133 @item O
2134 The constant zero
2135
2136 @item P
2137 0 or @minus{}1 for @code{dep} instruction
2138
2139 @item Q
2140 Non-volatile memory for floating-point loads and stores
2141
2142 @item R
2143 Integer constant in the range 1 to 4 for @code{shladd} instruction
2144
2145 @item S
2146 Memory operand except postincrement and postdecrement
2147 @end table
2148
2149 @item FRV---@file{config/frv/frv.h}
2150 @table @code
2151 @item a
2152 Register in the class @code{ACC_REGS} (@code{acc0} to @code{acc7}).
2153
2154 @item b
2155 Register in the class @code{EVEN_ACC_REGS} (@code{acc0} to @code{acc7}).
2156
2157 @item c
2158 Register in the class @code{CC_REGS} (@code{fcc0} to @code{fcc3} and
2159 @code{icc0} to @code{icc3}).
2160
2161 @item d
2162 Register in the class @code{GPR_REGS} (@code{gr0} to @code{gr63}).
2163
2164 @item e
2165 Register in the class @code{EVEN_REGS} (@code{gr0} to @code{gr63}).
2166 Odd registers are excluded not in the class but through the use of a machine
2167 mode larger than 4 bytes.
2168
2169 @item f
2170 Register in the class @code{FPR_REGS} (@code{fr0} to @code{fr63}).
2171
2172 @item h
2173 Register in the class @code{FEVEN_REGS} (@code{fr0} to @code{fr63}).
2174 Odd registers are excluded not in the class but through the use of a machine
2175 mode larger than 4 bytes.
2176
2177 @item l
2178 Register in the class @code{LR_REG} (the @code{lr} register).
2179
2180 @item q
2181 Register in the class @code{QUAD_REGS} (@code{gr2} to @code{gr63}).
2182 Register numbers not divisible by 4 are excluded not in the class but through
2183 the use of a machine mode larger than 8 bytes.
2184
2185 @item t
2186 Register in the class @code{ICC_REGS} (@code{icc0} to @code{icc3}).
2187
2188 @item u
2189 Register in the class @code{FCC_REGS} (@code{fcc0} to @code{fcc3}).
2190
2191 @item v
2192 Register in the class @code{ICR_REGS} (@code{cc4} to @code{cc7}).
2193
2194 @item w
2195 Register in the class @code{FCR_REGS} (@code{cc0} to @code{cc3}).
2196
2197 @item x
2198 Register in the class @code{QUAD_FPR_REGS} (@code{fr0} to @code{fr63}).
2199 Register numbers not divisible by 4 are excluded not in the class but through
2200 the use of a machine mode larger than 8 bytes.
2201
2202 @item z
2203 Register in the class @code{SPR_REGS} (@code{lcr} and @code{lr}).
2204
2205 @item A
2206 Register in the class @code{QUAD_ACC_REGS} (@code{acc0} to @code{acc7}).
2207
2208 @item B
2209 Register in the class @code{ACCG_REGS} (@code{accg0} to @code{accg7}).
2210
2211 @item C
2212 Register in the class @code{CR_REGS} (@code{cc0} to @code{cc7}).
2213
2214 @item G
2215 Floating point constant zero
2216
2217 @item I
2218 6-bit signed integer constant
2219
2220 @item J
2221 10-bit signed integer constant
2222
2223 @item L
2224 16-bit signed integer constant
2225
2226 @item M
2227 16-bit unsigned integer constant
2228
2229 @item N
2230 12-bit signed integer constant that is negative---i.e.@: in the
2231 range of @minus{}2048 to @minus{}1
2232
2233 @item O
2234 Constant zero
2235
2236 @item P
2237 12-bit signed integer constant that is greater than zero---i.e.@: in the
2238 range of 1 to 2047.
2239
2240 @end table
2241
2242 @item Blackfin family---@file{config/bfin/bfin.h}
2243 @table @code
2244 @item a
2245 P register
2246
2247 @item d
2248 D register
2249
2250 @item z
2251 A call clobbered P register.
2252
2253 @item D
2254 Even-numbered D register
2255
2256 @item W
2257 Odd-numbered D register
2258
2259 @item e
2260 Accumulator register.
2261
2262 @item A
2263 Even-numbered accumulator register.
2264
2265 @item B
2266 Odd-numbered accumulator register.
2267
2268 @item b
2269 I register
2270
2271 @item v
2272 B register
2273
2274 @item f
2275 M register
2276
2277 @item c
2278 Registers used for circular buffering, i.e. I, B, or L registers.
2279
2280 @item C
2281 The CC register.
2282
2283 @item t
2284 LT0 or LT1.
2285
2286 @item k
2287 LC0 or LC1.
2288
2289 @item u
2290 LB0 or LB1.
2291
2292 @item x
2293 Any D, P, B, M, I or L register.
2294
2295 @item y
2296 Additional registers typically used only in prologues and epilogues: RETS,
2297 RETN, RETI, RETX, RETE, ASTAT, SEQSTAT and USP.
2298
2299 @item w
2300 Any register except accumulators or CC.
2301
2302 @item Ksh
2303 Signed 16 bit integer (in the range -32768 to 32767)
2304
2305 @item Kuh
2306 Unsigned 16 bit integer (in the range 0 to 65535)
2307
2308 @item Ks7
2309 Signed 7 bit integer (in the range -64 to 63)
2310
2311 @item Ku7
2312 Unsigned 7 bit integer (in the range 0 to 127)
2313
2314 @item Ku5
2315 Unsigned 5 bit integer (in the range 0 to 31)
2316
2317 @item Ks4
2318 Signed 4 bit integer (in the range -8 to 7)
2319
2320 @item Ks3
2321 Signed 3 bit integer (in the range -3 to 4)
2322
2323 @item Ku3
2324 Unsigned 3 bit integer (in the range 0 to 7)
2325
2326 @item P@var{n}
2327 Constant @var{n}, where @var{n} is a single-digit constant in the range 0 to 4.
2328
2329 @item PA
2330 An integer equal to one of the MACFLAG_XXX constants that is suitable for
2331 use with either accumulator.
2332
2333 @item PB
2334 An integer equal to one of the MACFLAG_XXX constants that is suitable for
2335 use only with accumulator A1.
2336
2337 @item M1
2338 Constant 255.
2339
2340 @item M2
2341 Constant 65535.
2342
2343 @item J
2344 An integer constant with exactly a single bit set.
2345
2346 @item L
2347 An integer constant with all bits set except exactly one.
2348
2349 @item H
2350
2351 @item Q
2352 Any SYMBOL_REF.
2353 @end table
2354
2355 @item M32C---@file{config/m32c/m32c.c}
2356 @table @code
2357 @item Rsp
2358 @itemx Rfb
2359 @itemx Rsb
2360 @samp{$sp}, @samp{$fb}, @samp{$sb}.
2361
2362 @item Rcr
2363 Any control register, when they're 16 bits wide (nothing if control
2364 registers are 24 bits wide)
2365
2366 @item Rcl
2367 Any control register, when they're 24 bits wide.
2368
2369 @item R0w
2370 @itemx R1w
2371 @itemx R2w
2372 @itemx R3w
2373 $r0, $r1, $r2, $r3.
2374
2375 @item R02
2376 $r0 or $r2, or $r2r0 for 32 bit values.
2377
2378 @item R13
2379 $r1 or $r3, or $r3r1 for 32 bit values.
2380
2381 @item Rdi
2382 A register that can hold a 64 bit value.
2383
2384 @item Rhl
2385 $r0 or $r1 (registers with addressable high/low bytes)
2386
2387 @item R23
2388 $r2 or $r3
2389
2390 @item Raa
2391 Address registers
2392
2393 @item Raw
2394 Address registers when they're 16 bits wide.
2395
2396 @item Ral
2397 Address registers when they're 24 bits wide.
2398
2399 @item Rqi
2400 Registers that can hold QI values.
2401
2402 @item Rad
2403 Registers that can be used with displacements ($a0, $a1, $sb).
2404
2405 @item Rsi
2406 Registers that can hold 32 bit values.
2407
2408 @item Rhi
2409 Registers that can hold 16 bit values.
2410
2411 @item Rhc
2412 Registers chat can hold 16 bit values, including all control
2413 registers.
2414
2415 @item Rra
2416 $r0 through R1, plus $a0 and $a1.
2417
2418 @item Rfl
2419 The flags register.
2420
2421 @item Rmm
2422 The memory-based pseudo-registers $mem0 through $mem15.
2423
2424 @item Rpi
2425 Registers that can hold pointers (16 bit registers for r8c, m16c; 24
2426 bit registers for m32cm, m32c).
2427
2428 @item Rpa
2429 Matches multiple registers in a PARALLEL to form a larger register.
2430 Used to match function return values.
2431
2432 @item Is3
2433 -8 @dots{} 7
2434
2435 @item IS1
2436 -128 @dots{} 127
2437
2438 @item IS2
2439 -32768 @dots{} 32767
2440
2441 @item IU2
2442 0 @dots{} 65535
2443
2444 @item In4
2445 -8 @dots{} -1 or 1 @dots{} 8
2446
2447 @item In5
2448 -16 @dots{} -1 or 1 @dots{} 16
2449
2450 @item In6
2451 -32 @dots{} -1 or 1 @dots{} 32
2452
2453 @item IM2
2454 -65536 @dots{} -1
2455
2456 @item Ilb
2457 An 8 bit value with exactly one bit set.
2458
2459 @item Ilw
2460 A 16 bit value with exactly one bit set.
2461
2462 @item Sd
2463 The common src/dest memory addressing modes.
2464
2465 @item Sa
2466 Memory addressed using $a0 or $a1.
2467
2468 @item Si
2469 Memory addressed with immediate addresses.
2470
2471 @item Ss
2472 Memory addressed using the stack pointer ($sp).
2473
2474 @item Sf
2475 Memory addressed using the frame base register ($fb).
2476
2477 @item Ss
2478 Memory addressed using the small base register ($sb).
2479
2480 @item S1
2481 $r1h
2482 @end table
2483
2484 @item MIPS---@file{config/mips/constraints.md}
2485 @table @code
2486 @item d
2487 An address register.  This is equivalent to @code{r} unless
2488 generating MIPS16 code.
2489
2490 @item f
2491 A floating-point register (if available).
2492
2493 @item h
2494 The @code{hi} register.
2495
2496 @item l
2497 The @code{lo} register.
2498
2499 @item x
2500 The @code{hi} and @code{lo} registers.
2501
2502 @item c
2503 A register suitable for use in an indirect jump.  This will always be
2504 @code{$25} for @option{-mabicalls}.
2505
2506 @item y
2507 Equivalent to @code{r}; retained for backwards compatibility.
2508
2509 @item z
2510 A floating-point condition code register.
2511
2512 @item I
2513 A signed 16-bit constant (for arithmetic instructions).
2514
2515 @item J
2516 Integer zero.
2517
2518 @item K
2519 An unsigned 16-bit constant (for logic instructions).
2520
2521 @item L
2522 A signed 32-bit constant in which the lower 16 bits are zero.
2523 Such constants can be loaded using @code{lui}.
2524
2525 @item M
2526 A constant that cannot be loaded using @code{lui}, @code{addiu}
2527 or @code{ori}.
2528
2529 @item N
2530 A constant in the range -65535 to -1 (inclusive).
2531
2532 @item O
2533 A signed 15-bit constant.
2534
2535 @item P
2536 A constant in the range 1 to 65535 (inclusive).
2537
2538 @item G
2539 Floating-point zero.
2540
2541 @item R
2542 An address that can be used in a non-macro load or store.
2543 @end table
2544
2545 @item Motorola 680x0---@file{config/m68k/m68k.h}
2546 @table @code
2547 @item a
2548 Address register
2549
2550 @item d
2551 Data register
2552
2553 @item f
2554 68881 floating-point register, if available
2555
2556 @item I
2557 Integer in the range 1 to 8
2558
2559 @item J
2560 16-bit signed number
2561
2562 @item K
2563 Signed number whose magnitude is greater than 0x80
2564
2565 @item L
2566 Integer in the range @minus{}8 to @minus{}1
2567
2568 @item M
2569 Signed number whose magnitude is greater than 0x100
2570
2571 @item G
2572 Floating point constant that is not a 68881 constant
2573 @end table
2574
2575 @item Motorola 68HC11 & 68HC12 families---@file{config/m68hc11/m68hc11.h}
2576 @table @code
2577 @item a
2578 Register `a'
2579
2580 @item b
2581 Register `b'
2582
2583 @item d
2584 Register `d'
2585
2586 @item q
2587 An 8-bit register
2588
2589 @item t
2590 Temporary soft register _.tmp
2591
2592 @item u
2593 A soft register _.d1 to _.d31
2594
2595 @item w
2596 Stack pointer register
2597
2598 @item x
2599 Register `x'
2600
2601 @item y
2602 Register `y'
2603
2604 @item z
2605 Pseudo register `z' (replaced by `x' or `y' at the end)
2606
2607 @item A
2608 An address register: x, y or z
2609
2610 @item B
2611 An address register: x or y
2612
2613 @item D
2614 Register pair (x:d) to form a 32-bit value
2615
2616 @item L
2617 Constants in the range @minus{}65536 to 65535
2618
2619 @item M
2620 Constants whose 16-bit low part is zero
2621
2622 @item N
2623 Constant integer 1 or @minus{}1
2624
2625 @item O
2626 Constant integer 16
2627
2628 @item P
2629 Constants in the range @minus{}8 to 2
2630
2631 @end table
2632
2633 @need 1000
2634 @item SPARC---@file{config/sparc/sparc.h}
2635 @table @code
2636 @item f
2637 Floating-point register on the SPARC-V8 architecture and
2638 lower floating-point register on the SPARC-V9 architecture.
2639
2640 @item e
2641 Floating-point register.  It is equivalent to @samp{f} on the
2642 SPARC-V8 architecture and contains both lower and upper
2643 floating-point registers on the SPARC-V9 architecture.
2644
2645 @item c
2646 Floating-point condition code register.
2647
2648 @item d
2649 Lower floating-point register.  It is only valid on the SPARC-V9
2650 architecture when the Visual Instruction Set is available.
2651
2652 @item b
2653 Floating-point register.  It is only valid on the SPARC-V9 architecture
2654 when the Visual Instruction Set is available.
2655
2656 @item h
2657 64-bit global or out register for the SPARC-V8+ architecture.
2658
2659 @item I
2660 Signed 13-bit constant
2661
2662 @item J
2663 Zero
2664
2665 @item K
2666 32-bit constant with the low 12 bits clear (a constant that can be
2667 loaded with the @code{sethi} instruction)
2668
2669 @item L
2670 A constant in the range supported by @code{movcc} instructions
2671
2672 @item M
2673 A constant in the range supported by @code{movrcc} instructions
2674
2675 @item N
2676 Same as @samp{K}, except that it verifies that bits that are not in the
2677 lower 32-bit range are all zero.  Must be used instead of @samp{K} for
2678 modes wider than @code{SImode}
2679
2680 @item O
2681 The constant 4096
2682
2683 @item G
2684 Floating-point zero
2685
2686 @item H
2687 Signed 13-bit constant, sign-extended to 32 or 64 bits
2688
2689 @item Q
2690 Floating-point constant whose integral representation can
2691 be moved into an integer register using a single sethi
2692 instruction
2693
2694 @item R
2695 Floating-point constant whose integral representation can
2696 be moved into an integer register using a single mov
2697 instruction
2698
2699 @item S
2700 Floating-point constant whose integral representation can
2701 be moved into an integer register using a high/lo_sum
2702 instruction sequence
2703
2704 @item T
2705 Memory address aligned to an 8-byte boundary
2706
2707 @item U
2708 Even register
2709
2710 @item W
2711 Memory address for @samp{e} constraint registers
2712
2713 @item Y
2714 Vector zero
2715
2716 @end table
2717
2718 @item SPU---@file{config/spu/spu.h}
2719 @table @code
2720 @item a
2721 An immediate which can be loaded with the il/ila/ilh/ilhu instructions.  const_int is treated as a 64 bit value.
2722
2723 @item c
2724 An immediate for and/xor/or instructions.  const_int is treated as a 64 bit value.
2725
2726 @item d
2727 An immediate for the @code{iohl} instruction.  const_int is treated as a 64 bit value.
2728
2729 @item f
2730 An immediate which can be loaded with @code{fsmbi}.
2731
2732 @item A
2733 An immediate which can be loaded with the il/ila/ilh/ilhu instructions.  const_int is treated as a 32 bit value.
2734
2735 @item B
2736 An immediate for most arithmetic instructions.  const_int is treated as a 32 bit value.
2737
2738 @item C
2739 An immediate for and/xor/or instructions.  const_int is treated as a 32 bit value.
2740
2741 @item D
2742 An immediate for the @code{iohl} instruction.  const_int is treated as a 32 bit value.
2743
2744 @item I
2745 A constant in the range [-64, 63] for shift/rotate instructions.
2746
2747 @item J
2748 An unsigned 7-bit constant for conversion/nop/channel instructions.
2749
2750 @item K
2751 A signed 10-bit constant for most arithmetic instructions.
2752
2753 @item M
2754 A signed 16 bit immediate for @code{stop}.
2755
2756 @item N
2757 An unsigned 16-bit constant for @code{iohl} and @code{fsmbi}.
2758
2759 @item O
2760 An unsigned 7-bit constant whose 3 least significant bits are 0.
2761
2762 @item P
2763 An unsigned 3-bit constant for 16-byte rotates and shifts
2764
2765 @item R
2766 Call operand, reg, for indirect calls
2767
2768 @item S
2769 Call operand, symbol, for relative calls.
2770
2771 @item T
2772 Call operand, const_int, for absolute calls.
2773
2774 @item U
2775 An immediate which can be loaded with the il/ila/ilh/ilhu instructions.  const_int is sign extended to 128 bit.
2776
2777 @item W
2778 An immediate for shift and rotate instructions.  const_int is treated as a 32 bit value.
2779
2780 @item Y
2781 An immediate for and/xor/or instructions.  const_int is sign extended as a 128 bit.
2782
2783 @item Z
2784 An immediate for the @code{iohl} instruction.  const_int is sign extended to 128 bit.
2785
2786 @end table
2787
2788 @item TMS320C3x/C4x---@file{config/c4x/c4x.h}
2789 @table @code
2790 @item a
2791 Auxiliary (address) register (ar0-ar7)
2792
2793 @item b
2794 Stack pointer register (sp)
2795
2796 @item c
2797 Standard (32-bit) precision integer register
2798
2799 @item f
2800 Extended (40-bit) precision register (r0-r11)
2801
2802 @item k
2803 Block count register (bk)
2804
2805 @item q
2806 Extended (40-bit) precision low register (r0-r7)
2807
2808 @item t
2809 Extended (40-bit) precision register (r0-r1)
2810
2811 @item u
2812 Extended (40-bit) precision register (r2-r3)
2813
2814 @item v
2815 Repeat count register (rc)
2816
2817 @item x
2818 Index register (ir0-ir1)
2819
2820 @item y
2821 Status (condition code) register (st)
2822
2823 @item z
2824 Data page register (dp)
2825
2826 @item G
2827 Floating-point zero
2828
2829 @item H
2830 Immediate 16-bit floating-point constant
2831
2832 @item I
2833 Signed 16-bit constant
2834
2835 @item J
2836 Signed 8-bit constant
2837
2838 @item K
2839 Signed 5-bit constant
2840
2841 @item L
2842 Unsigned 16-bit constant
2843
2844 @item M
2845 Unsigned 8-bit constant
2846
2847 @item N
2848 Ones complement of unsigned 16-bit constant
2849
2850 @item O
2851 High 16-bit constant (32-bit constant with 16 LSBs zero)
2852
2853 @item Q
2854 Indirect memory reference with signed 8-bit or index register displacement
2855
2856 @item R
2857 Indirect memory reference with unsigned 5-bit displacement
2858
2859 @item S
2860 Indirect memory reference with 1 bit or index register displacement
2861
2862 @item T
2863 Direct memory reference
2864
2865 @item U
2866 Symbolic address
2867
2868 @end table
2869
2870 @item S/390 and zSeries---@file{config/s390/s390.h}
2871 @table @code
2872 @item a
2873 Address register (general purpose register except r0)
2874
2875 @item c
2876 Condition code register
2877
2878 @item d
2879 Data register (arbitrary general purpose register)
2880
2881 @item f
2882 Floating-point register
2883
2884 @item I
2885 Unsigned 8-bit constant (0--255)
2886
2887 @item J
2888 Unsigned 12-bit constant (0--4095)
2889
2890 @item K
2891 Signed 16-bit constant (@minus{}32768--32767)
2892
2893 @item L
2894 Value appropriate as displacement.
2895 @table @code
2896        @item (0..4095)
2897        for short displacement
2898        @item (-524288..524287)
2899        for long displacement
2900 @end table
2901
2902 @item M
2903 Constant integer with a value of 0x7fffffff.
2904
2905 @item N
2906 Multiple letter constraint followed by 4 parameter letters.
2907 @table @code
2908          @item 0..9:
2909          number of the part counting from most to least significant
2910          @item H,Q:
2911          mode of the part
2912          @item D,S,H:
2913          mode of the containing operand
2914          @item 0,F:
2915          value of the other parts (F---all bits set)
2916 @end table
2917 The constraint matches if the specified part of a constant
2918 has a value different from it's other parts.
2919
2920 @item Q
2921 Memory reference without index register and with short displacement.
2922
2923 @item R
2924 Memory reference with index register and short displacement.
2925
2926 @item S
2927 Memory reference without index register but with long displacement.
2928
2929 @item T
2930 Memory reference with index register and long displacement.
2931
2932 @item U
2933 Pointer with short displacement.
2934
2935 @item W
2936 Pointer with long displacement.
2937
2938 @item Y
2939 Shift count operand.
2940
2941 @end table
2942
2943 @item Score family---@file{config/score/score.h}
2944 @table @code
2945 @item d
2946 Registers from r0 to r32.
2947
2948 @item e
2949 Registers from r0 to r16.
2950
2951 @item t
2952 r8---r11 or r22---r27 registers.
2953
2954 @item h
2955 hi register.
2956
2957 @item l
2958 lo register.
2959
2960 @item x
2961 hi + lo register.
2962
2963 @item q
2964 cnt register.
2965
2966 @item y
2967 lcb register.
2968
2969 @item z
2970 scb register.
2971
2972 @item a
2973 cnt + lcb + scb register.
2974
2975 @item c
2976 cr0---cr15 register.
2977
2978 @item b
2979 cp1 registers.
2980
2981 @item f
2982 cp2 registers.
2983
2984 @item i
2985 cp3 registers.
2986
2987 @item j
2988 cp1 + cp2 + cp3 registers.
2989
2990 @item I
2991 High 16-bit constant (32-bit constant with 16 LSBs zero).
2992
2993 @item J
2994 Unsigned 5 bit integer (in the range 0 to 31).
2995
2996 @item K
2997 Unsigned 16 bit integer (in the range 0 to 65535).
2998
2999 @item L
3000 Signed 16 bit integer (in the range @minus{}32768 to 32767).
3001
3002 @item M
3003 Unsigned 14 bit integer (in the range 0 to 16383).
3004
3005 @item N
3006 Signed 14 bit integer (in the range @minus{}8192 to 8191).
3007
3008 @item Z
3009 Any SYMBOL_REF.
3010 @end table
3011
3012 @item Xstormy16---@file{config/stormy16/stormy16.h}
3013 @table @code
3014 @item a
3015 Register r0.
3016
3017 @item b
3018 Register r1.
3019
3020 @item c
3021 Register r2.
3022
3023 @item d
3024 Register r8.
3025
3026 @item e
3027 Registers r0 through r7.
3028
3029 @item t
3030 Registers r0 and r1.
3031
3032 @item y
3033 The carry register.
3034
3035 @item z
3036 Registers r8 and r9.
3037
3038 @item I
3039 A constant between 0 and 3 inclusive.
3040
3041 @item J
3042 A constant that has exactly one bit set.
3043
3044 @item K
3045 A constant that has exactly one bit clear.
3046
3047 @item L
3048 A constant between 0 and 255 inclusive.
3049
3050 @item M
3051 A constant between @minus{}255 and 0 inclusive.
3052
3053 @item N
3054 A constant between @minus{}3 and 0 inclusive.
3055
3056 @item O
3057 A constant between 1 and 4 inclusive.
3058
3059 @item P
3060 A constant between @minus{}4 and @minus{}1 inclusive.
3061
3062 @item Q
3063 A memory reference that is a stack push.
3064
3065 @item R
3066 A memory reference that is a stack pop.
3067
3068 @item S
3069 A memory reference that refers to a constant address of known value.
3070
3071 @item T
3072 The register indicated by Rx (not implemented yet).
3073
3074 @item U
3075 A constant that is not between 2 and 15 inclusive.
3076
3077 @item Z
3078 The constant 0.
3079
3080 @end table
3081
3082 @item Xtensa---@file{config/xtensa/constraints.md}
3083 @table @code
3084 @item a
3085 General-purpose 32-bit register
3086
3087 @item b
3088 One-bit boolean register
3089
3090 @item A
3091 MAC16 40-bit accumulator register
3092
3093 @item I
3094 Signed 12-bit integer constant, for use in MOVI instructions
3095
3096 @item J
3097 Signed 8-bit integer constant, for use in ADDI instructions
3098
3099 @item K
3100 Integer constant valid for BccI instructions
3101
3102 @item L
3103 Unsigned constant valid for BccUI instructions
3104
3105 @end table
3106
3107 @end table
3108
3109 @ifset INTERNALS
3110 @node Define Constraints
3111 @subsection Defining Machine-Specific Constraints
3112 @cindex defining constraints
3113 @cindex constraints, defining
3114
3115 Machine-specific constraints fall into two categories: register and
3116 non-register constraints.  Within the latter category, constraints
3117 which allow subsets of all possible memory or address operands should
3118 be specially marked, to give @code{reload} more information.
3119
3120 Machine-specific constraints can be given names of arbitrary length,
3121 but they must be entirely composed of letters, digits, underscores
3122 (@samp{_}), and angle brackets (@samp{< >}).  Like C identifiers, they
3123 must begin with a letter or underscore.
3124
3125 In order to avoid ambiguity in operand constraint strings, no
3126 constraint can have a name that begins with any other constraint's
3127 name.  For example, if @code{x} is defined as a constraint name,
3128 @code{xy} may not be, and vice versa.  As a consequence of this rule,
3129 no constraint may begin with one of the generic constraint letters:
3130 @samp{E F V X g i m n o p r s}.
3131
3132 Register constraints correspond directly to register classes.
3133 @xref{Register Classes}.  There is thus not much flexibility in their
3134 definitions.
3135
3136 @deffn {MD Expression} define_register_constraint name regclass docstring
3137 All three arguments are string constants.
3138 @var{name} is the name of the constraint, as it will appear in
3139 @code{match_operand} expressions.  If @var{name} is a multi-letter
3140 constraint its length shall be the same for all constraints starting
3141 with the same letter.  @var{regclass} can be either the
3142 name of the corresponding register class (@pxref{Register Classes}),
3143 or a C expression which evaluates to the appropriate register class.
3144 If it is an expression, it must have no side effects, and it cannot
3145 look at the operand.  The usual use of expressions is to map some
3146 register constraints to @code{NO_REGS} when the register class
3147 is not available on a given subarchitecture.
3148
3149 @var{docstring} is a sentence documenting the meaning of the
3150 constraint.  Docstrings are explained further below.
3151 @end deffn
3152
3153 Non-register constraints are more like predicates: the constraint
3154 definition gives a Boolean expression which indicates whether the
3155 constraint matches.
3156
3157 @deffn {MD Expression} define_constraint name docstring exp
3158 The @var{name} and @var{docstring} arguments are the same as for
3159 @code{define_register_constraint}, but note that the docstring comes
3160 immediately after the name for these expressions.  @var{exp} is an RTL
3161 expression, obeying the same rules as the RTL expressions in predicate
3162 definitions.  @xref{Defining Predicates}, for details.  If it
3163 evaluates true, the constraint matches; if it evaluates false, it
3164 doesn't. Constraint expressions should indicate which RTL codes they
3165 might match, just like predicate expressions.
3166
3167 @code{match_test} C expressions have access to the
3168 following variables:
3169
3170 @table @var
3171 @item op
3172 The RTL object defining the operand.
3173 @item mode
3174 The machine mode of @var{op}.
3175 @item ival
3176 @samp{INTVAL (@var{op})}, if @var{op} is a @code{const_int}.
3177 @item hval
3178 @samp{CONST_DOUBLE_HIGH (@var{op})}, if @var{op} is an integer
3179 @code{const_double}.
3180 @item lval
3181 @samp{CONST_DOUBLE_LOW (@var{op})}, if @var{op} is an integer
3182 @code{const_double}.
3183 @item rval
3184 @samp{CONST_DOUBLE_REAL_VALUE (@var{op})}, if @var{op} is a floating-point
3185 @code{const_double}.
3186 @end table
3187
3188 The @var{*val} variables should only be used once another piece of the
3189 expression has verified that @var{op} is the appropriate kind of RTL
3190 object.
3191 @end deffn
3192
3193 Most non-register constraints should be defined with
3194 @code{define_constraint}.  The remaining two definition expressions
3195 are only appropriate for constraints that should be handled specially
3196 by @code{reload} if they fail to match.
3197
3198 @deffn {MD Expression} define_memory_constraint name docstring exp
3199 Use this expression for constraints that match a subset of all memory
3200 operands: that is, @code{reload} can make them match by converting the
3201 operand to the form @samp{@w{(mem (reg @var{X}))}}, where @var{X} is a
3202 base register (from the register class specified by
3203 @code{BASE_REG_CLASS}, @pxref{Register Classes}).
3204
3205 For example, on the S/390, some instructions do not accept arbitrary
3206 memory references, but only those that do not make use of an index
3207 register.  The constraint letter @samp{Q} is defined to represent a
3208 memory address of this type.  If @samp{Q} is defined with
3209 @code{define_memory_constraint}, a @samp{Q} constraint can handle any
3210 memory operand, because @code{reload} knows it can simply copy the
3211 memory address into a base register if required.  This is analogous to
3212 the way a @samp{o} constraint can handle any memory operand.
3213
3214 The syntax and semantics are otherwise identical to
3215 @code{define_constraint}.
3216 @end deffn
3217
3218 @deffn {MD Expression} define_address_constraint name docstring exp
3219 Use this expression for constraints that match a subset of all address
3220 operands: that is, @code{reload} can make the constraint match by
3221 converting the operand to the form @samp{@w{(reg @var{X})}}, again
3222 with @var{X} a base register.
3223
3224 Constraints defined with @code{define_address_constraint} can only be
3225 used with the @code{address_operand} predicate, or machine-specific
3226 predicates that work the same way.  They are treated analogously to
3227 the generic @samp{p} constraint.
3228
3229 The syntax and semantics are otherwise identical to
3230 @code{define_constraint}.
3231 @end deffn
3232
3233 For historical reasons, names beginning with the letters @samp{G H}
3234 are reserved for constraints that match only @code{const_double}s, and
3235 names beginning with the letters @samp{I J K L M N O P} are reserved
3236 for constraints that match only @code{const_int}s.  This may change in
3237 the future.  For the time being, constraints with these names must be
3238 written in a stylized form, so that @code{genpreds} can tell you did
3239 it correctly:
3240
3241 @smallexample
3242 @group
3243 (define_constraint "[@var{GHIJKLMNOP}]@dots{}"
3244   "@var{doc}@dots{}"
3245   (and (match_code "const_int")  ; @r{@code{const_double} for G/H}
3246        @var{condition}@dots{}))            ; @r{usually a @code{match_test}}
3247 @end group
3248 @end smallexample
3249 @c the semicolons line up in the formatted manual
3250
3251 It is fine to use names beginning with other letters for constraints
3252 that match @code{const_double}s or @code{const_int}s.
3253
3254 Each docstring in a constraint definition should be one or more complete
3255 sentences, marked up in Texinfo format.  @emph{They are currently unused.}
3256 In the future they will be copied into the GCC manual, in @ref{Machine
3257 Constraints}, replacing the hand-maintained tables currently found in
3258 that section.  Also, in the future the compiler may use this to give
3259 more helpful diagnostics when poor choice of @code{asm} constraints
3260 causes a reload failure.
3261
3262 If you put the pseudo-Texinfo directive @samp{@@internal} at the
3263 beginning of a docstring, then (in the future) it will appear only in
3264 the internals manual's version of the machine-specific constraint tables.
3265 Use this for constraints that should not appear in @code{asm} statements.
3266
3267 @node C Constraint Interface
3268 @subsection Testing constraints from C
3269 @cindex testing constraints
3270 @cindex constraints, testing
3271
3272 It is occasionally useful to test a constraint from C code rather than
3273 implicitly via the constraint string in a @code{match_operand}.  The
3274 generated file @file{tm_p.h} declares a few interfaces for working
3275 with machine-specific constraints.  None of these interfaces work with
3276 the generic constraints described in @ref{Simple Constraints}.  This
3277 may change in the future.
3278
3279 @strong{Warning:} @file{tm_p.h} may declare other functions that
3280 operate on constraints, besides the ones documented here.  Do not use
3281 those functions from machine-dependent code.  They exist to implement
3282 the old constraint interface that machine-independent components of
3283 the compiler still expect.  They will change or disappear in the
3284 future.
3285
3286 Some valid constraint names are not valid C identifiers, so there is a
3287 mangling scheme for referring to them from C@.  Constraint names that
3288 do not contain angle brackets or underscores are left unchanged.
3289 Underscores are doubled, each @samp{<} is replaced with @samp{_l}, and
3290 each @samp{>} with @samp{_g}.  Here are some examples:
3291
3292 @c the @c's prevent double blank lines in the printed manual.
3293 @example
3294 @multitable {Original} {Mangled}
3295 @item @strong{Original} @tab @strong{Mangled}  @c
3296 @item @code{x}     @tab @code{x}       @c
3297 @item @code{P42x}  @tab @code{P42x}    @c
3298 @item @code{P4_x}  @tab @code{P4__x}   @c
3299 @item @code{P4>x}  @tab @code{P4_gx}   @c
3300 @item @code{P4>>}  @tab @code{P4_g_g}  @c
3301 @item @code{P4_g>} @tab @code{P4__g_g} @c
3302 @end multitable
3303 @end example
3304
3305 Throughout this section, the variable @var{c} is either a constraint
3306 in the abstract sense, or a constant from @code{enum constraint_num};
3307 the variable @var{m} is a mangled constraint name (usually as part of
3308 a larger identifier).
3309
3310 @deftp Enum constraint_num
3311 For each machine-specific constraint, there is a corresponding
3312 enumeration constant: @samp{CONSTRAINT_} plus the mangled name of the
3313 constraint.  Functions that take an @code{enum constraint_num} as an
3314 argument expect one of these constants.
3315
3316 Machine-independent constraints do not have associated constants.
3317 This may change in the future.
3318 @end deftp
3319
3320 @deftypefun {inline bool} satisfies_constraint_@var{m} (rtx @var{exp})
3321 For each machine-specific, non-register constraint @var{m}, there is
3322 one of these functions; it returns @code{true} if @var{exp} satisfies the
3323 constraint.  These functions are only visible if @file{rtl.h} was included
3324 before @file{tm_p.h}.
3325 @end deftypefun
3326
3327 @deftypefun bool constraint_satisfied_p (rtx @var{exp}, enum constraint_num @var{c})
3328 Like the @code{satisfies_constraint_@var{m}} functions, but the
3329 constraint to test is given as an argument, @var{c}.  If @var{c}
3330 specifies a register constraint, this function will always return
3331 @code{false}.
3332 @end deftypefun
3333
3334 @deftypefun {enum reg_class} regclass_for_constraint (enum constraint_num @var{c})
3335 Returns the register class associated with @var{c}.  If @var{c} is not
3336 a register constraint, or those registers are not available for the
3337 currently selected subtarget, returns @code{NO_REGS}.
3338 @end deftypefun
3339
3340 Here is an example use of @code{satisfies_constraint_@var{m}}.  In
3341 peephole optimizations (@pxref{Peephole Definitions}), operand
3342 constraint strings are ignored, so if there are relevant constraints,
3343 they must be tested in the C condition.  In the example, the
3344 optimization is applied if operand 2 does @emph{not} satisfy the
3345 @samp{K} constraint.  (This is a simplified version of a peephole
3346 definition from the i386 machine description.)
3347
3348 @smallexample
3349 (define_peephole2
3350   [(match_scratch:SI 3 "r")
3351    (set (match_operand:SI 0 "register_operand" "")
3352         (mult:SI (match_operand:SI 1 "memory_operand" "")
3353                  (match_operand:SI 2 "immediate_operand" "")))]
3354
3355   "!satisfies_constraint_K (operands[2])"
3356
3357   [(set (match_dup 3) (match_dup 1))
3358    (set (match_dup 0) (mult:SI (match_dup 3) (match_dup 2)))]
3359
3360   "")
3361 @end smallexample
3362
3363 @node Standard Names
3364 @section Standard Pattern Names For Generation
3365 @cindex standard pattern names
3366 @cindex pattern names
3367 @cindex names, pattern
3368
3369 Here is a table of the instruction names that are meaningful in the RTL
3370 generation pass of the compiler.  Giving one of these names to an
3371 instruction pattern tells the RTL generation pass that it can use the
3372 pattern to accomplish a certain task.
3373
3374 @table @asis
3375 @cindex @code{mov@var{m}} instruction pattern
3376 @item @samp{mov@var{m}}
3377 Here @var{m} stands for a two-letter machine mode name, in lowercase.
3378 This instruction pattern moves data with that machine mode from operand
3379 1 to operand 0.  For example, @samp{movsi} moves full-word data.
3380
3381 If operand 0 is a @code{subreg} with mode @var{m} of a register whose
3382 own mode is wider than @var{m}, the effect of this instruction is
3383 to store the specified value in the part of the register that corresponds
3384 to mode @var{m}.  Bits outside of @var{m}, but which are within the
3385 same target word as the @code{subreg} are undefined.  Bits which are
3386 outside the target word are left unchanged.
3387
3388 This class of patterns is special in several ways.  First of all, each
3389 of these names up to and including full word size @emph{must} be defined,
3390 because there is no other way to copy a datum from one place to another.
3391 If there are patterns accepting operands in larger modes,
3392 @samp{mov@var{m}} must be defined for integer modes of those sizes.
3393
3394 Second, these patterns are not used solely in the RTL generation pass.
3395 Even the reload pass can generate move insns to copy values from stack
3396 slots into temporary registers.  When it does so, one of the operands is
3397 a hard register and the other is an operand that can need to be reloaded
3398 into a register.
3399
3400 @findex force_reg
3401 Therefore, when given such a pair of operands, the pattern must generate
3402 RTL which needs no reloading and needs no temporary registers---no
3403 registers other than the operands.  For example, if you support the
3404 pattern with a @code{define_expand}, then in such a case the
3405 @code{define_expand} mustn't call @code{force_reg} or any other such
3406 function which might generate new pseudo registers.
3407
3408 This requirement exists even for subword modes on a RISC machine where
3409 fetching those modes from memory normally requires several insns and
3410 some temporary registers.
3411
3412 @findex change_address
3413 During reload a memory reference with an invalid address may be passed
3414 as an operand.  Such an address will be replaced with a valid address
3415 later in the reload pass.  In this case, nothing may be done with the
3416 address except to use it as it stands.  If it is copied, it will not be
3417 replaced with a valid address.  No attempt should be made to make such
3418 an address into a valid address and no routine (such as
3419 @code{change_address}) that will do so may be called.  Note that
3420 @code{general_operand} will fail when applied to such an address.
3421
3422 @findex reload_in_progress
3423 The global variable @code{reload_in_progress} (which must be explicitly
3424 declared if required) can be used to determine whether such special
3425 handling is required.
3426
3427 The variety of operands that have reloads depends on the rest of the
3428 machine description, but typically on a RISC machine these can only be
3429 pseudo registers that did not get hard registers, while on other
3430 machines explicit memory references will get optional reloads.
3431
3432 If a scratch register is required to move an object to or from memory,
3433 it can be allocated using @code{gen_reg_rtx} prior to life analysis.
3434
3435 If there are cases which need scratch registers during or after reload,
3436 you must provide an appropriate secondary_reload target hook.
3437
3438 @findex no_new_pseudos
3439 The global variable @code{no_new_pseudos} can be used to determine if it
3440 is unsafe to create new pseudo registers.  If this variable is nonzero, then
3441 it is unsafe to call @code{gen_reg_rtx} to allocate a new pseudo.
3442
3443 The constraints on a @samp{mov@var{m}} must permit moving any hard
3444 register to any other hard register provided that
3445 @code{HARD_REGNO_MODE_OK} permits mode @var{m} in both registers and
3446 @code{REGISTER_MOVE_COST} applied to their classes returns a value of 2.
3447
3448 It is obligatory to support floating point @samp{mov@var{m}}
3449 instructions into and out of any registers that can hold fixed point
3450 values, because unions and structures (which have modes @code{SImode} or
3451 @code{DImode}) can be in those registers and they may have floating
3452 point members.
3453
3454 There may also be a need to support fixed point @samp{mov@var{m}}
3455 instructions in and out of floating point registers.  Unfortunately, I
3456 have forgotten why this was so, and I don't know whether it is still
3457 true.  If @code{HARD_REGNO_MODE_OK} rejects fixed point values in
3458 floating point registers, then the constraints of the fixed point
3459 @samp{mov@var{m}} instructions must be designed to avoid ever trying to
3460 reload into a floating point register.
3461
3462 @cindex @code{reload_in} instruction pattern
3463 @cindex @code{reload_out} instruction pattern
3464 @item @samp{reload_in@var{m}}
3465 @itemx @samp{reload_out@var{m}}
3466 These named patterns have been obsoleted by the target hook
3467 @code{secondary_reload}.
3468
3469 Like @samp{mov@var{m}}, but used when a scratch register is required to
3470 move between operand 0 and operand 1.  Operand 2 describes the scratch
3471 register.  See the discussion of the @code{SECONDARY_RELOAD_CLASS}
3472 macro in @pxref{Register Classes}.
3473
3474 There are special restrictions on the form of the @code{match_operand}s
3475 used in these patterns.  First, only the predicate for the reload
3476 operand is examined, i.e., @code{reload_in} examines operand 1, but not
3477 the predicates for operand 0 or 2.  Second, there may be only one
3478 alternative in the constraints.  Third, only a single register class
3479 letter may be used for the constraint; subsequent constraint letters
3480 are ignored.  As a special exception, an empty constraint string
3481 matches the @code{ALL_REGS} register class.  This may relieve ports
3482 of the burden of defining an @code{ALL_REGS} constraint letter just
3483 for these patterns.
3484
3485 @cindex @code{movstrict@var{m}} instruction pattern
3486 @item @samp{movstrict@var{m}}
3487 Like @samp{mov@var{m}} except that if operand 0 is a @code{subreg}
3488 with mode @var{m} of a register whose natural mode is wider,
3489 the @samp{movstrict@var{m}} instruction is guaranteed not to alter
3490 any of the register except the part which belongs to mode @var{m}.
3491
3492 @cindex @code{movmisalign@var{m}} instruction pattern
3493 @item @samp{movmisalign@var{m}}
3494 This variant of a move pattern is designed to load or store a value
3495 from a memory address that is not naturally aligned for its mode.
3496 For a store, the memory will be in operand 0; for a load, the memory
3497 will be in operand 1.  The other operand is guaranteed not to be a
3498 memory, so that it's easy to tell whether this is a load or store.
3499
3500 This pattern is used by the autovectorizer, and when expanding a
3501 @code{MISALIGNED_INDIRECT_REF} expression.
3502
3503 @cindex @code{load_multiple} instruction pattern
3504 @item @samp{load_multiple}
3505 Load several consecutive memory locations into consecutive registers.
3506 Operand 0 is the first of the consecutive registers, operand 1
3507 is the first memory location, and operand 2 is a constant: the
3508 number of consecutive registers.
3509
3510 Define this only if the target machine really has such an instruction;
3511 do not define this if the most efficient way of loading consecutive
3512 registers from memory is to do them one at a time.
3513
3514 On some machines, there are restrictions as to which consecutive
3515 registers can be stored into memory, such as particular starting or
3516 ending register numbers or only a range of valid counts.  For those
3517 machines, use a @code{define_expand} (@pxref{Expander Definitions})
3518 and make the pattern fail if the restrictions are not met.
3519
3520 Write the generated insn as a @code{parallel} with elements being a
3521 @code{set} of one register from the appropriate memory location (you may
3522 also need @code{use} or @code{clobber} elements).  Use a
3523 @code{match_parallel} (@pxref{RTL Template}) to recognize the insn.  See
3524 @file{rs6000.md} for examples of the use of this insn pattern.
3525
3526 @cindex @samp{store_multiple} instruction pattern
3527 @item @samp{store_multiple}
3528 Similar to @samp{load_multiple}, but store several consecutive registers
3529 into consecutive memory locations.  Operand 0 is the first of the
3530 consecutive memory locations, operand 1 is the first register, and
3531 operand 2 is a constant: the number of consecutive registers.
3532
3533 @cindex @code{vec_set@var{m}} instruction pattern
3534 @item @samp{vec_set@var{m}}
3535 Set given field in the vector value.  Operand 0 is the vector to modify,
3536 operand 1 is new value of field and operand 2 specify the field index.
3537
3538 @cindex @code{vec_extract@var{m}} instruction pattern
3539 @item @samp{vec_extract@var{m}}
3540 Extract given field from the vector value.  Operand 1 is the vector, operand 2
3541 specify field index and operand 0 place to store value into.
3542
3543 @cindex @code{vec_extract_even@var{m}} instruction pattern
3544 @item @samp{vec_extract_even@var{m}}
3545 Extract even elements from the input vectors (operand 1 and operand 2).
3546 The even elements of operand 2 are concatenated to the even elements of operand
3547 1 in their original order. The result is stored in operand 0.
3548 The output and input vectors should have the same modes.
3549
3550 @cindex @code{vec_extract_odd@var{m}} instruction pattern
3551 @item @samp{vec_extract_odd@var{m}}
3552 Extract odd elements from the input vectors (operand 1 and operand 2).
3553 The odd elements of operand 2 are concatenated to the odd elements of operand
3554 1 in their original order. The result is stored in operand 0.
3555 The output and input vectors should have the same modes.
3556
3557 @cindex @code{vec_interleave_high@var{m}} instruction pattern
3558 @item @samp{vec_interleave_high@var{m}}
3559 Merge high elements of the two input vectors into the output vector. The output
3560 and input vectors should have the same modes (@code{N} elements). The high
3561 @code{N/2} elements of the first input vector are interleaved with the high
3562 @code{N/2} elements of the second input vector.
3563
3564 @cindex @code{vec_interleave_low@var{m}} instruction pattern
3565 @item @samp{vec_interleave_low@var{m}}
3566 Merge low elements of the two input vectors into the output vector. The output
3567 and input vectors should have the same modes (@code{N} elements). The low
3568 @code{N/2} elements of the first input vector are interleaved with the low
3569 @code{N/2} elements of the second input vector.
3570
3571 @cindex @code{vec_init@var{m}} instruction pattern
3572 @item @samp{vec_init@var{m}}
3573 Initialize the vector to given values.  Operand 0 is the vector to initialize
3574 and operand 1 is parallel containing values for individual fields.
3575
3576 @cindex @code{push@var{m}1} instruction pattern
3577 @item @samp{push@var{m}1}
3578 Output a push instruction.  Operand 0 is value to push.  Used only when
3579 @code{PUSH_ROUNDING} is defined.  For historical reason, this pattern may be
3580 missing and in such case an @code{mov} expander is used instead, with a
3581 @code{MEM} expression forming the push operation.  The @code{mov} expander
3582 method is deprecated.
3583
3584 @cindex @code{add@var{m}3} instruction pattern
3585 @item @samp{add@var{m}3}
3586 Add operand 2 and operand 1, storing the result in operand 0.  All operands
3587 must have mode @var{m}.  This can be used even on two-address machines, by
3588 means of constraints requiring operands 1 and 0 to be the same location.
3589
3590 @cindex @code{sub@var{m}3} instruction pattern
3591 @cindex @code{mul@var{m}3} instruction pattern
3592 @cindex @code{div@var{m}3} instruction pattern
3593 @cindex @code{udiv@var{m}3} instruction pattern
3594 @cindex @code{mod@var{m}3} instruction pattern
3595 @cindex @code{umod@var{m}3} instruction pattern
3596 @cindex @code{umin@var{m}3} instruction pattern
3597 @cindex @code{umax@var{m}3} instruction pattern
3598 @cindex @code{and@var{m}3} instruction pattern
3599 @cindex @code{ior@var{m}3} instruction pattern
3600 @cindex @code{xor@var{m}3} instruction pattern
3601 @item @samp{sub@var{m}3}, @samp{mul@var{m}3}
3602 @itemx @samp{div@var{m}3}, @samp{udiv@var{m}3}
3603 @itemx @samp{mod@var{m}3}, @samp{umod@var{m}3}
3604 @itemx @samp{umin@var{m}3}, @samp{umax@var{m}3}
3605 @itemx @samp{and@var{m}3}, @samp{ior@var{m}3}, @samp{xor@var{m}3}
3606 Similar, for other arithmetic operations.
3607
3608 @cindex @code{min@var{m}3} instruction pattern
3609 @cindex @code{max@var{m}3} instruction pattern
3610 @item @samp{smin@var{m}3}, @samp{smax@var{m}3}
3611 Signed minimum and maximum operations.  When used with floating point,
3612 if both operands are zeros, or if either operand is @code{NaN}, then
3613 it is unspecified which of the two operands is returned as the result.
3614
3615 @cindex @code{reduc_smin_@var{m}} instruction pattern
3616 @cindex @code{reduc_smax_@var{m}} instruction pattern
3617 @item @samp{reduc_smin_@var{m}}, @samp{reduc_smax_@var{m}}
3618 Find the signed minimum/maximum of the elements of a vector. The vector is
3619 operand 1, and the scalar result is stored in the least significant bits of
3620 operand 0 (also a vector). The output and input vector should have the same
3621 modes.
3622
3623 @cindex @code{reduc_umin_@var{m}} instruction pattern
3624 @cindex @code{reduc_umax_@var{m}} instruction pattern
3625 @item @samp{reduc_umin_@var{m}}, @samp{reduc_umax_@var{m}}
3626 Find the unsigned minimum/maximum of the elements of a vector. The vector is
3627 operand 1, and the scalar result is stored in the least significant bits of
3628 operand 0 (also a vector). The output and input vector should have the same
3629 modes.
3630
3631 @cindex @code{reduc_splus_@var{m}} instruction pattern
3632 @item @samp{reduc_splus_@var{m}}
3633 Compute the sum of the signed elements of a vector. The vector is operand 1,
3634 and the scalar result is stored in the least significant bits of operand 0
3635 (also a vector). The output and input vector should have the same modes.
3636
3637 @cindex @code{reduc_uplus_@var{m}} instruction pattern
3638 @item @samp{reduc_uplus_@var{m}}
3639 Compute the sum of the unsigned elements of a vector. The vector is operand 1,
3640 and the scalar result is stored in the least significant bits of operand 0
3641 (also a vector). The output and input vector should have the same modes.
3642
3643 @cindex @code{sdot_prod@var{m}} instruction pattern
3644 @item @samp{sdot_prod@var{m}}
3645 @cindex @code{udot_prod@var{m}} instruction pattern
3646 @item @samp{udot_prod@var{m}}
3647 Compute the sum of the products of two signed/unsigned elements.
3648 Operand 1 and operand 2 are of the same mode. Their product, which is of a
3649 wider mode, is computed and added to operand 3. Operand 3 is of a mode equal or
3650 wider than the mode of the product. The result is placed in operand 0, which
3651 is of the same mode as operand 3.
3652
3653 @cindex @code{ssum_widen@var{m3}} instruction pattern
3654 @item @samp{ssum_widen@var{m3}}
3655 @cindex @code{usum_widen@var{m3}} instruction pattern
3656 @item @samp{usum_widen@var{m3}}
3657 Operands 0 and 2 are of the same mode, which is wider than the mode of
3658 operand 1. Add operand 1 to operand 2 and place the widened result in
3659 operand 0. (This is used express accumulation of elements into an accumulator
3660 of a wider mode.)
3661
3662 @cindex @code{vec_shl_@var{m}} instruction pattern
3663 @cindex @code{vec_shr_@var{m}} instruction pattern
3664 @item @samp{vec_shl_@var{m}}, @samp{vec_shr_@var{m}}
3665 Whole vector left/right shift in bits.
3666 Operand 1 is a vector to be shifted.
3667 Operand 2 is an integer shift amount in bits.
3668 Operand 0 is where the resulting shifted vector is stored.
3669 The output and input vectors should have the same modes.
3670
3671 @cindex @code{vec_pack_trunc_@var{m}} instruction pattern
3672 @item @samp{vec_pack_trunc_@var{m}}
3673 Narrow (demote) and merge the elements of two vectors. Operands 1 and 2
3674 are vectors of the same mode having N integral or floating point elements
3675 of size S.  Operand 0 is the resulting vector in which 2*N elements of
3676 size N/2 are concatenated after narrowing them down using truncation.
3677
3678 @cindex @code{vec_pack_ssat_@var{m}} instruction pattern
3679 @cindex @code{vec_pack_usat_@var{m}} instruction pattern
3680 @item @samp{vec_pack_ssat_@var{m}}, @samp{vec_pack_usat_@var{m}}
3681 Narrow (demote) and merge the elements of two vectors.  Operands 1 and 2
3682 are vectors of the same mode having N integral elements of size S.
3683 Operand 0 is the resulting vector in which the elements of the two input
3684 vectors are concatenated after narrowing them down using signed/unsigned
3685 saturating arithmetic.
3686
3687 @cindex @code{vec_pack_sfix_trunc_@var{m}} instruction pattern
3688 @cindex @code{vec_pack_ufix_trunc_@var{m}} instruction pattern
3689 @item @samp{vec_pack_sfix_trunc_@var{m}}, @samp{vec_pack_ufix_trunc_@var{m}}
3690 Narrow, convert to signed/unsigned integral type and merge the elements
3691 of two vectors.  Operands 1 and 2 are vectors of the same mode having N
3692 floating point elements of size S.  Operand 0 is the resulting vector
3693 in which 2*N elements of size N/2 are concatenated.
3694
3695 @cindex @code{vec_unpacks_hi_@var{m}} instruction pattern
3696 @cindex @code{vec_unpacks_lo_@var{m}} instruction pattern
3697 @item @samp{vec_unpacks_hi_@var{m}}, @samp{vec_unpacks_lo_@var{m}}
3698 Extract and widen (promote) the high/low part of a vector of signed
3699 integral or floating point elements.  The input vector (operand 1) has N
3700 elements of size S.  Widen (promote) the high/low elements of the vector
3701 using signed or floating point extension and place the resulting N/2
3702 values of size 2*S in the output vector (operand 0).
3703
3704 @cindex @code{vec_unpacku_hi_@var{m}} instruction pattern
3705 @cindex @code{vec_unpacku_lo_@var{m}} instruction pattern
3706 @item @samp{vec_unpacku_hi_@var{m}}, @samp{vec_unpacku_lo_@var{m}}
3707 Extract and widen (promote) the high/low part of a vector of unsigned
3708 integral elements.  The input vector (operand 1) has N elements of size S.
3709 Widen (promote) the high/low elements of the vector using zero extension and
3710 place the resulting N/2 values of size 2*S in the output vector (operand 0).
3711
3712 @cindex @code{vec_unpacks_float_hi_@var{m}} instruction pattern
3713 @cindex @code{vec_unpacks_float_lo_@var{m}} instruction pattern
3714 @cindex @code{vec_unpacku_float_hi_@var{m}} instruction pattern
3715 @cindex @code{vec_unpacku_float_lo_@var{m}} instruction pattern
3716 @item @samp{vec_unpacks_float_hi_@var{m}}, @samp{vec_unpacks_float_lo_@var{m}}
3717 @itemx @samp{vec_unpacku_float_hi_@var{m}}, @samp{vec_unpacku_float_lo_@var{m}}
3718 Extract, convert to floating point type and widen the high/low part of a
3719 vector of signed/unsigned integral elements.  The input vector (operand 1)
3720 has N elements of size S.  Convert the high/low elements of the vector using
3721 floating point conversion and place the resulting N/2 values of size 2*S in
3722 the output vector (operand 0).
3723
3724 @cindex @code{vec_widen_umult_hi_@var{m}} instruction pattern
3725 @cindex @code{vec_widen_umult_lo__@var{m}} instruction pattern
3726 @cindex @code{vec_widen_smult_hi_@var{m}} instruction pattern
3727 @cindex @code{vec_widen_smult_lo_@var{m}} instruction pattern
3728 @item @samp{vec_widen_umult_hi_@var{m}}, @samp{vec_widen_umult_lo_@var{m}}
3729 @itemx @samp{vec_widen_smult_hi_@var{m}}, @samp{vec_widen_smult_lo_@var{m}}
3730 Signed/Unsigned widening multiplication.  The two inputs (operands 1 and 2)
3731 are vectors with N signed/unsigned elements of size S.  Multiply the high/low
3732 elements of the two vectors, and put the N/2 products of size 2*S in the
3733 output vector (operand 0).
3734
3735 @cindex @code{mulhisi3} instruction pattern
3736 @item @samp{mulhisi3}
3737 Multiply operands 1 and 2, which have mode @code{HImode}, and store
3738 a @code{SImode} product in operand 0.
3739
3740 @cindex @code{mulqihi3} instruction pattern
3741 @cindex @code{mulsidi3} instruction pattern
3742 @item @samp{mulqihi3}, @samp{mulsidi3}
3743 Similar widening-multiplication instructions of other widths.
3744
3745 @cindex @code{umulqihi3} instruction pattern
3746 @cindex @code{umulhisi3} instruction pattern
3747 @cindex @code{umulsidi3} instruction pattern
3748 @item @samp{umulqihi3}, @samp{umulhisi3}, @samp{umulsidi3}
3749 Similar widening-multiplication instructions that do unsigned
3750 multiplication.
3751
3752 @cindex @code{usmulqihi3} instruction pattern
3753 @cindex @code{usmulhisi3} instruction pattern
3754 @cindex @code{usmulsidi3} instruction pattern
3755 @item @samp{usmulqihi3}, @samp{usmulhisi3}, @samp{usmulsidi3}
3756 Similar widening-multiplication instructions that interpret the first
3757 operand as unsigned and the second operand as signed, then do a signed
3758 multiplication.
3759
3760 @cindex @code{smul@var{m}3_highpart} instruction pattern
3761 @item @samp{smul@var{m}3_highpart}
3762 Perform a signed multiplication of operands 1 and 2, which have mode
3763 @var{m}, and store the most significant half of the product in operand 0.
3764 The least significant half of the product is discarded.
3765
3766 @cindex @code{umul@var{m}3_highpart} instruction pattern
3767 @item @samp{umul@var{m}3_highpart}
3768 Similar, but the multiplication is unsigned.
3769
3770 @cindex @code{madd@var{m}@var{n}4} instruction pattern
3771 @item @samp{madd@var{m}@var{n}4}
3772 Multiply operands 1 and 2, sign-extend them to mode @var{n}, add
3773 operand 3, and store the result in operand 0.  Operands 1 and 2
3774 have mode @var{m} and operands 0 and 3 have mode @var{n}.
3775 Both modes must be integer modes and @var{n} must be twice
3776 the size of @var{m}.
3777
3778 In other words, @code{madd@var{m}@var{n}4} is like
3779 @code{mul@var{m}@var{n}3} except that it also adds operand 3.
3780
3781 These instructions are not allowed to @code{FAIL}.
3782
3783 @cindex @code{umadd@var{m}@var{n}4} instruction pattern
3784 @item @samp{umadd@var{m}@var{n}4}
3785 Like @code{madd@var{m}@var{n}4}, but zero-extend the multiplication
3786 operands instead of sign-extending them.
3787
3788 @cindex @code{msub@var{m}@var{n}4} instruction pattern
3789 @item @samp{msub@var{m}@var{n}4}
3790 Multiply operands 1 and 2, sign-extend them to mode @var{n}, subtract the
3791 result from operand 3, and store the result in operand 0.  Operands 1 and 2
3792 have mode @var{m} and operands 0 and 3 have mode @var{n}.
3793 Both modes must be integer modes and @var{n} must be twice
3794 the size of @var{m}.
3795
3796 In other words, @code{msub@var{m}@var{n}4} is like
3797 @code{mul@var{m}@var{n}3} except that it also subtracts the result
3798 from operand 3.
3799
3800 These instructions are not allowed to @code{FAIL}.
3801
3802 @cindex @code{umsub@var{m}@var{n}4} instruction pattern
3803 @item @samp{umsub@var{m}@var{n}4}
3804 Like @code{msub@var{m}@var{n}4}, but zero-extend the multiplication
3805 operands instead of sign-extending them.
3806
3807 @cindex @code{divmod@var{m}4} instruction pattern
3808 @item @samp{divmod@var{m}4}
3809 Signed division that produces both a quotient and a remainder.
3810 Operand 1 is divided by operand 2 to produce a quotient stored
3811 in operand 0 and a remainder stored in operand 3.
3812
3813 For machines with an instruction that produces both a quotient and a
3814 remainder, provide a pattern for @samp{divmod@var{m}4} but do not
3815 provide patterns for @samp{div@var{m}3} and @samp{mod@var{m}3}.  This
3816 allows optimization in the relatively common case when both the quotient
3817 and remainder are computed.
3818
3819 If an instruction that just produces a quotient or just a remainder
3820 exists and is more efficient than the instruction that produces both,
3821 write the output routine of @samp{divmod@var{m}4} to call
3822 @code{find_reg_note} and look for a @code{REG_UNUSED} note on the
3823 quotient or remainder and generate the appropriate instruction.
3824
3825 @cindex @code{udivmod@var{m}4} instruction pattern
3826 @item @samp{udivmod@var{m}4}
3827 Similar, but does unsigned division.
3828
3829 @anchor{shift patterns}
3830 @cindex @code{ashl@var{m}3} instruction pattern
3831 @item @samp{ashl@var{m}3}
3832 Arithmetic-shift operand 1 left by a number of bits specified by operand
3833 2, and store the result in operand 0.  Here @var{m} is the mode of
3834 operand 0 and operand 1; operand 2's mode is specified by the
3835 instruction pattern, and the compiler will convert the operand to that
3836 mode before generating the instruction.  The meaning of out-of-range shift
3837 counts can optionally be specified by @code{TARGET_SHIFT_TRUNCATION_MASK}.
3838 @xref{TARGET_SHIFT_TRUNCATION_MASK}.
3839
3840 @cindex @code{ashr@var{m}3} instruction pattern
3841 @cindex @code{lshr@var{m}3} instruction pattern
3842 @cindex @code{rotl@var{m}3} instruction pattern
3843 @cindex @code{rotr@var{m}3} instruction pattern
3844 @item @samp{ashr@var{m}3}, @samp{lshr@var{m}3}, @samp{rotl@var{m}3}, @samp{rotr@var{m}3}
3845 Other shift and rotate instructions, analogous to the
3846 @code{ashl@var{m}3} instructions.
3847
3848 @cindex @code{neg@var{m}2} instruction pattern
3849 @item @samp{neg@var{m}2}
3850 Negate operand 1 and store the result in operand 0.
3851
3852 @cindex @code{abs@var{m}2} instruction pattern
3853 @item @samp{abs@var{m}2}
3854 Store the absolute value of operand 1 into operand 0.
3855
3856 @cindex @code{sqrt@var{m}2} instruction pattern
3857 @item @samp{sqrt@var{m}2}
3858 Store the square root of operand 1 into operand 0.
3859
3860 The @code{sqrt} built-in function of C always uses the mode which
3861 corresponds to the C data type @code{double} and the @code{sqrtf}
3862 built-in function uses the mode which corresponds to the C data
3863 type @code{float}.
3864
3865 @cindex @code{fmod@var{m}3} instruction pattern
3866 @item @samp{fmod@var{m}3}
3867 Store the remainder of dividing operand 1 by operand 2 into
3868 operand 0, rounded towards zero to an integer.
3869
3870 The @code{fmod} built-in function of C always uses the mode which
3871 corresponds to the C data type @code{double} and the @code{fmodf}
3872 built-in function uses the mode which corresponds to the C data
3873 type @code{float}.
3874
3875 @cindex @code{remainder@var{m}3} instruction pattern
3876 @item @samp{remainder@var{m}3}
3877 Store the remainder of dividing operand 1 by operand 2 into
3878 operand 0, rounded to the nearest integer.
3879
3880 The @code{remainder} built-in function of C always uses the mode
3881 which corresponds to the C data type @code{double} and the
3882 @code{remainderf} built-in function uses the mode which corresponds
3883 to the C data type @code{float}.
3884
3885 @cindex @code{cos@var{m}2} instruction pattern
3886 @item @samp{cos@var{m}2}
3887 Store the cosine of operand 1 into operand 0.
3888
3889 The @code{cos} built-in function of C always uses the mode which
3890 corresponds to the C data type @code{double} and the @code{cosf}
3891 built-in function uses the mode which corresponds to the C data
3892 type @code{float}.
3893
3894 @cindex @code{sin@var{m}2} instruction pattern
3895 @item @samp{sin@var{m}2}
3896 Store the sine of operand 1 into operand 0.
3897
3898 The @code{sin} built-in function of C always uses the mode which
3899 corresponds to the C data type @code{double} and the @code{sinf}
3900 built-in function uses the mode which corresponds to the C data
3901 type @code{float}.
3902
3903 @cindex @code{exp@var{m}2} instruction pattern
3904 @item @samp{exp@var{m}2}
3905 Store the exponential of operand 1 into operand 0.
3906
3907 The @code{exp} built-in function of C always uses the mode which
3908 corresponds to the C data type @code{double} and the @code{expf}
3909 built-in function uses the mode which corresponds to the C data
3910 type @code{float}.
3911
3912 @cindex @code{log@var{m}2} instruction pattern
3913 @item @samp{log@var{m}2}
3914 Store the natural logarithm of operand 1 into operand 0.
3915
3916 The @code{log} built-in function of C always uses the mode which
3917 corresponds to the C data type @code{double} and the @code{logf}
3918 built-in function uses the mode which corresponds to the C data
3919 type @code{float}.
3920
3921 @cindex @code{pow@var{m}3} instruction pattern
3922 @item @samp{pow@var{m}3}
3923 Store the value of operand 1 raised to the exponent operand 2
3924 into operand 0.
3925
3926 The @code{pow} built-in function of C always uses the mode which
3927 corresponds to the C data type @code{double} and the @code{powf}
3928 built-in function uses the mode which corresponds to the C data
3929 type @code{float}.
3930
3931 @cindex @code{atan2@var{m}3} instruction pattern
3932 @item @samp{atan2@var{m}3}
3933 Store the arc tangent (inverse tangent) of operand 1 divided by
3934 operand 2 into operand 0, using the signs of both arguments to
3935 determine the quadrant of the result.
3936
3937 The @code{atan2} built-in function of C always uses the mode which
3938 corresponds to the C data type @code{double} and the @code{atan2f}
3939 built-in function uses the mode which corresponds to the C data
3940 type @code{float}.
3941
3942 @cindex @code{floor@var{m}2} instruction pattern
3943 @item @samp{floor@var{m}2}
3944 Store the largest integral value not greater than argument.
3945
3946 The @code{floor} built-in function of C always uses the mode which
3947 corresponds to the C data type @code{double} and the @code{floorf}
3948 built-in function uses the mode which corresponds to the C data
3949 type @code{float}.
3950
3951 @cindex @code{btrunc@var{m}2} instruction pattern
3952 @item @samp{btrunc@var{m}2}
3953 Store the argument rounded to integer towards zero.
3954
3955 The @code{trunc} built-in function of C always uses the mode which
3956 corresponds to the C data type @code{double} and the @code{truncf}
3957 built-in function uses the mode which corresponds to the C data
3958 type @code{float}.
3959
3960 @cindex @code{round@var{m}2} instruction pattern
3961 @item @samp{round@var{m}2}
3962 Store the argument rounded to integer away from zero.
3963
3964 The @code{round} built-in function of C always uses the mode which
3965 corresponds to the C data type @code{double} and the @code{roundf}
3966 built-in function uses the mode which corresponds to the C data
3967 type @code{float}.
3968
3969 @cindex @code{ceil@var{m}2} instruction pattern
3970 @item @samp{ceil@var{m}2}
3971 Store the argument rounded to integer away from zero.
3972
3973 The @code{ceil} built-in function of C always uses the mode which
3974 corresponds to the C data type @code{double} and the @code{ceilf}
3975 built-in function uses the mode which corresponds to the C data
3976 type @code{float}.
3977
3978 @cindex @code{nearbyint@var{m}2} instruction pattern
3979 @item @samp{nearbyint@var{m}2}
3980 Store the argument rounded according to the default rounding mode
3981
3982 The @code{nearbyint} built-in function of C always uses the mode which
3983 corresponds to the C data type @code{double} and the @code{nearbyintf}
3984 built-in function uses the mode which corresponds to the C data
3985 type @code{float}.
3986
3987 @cindex @code{rint@var{m}2} instruction pattern
3988 @item @samp{rint@var{m}2}
3989 Store the argument rounded according to the default rounding mode and
3990 raise the inexact exception when the result differs in value from
3991 the argument
3992
3993 The @code{rint} built-in function of C always uses the mode which
3994 corresponds to the C data type @code{double} and the @code{rintf}
3995 built-in function uses the mode which corresponds to the C data
3996 type @code{float}.
3997
3998 @cindex @code{lrint@var{m}@var{n}2}
3999 @item @samp{lrint@var{m}@var{n}2}
4000 Convert operand 1 (valid for floating point mode @var{m}) to fixed
4001 point mode @var{n} as a signed number according to the current
4002 rounding mode and store in operand 0 (which has mode @var{n}).
4003
4004 @cindex @code{lround@var{m}@var{n}2}
4005 @item @samp{lround@var{m}2}
4006 Convert operand 1 (valid for floating point mode @var{m}) to fixed
4007 point mode @var{n} as a signed number rounding to nearest and away
4008 from zero and store in operand 0 (which has mode @var{n}).
4009
4010 @cindex @code{lfloor@var{m}@var{n}2}
4011 @item @samp{lfloor@var{m}2}
4012 Convert operand 1 (valid for floating point mode @var{m}) to fixed
4013 point mode @var{n} as a signed number rounding down and store in
4014 operand 0 (which has mode @var{n}).
4015
4016 @cindex @code{lceil@var{m}@var{n}2}
4017 @item @samp{lceil@var{m}2}
4018 Convert operand 1 (valid for floating point mode @var{m}) to fixed
4019 point mode @var{n} as a signed number rounding up and store in
4020 operand 0 (which has mode @var{n}).
4021
4022 @cindex @code{copysign@var{m}3} instruction pattern
4023 @item @samp{copysign@var{m}3}
4024 Store a value with the magnitude of operand 1 and the sign of operand
4025 2 into operand 0.
4026
4027 The @code{copysign} built-in function of C always uses the mode which
4028 corresponds to the C data type @code{double} and the @code{copysignf}
4029 built-in function uses the mode which corresponds to the C data
4030 type @code{float}.
4031
4032 @cindex @code{ffs@var{m}2} instruction pattern
4033 @item @samp{ffs@var{m}2}
4034 Store into operand 0 one plus the index of the least significant 1-bit
4035 of operand 1.  If operand 1 is zero, store zero.  @var{m} is the mode
4036 of operand 0; operand 1's mode is specified by the instruction
4037 pattern, and the compiler will convert the operand to that mode before
4038 generating the instruction.
4039
4040 The @code{ffs} built-in function of C always uses the mode which
4041 corresponds to the C data type @code{int}.
4042
4043 @cindex @code{clz@var{m}2} instruction pattern
4044 @item @samp{clz@var{m}2}
4045 Store into operand 0 the number of leading 0-bits in @var{x}, starting
4046 at the most significant bit position.  If @var{x} is 0, the result is
4047 undefined.  @var{m} is the mode of operand 0; operand 1's mode is
4048 specified by the instruction pattern, and the compiler will convert the
4049 operand to that mode before generating the instruction.
4050
4051 @cindex @code{ctz@var{m}2} instruction pattern
4052 @item @samp{ctz@var{m}2}
4053 Store into operand 0 the number of trailing 0-bits in @var{x}, starting
4054 at the least significant bit position.  If @var{x} is 0, the result is
4055 undefined.  @var{m} is the mode of operand 0; operand 1's mode is
4056 specified by the instruction pattern, and the compiler will convert the
4057 operand to that mode before generating the instruction.
4058
4059 @cindex @code{popcount@var{m}2} instruction pattern
4060 @item @samp{popcount@var{m}2}
4061 Store into operand 0 the number of 1-bits in @var{x}.  @var{m} is the
4062 mode of operand 0; operand 1's mode is specified by the instruction
4063 pattern, and the compiler will convert the operand to that mode before
4064 generating the instruction.
4065
4066 @cindex @code{parity@var{m}2} instruction pattern
4067 @item @samp{parity@var{m}2}
4068 Store into operand 0 the parity of @var{x}, i.e.@: the number of 1-bits
4069 in @var{x} modulo 2.  @var{m} is the mode of operand 0; operand 1's mode
4070 is specified by the instruction pattern, and the compiler will convert
4071 the operand to that mode before generating the instruction.
4072
4073 @cindex @code{one_cmpl@var{m}2} instruction pattern
4074 @item @samp{one_cmpl@var{m}2}
4075 Store the bitwise-complement of operand 1 into operand 0.
4076
4077 @cindex @code{cmp@var{m}} instruction pattern
4078 @item @samp{cmp@var{m}}
4079 Compare operand 0 and operand 1, and set the condition codes.
4080 The RTL pattern should look like this:
4081
4082 @smallexample
4083 (set (cc0) (compare (match_operand:@var{m} 0 @dots{})
4084                     (match_operand:@var{m} 1 @dots{})))
4085 @end smallexample
4086
4087 @cindex @code{tst@var{m}} instruction pattern
4088 @item @samp{tst@var{m}}
4089 Compare operand 0 against zero, and set the condition codes.
4090 The RTL pattern should look like this:
4091
4092 @smallexample
4093 (set (cc0) (match_operand:@var{m} 0 @dots{}))
4094 @end smallexample
4095
4096 @samp{tst@var{m}} patterns should not be defined for machines that do
4097 not use @code{(cc0)}.  Doing so would confuse the optimizer since it
4098 would no longer be clear which @code{set} operations were comparisons.
4099 The @samp{cmp@var{m}} patterns should be used instead.
4100
4101 @cindex @code{movmem@var{m}} instruction pattern
4102 @item @samp{movmem@var{m}}
4103 Block move instruction.  The destination and source blocks of memory
4104 are the first two operands, and both are @code{mem:BLK}s with an
4105 address in mode @code{Pmode}.
4106
4107 The number of bytes to move is the third operand, in mode @var{m}.
4108 Usually, you specify @code{word_mode} for @var{m}.  However, if you can
4109 generate better code knowing the range of valid lengths is smaller than
4110 those representable in a full word, you should provide a pattern with a
4111 mode corresponding to the range of values you can handle efficiently
4112 (e.g., @code{QImode} for values in the range 0--127; note we avoid numbers
4113 that appear negative) and also a pattern with @code{word_mode}.
4114
4115 The fourth operand is the known shared alignment of the source and
4116 destination, in the form of a @code{const_int} rtx.  Thus, if the
4117 compiler knows that both source and destination are word-aligned,
4118 it may provide the value 4 for this operand.
4119
4120 Optional operands 5 and 6 specify expected alignment and size of block
4121 respectively.  The expected alignment differs from alignment in operand 4
4122 in a way that the blocks are not required to be aligned according to it in
4123 all cases. Expected size, when unknown, is set to @code{(const_int -1)}.
4124
4125 Descriptions of multiple @code{movmem@var{m}} patterns can only be
4126 beneficial if the patterns for smaller modes have fewer restrictions
4127 on their first, second and fourth operands.  Note that the mode @var{m}
4128 in @code{movmem@var{m}} does not impose any restriction on the mode of
4129 individually moved data units in the block.
4130
4131 These patterns need not give special consideration to the possibility
4132 that the source and destination strings might overlap.
4133
4134 @cindex @code{movstr} instruction pattern
4135 @item @samp{movstr}
4136 String copy instruction, with @code{stpcpy} semantics.  Operand 0 is
4137 an output operand in mode @code{Pmode}.  The addresses of the
4138 destination and source strings are operands 1 and 2, and both are
4139 @code{mem:BLK}s with addresses in mode @code{Pmode}.  The execution of
4140 the expansion of this pattern should store in operand 0 the address in
4141 which the @code{NUL} terminator was stored in the destination string.
4142
4143 @cindex @code{setmem@var{m}} instruction pattern
4144 @item @samp{setmem@var{m}}
4145 Block set instruction.  The destination string is the first operand,
4146 given as a @code{mem:BLK} whose address is in mode @code{Pmode}.  The
4147 number of bytes to set is the second operand, in mode @var{m}.  The value to
4148 initialize the memory with is the third operand. Targets that only support the
4149 clearing of memory should reject any value that is not the constant 0.  See
4150 @samp{movmem@var{m}} for a discussion of the choice of mode.
4151
4152 The fourth operand is the known alignment of the destination, in the form
4153 of a @code{const_int} rtx.  Thus, if the compiler knows that the
4154 destination is word-aligned, it may provide the value 4 for this
4155 operand.
4156
4157 Optional operands 5 and 6 specify expected alignment and size of block
4158 respectively.  The expected alignment differs from alignment in operand 4
4159 in a way that the blocks are not required to be aligned according to it in
4160 all cases. Expected size, when unknown, is set to @code{(const_int -1)}.
4161
4162 The use for multiple @code{setmem@var{m}} is as for @code{movmem@var{m}}.
4163
4164 @cindex @code{cmpstrn@var{m}} instruction pattern
4165 @item @samp{cmpstrn@var{m}}
4166 String compare instruction, with five operands.  Operand 0 is the output;
4167 it has mode @var{m}.  The remaining four operands are like the operands
4168 of @samp{movmem@var{m}}.  The two memory blocks specified are compared
4169 byte by byte in lexicographic order starting at the beginning of each
4170 string.  The instruction is not allowed to prefetch more than one byte
4171 at a time since either string may end in the first byte and reading past
4172 that may access an invalid page or segment and cause a fault.  The
4173 effect of the instruction is to store a value in operand 0 whose sign
4174 indicates the result of the comparison.
4175
4176 @cindex @code{cmpstr@var{m}} instruction pattern
4177 @item @samp{cmpstr@var{m}}
4178 String compare instruction, without known maximum length.  Operand 0 is the
4179 output; it has mode @var{m}.  The second and third operand are the blocks of
4180 memory to be compared; both are @code{mem:BLK} with an address in mode
4181 @code{Pmode}.
4182
4183 The fourth operand is the known shared alignment of the source and
4184 destination, in the form of a @code{const_int} rtx.  Thus, if the
4185 compiler knows that both source and destination are word-aligned,
4186 it may provide the value 4 for this operand.
4187
4188 The two memory blocks specified are compared byte by byte in lexicographic
4189 order starting at the beginning of each string.  The instruction is not allowed
4190 to prefetch more than one byte at a time since either string may end in the
4191 first byte and reading past that may access an invalid page or segment and
4192 cause a fault.  The effect of the instruction is to store a value in operand 0
4193 whose sign indicates the result of the comparison.
4194
4195 @cindex @code{cmpmem@var{m}} instruction pattern
4196 @item @samp{cmpmem@var{m}}
4197 Block compare instruction, with five operands like the operands
4198 of @samp{cmpstr@var{m}}.  The two memory blocks specified are compared
4199 byte by byte in lexicographic order starting at the beginning of each
4200 block.  Unlike @samp{cmpstr@var{m}} the instruction can prefetch
4201 any bytes in the two memory blocks.  The effect of the instruction is
4202 to store a value in operand 0 whose sign indicates the result of the
4203 comparison.
4204
4205 @cindex @code{strlen@var{m}} instruction pattern
4206 @item @samp{strlen@var{m}}
4207 Compute the length of a string, with three operands.
4208 Operand 0 is the result (of mode @var{m}), operand 1 is
4209 a @code{mem} referring to the first character of the string,
4210 operand 2 is the character to search for (normally zero),
4211 and operand 3 is a constant describing the known alignment
4212 of the beginning of the string.
4213
4214 @cindex @code{float@var{mn}2} instruction pattern
4215 @item @samp{float@var{m}@var{n}2}
4216 Convert signed integer operand 1 (valid for fixed point mode @var{m}) to
4217 floating point mode @var{n} and store in operand 0 (which has mode
4218 @var{n}).
4219
4220 @cindex @code{floatuns@var{mn}2} instruction pattern
4221 @item @samp{floatuns@var{m}@var{n}2}
4222 Convert unsigned integer operand 1 (valid for fixed point mode @var{m})
4223 to floating point mode @var{n} and store in operand 0 (which has mode
4224 @var{n}).
4225
4226 @cindex @code{fix@var{mn}2} instruction pattern
4227 @item @samp{fix@var{m}@var{n}2}
4228 Convert operand 1 (valid for floating point mode @var{m}) to fixed
4229 point mode @var{n} as a signed number and store in operand 0 (which
4230 has mode @var{n}).  This instruction's result is defined only when
4231 the value of operand 1 is an integer.
4232
4233 If the machine description defines this pattern, it also needs to
4234 define the @code{ftrunc} pattern.
4235
4236 @cindex @code{fixuns@var{mn}2} instruction pattern
4237 @item @samp{fixuns@var{m}@var{n}2}
4238 Convert operand 1 (valid for floating point mode @var{m}) to fixed
4239 point mode @var{n} as an unsigned number and store in operand 0 (which
4240 has mode @var{n}).  This instruction's result is defined only when the
4241 value of operand 1 is an integer.
4242
4243 @cindex @code{ftrunc@var{m}2} instruction pattern
4244 @item @samp{ftrunc@var{m}2}
4245 Convert operand 1 (valid for floating point mode @var{m}) to an
4246 integer value, still represented in floating point mode @var{m}, and
4247 store it in operand 0 (valid for floating point mode @var{m}).
4248
4249 @cindex @code{fix_trunc@var{mn}2} instruction pattern
4250 @item @samp{fix_trunc@var{m}@var{n}2}
4251 Like @samp{fix@var{m}@var{n}2} but works for any floating point value
4252 of mode @var{m} by converting the value to an integer.
4253
4254 @cindex @code{fixuns_trunc@var{mn}2} instruction pattern
4255 @item @samp{fixuns_trunc@var{m}@var{n}2}
4256 Like @samp{fixuns@var{m}@var{n}2} but works for any floating point
4257 value of mode @var{m} by converting the value to an integer.
4258
4259 @cindex @code{trunc@var{mn}2} instruction pattern
4260 @item @samp{trunc@var{m}@var{n}2}
4261 Truncate operand 1 (valid for mode @var{m}) to mode @var{n} and
4262 store in operand 0 (which has mode @var{n}).  Both modes must be fixed
4263 point or both floating point.
4264
4265 @cindex @code{extend@var{mn}2} instruction pattern
4266 @item @samp{extend@var{m}@var{n}2}
4267 Sign-extend operand 1 (valid for mode @var{m}) to mode @var{n} and
4268 store in operand 0 (which has mode @var{n}).  Both modes must be fixed
4269 point or both floating point.
4270
4271 @cindex @code{zero_extend@var{mn}2} instruction pattern
4272 @item @samp{zero_extend@var{m}@var{n}2}
4273 Zero-extend operand 1 (valid for mode @var{m}) to mode @var{n} and
4274 store in operand 0 (which has mode @var{n}).  Both modes must be fixed
4275 point.
4276
4277 @cindex @code{extv} instruction pattern
4278 @item @samp{extv}
4279 Extract a bit-field from operand 1 (a register or memory operand), where
4280 operand 2 specifies the width in bits and operand 3 the starting bit,
4281 and store it in operand 0.  Operand 0 must have mode @code{word_mode}.
4282 Operand 1 may have mode @code{byte_mode} or @code{word_mode}; often
4283 @code{word_mode} is allowed only for registers.  Operands 2 and 3 must
4284 be valid for @code{word_mode}.
4285
4286 The RTL generation pass generates this instruction only with constants
4287 for operands 2 and 3 and the constant is never zero for operand 2.
4288
4289 The bit-field value is sign-extended to a full word integer
4290 before it is stored in operand 0.
4291
4292 @cindex @code{extzv} instruction pattern
4293 @item @samp{extzv}
4294 Like @samp{extv} except that the bit-field value is zero-extended.
4295
4296 @cindex @code{insv} instruction pattern
4297 @item @samp{insv}
4298 Store operand 3 (which must be valid for @code{word_mode}) into a
4299 bit-field in operand 0, where operand 1 specifies the width in bits and
4300 operand 2 the starting bit.  Operand 0 may have mode @code{byte_mode} or
4301 @code{word_mode}; often @code{word_mode} is allowed only for registers.
4302 Operands 1 and 2 must be valid for @code{word_mode}.
4303
4304 The RTL generation pass generates this instruction only with constants
4305 for operands 1 and 2 and the constant is never zero for operand 1.
4306
4307 @cindex @code{mov@var{mode}cc} instruction pattern
4308 @item @samp{mov@var{mode}cc}
4309 Conditionally move operand 2 or operand 3 into operand 0 according to the
4310 comparison in operand 1.  If the comparison is true, operand 2 is moved
4311 into operand 0, otherwise operand 3 is moved.
4312
4313 The mode of the operands being compared need not be the same as the operands
4314 being moved.  Some machines, sparc64 for example, have instructions that
4315 conditionally move an integer value based on the floating point condition
4316 codes and vice versa.
4317
4318 If the machine does not have conditional move instructions, do not
4319 define these patterns.
4320
4321 @cindex @code{add@var{mode}cc} instruction pattern
4322 @item @samp{add@var{mode}cc}
4323 Similar to @samp{mov@var{mode}cc} but for conditional addition.  Conditionally
4324 move operand 2 or (operands 2 + operand 3) into operand 0 according to the
4325 comparison in operand 1.  If the comparison is true, operand 2 is moved into
4326 operand 0, otherwise (operand 2 + operand 3) is moved.
4327
4328 @cindex @code{s@var{cond}} instruction pattern
4329 @item @samp{s@var{cond}}
4330 Store zero or nonzero in the operand according to the condition codes.
4331 Value stored is nonzero iff the condition @var{cond} is true.
4332 @var{cond} is the name of a comparison operation expression code, such
4333 as @code{eq}, @code{lt} or @code{leu}.
4334
4335 You specify the mode that the operand must have when you write the
4336 @code{match_operand} expression.  The compiler automatically sees
4337 which mode you have used and supplies an operand of that mode.
4338
4339 The value stored for a true condition must have 1 as its low bit, or
4340 else must be negative.  Otherwise the instruction is not suitable and
4341 you should omit it from the machine description.  You describe to the
4342 compiler exactly which value is stored by defining the macro
4343 @code{STORE_FLAG_VALUE} (@pxref{Misc}).  If a description cannot be
4344 found that can be used for all the @samp{s@var{cond}} patterns, you
4345 should omit those operations from the machine description.
4346
4347 These operations may fail, but should do so only in relatively
4348 uncommon cases; if they would fail for common cases involving
4349 integer comparisons, it is best to omit these patterns.
4350
4351 If these operations are omitted, the compiler will usually generate code
4352 that copies the constant one to the target and branches around an
4353 assignment of zero to the target.  If this code is more efficient than
4354 the potential instructions used for the @samp{s@var{cond}} pattern
4355 followed by those required to convert the result into a 1 or a zero in
4356 @code{SImode}, you should omit the @samp{s@var{cond}} operations from
4357 the machine description.
4358
4359 @cindex @code{b@var{cond}} instruction pattern
4360 @item @samp{b@var{cond}}
4361 Conditional branch instruction.  Operand 0 is a @code{label_ref} that
4362 refers to the label to jump to.  Jump if the condition codes meet
4363 condition @var{cond}.
4364
4365 Some machines do not follow the model assumed here where a comparison
4366 instruction is followed by a conditional branch instruction.  In that
4367 case, the @samp{cmp@var{m}} (and @samp{tst@var{m}}) patterns should
4368 simply store the operands away and generate all the required insns in a
4369 @code{define_expand} (@pxref{Expander Definitions}) for the conditional
4370 branch operations.  All calls to expand @samp{b@var{cond}} patterns are
4371 immediately preceded by calls to expand either a @samp{cmp@var{m}}
4372 pattern or a @samp{tst@var{m}} pattern.
4373
4374 Machines that use a pseudo register for the condition code value, or
4375 where the mode used for the comparison depends on the condition being
4376 tested, should also use the above mechanism.  @xref{Jump Patterns}.
4377
4378 The above discussion also applies to the @samp{mov@var{mode}cc} and
4379 @samp{s@var{cond}} patterns.
4380
4381 @cindex @code{cbranch@var{mode}4} instruction pattern
4382 @item @samp{cbranch@var{mode}4}
4383 Conditional branch instruction combined with a compare instruction.
4384 Operand 0 is a comparison operator.  Operand 1 and operand 2 are the
4385 first and second operands of the comparison, respectively.  Operand 3
4386 is a @code{label_ref} that refers to the label to jump to.
4387
4388 @cindex @code{jump} instruction pattern
4389 @item @samp{jump}
4390 A jump inside a function; an unconditional branch.  Operand 0 is the
4391 @code{label_ref} of the label to jump to.  This pattern name is mandatory
4392 on all machines.
4393
4394 @cindex @code{call} instruction pattern
4395 @item @samp{call}
4396 Subroutine call instruction returning no value.  Operand 0 is the
4397 function to call; operand 1 is the number of bytes of arguments pushed
4398 as a @code{const_int}; operand 2 is the number of registers used as
4399 operands.
4400
4401 On most machines, operand 2 is not actually stored into the RTL
4402 pattern.  It is supplied for the sake of some RISC machines which need
4403 to put this information into the assembler code; they can put it in
4404 the RTL instead of operand 1.
4405
4406 Operand 0 should be a @code{mem} RTX whose address is the address of the
4407 function.  Note, however, that this address can be a @code{symbol_ref}
4408 expression even if it would not be a legitimate memory address on the
4409 target machine.  If it is also not a valid argument for a call
4410 instruction, the pattern for this operation should be a
4411 @code{define_expand} (@pxref{Expander Definitions}) that places the
4412 address into a register and uses that register in the call instruction.
4413
4414 @cindex @code{call_value} instruction pattern
4415 @item @samp{call_value}
4416 Subroutine call instruction returning a value.  Operand 0 is the hard
4417 register in which the value is returned.  There are three more
4418 operands, the same as the three operands of the @samp{call}
4419 instruction (but with numbers increased by one).
4420
4421 Subroutines that return @code{BLKmode} objects use the @samp{call}
4422 insn.
4423
4424 @cindex @code{call_pop} instruction pattern
4425 @cindex @code{call_value_pop} instruction pattern
4426 @item @samp{call_pop}, @samp{call_value_pop}
4427 Similar to @samp{call} and @samp{call_value}, except used if defined and
4428 if @code{RETURN_POPS_ARGS} is nonzero.  They should emit a @code{parallel}
4429 that contains both the function call and a @code{set} to indicate the
4430 adjustment made to the frame pointer.
4431
4432 For machines where @code{RETURN_POPS_ARGS} can be nonzero, the use of these
4433 patterns increases the number of functions for which the frame pointer
4434 can be eliminated, if desired.
4435
4436 @cindex @code{untyped_call} instruction pattern
4437 @item @samp{untyped_call}
4438 Subroutine call instruction returning a value of any type.  Operand 0 is
4439 the function to call; operand 1 is a memory location where the result of
4440 calling the function is to be stored; operand 2 is a @code{parallel}
4441 expression where each element is a @code{set} expression that indicates
4442 the saving of a function return value into the result block.
4443
4444 This instruction pattern should be defined to support
4445 @code{__builtin_apply} on machines where special instructions are needed
4446 to call a subroutine with arbitrary arguments or to save the value
4447 returned.  This instruction pattern is required on machines that have
4448 multiple registers that can hold a return value
4449 (i.e.@: @code{FUNCTION_VALUE_REGNO_P} is true for more than one register).
4450
4451 @cindex @code{return} instruction pattern
4452 @item @samp{return}
4453 Subroutine return instruction.  This instruction pattern name should be
4454 defined only if a single instruction can do all the work of returning
4455 from a function.
4456
4457 Like the @samp{mov@var{m}} patterns, this pattern is also used after the
4458 RTL generation phase.  In this case it is to support machines where
4459 multiple instructions are usually needed to return from a function, but
4460 some class of functions only requires one instruction to implement a
4461 return.  Normally, the applicable functions are those which do not need
4462 to save any registers or allocate stack space.
4463
4464 @findex reload_completed
4465 @findex leaf_function_p
4466 For such machines, the condition specified in this pattern should only
4467 be true when @code{reload_completed} is nonzero and the function's
4468 epilogue would only be a single instruction.  For machines with register
4469 windows, the routine @code{leaf_function_p} may be used to determine if
4470 a register window push is required.
4471
4472 Machines that have conditional return instructions should define patterns
4473 such as
4474
4475 @smallexample
4476 (define_insn ""
4477   [(set (pc)
4478         (if_then_else (match_operator
4479                          0 "comparison_operator"
4480                          [(cc0) (const_int 0)])
4481                       (return)
4482                       (pc)))]
4483   "@var{condition}"
4484   "@dots{}")
4485 @end smallexample
4486
4487 where @var{condition} would normally be the same condition specified on the
4488 named @samp{return} pattern.
4489
4490 @cindex @code{untyped_return} instruction pattern
4491 @item @samp{untyped_return}
4492 Untyped subroutine return instruction.  This instruction pattern should
4493 be defined to support @code{__builtin_return} on machines where special
4494 instructions are needed to return a value of any type.
4495
4496 Operand 0 is a memory location where the result of calling a function
4497 with @code{__builtin_apply} is stored; operand 1 is a @code{parallel}
4498 expression where each element is a @code{set} expression that indicates
4499 the restoring of a function return value from the result block.
4500
4501 @cindex @code{nop} instruction pattern
4502 @item @samp{nop}
4503 No-op instruction.  This instruction pattern name should always be defined
4504 to output a no-op in assembler code.  @code{(const_int 0)} will do as an
4505 RTL pattern.
4506
4507 @cindex @code{indirect_jump} instruction pattern
4508 @item @samp{indirect_jump}
4509 An instruction to jump to an address which is operand zero.
4510 This pattern name is mandatory on all machines.
4511
4512 @cindex @code{casesi} instruction pattern
4513 @item @samp{casesi}
4514 Instruction to jump through a dispatch table, including bounds checking.
4515 This instruction takes five operands:
4516
4517 @enumerate
4518 @item
4519 The index to dispatch on, which has mode @code{SImode}.
4520
4521 @item
4522 The lower bound for indices in the table, an integer constant.
4523
4524 @item
4525 The total range of indices in the table---the largest index
4526 minus the smallest one (both inclusive).
4527
4528 @item
4529 A label that precedes the table itself.
4530
4531 @item
4532 A label to jump to if the index has a value outside the bounds.
4533 @end enumerate
4534
4535 The table is a @code{addr_vec} or @code{addr_diff_vec} inside of a
4536 @code{jump_insn}.  The number of elements in the table is one plus the
4537 difference between the upper bound and the lower bound.
4538
4539 @cindex @code{tablejump} instruction pattern
4540 @item @samp{tablejump}
4541 Instruction to jump to a variable address.  This is a low-level
4542 capability which can be used to implement a dispatch table when there
4543 is no @samp{casesi} pattern.
4544
4545 This pattern requires two operands: the address or offset, and a label
4546 which should immediately precede the jump table.  If the macro
4547 @code{CASE_VECTOR_PC_RELATIVE} evaluates to a nonzero value then the first
4548 operand is an offset which counts from the address of the table; otherwise,
4549 it is an absolute address to jump to.  In either case, the first operand has
4550 mode @code{Pmode}.
4551
4552 The @samp{tablejump} insn is always the last insn before the jump
4553 table it uses.  Its assembler code normally has no need to use the
4554 second operand, but you should incorporate it in the RTL pattern so
4555 that the jump optimizer will not delete the table as unreachable code.
4556
4557
4558 @cindex @code{decrement_and_branch_until_zero} instruction pattern
4559 @item @samp{decrement_and_branch_until_zero}
4560 Conditional branch instruction that decrements a register and
4561 jumps if the register is nonzero.  Operand 0 is the register to
4562 decrement and test; operand 1 is the label to jump to if the
4563 register is nonzero.  @xref{Looping Patterns}.
4564
4565 This optional instruction pattern is only used by the combiner,
4566 typically for loops reversed by the loop optimizer when strength
4567 reduction is enabled.
4568
4569 @cindex @code{doloop_end} instruction pattern
4570 @item @samp{doloop_end}
4571 Conditional branch instruction that decrements a register and jumps if
4572 the register is nonzero.  This instruction takes five operands: Operand
4573 0 is the register to decrement and test; operand 1 is the number of loop
4574 iterations as a @code{const_int} or @code{const0_rtx} if this cannot be
4575 determined until run-time; operand 2 is the actual or estimated maximum
4576 number of iterations as a @code{const_int}; operand 3 is the number of
4577 enclosed loops as a @code{const_int} (an innermost loop has a value of
4578 1); operand 4 is the label to jump to if the register is nonzero.
4579 @xref{Looping Patterns}.
4580
4581 This optional instruction pattern should be defined for machines with
4582 low-overhead looping instructions as the loop optimizer will try to
4583 modify suitable loops to utilize it.  If nested low-overhead looping is
4584 not supported, use a @code{define_expand} (@pxref{Expander Definitions})
4585 and make the pattern fail if operand 3 is not @code{const1_rtx}.
4586 Similarly, if the actual or estimated maximum number of iterations is
4587 too large for this instruction, make it fail.
4588
4589 @cindex @code{doloop_begin} instruction pattern
4590 @item @samp{doloop_begin}
4591 Companion instruction to @code{doloop_end} required for machines that
4592 need to perform some initialization, such as loading special registers
4593 used by a low-overhead looping instruction.  If initialization insns do
4594 not always need to be emitted, use a @code{define_expand}
4595 (@pxref{Expander Definitions}) and make it fail.
4596
4597
4598 @cindex @code{canonicalize_funcptr_for_compare} instruction pattern
4599 @item @samp{canonicalize_funcptr_for_compare}
4600 Canonicalize the function pointer in operand 1 and store the result
4601 into operand 0.
4602
4603 Operand 0 is always a @code{reg} and has mode @code{Pmode}; operand 1
4604 may be a @code{reg}, @code{mem}, @code{symbol_ref}, @code{const_int}, etc
4605 and also has mode @code{Pmode}.
4606
4607 Canonicalization of a function pointer usually involves computing
4608 the address of the function which would be called if the function
4609 pointer were used in an indirect call.
4610
4611 Only define this pattern if function pointers on the target machine
4612 can have different values but still call the same function when
4613 used in an indirect call.
4614
4615 @cindex @code{save_stack_block} instruction pattern
4616 @cindex @code{save_stack_function} instruction pattern
4617 @cindex @code{save_stack_nonlocal} instruction pattern
4618 @cindex @code{restore_stack_block} instruction pattern
4619 @cindex @code{restore_stack_function} instruction pattern
4620 @cindex @code{restore_stack_nonlocal} instruction pattern
4621 @item @samp{save_stack_block}
4622 @itemx @samp{save_stack_function}
4623 @itemx @samp{save_stack_nonlocal}
4624 @itemx @samp{restore_stack_block}
4625 @itemx @samp{restore_stack_function}
4626 @itemx @samp{restore_stack_nonlocal}
4627 Most machines save and restore the stack pointer by copying it to or
4628 from an object of mode @code{Pmode}.  Do not define these patterns on
4629 such machines.
4630
4631 Some machines require special handling for stack pointer saves and
4632 restores.  On those machines, define the patterns corresponding to the
4633 non-standard cases by using a @code{define_expand} (@pxref{Expander
4634 Definitions}) that produces the required insns.  The three types of
4635 saves and restores are:
4636
4637 @enumerate
4638 @item
4639 @samp{save_stack_block} saves the stack pointer at the start of a block
4640 that allocates a variable-sized object, and @samp{restore_stack_block}
4641 restores the stack pointer when the block is exited.
4642
4643 @item
4644 @samp{save_stack_function} and @samp{restore_stack_function} do a
4645 similar job for the outermost block of a function and are used when the
4646 function allocates variable-sized objects or calls @code{alloca}.  Only
4647 the epilogue uses the restored stack pointer, allowing a simpler save or
4648 restore sequence on some machines.
4649
4650 @item
4651 @samp{save_stack_nonlocal} is used in functions that contain labels
4652 branched to by nested functions.  It saves the stack pointer in such a
4653 way that the inner function can use @samp{restore_stack_nonlocal} to
4654 restore the stack pointer.  The compiler generates code to restore the
4655 frame and argument pointer registers, but some machines require saving
4656 and restoring additional data such as register window information or
4657 stack backchains.  Place insns in these patterns to save and restore any
4658 such required data.
4659 @end enumerate
4660
4661 When saving the stack pointer, operand 0 is the save area and operand 1
4662 is the stack pointer.  The mode used to allocate the save area defaults
4663 to @code{Pmode} but you can override that choice by defining the
4664 @code{STACK_SAVEAREA_MODE} macro (@pxref{Storage Layout}).  You must
4665 specify an integral mode, or @code{VOIDmode} if no save area is needed
4666 for a particular type of save (either because no save is needed or
4667 because a machine-specific save area can be used).  Operand 0 is the
4668 stack pointer and operand 1 is the save area for restore operations.  If
4669 @samp{save_stack_block} is defined, operand 0 must not be
4670 @code{VOIDmode} since these saves can be arbitrarily nested.
4671
4672 A save area is a @code{mem} that is at a constant offset from
4673 @code{virtual_stack_vars_rtx} when the stack pointer is saved for use by
4674 nonlocal gotos and a @code{reg} in the other two cases.
4675
4676 @cindex @code{allocate_stack} instruction pattern
4677 @item @samp{allocate_stack}
4678 Subtract (or add if @code{STACK_GROWS_DOWNWARD} is undefined) operand 1 from
4679 the stack pointer to create space for dynamically allocated data.
4680
4681 Store the resultant pointer to this space into operand 0.  If you
4682 are allocating space from the main stack, do this by emitting a
4683 move insn to copy @code{virtual_stack_dynamic_rtx} to operand 0.
4684 If you are allocating the space elsewhere, generate code to copy the
4685 location of the space to operand 0.  In the latter case, you must
4686 ensure this space gets freed when the corresponding space on the main
4687 stack is free.
4688
4689 Do not define this pattern if all that must be done is the subtraction.
4690 Some machines require other operations such as stack probes or
4691 maintaining the back chain.  Define this pattern to emit those
4692 operations in addition to updating the stack pointer.
4693
4694 @cindex @code{check_stack} instruction pattern
4695 @item @samp{check_stack}
4696 If stack checking cannot be done on your system by probing the stack with
4697 a load or store instruction (@pxref{Stack Checking}), define this pattern
4698 to perform the needed check and signaling an error if the stack
4699 has overflowed.  The single operand is the location in the stack furthest
4700 from the current stack pointer that you need to validate.  Normally,
4701 on machines where this pattern is needed, you would obtain the stack
4702 limit from a global or thread-specific variable or register.
4703
4704 @cindex @code{nonlocal_goto} instruction pattern
4705 @item @samp{nonlocal_goto}
4706 Emit code to generate a non-local goto, e.g., a jump from one function
4707 to a label in an outer function.  This pattern has four arguments,
4708 each representing a value to be used in the jump.  The first
4709 argument is to be loaded into the frame pointer, the second is
4710 the address to branch to (code to dispatch to the actual label),
4711 the third is the address of a location where the stack is saved,
4712 and the last is the address of the label, to be placed in the
4713 location for the incoming static chain.
4714
4715 On most machines you need not define this pattern, since GCC will
4716 already generate the correct code, which is to load the frame pointer
4717 and static chain, restore the stack (using the
4718 @samp{restore_stack_nonlocal} pattern, if defined), and jump indirectly
4719 to the dispatcher.  You need only define this pattern if this code will
4720 not work on your machine.
4721
4722 @cindex @code{nonlocal_goto_receiver} instruction pattern
4723 @item @samp{nonlocal_goto_receiver}
4724 This pattern, if defined, contains code needed at the target of a
4725 nonlocal goto after the code already generated by GCC@.  You will not
4726 normally need to define this pattern.  A typical reason why you might
4727 need this pattern is if some value, such as a pointer to a global table,
4728 must be restored when the frame pointer is restored.  Note that a nonlocal
4729 goto only occurs within a unit-of-translation, so a global table pointer
4730 that is shared by all functions of a given module need not be restored.
4731 There are no arguments.
4732
4733 @cindex @code{exception_receiver} instruction pattern
4734 @item @samp{exception_receiver}
4735 This pattern, if defined, contains code needed at the site of an
4736 exception handler that isn't needed at the site of a nonlocal goto.  You
4737 will not normally need to define this pattern.  A typical reason why you
4738 might need this pattern is if some value, such as a pointer to a global
4739 table, must be restored after control flow is branched to the handler of
4740 an exception.  There are no arguments.
4741
4742 @cindex @code{builtin_setjmp_setup} instruction pattern
4743 @item @samp{builtin_setjmp_setup}
4744 This pattern, if defined, contains additional code needed to initialize
4745 the @code{jmp_buf}.  You will not normally need to define this pattern.
4746 A typical reason why you might need this pattern is if some value, such
4747 as a pointer to a global table, must be restored.  Though it is
4748 preferred that the pointer value be recalculated if possible (given the
4749 address of a label for instance).  The single argument is a pointer to
4750 the @code{jmp_buf}.  Note that the buffer is five words long and that
4751 the first three are normally used by the generic mechanism.
4752
4753 @cindex @code{builtin_setjmp_receiver} instruction pattern
4754 @item @samp{builtin_setjmp_receiver}
4755 This pattern, if defined, contains code needed at the site of an
4756 built-in setjmp that isn't needed at the site of a nonlocal goto.  You
4757 will not normally need to define this pattern.  A typical reason why you
4758 might need this pattern is if some value, such as a pointer to a global
4759 table, must be restored.  It takes one argument, which is the label
4760 to which builtin_longjmp transfered control; this pattern may be emitted
4761 at a small offset from that label.
4762
4763 @cindex @code{builtin_longjmp} instruction pattern
4764 @item @samp{builtin_longjmp}
4765 This pattern, if defined, performs the entire action of the longjmp.
4766 You will not normally need to define this pattern unless you also define
4767 @code{builtin_setjmp_setup}.  The single argument is a pointer to the
4768 @code{jmp_buf}.
4769
4770 @cindex @code{eh_return} instruction pattern
4771 @item @samp{eh_return}
4772 This pattern, if defined, affects the way @code{__builtin_eh_return},
4773 and thence the call frame exception handling library routines, are
4774 built.  It is intended to handle non-trivial actions needed along
4775 the abnormal return path.
4776
4777 The address of the exception handler to which the function should return
4778 is passed as operand to this pattern.  It will normally need to copied by
4779 the pattern to some special register or memory location.
4780 If the pattern needs to determine the location of the target call
4781 frame in order to do so, it may use @code{EH_RETURN_STACKADJ_RTX},
4782 if defined; it will have already been assigned.
4783
4784 If this pattern is not defined, the default action will be to simply
4785 copy the return address to @code{EH_RETURN_HANDLER_RTX}.  Either
4786 that macro or this pattern needs to be defined if call frame exception
4787 handling is to be used.
4788
4789 @cindex @code{prologue} instruction pattern
4790 @anchor{prologue instruction pattern}
4791 @item @samp{prologue}
4792 This pattern, if defined, emits RTL for entry to a function.  The function
4793 entry is responsible for setting up the stack frame, initializing the frame
4794 pointer register, saving callee saved registers, etc.
4795
4796 Using a prologue pattern is generally preferred over defining
4797 @code{TARGET_ASM_FUNCTION_PROLOGUE} to emit assembly code for the prologue.
4798
4799 The @code{prologue} pattern is particularly useful for targets which perform
4800 instruction scheduling.
4801
4802 @cindex @code{epilogue} instruction pattern
4803 @anchor{epilogue instruction pattern}
4804 @item @samp{epilogue}
4805 This pattern emits RTL for exit from a function.  The function
4806 exit is responsible for deallocating the stack frame, restoring callee saved
4807 registers and emitting the return instruction.
4808
4809 Using an epilogue pattern is generally preferred over defining
4810 @code{TARGET_ASM_FUNCTION_EPILOGUE} to emit assembly code for the epilogue.
4811
4812 The @code{epilogue} pattern is particularly useful for targets which perform
4813 instruction scheduling or which have delay slots for their return instruction.
4814
4815 @cindex @code{sibcall_epilogue} instruction pattern
4816 @item @samp{sibcall_epilogue}
4817 This pattern, if defined, emits RTL for exit from a function without the final
4818 branch back to the calling function.  This pattern will be emitted before any
4819 sibling call (aka tail call) sites.
4820
4821 The @code{sibcall_epilogue} pattern must not clobber any arguments used for
4822 parameter passing or any stack slots for arguments passed to the current
4823 function.
4824
4825 @cindex @code{trap} instruction pattern
4826 @item @samp{trap}
4827 This pattern, if defined, signals an error, typically by causing some
4828 kind of signal to be raised.  Among other places, it is used by the Java
4829 front end to signal `invalid array index' exceptions.
4830
4831 @cindex @code{conditional_trap} instruction pattern
4832 @item @samp{conditional_trap}
4833 Conditional trap instruction.  Operand 0 is a piece of RTL which
4834 performs a comparison.  Operand 1 is the trap code, an integer.
4835
4836 A typical @code{conditional_trap} pattern looks like
4837
4838 @smallexample
4839 (define_insn "conditional_trap"
4840   [(trap_if (match_operator 0 "trap_operator"
4841              [(cc0) (const_int 0)])
4842             (match_operand 1 "const_int_operand" "i"))]
4843   ""
4844   "@dots{}")
4845 @end smallexample
4846
4847 @cindex @code{prefetch} instruction pattern
4848 @item @samp{prefetch}
4849
4850 This pattern, if defined, emits code for a non-faulting data prefetch
4851 instruction.  Operand 0 is the address of the memory to prefetch.  Operand 1
4852 is a constant 1 if the prefetch is preparing for a write to the memory
4853 address, or a constant 0 otherwise.  Operand 2 is the expected degree of
4854 temporal locality of the data and is a value between 0 and 3, inclusive; 0
4855 means that the data has no temporal locality, so it need not be left in the
4856 cache after the access; 3 means that the data has a high degree of temporal
4857 locality and should be left in all levels of cache possible;  1 and 2 mean,
4858 respectively, a low or moderate degree of temporal locality.
4859
4860 Targets that do not support write prefetches or locality hints can ignore
4861 the values of operands 1 and 2.
4862
4863 @cindex @code{blockage} instruction pattern
4864 @item @samp{blockage}
4865
4866 This pattern defines a pseudo insn that prevents the instruction
4867 scheduler from moving instructions across the boundary defined by the
4868 blockage insn.  Normally an UNSPEC_VOLATILE pattern.
4869
4870 @cindex @code{memory_barrier} instruction pattern
4871 @item @samp{memory_barrier}
4872
4873 If the target memory model is not fully synchronous, then this pattern
4874 should be defined to an instruction that orders both loads and stores
4875 before the instruction with respect to loads and stores after the instruction.
4876 This pattern has no operands.
4877
4878 @cindex @code{sync_compare_and_swap@var{mode}} instruction pattern
4879 @item @samp{sync_compare_and_swap@var{mode}}
4880
4881 This pattern, if defined, emits code for an atomic compare-and-swap
4882 operation.  Operand 1 is the memory on which the atomic operation is
4883 performed.  Operand 2 is the ``old'' value to be compared against the
4884 current contents of the memory location.  Operand 3 is the ``new'' value
4885 to store in the memory if the compare succeeds.  Operand 0 is the result
4886 of the operation; it should contain the contents of the memory
4887 before the operation.  If the compare succeeds, this should obviously be
4888 a copy of operand 2.
4889
4890 This pattern must show that both operand 0 and operand 1 are modified.
4891
4892 This pattern must issue any memory barrier instructions such that all
4893 memory operations before the atomic operation occur before the atomic
4894 operation and all memory operations after the atomic operation occur
4895 after the atomic operation.
4896
4897 @cindex @code{sync_compare_and_swap_cc@var{mode}} instruction pattern
4898 @item @samp{sync_compare_and_swap_cc@var{mode}}
4899
4900 This pattern is just like @code{sync_compare_and_swap@var{mode}}, except
4901 it should act as if compare part of the compare-and-swap were issued via
4902 @code{cmp@var{m}}.  This comparison will only be used with @code{EQ} and
4903 @code{NE} branches and @code{setcc} operations.
4904
4905 Some targets do expose the success or failure of the compare-and-swap
4906 operation via the status flags.  Ideally we wouldn't need a separate
4907 named pattern in order to take advantage of this, but the combine pass
4908 does not handle patterns with multiple sets, which is required by
4909 definition for @code{sync_compare_and_swap@var{mode}}.
4910
4911 @cindex @code{sync_add@var{mode}} instruction pattern
4912 @cindex @code{sync_sub@var{mode}} instruction pattern
4913 @cindex @code{sync_ior@var{mode}} instruction pattern
4914 @cindex @code{sync_and@var{mode}} instruction pattern
4915 @cindex @code{sync_xor@var{mode}} instruction pattern
4916 @cindex @code{sync_nand@var{mode}} instruction pattern
4917 @item @samp{sync_add@var{mode}}, @samp{sync_sub@var{mode}}
4918 @itemx @samp{sync_ior@var{mode}}, @samp{sync_and@var{mode}}
4919 @itemx @samp{sync_xor@var{mode}}, @samp{sync_nand@var{mode}}
4920
4921 These patterns emit code for an atomic operation on memory.
4922 Operand 0 is the memory on which the atomic operation is performed.
4923 Operand 1 is the second operand to the binary operator.
4924
4925 The ``nand'' operation is @code{~op0 & op1}.
4926
4927 This pattern must issue any memory barrier instructions such that all
4928 memory operations before the atomic operation occur before the atomic
4929 operation and all memory operations after the atomic operation occur
4930 after the atomic operation.
4931
4932 If these patterns are not defined, the operation will be constructed
4933 from a compare-and-swap operation, if defined.
4934
4935 @cindex @code{sync_old_add@var{mode}} instruction pattern
4936 @cindex @code{sync_old_sub@var{mode}} instruction pattern
4937 @cindex @code{sync_old_ior@var{mode}} instruction pattern
4938 @cindex @code{sync_old_and@var{mode}} instruction pattern
4939 @cindex @code{sync_old_xor@var{mode}} instruction pattern
4940 @cindex @code{sync_old_nand@var{mode}} instruction pattern
4941 @item @samp{sync_old_add@var{mode}}, @samp{sync_old_sub@var{mode}}
4942 @itemx @samp{sync_old_ior@var{mode}}, @samp{sync_old_and@var{mode}}
4943 @itemx @samp{sync_old_xor@var{mode}}, @samp{sync_old_nand@var{mode}}
4944
4945 These patterns are emit code for an atomic operation on memory,
4946 and return the value that the memory contained before the operation.
4947 Operand 0 is the result value, operand 1 is the memory on which the
4948 atomic operation is performed, and operand 2 is the second operand
4949 to the binary operator.
4950
4951 This pattern must issue any memory barrier instructions such that all
4952 memory operations before the atomic operation occur before the atomic
4953 operation and all memory operations after the atomic operation occur
4954 after the atomic operation.
4955
4956 If these patterns are not defined, the operation will be constructed
4957 from a compare-and-swap operation, if defined.
4958
4959 @cindex @code{sync_new_add@var{mode}} instruction pattern
4960 @cindex @code{sync_new_sub@var{mode}} instruction pattern
4961 @cindex @code{sync_new_ior@var{mode}} instruction pattern
4962 @cindex @code{sync_new_and@var{mode}} instruction pattern
4963 @cindex @code{sync_new_xor@var{mode}} instruction pattern
4964 @cindex @code{sync_new_nand@var{mode}} instruction pattern
4965 @item @samp{sync_new_add@var{mode}}, @samp{sync_new_sub@var{mode}}
4966 @itemx @samp{sync_new_ior@var{mode}}, @samp{sync_new_and@var{mode}}
4967 @itemx @samp{sync_new_xor@var{mode}}, @samp{sync_new_nand@var{mode}}
4968
4969 These patterns are like their @code{sync_old_@var{op}} counterparts,
4970 except that they return the value that exists in the memory location
4971 after the operation, rather than before the operation.
4972
4973 @cindex @code{sync_lock_test_and_set@var{mode}} instruction pattern
4974 @item @samp{sync_lock_test_and_set@var{mode}}
4975
4976 This pattern takes two forms, based on the capabilities of the target.
4977 In either case, operand 0 is the result of the operand, operand 1 is
4978 the memory on which the atomic operation is performed, and operand 2
4979 is the value to set in the lock.
4980
4981 In the ideal case, this operation is an atomic exchange operation, in
4982 which the previous value in memory operand is copied into the result
4983 operand, and the value operand is stored in the memory operand.
4984
4985 For less capable targets, any value operand that is not the constant 1
4986 should be rejected with @code{FAIL}.  In this case the target may use
4987 an atomic test-and-set bit operation.  The result operand should contain
4988 1 if the bit was previously set and 0 if the bit was previously clear.
4989 The true contents of the memory operand are implementation defined.
4990
4991 This pattern must issue any memory barrier instructions such that the
4992 pattern as a whole acts as an acquire barrier, that is all memory
4993 operations after the pattern do not occur until the lock is acquired.
4994
4995 If this pattern is not defined, the operation will be constructed from
4996 a compare-and-swap operation, if defined.
4997
4998 @cindex @code{sync_lock_release@var{mode}} instruction pattern
4999 @item @samp{sync_lock_release@var{mode}}
5000
5001 This pattern, if defined, releases a lock set by
5002 @code{sync_lock_test_and_set@var{mode}}.  Operand 0 is the memory
5003 that contains the lock; operand 1 is the value to store in the lock.
5004
5005 If the target doesn't implement full semantics for
5006 @code{sync_lock_test_and_set@var{mode}}, any value operand which is not
5007 the constant 0 should be rejected with @code{FAIL}, and the true contents
5008 of the memory operand are implementation defined.
5009
5010 This pattern must issue any memory barrier instructions such that the
5011 pattern as a whole acts as a release barrier, that is the lock is
5012 released only after all previous memory operations have completed.
5013
5014 If this pattern is not defined, then a @code{memory_barrier} pattern
5015 will be emitted, followed by a store of the value to the memory operand.
5016
5017 @cindex @code{stack_protect_set} instruction pattern
5018 @item @samp{stack_protect_set}
5019
5020 This pattern, if defined, moves a @code{Pmode} value from the memory
5021 in operand 1 to the memory in operand 0 without leaving the value in
5022 a register afterward.  This is to avoid leaking the value some place
5023 that an attacker might use to rewrite the stack guard slot after
5024 having clobbered it.
5025
5026 If this pattern is not defined, then a plain move pattern is generated.
5027
5028 @cindex @code{stack_protect_test} instruction pattern
5029 @item @samp{stack_protect_test}
5030
5031 This pattern, if defined, compares a @code{Pmode} value from the
5032 memory in operand 1 with the memory in operand 0 without leaving the
5033 value in a register afterward and branches to operand 2 if the values
5034 weren't equal.
5035
5036 If this pattern is not defined, then a plain compare pattern and
5037 conditional branch pattern is used.
5038
5039 @end table
5040
5041 @end ifset
5042 @c Each of the following nodes are wrapped in separate
5043 @c "@ifset INTERNALS" to work around memory limits for the default
5044 @c configuration in older tetex distributions.  Known to not work:
5045 @c tetex-1.0.7, known to work: tetex-2.0.2.
5046 @ifset INTERNALS
5047 @node Pattern Ordering
5048 @section When the Order of Patterns Matters
5049 @cindex Pattern Ordering
5050 @cindex Ordering of Patterns
5051
5052 Sometimes an insn can match more than one instruction pattern.  Then the
5053 pattern that appears first in the machine description is the one used.
5054 Therefore, more specific patterns (patterns that will match fewer things)
5055 and faster instructions (those that will produce better code when they
5056 do match) should usually go first in the description.
5057
5058 In some cases the effect of ordering the patterns can be used to hide
5059 a pattern when it is not valid.  For example, the 68000 has an
5060 instruction for converting a fullword to floating point and another
5061 for converting a byte to floating point.  An instruction converting
5062 an integer to floating point could match either one.  We put the
5063 pattern to convert the fullword first to make sure that one will
5064 be used rather than the other.  (Otherwise a large integer might
5065 be generated as a single-byte immediate quantity, which would not work.)
5066 Instead of using this pattern ordering it would be possible to make the
5067 pattern for convert-a-byte smart enough to deal properly with any
5068 constant value.
5069
5070 @end ifset
5071 @ifset INTERNALS
5072 @node Dependent Patterns
5073 @section Interdependence of Patterns
5074 @cindex Dependent Patterns
5075 @cindex Interdependence of Patterns
5076
5077 Every machine description must have a named pattern for each of the
5078 conditional branch names @samp{b@var{cond}}.  The recognition template
5079 must always have the form
5080
5081 @smallexample
5082 (set (pc)
5083      (if_then_else (@var{cond} (cc0) (const_int 0))
5084                    (label_ref (match_operand 0 "" ""))
5085                    (pc)))
5086 @end smallexample
5087
5088 @noindent
5089 In addition, every machine description must have an anonymous pattern
5090 for each of the possible reverse-conditional branches.  Their templates
5091 look like
5092
5093 @smallexample
5094 (set (pc)
5095      (if_then_else (@var{cond} (cc0) (const_int 0))
5096                    (pc)
5097                    (label_ref (match_operand 0 "" ""))))
5098 @end smallexample
5099
5100 @noindent
5101 They are necessary because jump optimization can turn direct-conditional
5102 branches into reverse-conditional branches.
5103
5104 It is often convenient to use the @code{match_operator} construct to
5105 reduce the number of patterns that must be specified for branches.  For
5106 example,
5107
5108 @smallexample
5109 (define_insn ""
5110   [(set (pc)
5111         (if_then_else (match_operator 0 "comparison_operator"
5112                                       [(cc0) (const_int 0)])
5113                       (pc)
5114                       (label_ref (match_operand 1 "" ""))))]
5115   "@var{condition}"
5116   "@dots{}")
5117 @end smallexample
5118
5119 In some cases machines support instructions identical except for the
5120 machine mode of one or more operands.  For example, there may be
5121 ``sign-extend halfword'' and ``sign-extend byte'' instructions whose
5122 patterns are
5123
5124 @smallexample
5125 (set (match_operand:SI 0 @dots{})
5126      (extend:SI (match_operand:HI 1 @dots{})))
5127
5128 (set (match_operand:SI 0 @dots{})
5129      (extend:SI (match_operand:QI 1 @dots{})))
5130 @end smallexample
5131
5132 @noindent
5133 Constant integers do not specify a machine mode, so an instruction to
5134 extend a constant value could match either pattern.  The pattern it
5135 actually will match is the one that appears first in the file.  For correct
5136 results, this must be the one for the widest possible mode (@code{HImode},
5137 here).  If the pattern matches the @code{QImode} instruction, the results
5138 will be incorrect if the constant value does not actually fit that mode.
5139
5140 Such instructions to extend constants are rarely generated because they are
5141 optimized away, but they do occasionally happen in nonoptimized
5142 compilations.
5143
5144 If a constraint in a pattern allows a constant, the reload pass may
5145 replace a register with a constant permitted by the constraint in some
5146 cases.  Similarly for memory references.  Because of this substitution,
5147 you should not provide separate patterns for increment and decrement
5148 instructions.  Instead, they should be generated from the same pattern
5149 that supports register-register add insns by examining the operands and
5150 generating the appropriate machine instruction.
5151
5152 @end ifset
5153 @ifset INTERNALS
5154 @node Jump Patterns
5155 @section Defining Jump Instruction Patterns
5156 @cindex jump instruction patterns
5157 @cindex defining jump instruction patterns
5158
5159 For most machines, GCC assumes that the machine has a condition code.
5160 A comparison insn sets the condition code, recording the results of both
5161 signed and unsigned comparison of the given operands.  A separate branch
5162 insn tests the condition code and branches or not according its value.
5163 The branch insns come in distinct signed and unsigned flavors.  Many
5164 common machines, such as the VAX, the 68000 and the 32000, work this
5165 way.
5166
5167 Some machines have distinct signed and unsigned compare instructions, and
5168 only one set of conditional branch instructions.  The easiest way to handle
5169 these machines is to treat them just like the others until the final stage
5170 where assembly code is written.  At this time, when outputting code for the
5171 compare instruction, peek ahead at the following branch using
5172 @code{next_cc0_user (insn)}.  (The variable @code{insn} refers to the insn
5173 being output, in the output-writing code in an instruction pattern.)  If
5174 the RTL says that is an unsigned branch, output an unsigned compare;
5175 otherwise output a signed compare.  When the branch itself is output, you
5176 can treat signed and unsigned branches identically.
5177
5178 The reason you can do this is that GCC always generates a pair of
5179 consecutive RTL insns, possibly separated by @code{note} insns, one to
5180 set the condition code and one to test it, and keeps the pair inviolate
5181 until the end.
5182
5183 To go with this technique, you must define the machine-description macro
5184 @code{NOTICE_UPDATE_CC} to do @code{CC_STATUS_INIT}; in other words, no
5185 compare instruction is superfluous.
5186
5187 Some machines have compare-and-branch instructions and no condition code.
5188 A similar technique works for them.  When it is time to ``output'' a
5189 compare instruction, record its operands in two static variables.  When
5190 outputting the branch-on-condition-code instruction that follows, actually
5191 output a compare-and-branch instruction that uses the remembered operands.
5192
5193 It also works to define patterns for compare-and-branch instructions.
5194 In optimizing compilation, the pair of compare and branch instructions
5195 will be combined according to these patterns.  But this does not happen
5196 if optimization is not requested.  So you must use one of the solutions
5197 above in addition to any special patterns you define.
5198
5199 In many RISC machines, most instructions do not affect the condition
5200 code and there may not even be a separate condition code register.  On
5201 these machines, the restriction that the definition and use of the
5202 condition code be adjacent insns is not necessary and can prevent
5203 important optimizations.  For example, on the IBM RS/6000, there is a
5204 delay for taken branches unless the condition code register is set three
5205 instructions earlier than the conditional branch.  The instruction
5206 scheduler cannot perform this optimization if it is not permitted to
5207 separate the definition and use of the condition code register.
5208
5209 On these machines, do not use @code{(cc0)}, but instead use a register
5210 to represent the condition code.  If there is a specific condition code
5211 register in the machine, use a hard register.  If the condition code or
5212 comparison result can be placed in any general register, or if there are
5213 multiple condition registers, use a pseudo register.
5214
5215 @findex prev_cc0_setter
5216 @findex next_cc0_user
5217 On some machines, the type of branch instruction generated may depend on
5218 the way the condition code was produced; for example, on the 68k and
5219 SPARC, setting the condition code directly from an add or subtract
5220 instruction does not clear the overflow bit the way that a test
5221 instruction does, so a different branch instruction must be used for
5222 some conditional branches.  For machines that use @code{(cc0)}, the set
5223 and use of the condition code must be adjacent (separated only by
5224 @code{note} insns) allowing flags in @code{cc_status} to be used.
5225 (@xref{Condition Code}.)  Also, the comparison and branch insns can be
5226 located from each other by using the functions @code{prev_cc0_setter}
5227 and @code{next_cc0_user}.
5228
5229 However, this is not true on machines that do not use @code{(cc0)}.  On
5230 those machines, no assumptions can be made about the adjacency of the
5231 compare and branch insns and the above methods cannot be used.  Instead,
5232 we use the machine mode of the condition code register to record
5233 different formats of the condition code register.
5234
5235 Registers used to store the condition code value should have a mode that
5236 is in class @code{MODE_CC}.  Normally, it will be @code{CCmode}.  If
5237 additional modes are required (as for the add example mentioned above in
5238 the SPARC), define them in @file{@var{machine}-modes.def}
5239 (@pxref{Condition Code}).  Also define @code{SELECT_CC_MODE} to choose
5240 a mode given an operand of a compare.
5241
5242 If it is known during RTL generation that a different mode will be
5243 required (for example, if the machine has separate compare instructions
5244 for signed and unsigned quantities, like most IBM processors), they can
5245 be specified at that time.
5246
5247 If the cases that require different modes would be made by instruction
5248 combination, the macro @code{SELECT_CC_MODE} determines which machine
5249 mode should be used for the comparison result.  The patterns should be
5250 written using that mode.  To support the case of the add on the SPARC
5251 discussed above, we have the pattern
5252
5253 @smallexample
5254 (define_insn ""
5255   [(set (reg:CC_NOOV 0)
5256         (compare:CC_NOOV
5257           (plus:SI (match_operand:SI 0 "register_operand" "%r")
5258                    (match_operand:SI 1 "arith_operand" "rI"))
5259           (const_int 0)))]
5260   ""
5261   "@dots{}")
5262 @end smallexample
5263
5264 The @code{SELECT_CC_MODE} macro on the SPARC returns @code{CC_NOOVmode}
5265 for comparisons whose argument is a @code{plus}.
5266
5267 @end ifset
5268 @ifset INTERNALS
5269 @node Looping Patterns
5270 @section Defining Looping Instruction Patterns
5271 @cindex looping instruction patterns
5272 @cindex defining looping instruction patterns
5273
5274 Some machines have special jump instructions that can be utilized to
5275 make loops more efficient.  A common example is the 68000 @samp{dbra}
5276 instruction which performs a decrement of a register and a branch if the
5277 result was greater than zero.  Other machines, in particular digital
5278 signal processors (DSPs), have special block repeat instructions to
5279 provide low-overhead loop support.  For example, the TI TMS320C3x/C4x
5280 DSPs have a block repeat instruction that loads special registers to
5281 mark the top and end of a loop and to count the number of loop
5282 iterations.  This avoids the need for fetching and executing a
5283 @samp{dbra}-like instruction and avoids pipeline stalls associated with
5284 the jump.
5285
5286 GCC has three special named patterns to support low overhead looping.
5287 They are @samp{decrement_and_branch_until_zero}, @samp{doloop_begin},
5288 and @samp{doloop_end}.  The first pattern,
5289 @samp{decrement_and_branch_until_zero}, is not emitted during RTL
5290 generation but may be emitted during the instruction combination phase.
5291 This requires the assistance of the loop optimizer, using information
5292 collected during strength reduction, to reverse a loop to count down to
5293 zero.  Some targets also require the loop optimizer to add a
5294 @code{REG_NONNEG} note to indicate that the iteration count is always
5295 positive.  This is needed if the target performs a signed loop
5296 termination test.  For example, the 68000 uses a pattern similar to the
5297 following for its @code{dbra} instruction:
5298
5299 @smallexample
5300 @group
5301 (define_insn "decrement_and_branch_until_zero"
5302   [(set (pc)
5303         (if_then_else
5304           (ge (plus:SI (match_operand:SI 0 "general_operand" "+d*am")
5305                        (const_int -1))
5306               (const_int 0))
5307           (label_ref (match_operand 1 "" ""))
5308           (pc)))
5309    (set (match_dup 0)
5310         (plus:SI (match_dup 0)
5311                  (const_int -1)))]
5312   "find_reg_note (insn, REG_NONNEG, 0)"
5313   "@dots{}")
5314 @end group
5315 @end smallexample
5316
5317 Note that since the insn is both a jump insn and has an output, it must
5318 deal with its own reloads, hence the `m' constraints.  Also note that
5319 since this insn is generated by the instruction combination phase
5320 combining two sequential insns together into an implicit parallel insn,
5321 the iteration counter needs to be biased by the same amount as the
5322 decrement operation, in this case @minus{}1.  Note that the following similar
5323 pattern will not be matched by the combiner.
5324
5325 @smallexample
5326 @group
5327 (define_insn "decrement_and_branch_until_zero"
5328   [(set (pc)
5329         (if_then_else
5330           (ge (match_operand:SI 0 "general_operand" "+d*am")
5331               (const_int 1))
5332           (label_ref (match_operand 1 "" ""))
5333           (pc)))
5334    (set (match_dup 0)
5335         (plus:SI (match_dup 0)
5336                  (const_int -1)))]
5337   "find_reg_note (insn, REG_NONNEG, 0)"
5338   "@dots{}")
5339 @end group
5340 @end smallexample
5341
5342 The other two special looping patterns, @samp{doloop_begin} and
5343 @samp{doloop_end}, are emitted by the loop optimizer for certain
5344 well-behaved loops with a finite number of loop iterations using
5345 information collected during strength reduction.
5346
5347 The @samp{doloop_end} pattern describes the actual looping instruction
5348 (or the implicit looping operation) and the @samp{doloop_begin} pattern
5349 is an optional companion pattern that can be used for initialization
5350 needed for some low-overhead looping instructions.
5351
5352 Note that some machines require the actual looping instruction to be
5353 emitted at the top of the loop (e.g., the TMS320C3x/C4x DSPs).  Emitting
5354 the true RTL for a looping instruction at the top of the loop can cause
5355 problems with flow analysis.  So instead, a dummy @code{doloop} insn is
5356 emitted at the end of the loop.  The machine dependent reorg pass checks
5357 for the presence of this @code{doloop} insn and then searches back to
5358 the top of the loop, where it inserts the true looping insn (provided
5359 there are no instructions in the loop which would cause problems).  Any
5360 additional labels can be emitted at this point.  In addition, if the
5361 desired special iteration counter register was not allocated, this
5362 machine dependent reorg pass could emit a traditional compare and jump
5363 instruction pair.
5364
5365 The essential difference between the
5366 @samp{decrement_and_branch_until_zero} and the @samp{doloop_end}
5367 patterns is that the loop optimizer allocates an additional pseudo
5368 register for the latter as an iteration counter.  This pseudo register
5369 cannot be used within the loop (i.e., general induction variables cannot
5370 be derived from it), however, in many cases the loop induction variable
5371 may become redundant and removed by the flow pass.
5372
5373
5374 @end ifset
5375 @ifset INTERNALS
5376 @node Insn Canonicalizations
5377 @section Canonicalization of Instructions
5378 @cindex canonicalization of instructions
5379 @cindex insn canonicalization
5380
5381 There are often cases where multiple RTL expressions could represent an
5382 operation performed by a single machine instruction.  This situation is
5383 most commonly encountered with logical, branch, and multiply-accumulate
5384 instructions.  In such cases, the compiler attempts to convert these
5385 multiple RTL expressions into a single canonical form to reduce the
5386 number of insn patterns required.
5387
5388 In addition to algebraic simplifications, following canonicalizations
5389 are performed:
5390
5391 @itemize @bullet
5392 @item
5393 For commutative and comparison operators, a constant is always made the
5394 second operand.  If a machine only supports a constant as the second
5395 operand, only patterns that match a constant in the second operand need
5396 be supplied.
5397
5398 @item
5399 For associative operators, a sequence of operators will always chain
5400 to the left; for instance, only the left operand of an integer @code{plus}
5401 can itself be a @code{plus}.  @code{and}, @code{ior}, @code{xor},
5402 @code{plus}, @code{mult}, @code{smin}, @code{smax}, @code{umin}, and
5403 @code{umax} are associative when applied to integers, and sometimes to
5404 floating-point.
5405
5406 @item
5407 @cindex @code{neg}, canonicalization of
5408 @cindex @code{not}, canonicalization of
5409 @cindex @code{mult}, canonicalization of
5410 @cindex @code{plus}, canonicalization of
5411 @cindex @code{minus}, canonicalization of
5412 For these operators, if only one operand is a @code{neg}, @code{not},
5413 @code{mult}, @code{plus}, or @code{minus} expression, it will be the
5414 first operand.
5415
5416 @item
5417 In combinations of @code{neg}, @code{mult}, @code{plus}, and
5418 @code{minus}, the @code{neg} operations (if any) will be moved inside
5419 the operations as far as possible.  For instance,
5420 @code{(neg (mult A B))} is canonicalized as @code{(mult (neg A) B)}, but
5421 @code{(plus (mult (neg A) B) C)} is canonicalized as
5422 @code{(minus A (mult B C))}.
5423
5424 @cindex @code{compare}, canonicalization of
5425 @item
5426 For the @code{compare} operator, a constant is always the second operand
5427 on machines where @code{cc0} is used (@pxref{Jump Patterns}).  On other
5428 machines, there are rare cases where the compiler might want to construct
5429 a @code{compare} with a constant as the first operand.  However, these
5430 cases are not common enough for it to be worthwhile to provide a pattern
5431 matching a constant as the first operand unless the machine actually has
5432 such an instruction.
5433
5434 An operand of @code{neg}, @code{not}, @code{mult}, @code{plus}, or
5435 @code{minus} is made the first operand under the same conditions as
5436 above.
5437
5438 @item
5439 @code{(minus @var{x} (const_int @var{n}))} is converted to
5440 @code{(plus @var{x} (const_int @var{-n}))}.
5441
5442 @item
5443 Within address computations (i.e., inside @code{mem}), a left shift is
5444 converted into the appropriate multiplication by a power of two.
5445
5446 @cindex @code{ior}, canonicalization of
5447 @cindex @code{and}, canonicalization of
5448 @cindex De Morgan's law
5449 @item
5450 De Morgan's Law is used to move bitwise negation inside a bitwise
5451 logical-and or logical-or operation.  If this results in only one
5452 operand being a @code{not} expression, it will be the first one.
5453
5454 A machine that has an instruction that performs a bitwise logical-and of one
5455 operand with the bitwise negation of the other should specify the pattern
5456 for that instruction as
5457
5458 @smallexample
5459 (define_insn ""
5460   [(set (match_operand:@var{m} 0 @dots{})
5461         (and:@var{m} (not:@var{m} (match_operand:@var{m} 1 @dots{}))
5462                      (match_operand:@var{m} 2 @dots{})))]
5463   "@dots{}"
5464   "@dots{}")
5465 @end smallexample
5466
5467 @noindent
5468 Similarly, a pattern for a ``NAND'' instruction should be written
5469
5470 @smallexample
5471 (define_insn ""
5472   [(set (match_operand:@var{m} 0 @dots{})
5473         (ior:@var{m} (not:@var{m} (match_operand:@var{m} 1 @dots{}))
5474                      (not:@var{m} (match_operand:@var{m} 2 @dots{}))))]
5475   "@dots{}"
5476   "@dots{}")
5477 @end smallexample
5478
5479 In both cases, it is not necessary to include patterns for the many
5480 logically equivalent RTL expressions.
5481
5482 @cindex @code{xor}, canonicalization of
5483 @item
5484 The only possible RTL expressions involving both bitwise exclusive-or
5485 and bitwise negation are @code{(xor:@var{m} @var{x} @var{y})}
5486 and @code{(not:@var{m} (xor:@var{m} @var{x} @var{y}))}.
5487
5488 @item
5489 The sum of three items, one of which is a constant, will only appear in
5490 the form
5491
5492 @smallexample
5493 (plus:@var{m} (plus:@var{m} @var{x} @var{y}) @var{constant})
5494 @end smallexample
5495
5496 @item
5497 On machines that do not use @code{cc0},
5498 @code{(compare @var{x} (const_int 0))} will be converted to
5499 @var{x}.
5500
5501 @cindex @code{zero_extract}, canonicalization of
5502 @cindex @code{sign_extract}, canonicalization of
5503 @item
5504 Equality comparisons of a group of bits (usually a single bit) with zero
5505 will be written using @code{zero_extract} rather than the equivalent
5506 @code{and} or @code{sign_extract} operations.
5507
5508 @end itemize
5509
5510 Further canonicalization rules are defined in the function
5511 @code{commutative_operand_precedence} in @file{gcc/rtlanal.c}.
5512
5513 @end ifset
5514 @ifset INTERNALS
5515 @node Expander Definitions
5516 @section Defining RTL Sequences for Code Generation
5517 @cindex expander definitions
5518 @cindex code generation RTL sequences
5519 @cindex defining RTL sequences for code generation
5520
5521 On some target machines, some standard pattern names for RTL generation
5522 cannot be handled with single insn, but a sequence of RTL insns can
5523 represent them.  For these target machines, you can write a
5524 @code{define_expand} to specify how to generate the sequence of RTL@.
5525
5526 @findex define_expand
5527 A @code{define_expand} is an RTL expression that looks almost like a
5528 @code{define_insn}; but, unlike the latter, a @code{define_expand} is used
5529 only for RTL generation and it can produce more than one RTL insn.
5530
5531 A @code{define_expand} RTX has four operands:
5532
5533 @itemize @bullet
5534 @item
5535 The name.  Each @code{define_expand} must have a name, since the only
5536 use for it is to refer to it by name.
5537
5538 @item
5539 The RTL template.  This is a vector of RTL expressions representing
5540 a sequence of separate instructions.  Unlike @code{define_insn}, there
5541 is no implicit surrounding @code{PARALLEL}.
5542
5543 @item
5544 The condition, a string containing a C expression.  This expression is
5545 used to express how the availability of this pattern depends on
5546 subclasses of target machine, selected by command-line options when GCC
5547 is run.  This is just like the condition of a @code{define_insn} that
5548 has a standard name.  Therefore, the condition (if present) may not
5549 depend on the data in the insn being matched, but only the
5550 target-machine-type flags.  The compiler needs to test these conditions
5551 during initialization in order to learn exactly which named instructions
5552 are available in a particular run.
5553
5554 @item
5555 The preparation statements, a string containing zero or more C
5556 statements which are to be executed before RTL code is generated from
5557 the RTL template.
5558
5559 Usually these statements prepare temporary registers for use as
5560 internal operands in the RTL template, but they can also generate RTL
5561 insns directly by calling routines such as @code{emit_insn}, etc.
5562 Any such insns precede the ones that come from the RTL template.
5563 @end itemize
5564
5565 Every RTL insn emitted by a @code{define_expand} must match some
5566 @code{define_insn} in the machine description.  Otherwise, the compiler
5567 will crash when trying to generate code for the insn or trying to optimize
5568 it.
5569
5570 The RTL template, in addition to controlling generation of RTL insns,
5571 also describes the operands that need to be specified when this pattern
5572 is used.  In particular, it gives a predicate for each operand.
5573
5574 A true operand, which needs to be specified in order to generate RTL from
5575 the pattern, should be described with a @code{match_operand} in its first
5576 occurrence in the RTL template.  This enters information on the operand's
5577 predicate into the tables that record such things.  GCC uses the
5578 information to preload the operand into a register if that is required for
5579 valid RTL code.  If the operand is referred to more than once, subsequent
5580 references should use @code{match_dup}.
5581
5582 The RTL template may also refer to internal ``operands'' which are
5583 temporary registers or labels used only within the sequence made by the
5584 @code{define_expand}.  Internal operands are substituted into the RTL
5585 template with @code{match_dup}, never with @code{match_operand}.  The
5586 values of the internal operands are not passed in as arguments by the
5587 compiler when it requests use of this pattern.  Instead, they are computed
5588 within the pattern, in the preparation statements.  These statements
5589 compute the values and store them into the appropriate elements of
5590 @code{operands} so that @code{match_dup} can find them.
5591
5592 There are two special macros defined for use in the preparation statements:
5593 @code{DONE} and @code{FAIL}.  Use them with a following semicolon,
5594 as a statement.
5595
5596 @table @code
5597
5598 @findex DONE
5599 @item DONE
5600 Use the @code{DONE} macro to end RTL generation for the pattern.  The
5601 only RTL insns resulting from the pattern on this occasion will be
5602 those already emitted by explicit calls to @code{emit_insn} within the
5603 preparation statements; the RTL template will not be generated.
5604
5605 @findex FAIL
5606 @item FAIL
5607 Make the pattern fail on this occasion.  When a pattern fails, it means
5608 that the pattern was not truly available.  The calling routines in the
5609 compiler will try other strategies for code generation using other patterns.
5610
5611 Failure is currently supported only for binary (addition, multiplication,
5612 shifting, etc.) and bit-field (@code{extv}, @code{extzv}, and @code{insv})
5613 operations.
5614 @end table
5615
5616 If the preparation falls through (invokes neither @code{DONE} nor
5617 @code{FAIL}), then the @code{define_expand} acts like a
5618 @code{define_insn} in that the RTL template is used to generate the
5619 insn.
5620
5621 The RTL template is not used for matching, only for generating the
5622 initial insn list.  If the preparation statement always invokes
5623 @code{DONE} or @code{FAIL}, the RTL template may be reduced to a simple
5624 list of operands, such as this example:
5625
5626 @smallexample
5627 @group
5628 (define_expand "addsi3"
5629   [(match_operand:SI 0 "register_operand" "")
5630    (match_operand:SI 1 "register_operand" "")
5631    (match_operand:SI 2 "register_operand" "")]
5632 @end group
5633 @group
5634   ""
5635   "
5636 @{
5637   handle_add (operands[0], operands[1], operands[2]);
5638   DONE;
5639 @}")
5640 @end group
5641 @end smallexample
5642
5643 Here is an example, the definition of left-shift for the SPUR chip:
5644
5645 @smallexample
5646 @group
5647 (define_expand "ashlsi3"
5648   [(set (match_operand:SI 0 "register_operand" "")
5649         (ashift:SI
5650 @end group
5651 @group
5652           (match_operand:SI 1 "register_operand" "")
5653           (match_operand:SI 2 "nonmemory_operand" "")))]
5654   ""
5655   "
5656 @end group
5657 @end smallexample
5658
5659 @smallexample
5660 @group
5661 @{
5662   if (GET_CODE (operands[2]) != CONST_INT
5663       || (unsigned) INTVAL (operands[2]) > 3)
5664     FAIL;
5665 @}")
5666 @end group
5667 @end smallexample
5668
5669 @noindent
5670 This example uses @code{define_expand} so that it can generate an RTL insn
5671 for shifting when the shift-count is in the supported range of 0 to 3 but
5672 fail in other cases where machine insns aren't available.  When it fails,
5673 the compiler tries another strategy using different patterns (such as, a
5674 library call).
5675
5676 If the compiler were able to handle nontrivial condition-strings in
5677 patterns with names, then it would be possible to use a
5678 @code{define_insn} in that case.  Here is another case (zero-extension
5679 on the 68000) which makes more use of the power of @code{define_expand}:
5680
5681 @smallexample
5682 (define_expand "zero_extendhisi2"
5683   [(set (match_operand:SI 0 "general_operand" "")
5684         (const_int 0))
5685    (set (strict_low_part
5686           (subreg:HI
5687             (match_dup 0)
5688             0))
5689         (match_operand:HI 1 "general_operand" ""))]
5690   ""
5691   "operands[1] = make_safe_from (operands[1], operands[0]);")
5692 @end smallexample
5693
5694 @noindent
5695 @findex make_safe_from
5696 Here two RTL insns are generated, one to clear the entire output operand
5697 and the other to copy the input operand into its low half.  This sequence
5698 is incorrect if the input operand refers to [the old value of] the output
5699 operand, so the preparation statement makes sure this isn't so.  The
5700 function @code{make_safe_from} copies the @code{operands[1]} into a
5701 temporary register if it refers to @code{operands[0]}.  It does this
5702 by emitting another RTL insn.
5703
5704 Finally, a third example shows the use of an internal operand.
5705 Zero-extension on the SPUR chip is done by @code{and}-ing the result
5706 against a halfword mask.  But this mask cannot be represented by a
5707 @code{const_int} because the constant value is too large to be legitimate
5708 on this machine.  So it must be copied into a register with
5709 @code{force_reg} and then the register used in the @code{and}.
5710
5711 @smallexample
5712 (define_expand "zero_extendhisi2"
5713   [(set (match_operand:SI 0 "register_operand" "")
5714         (and:SI (subreg:SI
5715                   (match_operand:HI 1 "register_operand" "")
5716                   0)
5717                 (match_dup 2)))]
5718   ""
5719   "operands[2]
5720      = force_reg (SImode, GEN_INT (65535)); ")
5721 @end smallexample
5722
5723 @emph{Note:} If the @code{define_expand} is used to serve a
5724 standard binary or unary arithmetic operation or a bit-field operation,
5725 then the last insn it generates must not be a @code{code_label},
5726 @code{barrier} or @code{note}.  It must be an @code{insn},
5727 @code{jump_insn} or @code{call_insn}.  If you don't need a real insn
5728 at the end, emit an insn to copy the result of the operation into
5729 itself.  Such an insn will generate no code, but it can avoid problems
5730 in the compiler.
5731
5732 @end ifset
5733 @ifset INTERNALS
5734 @node Insn Splitting
5735 @section Defining How to Split Instructions
5736 @cindex insn splitting
5737 @cindex instruction splitting
5738 @cindex splitting instructions
5739
5740 There are two cases where you should specify how to split a pattern
5741 into multiple insns.  On machines that have instructions requiring
5742 delay slots (@pxref{Delay Slots}) or that have instructions whose
5743 output is not available for multiple cycles (@pxref{Processor pipeline
5744 description}), the compiler phases that optimize these cases need to
5745 be able to move insns into one-instruction delay slots.  However, some
5746 insns may generate more than one machine instruction.  These insns
5747 cannot be placed into a delay slot.
5748
5749 Often you can rewrite the single insn as a list of individual insns,
5750 each corresponding to one machine instruction.  The disadvantage of
5751 doing so is that it will cause the compilation to be slower and require
5752 more space.  If the resulting insns are too complex, it may also
5753 suppress some optimizations.  The compiler splits the insn if there is a
5754 reason to believe that it might improve instruction or delay slot
5755 scheduling.
5756
5757 The insn combiner phase also splits putative insns.  If three insns are
5758 merged into one insn with a complex expression that cannot be matched by
5759 some @code{define_insn} pattern, the combiner phase attempts to split
5760 the complex pattern into two insns that are recognized.  Usually it can
5761 break the complex pattern into two patterns by splitting out some
5762 subexpression.  However, in some other cases, such as performing an
5763 addition of a large constant in two insns on a RISC machine, the way to
5764 split the addition into two insns is machine-dependent.
5765
5766 @findex define_split
5767 The @code{define_split} definition tells the compiler how to split a
5768 complex insn into several simpler insns.  It looks like this:
5769
5770 @smallexample
5771 (define_split
5772   [@var{insn-pattern}]
5773   "@var{condition}"
5774   [@var{new-insn-pattern-1}
5775    @var{new-insn-pattern-2}
5776    @dots{}]
5777   "@var{preparation-statements}")
5778 @end smallexample
5779
5780 @var{insn-pattern} is a pattern that needs to be split and
5781 @var{condition} is the final condition to be tested, as in a
5782 @code{define_insn}.  When an insn matching @var{insn-pattern} and
5783 satisfying @var{condition} is found, it is replaced in the insn list
5784 with the insns given by @var{new-insn-pattern-1},
5785 @var{new-insn-pattern-2}, etc.
5786
5787 The @var{preparation-statements} are similar to those statements that
5788 are specified for @code{define_expand} (@pxref{Expander Definitions})
5789 and are executed before the new RTL is generated to prepare for the
5790 generated code or emit some insns whose pattern is not fixed.  Unlike
5791 those in @code{define_expand}, however, these statements must not
5792 generate any new pseudo-registers.  Once reload has completed, they also
5793 must not allocate any space in the stack frame.
5794
5795 Patterns are matched against @var{insn-pattern} in two different
5796 circumstances.  If an insn needs to be split for delay slot scheduling
5797 or insn scheduling, the insn is already known to be valid, which means
5798 that it must have been matched by some @code{define_insn} and, if
5799 @code{reload_completed} is nonzero, is known to satisfy the constraints
5800 of that @code{define_insn}.  In that case, the new insn patterns must
5801 also be insns that are matched by some @code{define_insn} and, if
5802 @code{reload_completed} is nonzero, must also satisfy the constraints
5803 of those definitions.
5804
5805 As an example of this usage of @code{define_split}, consider the following
5806 example from @file{a29k.md}, which splits a @code{sign_extend} from
5807 @code{HImode} to @code{SImode} into a pair of shift insns:
5808
5809 @smallexample
5810 (define_split
5811   [(set (match_operand:SI 0 "gen_reg_operand" "")
5812         (sign_extend:SI (match_operand:HI 1 "gen_reg_operand" "")))]
5813   ""
5814   [(set (match_dup 0)
5815         (ashift:SI (match_dup 1)
5816                    (const_int 16)))
5817    (set (match_dup 0)
5818         (ashiftrt:SI (match_dup 0)
5819                      (const_int 16)))]
5820   "
5821 @{ operands[1] = gen_lowpart (SImode, operands[1]); @}")
5822 @end smallexample
5823
5824 When the combiner phase tries to split an insn pattern, it is always the
5825 case that the pattern is @emph{not} matched by any @code{define_insn}.
5826 The combiner pass first tries to split a single @code{set} expression
5827 and then the same @code{set} expression inside a @code{parallel}, but
5828 followed by a @code{clobber} of a pseudo-reg to use as a scratch
5829 register.  In these cases, the combiner expects exactly two new insn
5830 patterns to be generated.  It will verify that these patterns match some
5831 @code{define_insn} definitions, so you need not do this test in the
5832 @code{define_split} (of course, there is no point in writing a
5833 @code{define_split} that will never produce insns that match).
5834
5835 Here is an example of this use of @code{define_split}, taken from
5836 @file{rs6000.md}:
5837
5838 @smallexample
5839 (define_split
5840   [(set (match_operand:SI 0 "gen_reg_operand" "")
5841         (plus:SI (match_operand:SI 1 "gen_reg_operand" "")
5842                  (match_operand:SI 2 "non_add_cint_operand" "")))]
5843   ""
5844   [(set (match_dup 0) (plus:SI (match_dup 1) (match_dup 3)))
5845    (set (match_dup 0) (plus:SI (match_dup 0) (match_dup 4)))]
5846 "
5847 @{
5848   int low = INTVAL (operands[2]) & 0xffff;
5849   int high = (unsigned) INTVAL (operands[2]) >> 16;
5850
5851   if (low & 0x8000)
5852     high++, low |= 0xffff0000;
5853
5854   operands[3] = GEN_INT (high << 16);
5855   operands[4] = GEN_INT (low);
5856 @}")
5857 @end smallexample
5858
5859 Here the predicate @code{non_add_cint_operand} matches any
5860 @code{const_int} that is @emph{not} a valid operand of a single add
5861 insn.  The add with the smaller displacement is written so that it
5862 can be substituted into the address of a subsequent operation.
5863
5864 An example that uses a scratch register, from the same file, generates
5865 an equality comparison of a register and a large constant:
5866
5867 @smallexample
5868 (define_split
5869   [(set (match_operand:CC 0 "cc_reg_operand" "")
5870         (compare:CC (match_operand:SI 1 "gen_reg_operand" "")
5871                     (match_operand:SI 2 "non_short_cint_operand" "")))
5872    (clobber (match_operand:SI 3 "gen_reg_operand" ""))]
5873   "find_single_use (operands[0], insn, 0)
5874    && (GET_CODE (*find_single_use (operands[0], insn, 0)) == EQ
5875        || GET_CODE (*find_single_use (operands[0], insn, 0)) == NE)"
5876   [(set (match_dup 3) (xor:SI (match_dup 1) (match_dup 4)))
5877    (set (match_dup 0) (compare:CC (match_dup 3) (match_dup 5)))]
5878   "
5879 @{
5880   /* @r{Get the constant we are comparing against, C, and see what it
5881      looks like sign-extended to 16 bits.  Then see what constant
5882      could be XOR'ed with C to get the sign-extended value.}  */
5883
5884   int c = INTVAL (operands[2]);
5885   int sextc = (c << 16) >> 16;
5886   int xorv = c ^ sextc;
5887
5888   operands[4] = GEN_INT (xorv);
5889   operands[5] = GEN_INT (sextc);
5890 @}")
5891 @end smallexample
5892
5893 To avoid confusion, don't write a single @code{define_split} that
5894 accepts some insns that match some @code{define_insn} as well as some
5895 insns that don't.  Instead, write two separate @code{define_split}
5896 definitions, one for the insns that are valid and one for the insns that
5897 are not valid.
5898
5899 The splitter is allowed to split jump instructions into sequence of
5900 jumps or create new jumps in while splitting non-jump instructions.  As
5901 the central flowgraph and branch prediction information needs to be updated,
5902 several restriction apply.
5903
5904 Splitting of jump instruction into sequence that over by another jump
5905 instruction is always valid, as compiler expect identical behavior of new
5906 jump.  When new sequence contains multiple jump instructions or new labels,
5907 more assistance is needed.  Splitter is required to create only unconditional
5908 jumps, or simple conditional jump instructions.  Additionally it must attach a
5909 @code{REG_BR_PROB} note to each conditional jump.  A global variable
5910 @code{split_branch_probability} holds the probability of the original branch in case
5911 it was an simple conditional jump, @minus{}1 otherwise.  To simplify
5912 recomputing of edge frequencies, the new sequence is required to have only
5913 forward jumps to the newly created labels.
5914
5915 @findex define_insn_and_split
5916 For the common case where the pattern of a define_split exactly matches the
5917 pattern of a define_insn, use @code{define_insn_and_split}.  It looks like
5918 this:
5919
5920 @smallexample
5921 (define_insn_and_split
5922   [@var{insn-pattern}]
5923   "@var{condition}"
5924   "@var{output-template}"
5925   "@var{split-condition}"
5926   [@var{new-insn-pattern-1}
5927    @var{new-insn-pattern-2}
5928    @dots{}]
5929   "@var{preparation-statements}"
5930   [@var{insn-attributes}])
5931
5932 @end smallexample
5933
5934 @var{insn-pattern}, @var{condition}, @var{output-template}, and
5935 @var{insn-attributes} are used as in @code{define_insn}.  The
5936 @var{new-insn-pattern} vector and the @var{preparation-statements} are used as
5937 in a @code{define_split}.  The @var{split-condition} is also used as in
5938 @code{define_split}, with the additional behavior that if the condition starts
5939 with @samp{&&}, the condition used for the split will be the constructed as a
5940 logical ``and'' of the split condition with the insn condition.  For example,
5941 from i386.md:
5942
5943 @smallexample
5944 (define_insn_and_split "zero_extendhisi2_and"
5945   [(set (match_operand:SI 0 "register_operand" "=r")
5946      (zero_extend:SI (match_operand:HI 1 "register_operand" "0")))
5947    (clobber (reg:CC 17))]
5948   "TARGET_ZERO_EXTEND_WITH_AND && !optimize_size"
5949   "#"
5950   "&& reload_completed"
5951   [(parallel [(set (match_dup 0)
5952                    (and:SI (match_dup 0) (const_int 65535)))
5953               (clobber (reg:CC 17))])]
5954   ""
5955   [(set_attr "type" "alu1")])
5956
5957 @end smallexample
5958
5959 In this case, the actual split condition will be
5960 @samp{TARGET_ZERO_EXTEND_WITH_AND && !optimize_size && reload_completed}.
5961
5962 The @code{define_insn_and_split} construction provides exactly the same
5963 functionality as two separate @code{define_insn} and @code{define_split}
5964 patterns.  It exists for compactness, and as a maintenance tool to prevent
5965 having to ensure the two patterns' templates match.
5966
5967 @end ifset
5968 @ifset INTERNALS
5969 @node Including Patterns
5970 @section Including Patterns in Machine Descriptions.
5971 @cindex insn includes
5972
5973 @findex include
5974 The @code{include} pattern tells the compiler tools where to
5975 look for patterns that are in files other than in the file
5976 @file{.md}.  This is used only at build time and there is no preprocessing allowed.
5977
5978 It looks like:
5979
5980 @smallexample
5981
5982 (include
5983   @var{pathname})
5984 @end smallexample
5985
5986 For example:
5987
5988 @smallexample
5989
5990 (include "filestuff")
5991
5992 @end smallexample
5993
5994 Where @var{pathname} is a string that specifies the location of the file,
5995 specifies the include file to be in @file{gcc/config/target/filestuff}.  The
5996 directory @file{gcc/config/target} is regarded as the default directory.
5997
5998
5999 Machine descriptions may be split up into smaller more manageable subsections
6000 and placed into subdirectories.
6001
6002 By specifying:
6003
6004 @smallexample
6005
6006 (include "BOGUS/filestuff")
6007
6008 @end smallexample
6009
6010 the include file is specified to be in @file{gcc/config/@var{target}/BOGUS/filestuff}.
6011
6012 Specifying an absolute path for the include file such as;
6013 @smallexample
6014
6015 (include "/u2/BOGUS/filestuff")
6016
6017 @end smallexample
6018 is permitted but is not encouraged.
6019
6020 @subsection RTL Generation Tool Options for Directory Search
6021 @cindex directory options .md
6022 @cindex options, directory search
6023 @cindex search options
6024
6025 The @option{-I@var{dir}} option specifies directories to search for machine descriptions.
6026 For example:
6027
6028 @smallexample
6029
6030 genrecog -I/p1/abc/proc1 -I/p2/abcd/pro2 target.md
6031
6032 @end smallexample
6033
6034
6035 Add the directory @var{dir} to the head of the list of directories to be
6036 searched for header files.  This can be used to override a system machine definition
6037 file, substituting your own version, since these directories are
6038 searched before the default machine description file directories.  If you use more than
6039 one @option{-I} option, the directories are scanned in left-to-right
6040 order; the standard default directory come after.
6041
6042
6043 @end ifset
6044 @ifset INTERNALS
6045 @node Peephole Definitions
6046 @section Machine-Specific Peephole Optimizers
6047 @cindex peephole optimizer definitions
6048 @cindex defining peephole optimizers
6049
6050 In addition to instruction patterns the @file{md} file may contain
6051 definitions of machine-specific peephole optimizations.
6052
6053 The combiner does not notice certain peephole optimizations when the data
6054 flow in the program does not suggest that it should try them.  For example,
6055 sometimes two consecutive insns related in purpose can be combined even
6056 though the second one does not appear to use a register computed in the
6057 first one.  A machine-specific peephole optimizer can detect such
6058 opportunities.
6059
6060 There are two forms of peephole definitions that may be used.  The
6061 original @code{define_peephole} is run at assembly output time to
6062 match insns and substitute assembly text.  Use of @code{define_peephole}
6063 is deprecated.
6064
6065 A newer @code{define_peephole2} matches insns and substitutes new
6066 insns.  The @code{peephole2} pass is run after register allocation
6067 but before scheduling, which may result in much better code for
6068 targets that do scheduling.
6069
6070 @menu
6071 * define_peephole::     RTL to Text Peephole Optimizers
6072 * define_peephole2::    RTL to RTL Peephole Optimizers
6073 @end menu
6074
6075 @end ifset
6076 @ifset INTERNALS
6077 @node define_peephole
6078 @subsection RTL to Text Peephole Optimizers
6079 @findex define_peephole
6080
6081 @need 1000
6082 A definition looks like this:
6083
6084 @smallexample
6085 (define_peephole
6086   [@var{insn-pattern-1}
6087    @var{insn-pattern-2}
6088    @dots{}]
6089   "@var{condition}"
6090   "@var{template}"
6091   "@var{optional-insn-attributes}")
6092 @end smallexample
6093
6094 @noindent
6095 The last string operand may be omitted if you are not using any
6096 machine-specific information in this machine description.  If present,
6097 it must obey the same rules as in a @code{define_insn}.
6098
6099 In this skeleton, @var{insn-pattern-1} and so on are patterns to match
6100 consecutive insns.  The optimization applies to a sequence of insns when
6101 @var{insn-pattern-1} matches the first one, @var{insn-pattern-2} matches
6102 the next, and so on.
6103
6104 Each of the insns matched by a peephole must also match a
6105 @code{define_insn}.  Peepholes are checked only at the last stage just
6106 before code generation, and only optionally.  Therefore, any insn which
6107 would match a peephole but no @code{define_insn} will cause a crash in code
6108 generation in an unoptimized compilation, or at various optimization
6109 stages.
6110
6111 The operands of the insns are matched with @code{match_operands},
6112 @code{match_operator}, and @code{match_dup}, as usual.  What is not
6113 usual is that the operand numbers apply to all the insn patterns in the
6114 definition.  So, you can check for identical operands in two insns by
6115 using @code{match_operand} in one insn and @code{match_dup} in the
6116 other.
6117
6118 The operand constraints used in @code{match_operand} patterns do not have
6119 any direct effect on the applicability of the peephole, but they will
6120 be validated afterward, so make sure your constraints are general enough
6121 to apply whenever the peephole matches.  If the peephole matches
6122 but the constraints are not satisfied, the compiler will crash.
6123
6124 It is safe to omit constraints in all the operands of the peephole; or
6125 you can write constraints which serve as a double-check on the criteria
6126 previously tested.
6127
6128 Once a sequence of insns matches the patterns, the @var{condition} is
6129 checked.  This is a C expression which makes the final decision whether to
6130 perform the optimization (we do so if the expression is nonzero).  If
6131 @var{condition} is omitted (in other words, the string is empty) then the
6132 optimization is applied to every sequence of insns that matches the
6133 patterns.
6134
6135 The defined peephole optimizations are applied after register allocation
6136 is complete.  Therefore, the peephole definition can check which
6137 operands have ended up in which kinds of registers, just by looking at
6138 the operands.
6139
6140 @findex prev_active_insn
6141 The way to refer to the operands in @var{condition} is to write
6142 @code{operands[@var{i}]} for operand number @var{i} (as matched by
6143 @code{(match_operand @var{i} @dots{})}).  Use the variable @code{insn}
6144 to refer to the last of the insns being matched; use
6145 @code{prev_active_insn} to find the preceding insns.
6146
6147 @findex dead_or_set_p
6148 When optimizing computations with intermediate results, you can use
6149 @var{condition} to match only when the intermediate results are not used
6150 elsewhere.  Use the C expression @code{dead_or_set_p (@var{insn},
6151 @var{op})}, where @var{insn} is the insn in which you expect the value
6152 to be used for the last time (from the value of @code{insn}, together
6153 with use of @code{prev_nonnote_insn}), and @var{op} is the intermediate
6154 value (from @code{operands[@var{i}]}).
6155
6156 Applying the optimization means replacing the sequence of insns with one
6157 new insn.  The @var{template} controls ultimate output of assembler code
6158 for this combined insn.  It works exactly like the template of a
6159 @code{define_insn}.  Operand numbers in this template are the same ones
6160 used in matching the original sequence of insns.
6161
6162 The result of a defined peephole optimizer does not need to match any of
6163 the insn patterns in the machine description; it does not even have an
6164 opportunity to match them.  The peephole optimizer definition itself serves
6165 as the insn pattern to control how the insn is output.
6166
6167 Defined peephole optimizers are run as assembler code is being output,
6168 so the insns they produce are never combined or rearranged in any way.
6169
6170 Here is an example, taken from the 68000 machine description:
6171
6172 @smallexample
6173 (define_peephole
6174   [(set (reg:SI 15) (plus:SI (reg:SI 15) (const_int 4)))
6175    (set (match_operand:DF 0 "register_operand" "=f")
6176         (match_operand:DF 1 "register_operand" "ad"))]
6177   "FP_REG_P (operands[0]) && ! FP_REG_P (operands[1])"
6178 @{
6179   rtx xoperands[2];
6180   xoperands[1] = gen_rtx_REG (SImode, REGNO (operands[1]) + 1);
6181 #ifdef MOTOROLA
6182   output_asm_insn ("move.l %1,(sp)", xoperands);
6183   output_asm_insn ("move.l %1,-(sp)", operands);
6184   return "fmove.d (sp)+,%0";
6185 #else
6186   output_asm_insn ("movel %1,sp@@", xoperands);
6187   output_asm_insn ("movel %1,sp@@-", operands);
6188   return "fmoved sp@@+,%0";
6189 #endif
6190 @})
6191 @end smallexample
6192
6193 @need 1000
6194 The effect of this optimization is to change
6195
6196 @smallexample
6197 @group
6198 jbsr _foobar
6199 addql #4,sp
6200 movel d1,sp@@-
6201 movel d0,sp@@-
6202 fmoved sp@@+,fp0
6203 @end group
6204 @end smallexample
6205
6206 @noindent
6207 into
6208
6209 @smallexample
6210 @group
6211 jbsr _foobar
6212 movel d1,sp@@
6213 movel d0,sp@@-
6214 fmoved sp@@+,fp0
6215 @end group
6216 @end smallexample
6217
6218 @ignore
6219 @findex CC_REVERSED
6220 If a peephole matches a sequence including one or more jump insns, you must
6221 take account of the flags such as @code{CC_REVERSED} which specify that the
6222 condition codes are represented in an unusual manner.  The compiler
6223 automatically alters any ordinary conditional jumps which occur in such
6224 situations, but the compiler cannot alter jumps which have been replaced by
6225 peephole optimizations.  So it is up to you to alter the assembler code
6226 that the peephole produces.  Supply C code to write the assembler output,
6227 and in this C code check the condition code status flags and change the
6228 assembler code as appropriate.
6229 @end ignore
6230
6231 @var{insn-pattern-1} and so on look @emph{almost} like the second
6232 operand of @code{define_insn}.  There is one important difference: the
6233 second operand of @code{define_insn} consists of one or more RTX's
6234 enclosed in square brackets.  Usually, there is only one: then the same
6235 action can be written as an element of a @code{define_peephole}.  But
6236 when there are multiple actions in a @code{define_insn}, they are
6237 implicitly enclosed in a @code{parallel}.  Then you must explicitly
6238 write the @code{parallel}, and the square brackets within it, in the
6239 @code{define_peephole}.  Thus, if an insn pattern looks like this,
6240
6241 @smallexample
6242 (define_insn "divmodsi4"
6243   [(set (match_operand:SI 0 "general_operand" "=d")
6244         (div:SI (match_operand:SI 1 "general_operand" "0")
6245                 (match_operand:SI 2 "general_operand" "dmsK")))
6246    (set (match_operand:SI 3 "general_operand" "=d")
6247         (mod:SI (match_dup 1) (match_dup 2)))]
6248   "TARGET_68020"
6249   "divsl%.l %2,%3:%0")
6250 @end smallexample
6251
6252 @noindent
6253 then the way to mention this insn in a peephole is as follows:
6254
6255 @smallexample
6256 (define_peephole
6257   [@dots{}
6258    (parallel
6259     [(set (match_operand:SI 0 "general_operand" "=d")
6260           (div:SI (match_operand:SI 1 "general_operand" "0")
6261                   (match_operand:SI 2 "general_operand" "dmsK")))
6262      (set (match_operand:SI 3 "general_operand" "=d")
6263           (mod:SI (match_dup 1) (match_dup 2)))])
6264    @dots{}]
6265   @dots{})
6266 @end smallexample
6267
6268 @end ifset
6269 @ifset INTERNALS
6270 @node define_peephole2
6271 @subsection RTL to RTL Peephole Optimizers
6272 @findex define_peephole2
6273
6274 The @code{define_peephole2} definition tells the compiler how to
6275 substitute one sequence of instructions for another sequence,
6276 what additional scratch registers may be needed and what their
6277 lifetimes must be.
6278
6279 @smallexample
6280 (define_peephole2
6281   [@var{insn-pattern-1}
6282    @var{insn-pattern-2}
6283    @dots{}]
6284   "@var{condition}"
6285   [@var{new-insn-pattern-1}
6286    @var{new-insn-pattern-2}
6287    @dots{}]
6288   "@var{preparation-statements}")
6289 @end smallexample
6290
6291 The definition is almost identical to @code{define_split}
6292 (@pxref{Insn Splitting}) except that the pattern to match is not a
6293 single instruction, but a sequence of instructions.
6294
6295 It is possible to request additional scratch registers for use in the
6296 output template.  If appropriate registers are not free, the pattern
6297 will simply not match.
6298
6299 @findex match_scratch
6300 @findex match_dup
6301 Scratch registers are requested with a @code{match_scratch} pattern at
6302 the top level of the input pattern.  The allocated register (initially) will
6303 be dead at the point requested within the original sequence.  If the scratch
6304 is used at more than a single point, a @code{match_dup} pattern at the
6305 top level of the input pattern marks the last position in the input sequence
6306 at which the register must be available.
6307
6308 Here is an example from the IA-32 machine description:
6309
6310 @smallexample
6311 (define_peephole2
6312   [(match_scratch:SI 2 "r")
6313    (parallel [(set (match_operand:SI 0 "register_operand" "")
6314                    (match_operator:SI 3 "arith_or_logical_operator"
6315                      [(match_dup 0)
6316                       (match_operand:SI 1 "memory_operand" "")]))
6317               (clobber (reg:CC 17))])]
6318   "! optimize_size && ! TARGET_READ_MODIFY"
6319   [(set (match_dup 2) (match_dup 1))
6320    (parallel [(set (match_dup 0)
6321                    (match_op_dup 3 [(match_dup 0) (match_dup 2)]))
6322               (clobber (reg:CC 17))])]
6323   "")
6324 @end smallexample
6325
6326 @noindent
6327 This pattern tries to split a load from its use in the hopes that we'll be
6328 able to schedule around the memory load latency.  It allocates a single
6329 @code{SImode} register of class @code{GENERAL_REGS} (@code{"r"}) that needs
6330 to be live only at the point just before the arithmetic.
6331
6332 A real example requiring extended scratch lifetimes is harder to come by,
6333 so here's a silly made-up example:
6334
6335 @smallexample
6336 (define_peephole2
6337   [(match_scratch:SI 4 "r")
6338    (set (match_operand:SI 0 "" "") (match_operand:SI 1 "" ""))
6339    (set (match_operand:SI 2 "" "") (match_dup 1))
6340    (match_dup 4)
6341    (set (match_operand:SI 3 "" "") (match_dup 1))]
6342   "/* @r{determine 1 does not overlap 0 and 2} */"
6343   [(set (match_dup 4) (match_dup 1))
6344    (set (match_dup 0) (match_dup 4))
6345    (set (match_dup 2) (match_dup 4))]
6346    (set (match_dup 3) (match_dup 4))]
6347   "")
6348 @end smallexample
6349
6350 @noindent
6351 If we had not added the @code{(match_dup 4)} in the middle of the input
6352 sequence, it might have been the case that the register we chose at the
6353 beginning of the sequence is killed by the first or second @code{set}.
6354
6355 @end ifset
6356 @ifset INTERNALS
6357 @node Insn Attributes
6358 @section Instruction Attributes
6359 @cindex insn attributes
6360 @cindex instruction attributes
6361
6362 In addition to describing the instruction supported by the target machine,
6363 the @file{md} file also defines a group of @dfn{attributes} and a set of
6364 values for each.  Every generated insn is assigned a value for each attribute.
6365 One possible attribute would be the effect that the insn has on the machine's
6366 condition code.  This attribute can then be used by @code{NOTICE_UPDATE_CC}
6367 to track the condition codes.
6368
6369 @menu
6370 * Defining Attributes:: Specifying attributes and their values.
6371 * Expressions::         Valid expressions for attribute values.
6372 * Tagging Insns::       Assigning attribute values to insns.
6373 * Attr Example::        An example of assigning attributes.
6374 * Insn Lengths::        Computing the length of insns.
6375 * Constant Attributes:: Defining attributes that are constant.
6376 * Delay Slots::         Defining delay slots required for a machine.
6377 * Processor pipeline description:: Specifying information for insn scheduling.
6378 @end menu
6379
6380 @end ifset
6381 @ifset INTERNALS
6382 @node Defining Attributes
6383 @subsection Defining Attributes and their Values
6384 @cindex defining attributes and their values
6385 @cindex attributes, defining
6386
6387 @findex define_attr
6388 The @code{define_attr} expression is used to define each attribute required
6389 by the target machine.  It looks like:
6390
6391 @smallexample
6392 (define_attr @var{name} @var{list-of-values} @var{default})
6393 @end smallexample
6394
6395 @var{name} is a string specifying the name of the attribute being defined.
6396
6397 @var{list-of-values} is either a string that specifies a comma-separated
6398 list of values that can be assigned to the attribute, or a null string to
6399 indicate that the attribute takes numeric values.
6400
6401 @var{default} is an attribute expression that gives the value of this
6402 attribute for insns that match patterns whose definition does not include
6403 an explicit value for this attribute.  @xref{Attr Example}, for more
6404 information on the handling of defaults.  @xref{Constant Attributes},
6405 for information on attributes that do not depend on any particular insn.
6406
6407 @findex insn-attr.h
6408 For each defined attribute, a number of definitions are written to the
6409 @file{insn-attr.h} file.  For cases where an explicit set of values is
6410 specified for an attribute, the following are defined:
6411
6412 @itemize @bullet
6413 @item
6414 A @samp{#define} is written for the symbol @samp{HAVE_ATTR_@var{name}}.
6415
6416 @item
6417 An enumerated class is defined for @samp{attr_@var{name}} with
6418 elements of the form @samp{@var{upper-name}_@var{upper-value}} where
6419 the attribute name and value are first converted to uppercase.
6420
6421 @item
6422 A function @samp{get_attr_@var{name}} is defined that is passed an insn and
6423 returns the attribute value for that insn.
6424 @end itemize
6425
6426 For example, if the following is present in the @file{md} file:
6427
6428 @smallexample
6429 (define_attr "type" "branch,fp,load,store,arith" @dots{})
6430 @end smallexample
6431
6432 @noindent
6433 the following lines will be written to the file @file{insn-attr.h}.
6434
6435 @smallexample
6436 #define HAVE_ATTR_type
6437 enum attr_type @{TYPE_BRANCH, TYPE_FP, TYPE_LOAD,
6438                  TYPE_STORE, TYPE_ARITH@};
6439 extern enum attr_type get_attr_type ();
6440 @end smallexample
6441
6442 If the attribute takes numeric values, no @code{enum} type will be
6443 defined and the function to obtain the attribute's value will return
6444 @code{int}.
6445
6446 @end ifset
6447 @ifset INTERNALS
6448 @node Expressions
6449 @subsection Attribute Expressions
6450 @cindex attribute expressions
6451
6452 RTL expressions used to define attributes use the codes described above
6453 plus a few specific to attribute definitions, to be discussed below.
6454 Attribute value expressions must have one of the following forms:
6455
6456 @table @code
6457 @cindex @code{const_int} and attributes
6458 @item (const_int @var{i})
6459 The integer @var{i} specifies the value of a numeric attribute.  @var{i}
6460 must be non-negative.
6461
6462 The value of a numeric attribute can be specified either with a
6463 @code{const_int}, or as an integer represented as a string in
6464 @code{const_string}, @code{eq_attr} (see below), @code{attr},
6465 @code{symbol_ref}, simple arithmetic expressions, and @code{set_attr}
6466 overrides on specific instructions (@pxref{Tagging Insns}).
6467
6468 @cindex @code{const_string} and attributes
6469 @item (const_string @var{value})
6470 The string @var{value} specifies a constant attribute value.
6471 If @var{value} is specified as @samp{"*"}, it means that the default value of
6472 the attribute is to be used for the insn containing this expression.
6473 @samp{"*"} obviously cannot be used in the @var{default} expression
6474 of a @code{define_attr}.
6475
6476 If the attribute whose value is being specified is numeric, @var{value}
6477 must be a string containing a non-negative integer (normally
6478 @code{const_int} would be used in this case).  Otherwise, it must
6479 contain one of the valid values for the attribute.
6480
6481 @cindex @code{if_then_else} and attributes
6482 @item (if_then_else @var{test} @var{true-value} @var{false-value})
6483 @var{test} specifies an attribute test, whose format is defined below.
6484 The value of this expression is @var{true-value} if @var{test} is true,
6485 otherwise it is @var{false-value}.
6486
6487 @cindex @code{cond} and attributes
6488 @item (cond [@var{test1} @var{value1} @dots{}] @var{default})
6489 The first operand of this expression is a vector containing an even
6490 number of expressions and consisting of pairs of @var{test} and @var{value}
6491 expressions.  The value of the @code{cond} expression is that of the
6492 @var{value} corresponding to the first true @var{test} expression.  If
6493 none of the @var{test} expressions are true, the value of the @code{cond}
6494 expression is that of the @var{default} expression.
6495 @end table
6496
6497 @var{test} expressions can have one of the following forms:
6498
6499 @table @code
6500 @cindex @code{const_int} and attribute tests
6501 @item (const_int @var{i})
6502 This test is true if @var{i} is nonzero and false otherwise.
6503
6504 @cindex @code{not} and attributes
6505 @cindex @code{ior} and attributes
6506 @cindex @code{and} and attributes
6507 @item (not @var{test})
6508 @itemx (ior @var{test1} @var{test2})
6509 @itemx (and @var{test1} @var{test2})
6510 These tests are true if the indicated logical function is true.
6511
6512 @cindex @code{match_operand} and attributes
6513 @item (match_operand:@var{m} @var{n} @var{pred} @var{constraints})
6514 This test is true if operand @var{n} of the insn whose attribute value
6515 is being determined has mode @var{m} (this part of the test is ignored
6516 if @var{m} is @code{VOIDmode}) and the function specified by the string
6517 @var{pred} returns a nonzero value when passed operand @var{n} and mode
6518 @var{m} (this part of the test is ignored if @var{pred} is the null
6519 string).
6520
6521 The @var{constraints} operand is ignored and should be the null string.
6522
6523 @cindex @code{le} and attributes
6524 @cindex @code{leu} and attributes
6525 @cindex @code{lt} and attributes
6526 @cindex @code{gt} and attributes
6527 @cindex @code{gtu} and attributes
6528 @cindex @code{ge} and attributes
6529 @cindex @code{geu} and attributes
6530 @cindex @code{ne} and attributes
6531 @cindex @code{eq} and attributes
6532 @cindex @code{plus} and attributes
6533 @cindex @code{minus} and attributes
6534 @cindex @code{mult} and attributes
6535 @cindex @code{div} and attributes
6536 @cindex @code{mod} and attributes
6537 @cindex @code{abs} and attributes
6538 @cindex @code{neg} and attributes
6539 @cindex @code{ashift} and attributes
6540 @cindex @code{lshiftrt} and attributes
6541 @cindex @code{ashiftrt} and attributes
6542 @item (le @var{arith1} @var{arith2})
6543 @itemx (leu @var{arith1} @var{arith2})
6544 @itemx (lt @var{arith1} @var{arith2})
6545 @itemx (ltu @var{arith1} @var{arith2})
6546 @itemx (gt @var{arith1} @var{arith2})
6547 @itemx (gtu @var{arith1} @var{arith2})
6548 @itemx (ge @var{arith1} @var{arith2})
6549 @itemx (geu @var{arith1} @var{arith2})
6550 @itemx (ne @var{arith1} @var{arith2})
6551 @itemx (eq @var{arith1} @var{arith2})
6552 These tests are true if the indicated comparison of the two arithmetic
6553 expressions is true.  Arithmetic expressions are formed with
6554 @code{plus}, @code{minus}, @code{mult}, @code{div}, @code{mod},
6555 @code{abs}, @code{neg}, @code{and}, @code{ior}, @code{xor}, @code{not},
6556 @code{ashift}, @code{lshiftrt}, and @code{ashiftrt} expressions.
6557
6558 @findex get_attr
6559 @code{const_int} and @code{symbol_ref} are always valid terms (@pxref{Insn
6560 Lengths},for additional forms).  @code{symbol_ref} is a string
6561 denoting a C expression that yields an @code{int} when evaluated by the
6562 @samp{get_attr_@dots{}} routine.  It should normally be a global
6563 variable.
6564
6565 @findex eq_attr
6566 @item (eq_attr @var{name} @var{value})
6567 @var{name} is a string specifying the name of an attribute.
6568
6569 @var{value} is a string that is either a valid value for attribute
6570 @var{name}, a comma-separated list of values, or @samp{!} followed by a
6571 value or list.  If @var{value} does not begin with a @samp{!}, this
6572 test is true if the value of the @var{name} attribute of the current
6573 insn is in the list specified by @var{value}.  If @var{value} begins
6574 with a @samp{!}, this test is true if the attribute's value is
6575 @emph{not} in the specified list.
6576
6577 For example,
6578
6579 @smallexample
6580 (eq_attr "type" "load,store")
6581 @end smallexample
6582
6583 @noindent
6584 is equivalent to
6585
6586 @smallexample
6587 (ior (eq_attr "type" "load") (eq_attr "type" "store"))
6588 @end smallexample
6589
6590 If @var{name} specifies an attribute of @samp{alternative}, it refers to the
6591 value of the compiler variable @code{which_alternative}
6592 (@pxref{Output Statement}) and the values must be small integers.  For
6593 example,
6594
6595 @smallexample
6596 (eq_attr "alternative" "2,3")
6597 @end smallexample
6598
6599 @noindent
6600 is equivalent to
6601
6602 @smallexample
6603 (ior (eq (symbol_ref "which_alternative") (const_int 2))
6604      (eq (symbol_ref "which_alternative") (const_int 3)))
6605 @end smallexample
6606
6607 Note that, for most attributes, an @code{eq_attr} test is simplified in cases
6608 where the value of the attribute being tested is known for all insns matching
6609 a particular pattern.  This is by far the most common case.
6610
6611 @findex attr_flag
6612 @item (attr_flag @var{name})
6613 The value of an @code{attr_flag} expression is true if the flag
6614 specified by @var{name} is true for the @code{insn} currently being
6615 scheduled.
6616
6617 @var{name} is a string specifying one of a fixed set of flags to test.
6618 Test the flags @code{forward} and @code{backward} to determine the
6619 direction of a conditional branch.  Test the flags @code{very_likely},
6620 @code{likely}, @code{very_unlikely}, and @code{unlikely} to determine
6621 if a conditional branch is expected to be taken.
6622
6623 If the @code{very_likely} flag is true, then the @code{likely} flag is also
6624 true.  Likewise for the @code{very_unlikely} and @code{unlikely} flags.
6625
6626 This example describes a conditional branch delay slot which
6627 can be nullified for forward branches that are taken (annul-true) or
6628 for backward branches which are not taken (annul-false).
6629
6630 @smallexample
6631 (define_delay (eq_attr "type" "cbranch")
6632   [(eq_attr "in_branch_delay" "true")
6633    (and (eq_attr "in_branch_delay" "true")
6634         (attr_flag "forward"))
6635    (and (eq_attr "in_branch_delay" "true")
6636         (attr_flag "backward"))])
6637 @end smallexample
6638
6639 The @code{forward} and @code{backward} flags are false if the current
6640 @code{insn} being scheduled is not a conditional branch.
6641
6642 The @code{very_likely} and @code{likely} flags are true if the
6643 @code{insn} being scheduled is not a conditional branch.
6644 The @code{very_unlikely} and @code{unlikely} flags are false if the
6645 @code{insn} being scheduled is not a conditional branch.
6646
6647 @code{attr_flag} is only used during delay slot scheduling and has no
6648 meaning to other passes of the compiler.
6649
6650 @findex attr
6651 @item (attr @var{name})
6652 The value of another attribute is returned.  This is most useful
6653 for numeric attributes, as @code{eq_attr} and @code{attr_flag}
6654 produce more efficient code for non-numeric attributes.
6655 @end table
6656
6657 @end ifset
6658 @ifset INTERNALS
6659 @node Tagging Insns
6660 @subsection Assigning Attribute Values to Insns
6661 @cindex tagging insns
6662 @cindex assigning attribute values to insns
6663
6664 The value assigned to an attribute of an insn is primarily determined by
6665 which pattern is matched by that insn (or which @code{define_peephole}
6666 generated it).  Every @code{define_insn} and @code{define_peephole} can
6667 have an optional last argument to specify the values of attributes for
6668 matching insns.  The value of any attribute not specified in a particular
6669 insn is set to the default value for that attribute, as specified in its
6670 @code{define_attr}.  Extensive use of default values for attributes
6671 permits the specification of the values for only one or two attributes
6672 in the definition of most insn patterns, as seen in the example in the
6673 next section.
6674
6675 The optional last argument of @code{define_insn} and
6676 @code{define_peephole} is a vector of expressions, each of which defines
6677 the value for a single attribute.  The most general way of assigning an
6678 attribute's value is to use a @code{set} expression whose first operand is an
6679 @code{attr} expression giving the name of the attribute being set.  The
6680 second operand of the @code{set} is an attribute expression
6681 (@pxref{Expressions}) giving the value of the attribute.
6682
6683 When the attribute value depends on the @samp{alternative} attribute
6684 (i.e., which is the applicable alternative in the constraint of the
6685 insn), the @code{set_attr_alternative} expression can be used.  It
6686 allows the specification of a vector of attribute expressions, one for
6687 each alternative.
6688
6689 @findex set_attr
6690 When the generality of arbitrary attribute expressions is not required,
6691 the simpler @code{set_attr} expression can be used, which allows
6692 specifying a string giving either a single attribute value or a list
6693 of attribute values, one for each alternative.
6694
6695 The form of each of the above specifications is shown below.  In each case,
6696 @var{name} is a string specifying the attribute to be set.
6697
6698 @table @code
6699 @item (set_attr @var{name} @var{value-string})
6700 @var{value-string} is either a string giving the desired attribute value,
6701 or a string containing a comma-separated list giving the values for
6702 succeeding alternatives.  The number of elements must match the number
6703 of alternatives in the constraint of the insn pattern.
6704
6705 Note that it may be useful to specify @samp{*} for some alternative, in
6706 which case the attribute will assume its default value for insns matching
6707 that alternative.
6708
6709 @findex set_attr_alternative
6710 @item (set_attr_alternative @var{name} [@var{value1} @var{value2} @dots{}])
6711 Depending on the alternative of the insn, the value will be one of the
6712 specified values.  This is a shorthand for using a @code{cond} with
6713 tests on the @samp{alternative} attribute.
6714
6715 @findex attr
6716 @item (set (attr @var{name}) @var{value})
6717 The first operand of this @code{set} must be the special RTL expression
6718 @code{attr}, whose sole operand is a string giving the name of the
6719 attribute being set.  @var{value} is the value of the attribute.
6720 @end table
6721
6722 The following shows three different ways of representing the same
6723 attribute value specification:
6724
6725 @smallexample
6726 (set_attr "type" "load,store,arith")
6727
6728 (set_attr_alternative "type"
6729                       [(const_string "load") (const_string "store")
6730                        (const_string "arith")])
6731
6732 (set (attr "type")
6733      (cond [(eq_attr "alternative" "1") (const_string "load")
6734             (eq_attr "alternative" "2") (const_string "store")]
6735            (const_string "arith")))
6736 @end smallexample
6737
6738 @need 1000
6739 @findex define_asm_attributes
6740 The @code{define_asm_attributes} expression provides a mechanism to
6741 specify the attributes assigned to insns produced from an @code{asm}
6742 statement.  It has the form:
6743
6744 @smallexample
6745 (define_asm_attributes [@var{attr-sets}])
6746 @end smallexample
6747
6748 @noindent
6749 where @var{attr-sets} is specified the same as for both the
6750 @code{define_insn} and the @code{define_peephole} expressions.
6751
6752 These values will typically be the ``worst case'' attribute values.  For
6753 example, they might indicate that the condition code will be clobbered.
6754
6755 A specification for a @code{length} attribute is handled specially.  The
6756 way to compute the length of an @code{asm} insn is to multiply the
6757 length specified in the expression @code{define_asm_attributes} by the
6758 number of machine instructions specified in the @code{asm} statement,
6759 determined by counting the number of semicolons and newlines in the
6760 string.  Therefore, the value of the @code{length} attribute specified
6761 in a @code{define_asm_attributes} should be the maximum possible length
6762 of a single machine instruction.
6763
6764 @end ifset
6765 @ifset INTERNALS
6766 @node Attr Example
6767 @subsection Example of Attribute Specifications
6768 @cindex attribute specifications example
6769 @cindex attribute specifications
6770
6771 The judicious use of defaulting is important in the efficient use of
6772 insn attributes.  Typically, insns are divided into @dfn{types} and an
6773 attribute, customarily called @code{type}, is used to represent this
6774 value.  This attribute is normally used only to define the default value
6775 for other attributes.  An example will clarify this usage.
6776
6777 Assume we have a RISC machine with a condition code and in which only
6778 full-word operations are performed in registers.  Let us assume that we
6779 can divide all insns into loads, stores, (integer) arithmetic
6780 operations, floating point operations, and branches.
6781
6782 Here we will concern ourselves with determining the effect of an insn on
6783 the condition code and will limit ourselves to the following possible
6784 effects:  The condition code can be set unpredictably (clobbered), not
6785 be changed, be set to agree with the results of the operation, or only
6786 changed if the item previously set into the condition code has been
6787 modified.
6788
6789 Here is part of a sample @file{md} file for such a machine:
6790
6791 @smallexample
6792 (define_attr "type" "load,store,arith,fp,branch" (const_string "arith"))
6793
6794 (define_attr "cc" "clobber,unchanged,set,change0"
6795              (cond [(eq_attr "type" "load")
6796                         (const_string "change0")
6797                     (eq_attr "type" "store,branch")
6798                         (const_string "unchanged")
6799                     (eq_attr "type" "arith")
6800                         (if_then_else (match_operand:SI 0 "" "")
6801                                       (const_string "set")
6802                                       (const_string "clobber"))]
6803                    (const_string "clobber")))
6804
6805 (define_insn ""
6806   [(set (match_operand:SI 0 "general_operand" "=r,r,m")
6807         (match_operand:SI 1 "general_operand" "r,m,r"))]
6808   ""
6809   "@@
6810    move %0,%1
6811    load %0,%1
6812    store %0,%1"
6813   [(set_attr "type" "arith,load,store")])
6814 @end smallexample
6815
6816 Note that we assume in the above example that arithmetic operations
6817 performed on quantities smaller than a machine word clobber the condition
6818 code since they will set the condition code to a value corresponding to the
6819 full-word result.
6820
6821 @end ifset
6822 @ifset INTERNALS
6823 @node Insn Lengths
6824 @subsection Computing the Length of an Insn
6825 @cindex insn lengths, computing
6826 @cindex computing the length of an insn
6827
6828 For many machines, multiple types of branch instructions are provided, each
6829 for different length branch displacements.  In most cases, the assembler
6830 will choose the correct instruction to use.  However, when the assembler
6831 cannot do so, GCC can when a special attribute, the @code{length}
6832 attribute, is defined.  This attribute must be defined to have numeric
6833 values by specifying a null string in its @code{define_attr}.
6834
6835 In the case of the @code{length} attribute, two additional forms of
6836 arithmetic terms are allowed in test expressions:
6837
6838 @table @code
6839 @cindex @code{match_dup} and attributes
6840 @item (match_dup @var{n})
6841 This refers to the address of operand @var{n} of the current insn, which
6842 must be a @code{label_ref}.
6843
6844 @cindex @code{pc} and attributes
6845 @item (pc)
6846 This refers to the address of the @emph{current} insn.  It might have
6847 been more consistent with other usage to make this the address of the
6848 @emph{next} insn but this would be confusing because the length of the
6849 current insn is to be computed.
6850 @end table
6851
6852 @cindex @code{addr_vec}, length of
6853 @cindex @code{addr_diff_vec}, length of
6854 For normal insns, the length will be determined by value of the
6855 @code{length} attribute.  In the case of @code{addr_vec} and
6856 @code{addr_diff_vec} insn patterns, the length is computed as
6857 the number of vectors multiplied by the size of each vector.
6858
6859 Lengths are measured in addressable storage units (bytes).
6860
6861 The following macros can be used to refine the length computation:
6862
6863 @table @code
6864 @findex ADJUST_INSN_LENGTH
6865 @item ADJUST_INSN_LENGTH (@var{insn}, @var{length})
6866 If defined, modifies the length assigned to instruction @var{insn} as a
6867 function of the context in which it is used.  @var{length} is an lvalue
6868 that contains the initially computed length of the insn and should be
6869 updated with the correct length of the insn.
6870
6871 This macro will normally not be required.  A case in which it is
6872 required is the ROMP@.  On this machine, the size of an @code{addr_vec}
6873 insn must be increased by two to compensate for the fact that alignment
6874 may be required.
6875 @end table
6876
6877 @findex get_attr_length
6878 The routine that returns @code{get_attr_length} (the value of the
6879 @code{length} attribute) can be used by the output routine to
6880 determine the form of the branch instruction to be written, as the
6881 example below illustrates.
6882
6883 As an example of the specification of variable-length branches, consider
6884 the IBM 360.  If we adopt the convention that a register will be set to
6885 the starting address of a function, we can jump to labels within 4k of
6886 the start using a four-byte instruction.  Otherwise, we need a six-byte
6887 sequence to load the address from memory and then branch to it.
6888
6889 On such a machine, a pattern for a branch instruction might be specified
6890 as follows:
6891
6892 @smallexample
6893 (define_insn "jump"
6894   [(set (pc)
6895         (label_ref (match_operand 0 "" "")))]
6896   ""
6897 @{
6898    return (get_attr_length (insn) == 4
6899            ? "b %l0" : "l r15,=a(%l0); br r15");
6900 @}
6901   [(set (attr "length")
6902         (if_then_else (lt (match_dup 0) (const_int 4096))
6903                       (const_int 4)
6904                       (const_int 6)))])
6905 @end smallexample
6906
6907 @end ifset
6908 @ifset INTERNALS
6909 @node Constant Attributes
6910 @subsection Constant Attributes
6911 @cindex constant attributes
6912
6913 A special form of @code{define_attr}, where the expression for the
6914 default value is a @code{const} expression, indicates an attribute that
6915 is constant for a given run of the compiler.  Constant attributes may be
6916 used to specify which variety of processor is used.  For example,
6917
6918 @smallexample
6919 (define_attr "cpu" "m88100,m88110,m88000"
6920  (const
6921   (cond [(symbol_ref "TARGET_88100") (const_string "m88100")
6922          (symbol_ref "TARGET_88110") (const_string "m88110")]
6923         (const_string "m88000"))))
6924
6925 (define_attr "memory" "fast,slow"
6926  (const
6927   (if_then_else (symbol_ref "TARGET_FAST_MEM")
6928                 (const_string "fast")
6929                 (const_string "slow"))))
6930 @end smallexample
6931
6932 The routine generated for constant attributes has no parameters as it
6933 does not depend on any particular insn.  RTL expressions used to define
6934 the value of a constant attribute may use the @code{symbol_ref} form,
6935 but may not use either the @code{match_operand} form or @code{eq_attr}
6936 forms involving insn attributes.
6937
6938 @end ifset
6939 @ifset INTERNALS
6940 @node Delay Slots
6941 @subsection Delay Slot Scheduling
6942 @cindex delay slots, defining
6943
6944 The insn attribute mechanism can be used to specify the requirements for
6945 delay slots, if any, on a target machine.  An instruction is said to
6946 require a @dfn{delay slot} if some instructions that are physically
6947 after the instruction are executed as if they were located before it.
6948 Classic examples are branch and call instructions, which often execute
6949 the following instruction before the branch or call is performed.
6950
6951 On some machines, conditional branch instructions can optionally
6952 @dfn{annul} instructions in the delay slot.  This means that the
6953 instruction will not be executed for certain branch outcomes.  Both
6954 instructions that annul if the branch is true and instructions that
6955 annul if the branch is false are supported.
6956
6957 Delay slot scheduling differs from instruction scheduling in that
6958 determining whether an instruction needs a delay slot is dependent only
6959 on the type of instruction being generated, not on data flow between the
6960 instructions.  See the next section for a discussion of data-dependent
6961 instruction scheduling.
6962
6963 @findex define_delay
6964 The requirement of an insn needing one or more delay slots is indicated
6965 via the @code{define_delay} expression.  It has the following form:
6966
6967 @smallexample
6968 (define_delay @var{test}
6969               [@var{delay-1} @var{annul-true-1} @var{annul-false-1}
6970                @var{delay-2} @var{annul-true-2} @var{annul-false-2}
6971                @dots{}])
6972 @end smallexample
6973
6974 @var{test} is an attribute test that indicates whether this
6975 @code{define_delay} applies to a particular insn.  If so, the number of
6976 required delay slots is determined by the length of the vector specified
6977 as the second argument.  An insn placed in delay slot @var{n} must
6978 satisfy attribute test @var{delay-n}.  @var{annul-true-n} is an
6979 attribute test that specifies which insns may be annulled if the branch
6980 is true.  Similarly, @var{annul-false-n} specifies which insns in the
6981 delay slot may be annulled if the branch is false.  If annulling is not
6982 supported for that delay slot, @code{(nil)} should be coded.
6983
6984 For example, in the common case where branch and call insns require
6985 a single delay slot, which may contain any insn other than a branch or
6986 call, the following would be placed in the @file{md} file:
6987
6988 @smallexample
6989 (define_delay (eq_attr "type" "branch,call")
6990               [(eq_attr "type" "!branch,call") (nil) (nil)])
6991 @end smallexample
6992
6993 Multiple @code{define_delay} expressions may be specified.  In this
6994 case, each such expression specifies different delay slot requirements
6995 and there must be no insn for which tests in two @code{define_delay}
6996 expressions are both true.
6997
6998 For example, if we have a machine that requires one delay slot for branches
6999 but two for calls,  no delay slot can contain a branch or call insn,
7000 and any valid insn in the delay slot for the branch can be annulled if the
7001 branch is true, we might represent this as follows:
7002
7003 @smallexample
7004 (define_delay (eq_attr "type" "branch")
7005    [(eq_attr "type" "!branch,call")
7006     (eq_attr "type" "!branch,call")
7007     (nil)])
7008
7009 (define_delay (eq_attr "type" "call")
7010               [(eq_attr "type" "!branch,call") (nil) (nil)
7011                (eq_attr "type" "!branch,call") (nil) (nil)])
7012 @end smallexample
7013 @c the above is *still* too long.  --mew 4feb93
7014
7015 @end ifset
7016 @ifset INTERNALS
7017 @node Processor pipeline description
7018 @subsection Specifying processor pipeline description
7019 @cindex processor pipeline description
7020 @cindex processor functional units
7021 @cindex instruction latency time
7022 @cindex interlock delays
7023 @cindex data dependence delays
7024 @cindex reservation delays
7025 @cindex pipeline hazard recognizer
7026 @cindex automaton based pipeline description
7027 @cindex regular expressions
7028 @cindex deterministic finite state automaton
7029 @cindex automaton based scheduler
7030 @cindex RISC
7031 @cindex VLIW
7032
7033 To achieve better performance, most modern processors
7034 (super-pipelined, superscalar @acronym{RISC}, and @acronym{VLIW}
7035 processors) have many @dfn{functional units} on which several
7036 instructions can be executed simultaneously.  An instruction starts
7037 execution if its issue conditions are satisfied.  If not, the
7038 instruction is stalled until its conditions are satisfied.  Such
7039 @dfn{interlock (pipeline) delay} causes interruption of the fetching
7040 of successor instructions (or demands nop instructions, e.g.@: for some
7041 MIPS processors).
7042
7043 There are two major kinds of interlock delays in modern processors.
7044 The first one is a data dependence delay determining @dfn{instruction
7045 latency time}.  The instruction execution is not started until all
7046 source data have been evaluated by prior instructions (there are more
7047 complex cases when the instruction execution starts even when the data
7048 are not available but will be ready in given time after the
7049 instruction execution start).  Taking the data dependence delays into
7050 account is simple.  The data dependence (true, output, and
7051 anti-dependence) delay between two instructions is given by a
7052 constant.  In most cases this approach is adequate.  The second kind
7053 of interlock delays is a reservation delay.  The reservation delay
7054 means that two instructions under execution will be in need of shared
7055 processors resources, i.e.@: buses, internal registers, and/or
7056 functional units, which are reserved for some time.  Taking this kind
7057 of delay into account is complex especially for modern @acronym{RISC}
7058 processors.
7059
7060 The task of exploiting more processor parallelism is solved by an
7061 instruction scheduler.  For a better solution to this problem, the
7062 instruction scheduler has to have an adequate description of the
7063 processor parallelism (or @dfn{pipeline description}).  GCC
7064 machine descriptions describe processor parallelism and functional
7065 unit reservations for groups of instructions with the aid of
7066 @dfn{regular expressions}.
7067
7068 The GCC instruction scheduler uses a @dfn{pipeline hazard recognizer} to
7069 figure out the possibility of the instruction issue by the processor
7070 on a given simulated processor cycle.  The pipeline hazard recognizer is
7071 automatically generated from the processor pipeline description.  The
7072 pipeline hazard recognizer generated from the machine description
7073 is based on a deterministic finite state automaton (@acronym{DFA}):
7074 the instruction issue is possible if there is a transition from one
7075 automaton state to another one.  This algorithm is very fast, and
7076 furthermore, its speed is not dependent on processor
7077 complexity@footnote{However, the size of the automaton depends on
7078   processor complexity.  To limit this effect, machine descriptions
7079   can split orthogonal parts of the machine description among several
7080   automata: but then, since each of these must be stepped independently,
7081   this does cause a small decrease in the algorithm's performance.}.
7082
7083 @cindex automaton based pipeline description
7084 The rest of this section describes the directives that constitute
7085 an automaton-based processor pipeline description.  The order of
7086 these constructions within the machine description file is not
7087 important.
7088
7089 @findex define_automaton
7090 @cindex pipeline hazard recognizer
7091 The following optional construction describes names of automata
7092 generated and used for the pipeline hazards recognition.  Sometimes
7093 the generated finite state automaton used by the pipeline hazard
7094 recognizer is large.  If we use more than one automaton and bind functional
7095 units to the automata, the total size of the automata is usually
7096 less than the size of the single automaton.  If there is no one such
7097 construction, only one finite state automaton is generated.
7098
7099 @smallexample
7100 (define_automaton @var{automata-names})
7101 @end smallexample
7102
7103 @var{automata-names} is a string giving names of the automata.  The
7104 names are separated by commas.  All the automata should have unique names.
7105 The automaton name is used in the constructions @code{define_cpu_unit} and
7106 @code{define_query_cpu_unit}.
7107
7108 @findex define_cpu_unit
7109 @cindex processor functional units
7110 Each processor functional unit used in the description of instruction
7111 reservations should be described by the following construction.
7112
7113 @smallexample
7114 (define_cpu_unit @var{unit-names} [@var{automaton-name}])
7115 @end smallexample
7116
7117 @var{unit-names} is a string giving the names of the functional units
7118 separated by commas.  Don't use name @samp{nothing}, it is reserved
7119 for other goals.
7120
7121 @var{automaton-name} is a string giving the name of the automaton with
7122 which the unit is bound.  The automaton should be described in
7123 construction @code{define_automaton}.  You should give
7124 @dfn{automaton-name}, if there is a defined automaton.
7125
7126 The assignment of units to automata are constrained by the uses of the
7127 units in insn reservations.  The most important constraint is: if a
7128 unit reservation is present on a particular cycle of an alternative
7129 for an insn reservation, then some unit from the same automaton must
7130 be present on the same cycle for the other alternatives of the insn
7131 reservation.  The rest of the constraints are mentioned in the
7132 description of the subsequent constructions.
7133
7134 @findex define_query_cpu_unit
7135 @cindex querying function unit reservations
7136 The following construction describes CPU functional units analogously
7137 to @code{define_cpu_unit}.  The reservation of such units can be
7138 queried for an automaton state.  The instruction scheduler never
7139 queries reservation of functional units for given automaton state.  So
7140 as a rule, you don't need this construction.  This construction could
7141 be used for future code generation goals (e.g.@: to generate
7142 @acronym{VLIW} insn templates).
7143
7144 @smallexample
7145 (define_query_cpu_unit @var{unit-names} [@var{automaton-name}])
7146 @end smallexample
7147
7148 @var{unit-names} is a string giving names of the functional units
7149 separated by commas.
7150
7151 @var{automaton-name} is a string giving the name of the automaton with
7152 which the unit is bound.
7153
7154 @findex define_insn_reservation
7155 @cindex instruction latency time
7156 @cindex regular expressions
7157 @cindex data bypass
7158 The following construction is the major one to describe pipeline
7159 characteristics of an instruction.
7160
7161 @smallexample
7162 (define_insn_reservation @var{insn-name} @var{default_latency}
7163                          @var{condition} @var{regexp})
7164 @end smallexample
7165
7166 @var{default_latency} is a number giving latency time of the
7167 instruction.  There is an important difference between the old
7168 description and the automaton based pipeline description.  The latency
7169 time is used for all dependencies when we use the old description.  In
7170 the automaton based pipeline description, the given latency time is only
7171 used for true dependencies.  The cost of anti-dependencies is always
7172 zero and the cost of output dependencies is the difference between
7173 latency times of the producing and consuming insns (if the difference
7174 is negative, the cost is considered to be zero).  You can always
7175 change the default costs for any description by using the target hook
7176 @code{TARGET_SCHED_ADJUST_COST} (@pxref{Scheduling}).
7177
7178 @var{insn-name} is a string giving the internal name of the insn.  The
7179 internal names are used in constructions @code{define_bypass} and in
7180 the automaton description file generated for debugging.  The internal
7181 name has nothing in common with the names in @code{define_insn}.  It is a
7182 good practice to use insn classes described in the processor manual.
7183
7184 @var{condition} defines what RTL insns are described by this
7185 construction.  You should remember that you will be in trouble if
7186 @var{condition} for two or more different
7187 @code{define_insn_reservation} constructions is TRUE for an insn.  In
7188 this case what reservation will be used for the insn is not defined.
7189 Such cases are not checked during generation of the pipeline hazards
7190 recognizer because in general recognizing that two conditions may have
7191 the same value is quite difficult (especially if the conditions
7192 contain @code{symbol_ref}).  It is also not checked during the
7193 pipeline hazard recognizer work because it would slow down the
7194 recognizer considerably.
7195
7196 @var{regexp} is a string describing the reservation of the cpu's functional
7197 units by the instruction.  The reservations are described by a regular
7198 expression according to the following syntax:
7199
7200 @smallexample
7201        regexp = regexp "," oneof
7202               | oneof
7203
7204        oneof = oneof "|" allof
7205              | allof
7206
7207        allof = allof "+" repeat
7208              | repeat
7209
7210        repeat = element "*" number
7211               | element
7212
7213        element = cpu_function_unit_name
7214                | reservation_name
7215                | result_name
7216                | "nothing"
7217                | "(" regexp ")"
7218 @end smallexample
7219
7220 @itemize @bullet
7221 @item
7222 @samp{,} is used for describing the start of the next cycle in
7223 the reservation.
7224
7225 @item
7226 @samp{|} is used for describing a reservation described by the first
7227 regular expression @strong{or} a reservation described by the second
7228 regular expression @strong{or} etc.
7229
7230 @item
7231 @samp{+} is used for describing a reservation described by the first
7232 regular expression @strong{and} a reservation described by the
7233 second regular expression @strong{and} etc.
7234
7235 @item
7236 @samp{*} is used for convenience and simply means a sequence in which
7237 the regular expression are repeated @var{number} times with cycle
7238 advancing (see @samp{,}).
7239
7240 @item
7241 @samp{cpu_function_unit_name} denotes reservation of the named
7242 functional unit.
7243
7244 @item
7245 @samp{reservation_name} --- see description of construction
7246 @samp{define_reservation}.
7247
7248 @item
7249 @samp{nothing} denotes no unit reservations.
7250 @end itemize
7251
7252 @findex define_reservation
7253 Sometimes unit reservations for different insns contain common parts.
7254 In such case, you can simplify the pipeline description by describing
7255 the common part by the following construction
7256
7257 @smallexample
7258 (define_reservation @var{reservation-name} @var{regexp})
7259 @end smallexample
7260
7261 @var{reservation-name} is a string giving name of @var{regexp}.
7262 Functional unit names and reservation names are in the same name
7263 space.  So the reservation names should be different from the
7264 functional unit names and can not be the reserved name @samp{nothing}.
7265
7266 @findex define_bypass
7267 @cindex instruction latency time
7268 @cindex data bypass
7269 The following construction is used to describe exceptions in the
7270 latency time for given instruction pair.  This is so called bypasses.
7271
7272 @smallexample
7273 (define_bypass @var{number} @var{out_insn_names} @var{in_insn_names}
7274                [@var{guard}])
7275 @end smallexample
7276
7277 @var{number} defines when the result generated by the instructions
7278 given in string @var{out_insn_names} will be ready for the
7279 instructions given in string @var{in_insn_names}.  The instructions in
7280 the string are separated by commas.
7281
7282 @var{guard} is an optional string giving the name of a C function which
7283 defines an additional guard for the bypass.  The function will get the
7284 two insns as parameters.  If the function returns zero the bypass will
7285 be ignored for this case.  The additional guard is necessary to
7286 recognize complicated bypasses, e.g.@: when the consumer is only an address
7287 of insn @samp{store} (not a stored value).
7288
7289 @findex exclusion_set
7290 @findex presence_set
7291 @findex final_presence_set
7292 @findex absence_set
7293 @findex final_absence_set
7294 @cindex VLIW
7295 @cindex RISC
7296 The following five constructions are usually used to describe
7297 @acronym{VLIW} processors, or more precisely, to describe a placement
7298 of small instructions into @acronym{VLIW} instruction slots.  They
7299 can be used for @acronym{RISC} processors, too.
7300
7301 @smallexample
7302 (exclusion_set @var{unit-names} @var{unit-names})
7303 (presence_set @var{unit-names} @var{patterns})
7304 (final_presence_set @var{unit-names} @var{patterns})
7305 (absence_set @var{unit-names} @var{patterns})
7306 (final_absence_set @var{unit-names} @var{patterns})
7307 @end smallexample
7308
7309 @var{unit-names} is a string giving names of functional units
7310 separated by commas.
7311
7312 @var{patterns} is a string giving patterns of functional units
7313 separated by comma.  Currently pattern is one unit or units
7314 separated by white-spaces.
7315
7316 The first construction (@samp{exclusion_set}) means that each
7317 functional unit in the first string can not be reserved simultaneously
7318 with a unit whose name is in the second string and vice versa.  For
7319 example, the construction is useful for describing processors
7320 (e.g.@: some SPARC processors) with a fully pipelined floating point
7321 functional unit which can execute simultaneously only single floating
7322 point insns or only double floating point insns.
7323
7324 The second construction (@samp{presence_set}) means that each
7325 functional unit in the first string can not be reserved unless at
7326 least one of pattern of units whose names are in the second string is
7327 reserved.  This is an asymmetric relation.  For example, it is useful
7328 for description that @acronym{VLIW} @samp{slot1} is reserved after
7329 @samp{slot0} reservation.  We could describe it by the following
7330 construction
7331
7332 @smallexample
7333 (presence_set "slot1" "slot0")
7334 @end smallexample
7335
7336 Or @samp{slot1} is reserved only after @samp{slot0} and unit @samp{b0}
7337 reservation.  In this case we could write
7338
7339 @smallexample
7340 (presence_set "slot1" "slot0 b0")
7341 @end smallexample
7342
7343 The third construction (@samp{final_presence_set}) is analogous to
7344 @samp{presence_set}.  The difference between them is when checking is
7345 done.  When an instruction is issued in given automaton state
7346 reflecting all current and planned unit reservations, the automaton
7347 state is changed.  The first state is a source state, the second one
7348 is a result state.  Checking for @samp{presence_set} is done on the
7349 source state reservation, checking for @samp{final_presence_set} is
7350 done on the result reservation.  This construction is useful to
7351 describe a reservation which is actually two subsequent reservations.
7352 For example, if we use
7353
7354 @smallexample
7355 (presence_set "slot1" "slot0")
7356 @end smallexample
7357
7358 the following insn will be never issued (because @samp{slot1} requires
7359 @samp{slot0} which is absent in the source state).
7360
7361 @smallexample
7362 (define_reservation "insn_and_nop" "slot0 + slot1")
7363 @end smallexample
7364
7365 but it can be issued if we use analogous @samp{final_presence_set}.
7366
7367 The forth construction (@samp{absence_set}) means that each functional
7368 unit in the first string can be reserved only if each pattern of units
7369 whose names are in the second string is not reserved.  This is an
7370 asymmetric relation (actually @samp{exclusion_set} is analogous to
7371 this one but it is symmetric).  For example it might be useful in a
7372 @acronym{VLIW} description to say that @samp{slot0} cannot be reserved
7373 after either @samp{slot1} or @samp{slot2} have been reserved.  This
7374 can be described as:
7375
7376 @smallexample
7377 (absence_set "slot0" "slot1, slot2")
7378 @end smallexample
7379
7380 Or @samp{slot2} can not be reserved if @samp{slot0} and unit @samp{b0}
7381 are reserved or @samp{slot1} and unit @samp{b1} are reserved.  In
7382 this case we could write
7383
7384 @smallexample
7385 (absence_set "slot2" "slot0 b0, slot1 b1")
7386 @end smallexample
7387
7388 All functional units mentioned in a set should belong to the same
7389 automaton.
7390
7391 The last construction (@samp{final_absence_set}) is analogous to
7392 @samp{absence_set} but checking is done on the result (state)
7393 reservation.  See comments for @samp{final_presence_set}.
7394
7395 @findex automata_option
7396 @cindex deterministic finite state automaton
7397 @cindex nondeterministic finite state automaton
7398 @cindex finite state automaton minimization
7399 You can control the generator of the pipeline hazard recognizer with
7400 the following construction.
7401
7402 @smallexample
7403 (automata_option @var{options})
7404 @end smallexample
7405
7406 @var{options} is a string giving options which affect the generated
7407 code.  Currently there are the following options:
7408
7409 @itemize @bullet
7410 @item
7411 @dfn{no-minimization} makes no minimization of the automaton.  This is
7412 only worth to do when we are debugging the description and need to
7413 look more accurately at reservations of states.
7414
7415 @item
7416 @dfn{time} means printing time statistics about the generation of
7417 automata.
7418
7419 @item
7420 @dfn{stats} means printing statistics about the generated automata
7421 such as the number of DFA states, NDFA states and arcs.
7422
7423 @item
7424 @dfn{v} means a generation of the file describing the result automata.
7425 The file has suffix @samp{.dfa} and can be used for the description
7426 verification and debugging.
7427
7428 @item
7429 @dfn{w} means a generation of warning instead of error for
7430 non-critical errors.
7431
7432 @item
7433 @dfn{ndfa} makes nondeterministic finite state automata.  This affects
7434 the treatment of operator @samp{|} in the regular expressions.  The
7435 usual treatment of the operator is to try the first alternative and,
7436 if the reservation is not possible, the second alternative.  The
7437 nondeterministic treatment means trying all alternatives, some of them
7438 may be rejected by reservations in the subsequent insns.
7439
7440 @item
7441 @dfn{progress} means output of a progress bar showing how many states
7442 were generated so far for automaton being processed.  This is useful
7443 during debugging a @acronym{DFA} description.  If you see too many
7444 generated states, you could interrupt the generator of the pipeline
7445 hazard recognizer and try to figure out a reason for generation of the
7446 huge automaton.
7447 @end itemize
7448
7449 As an example, consider a superscalar @acronym{RISC} machine which can
7450 issue three insns (two integer insns and one floating point insn) on
7451 the cycle but can finish only two insns.  To describe this, we define
7452 the following functional units.
7453
7454 @smallexample
7455 (define_cpu_unit "i0_pipeline, i1_pipeline, f_pipeline")
7456 (define_cpu_unit "port0, port1")
7457 @end smallexample
7458
7459 All simple integer insns can be executed in any integer pipeline and
7460 their result is ready in two cycles.  The simple integer insns are
7461 issued into the first pipeline unless it is reserved, otherwise they
7462 are issued into the second pipeline.  Integer division and
7463 multiplication insns can be executed only in the second integer
7464 pipeline and their results are ready correspondingly in 8 and 4
7465 cycles.  The integer division is not pipelined, i.e.@: the subsequent
7466 integer division insn can not be issued until the current division
7467 insn finished.  Floating point insns are fully pipelined and their
7468 results are ready in 3 cycles.  Where the result of a floating point
7469 insn is used by an integer insn, an additional delay of one cycle is
7470 incurred.  To describe all of this we could specify
7471
7472 @smallexample
7473 (define_cpu_unit "div")
7474
7475 (define_insn_reservation "simple" 2 (eq_attr "type" "int")
7476                          "(i0_pipeline | i1_pipeline), (port0 | port1)")
7477
7478 (define_insn_reservation "mult" 4 (eq_attr "type" "mult")
7479                          "i1_pipeline, nothing*2, (port0 | port1)")
7480
7481 (define_insn_reservation "div" 8 (eq_attr "type" "div")
7482                          "i1_pipeline, div*7, div + (port0 | port1)")
7483
7484 (define_insn_reservation "float" 3 (eq_attr "type" "float")
7485                          "f_pipeline, nothing, (port0 | port1))
7486
7487 (define_bypass 4 "float" "simple,mult,div")
7488 @end smallexample
7489
7490 To simplify the description we could describe the following reservation
7491
7492 @smallexample
7493 (define_reservation "finish" "port0|port1")
7494 @end smallexample
7495
7496 and use it in all @code{define_insn_reservation} as in the following
7497 construction
7498
7499 @smallexample
7500 (define_insn_reservation "simple" 2 (eq_attr "type" "int")
7501                          "(i0_pipeline | i1_pipeline), finish")
7502 @end smallexample
7503
7504
7505 @end ifset
7506 @ifset INTERNALS
7507 @node Conditional Execution
7508 @section Conditional Execution
7509 @cindex conditional execution
7510 @cindex predication
7511
7512 A number of architectures provide for some form of conditional
7513 execution, or predication.  The hallmark of this feature is the
7514 ability to nullify most of the instructions in the instruction set.
7515 When the instruction set is large and not entirely symmetric, it
7516 can be quite tedious to describe these forms directly in the
7517 @file{.md} file.  An alternative is the @code{define_cond_exec} template.
7518
7519 @findex define_cond_exec
7520 @smallexample
7521 (define_cond_exec
7522   [@var{predicate-pattern}]
7523   "@var{condition}"
7524   "@var{output-template}")
7525 @end smallexample
7526
7527 @var{predicate-pattern} is the condition that must be true for the
7528 insn to be executed at runtime and should match a relational operator.
7529 One can use @code{match_operator} to match several relational operators
7530 at once.  Any @code{match_operand} operands must have no more than one
7531 alternative.
7532
7533 @var{condition} is a C expression that must be true for the generated
7534 pattern to match.
7535
7536 @findex current_insn_predicate
7537 @var{output-template} is a string similar to the @code{define_insn}
7538 output template (@pxref{Output Template}), except that the @samp{*}
7539 and @samp{@@} special cases do not apply.  This is only useful if the
7540 assembly text for the predicate is a simple prefix to the main insn.
7541 In order to handle the general case, there is a global variable
7542 @code{current_insn_predicate} that will contain the entire predicate
7543 if the current insn is predicated, and will otherwise be @code{NULL}.
7544
7545 When @code{define_cond_exec} is used, an implicit reference to
7546 the @code{predicable} instruction attribute is made.
7547 @xref{Insn Attributes}.  This attribute must be boolean (i.e.@: have
7548 exactly two elements in its @var{list-of-values}).  Further, it must
7549 not be used with complex expressions.  That is, the default and all
7550 uses in the insns must be a simple constant, not dependent on the
7551 alternative or anything else.
7552
7553 For each @code{define_insn} for which the @code{predicable}
7554 attribute is true, a new @code{define_insn} pattern will be
7555 generated that matches a predicated version of the instruction.
7556 For example,
7557
7558 @smallexample
7559 (define_insn "addsi"
7560   [(set (match_operand:SI 0 "register_operand" "r")
7561         (plus:SI (match_operand:SI 1 "register_operand" "r")
7562                  (match_operand:SI 2 "register_operand" "r")))]
7563   "@var{test1}"
7564   "add %2,%1,%0")
7565
7566 (define_cond_exec
7567   [(ne (match_operand:CC 0 "register_operand" "c")
7568        (const_int 0))]
7569   "@var{test2}"
7570   "(%0)")
7571 @end smallexample
7572
7573 @noindent
7574 generates a new pattern
7575
7576 @smallexample
7577 (define_insn ""
7578   [(cond_exec
7579      (ne (match_operand:CC 3 "register_operand" "c") (const_int 0))
7580      (set (match_operand:SI 0 "register_operand" "r")
7581           (plus:SI (match_operand:SI 1 "register_operand" "r")
7582                    (match_operand:SI 2 "register_operand" "r"))))]
7583   "(@var{test2}) && (@var{test1})"
7584   "(%3) add %2,%1,%0")
7585 @end smallexample
7586
7587 @end ifset
7588 @ifset INTERNALS
7589 @node Constant Definitions
7590 @section Constant Definitions
7591 @cindex constant definitions
7592 @findex define_constants
7593
7594 Using literal constants inside instruction patterns reduces legibility and
7595 can be a maintenance problem.
7596
7597 To overcome this problem, you may use the @code{define_constants}
7598 expression.  It contains a vector of name-value pairs.  From that
7599 point on, wherever any of the names appears in the MD file, it is as
7600 if the corresponding value had been written instead.  You may use
7601 @code{define_constants} multiple times; each appearance adds more
7602 constants to the table.  It is an error to redefine a constant with
7603 a different value.
7604
7605 To come back to the a29k load multiple example, instead of
7606
7607 @smallexample
7608 (define_insn ""
7609   [(match_parallel 0 "load_multiple_operation"
7610      [(set (match_operand:SI 1 "gpc_reg_operand" "=r")
7611            (match_operand:SI 2 "memory_operand" "m"))
7612       (use (reg:SI 179))
7613       (clobber (reg:SI 179))])]
7614   ""
7615   "loadm 0,0,%1,%2")
7616 @end smallexample
7617
7618 You could write:
7619
7620 @smallexample
7621 (define_constants [
7622     (R_BP 177)
7623     (R_FC 178)
7624     (R_CR 179)
7625     (R_Q  180)
7626 ])
7627
7628 (define_insn ""
7629   [(match_parallel 0 "load_multiple_operation"
7630      [(set (match_operand:SI 1 "gpc_reg_operand" "=r")
7631            (match_operand:SI 2 "memory_operand" "m"))
7632       (use (reg:SI R_CR))
7633       (clobber (reg:SI R_CR))])]
7634   ""
7635   "loadm 0,0,%1,%2")
7636 @end smallexample
7637
7638 The constants that are defined with a define_constant are also output
7639 in the insn-codes.h header file as #defines.
7640 @end ifset
7641 @ifset INTERNALS
7642 @node Macros
7643 @section Macros
7644 @cindex macros in @file{.md} files
7645
7646 Ports often need to define similar patterns for more than one machine
7647 mode or for more than one rtx code.  GCC provides some simple macro
7648 facilities to make this process easier.
7649
7650 @menu
7651 * Mode Macros::         Generating variations of patterns for different modes.
7652 * Code Macros::         Doing the same for codes.
7653 @end menu
7654
7655 @node Mode Macros
7656 @subsection Mode Macros
7657 @cindex mode macros in @file{.md} files
7658
7659 Ports often need to define similar patterns for two or more different modes.
7660 For example:
7661
7662 @itemize @bullet
7663 @item
7664 If a processor has hardware support for both single and double
7665 floating-point arithmetic, the @code{SFmode} patterns tend to be
7666 very similar to the @code{DFmode} ones.
7667
7668 @item
7669 If a port uses @code{SImode} pointers in one configuration and
7670 @code{DImode} pointers in another, it will usually have very similar
7671 @code{SImode} and @code{DImode} patterns for manipulating pointers.
7672 @end itemize
7673
7674 Mode macros allow several patterns to be instantiated from one
7675 @file{.md} file template.  They can be used with any type of
7676 rtx-based construct, such as a @code{define_insn},
7677 @code{define_split}, or @code{define_peephole2}.
7678
7679 @menu
7680 * Defining Mode Macros:: Defining a new mode macro.
7681 * Substitutions::        Combining mode macros with substitutions
7682 * Examples::             Examples
7683 @end menu
7684
7685 @node Defining Mode Macros
7686 @subsubsection Defining Mode Macros
7687 @findex define_mode_macro
7688
7689 The syntax for defining a mode macro is:
7690
7691 @smallexample
7692 (define_mode_macro @var{name} [(@var{mode1} "@var{cond1}") ... (@var{moden} "@var{condn}")])
7693 @end smallexample
7694
7695 This allows subsequent @file{.md} file constructs to use the mode suffix
7696 @code{:@var{name}}.  Every construct that does so will be expanded
7697 @var{n} times, once with every use of @code{:@var{name}} replaced by
7698 @code{:@var{mode1}}, once with every use replaced by @code{:@var{mode2}},
7699 and so on.  In the expansion for a particular @var{modei}, every
7700 C condition will also require that @var{condi} be true.
7701
7702 For example:
7703
7704 @smallexample
7705 (define_mode_macro P [(SI "Pmode == SImode") (DI "Pmode == DImode")])
7706 @end smallexample
7707
7708 defines a new mode suffix @code{:P}.  Every construct that uses
7709 @code{:P} will be expanded twice, once with every @code{:P} replaced
7710 by @code{:SI} and once with every @code{:P} replaced by @code{:DI}.
7711 The @code{:SI} version will only apply if @code{Pmode == SImode} and
7712 the @code{:DI} version will only apply if @code{Pmode == DImode}.
7713
7714 As with other @file{.md} conditions, an empty string is treated
7715 as ``always true''.  @code{(@var{mode} "")} can also be abbreviated
7716 to @code{@var{mode}}.  For example:
7717
7718 @smallexample
7719 (define_mode_macro GPR [SI (DI "TARGET_64BIT")])
7720 @end smallexample
7721
7722 means that the @code{:DI} expansion only applies if @code{TARGET_64BIT}
7723 but that the @code{:SI} expansion has no such constraint.
7724
7725 Macros are applied in the order they are defined.  This can be
7726 significant if two macros are used in a construct that requires
7727 substitutions.  @xref{Substitutions}.
7728
7729 @node Substitutions
7730 @subsubsection Substitution in Mode Macros
7731 @findex define_mode_attr
7732
7733 If an @file{.md} file construct uses mode macros, each version of the
7734 construct will often need slightly different strings or modes.  For
7735 example:
7736
7737 @itemize @bullet
7738 @item
7739 When a @code{define_expand} defines several @code{add@var{m}3} patterns
7740 (@pxref{Standard Names}), each expander will need to use the
7741 appropriate mode name for @var{m}.
7742
7743 @item
7744 When a @code{define_insn} defines several instruction patterns,
7745 each instruction will often use a different assembler mnemonic.
7746
7747 @item
7748 When a @code{define_insn} requires operands with different modes,
7749 using a macro for one of the operand modes usually requires a specific
7750 mode for the other operand(s).
7751 @end itemize
7752
7753 GCC supports such variations through a system of ``mode attributes''.
7754 There are two standard attributes: @code{mode}, which is the name of
7755 the mode in lower case, and @code{MODE}, which is the same thing in
7756 upper case.  You can define other attributes using:
7757
7758 @smallexample
7759 (define_mode_attr @var{name} [(@var{mode1} "@var{value1}") ... (@var{moden} "@var{valuen}")])
7760 @end smallexample
7761
7762 where @var{name} is the name of the attribute and @var{valuei}
7763 is the value associated with @var{modei}.
7764
7765 When GCC replaces some @var{:macro} with @var{:mode}, it will scan
7766 each string and mode in the pattern for sequences of the form
7767 @code{<@var{macro}:@var{attr}>}, where @var{attr} is the name of a
7768 mode attribute.  If the attribute is defined for @var{mode}, the whole
7769 @code{<...>} sequence will be replaced by the appropriate attribute
7770 value.
7771
7772 For example, suppose an @file{.md} file has:
7773
7774 @smallexample
7775 (define_mode_macro P [(SI "Pmode == SImode") (DI "Pmode == DImode")])
7776 (define_mode_attr load [(SI "lw") (DI "ld")])
7777 @end smallexample
7778
7779 If one of the patterns that uses @code{:P} contains the string
7780 @code{"<P:load>\t%0,%1"}, the @code{SI} version of that pattern
7781 will use @code{"lw\t%0,%1"} and the @code{DI} version will use
7782 @code{"ld\t%0,%1"}.
7783
7784 Here is an example of using an attribute for a mode:
7785
7786 @smallexample
7787 (define_mode_macro LONG [SI DI])
7788 (define_mode_attr SHORT [(SI "HI") (DI "SI")])
7789 (define_insn ...
7790   (sign_extend:LONG (match_operand:<LONG:SHORT> ...)) ...)
7791 @end smallexample
7792
7793 The @code{@var{macro}:} prefix may be omitted, in which case the
7794 substitution will be attempted for every macro expansion.
7795
7796 @node Examples
7797 @subsubsection Mode Macro Examples
7798
7799 Here is an example from the MIPS port.  It defines the following
7800 modes and attributes (among others):
7801
7802 @smallexample
7803 (define_mode_macro GPR [SI (DI "TARGET_64BIT")])
7804 (define_mode_attr d [(SI "") (DI "d")])
7805 @end smallexample
7806
7807 and uses the following template to define both @code{subsi3}
7808 and @code{subdi3}:
7809
7810 @smallexample
7811 (define_insn "sub<mode>3"
7812   [(set (match_operand:GPR 0 "register_operand" "=d")
7813         (minus:GPR (match_operand:GPR 1 "register_operand" "d")
7814                    (match_operand:GPR 2 "register_operand" "d")))]
7815   ""
7816   "<d>subu\t%0,%1,%2"
7817   [(set_attr "type" "arith")
7818    (set_attr "mode" "<MODE>")])
7819 @end smallexample
7820
7821 This is exactly equivalent to:
7822
7823 @smallexample
7824 (define_insn "subsi3"
7825   [(set (match_operand:SI 0 "register_operand" "=d")
7826         (minus:SI (match_operand:SI 1 "register_operand" "d")
7827                   (match_operand:SI 2 "register_operand" "d")))]
7828   ""
7829   "subu\t%0,%1,%2"
7830   [(set_attr "type" "arith")
7831    (set_attr "mode" "SI")])
7832
7833 (define_insn "subdi3"
7834   [(set (match_operand:DI 0 "register_operand" "=d")
7835         (minus:DI (match_operand:DI 1 "register_operand" "d")
7836                   (match_operand:DI 2 "register_operand" "d")))]
7837   ""
7838   "dsubu\t%0,%1,%2"
7839   [(set_attr "type" "arith")
7840    (set_attr "mode" "DI")])
7841 @end smallexample
7842
7843 @node Code Macros
7844 @subsection Code Macros
7845 @cindex code macros in @file{.md} files
7846 @findex define_code_macro
7847 @findex define_code_attr
7848
7849 Code macros operate in a similar way to mode macros.  @xref{Mode Macros}.
7850
7851 The construct:
7852
7853 @smallexample
7854 (define_code_macro @var{name} [(@var{code1} "@var{cond1}") ... (@var{coden} "@var{condn}")])
7855 @end smallexample
7856
7857 defines a pseudo rtx code @var{name} that can be instantiated as
7858 @var{codei} if condition @var{condi} is true.  Each @var{codei}
7859 must have the same rtx format.  @xref{RTL Classes}.
7860
7861 As with mode macros, each pattern that uses @var{name} will be
7862 expanded @var{n} times, once with all uses of @var{name} replaced by
7863 @var{code1}, once with all uses replaced by @var{code2}, and so on.
7864 @xref{Defining Mode Macros}.
7865
7866 It is possible to define attributes for codes as well as for modes.
7867 There are two standard code attributes: @code{code}, the name of the
7868 code in lower case, and @code{CODE}, the name of the code in upper case.
7869 Other attributes are defined using:
7870
7871 @smallexample
7872 (define_code_attr @var{name} [(@var{code1} "@var{value1}") ... (@var{coden} "@var{valuen}")])
7873 @end smallexample
7874
7875 Here's an example of code macros in action, taken from the MIPS port:
7876
7877 @smallexample
7878 (define_code_macro any_cond [unordered ordered unlt unge uneq ltgt unle ungt
7879                              eq ne gt ge lt le gtu geu ltu leu])
7880
7881 (define_expand "b<code>"
7882   [(set (pc)
7883         (if_then_else (any_cond:CC (cc0)
7884                                    (const_int 0))
7885                       (label_ref (match_operand 0 ""))
7886                       (pc)))]
7887   ""
7888 @{
7889   gen_conditional_branch (operands, <CODE>);
7890   DONE;
7891 @})
7892 @end smallexample
7893
7894 This is equivalent to:
7895
7896 @smallexample
7897 (define_expand "bunordered"
7898   [(set (pc)
7899         (if_then_else (unordered:CC (cc0)
7900                                     (const_int 0))
7901                       (label_ref (match_operand 0 ""))
7902                       (pc)))]
7903   ""
7904 @{
7905   gen_conditional_branch (operands, UNORDERED);
7906   DONE;
7907 @})
7908
7909 (define_expand "bordered"
7910   [(set (pc)
7911         (if_then_else (ordered:CC (cc0)
7912                                   (const_int 0))
7913                       (label_ref (match_operand 0 ""))
7914                       (pc)))]
7915   ""
7916 @{
7917   gen_conditional_branch (operands, ORDERED);
7918   DONE;
7919 @})
7920
7921 ...
7922 @end smallexample
7923
7924 @end ifset