Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

Chapter 2. Lexical Structure

Programs

An M program consists of one or more source files, known formally as compilation units. A compilation unit file is an ordered sequence of Unicode characters. Compilation units typically have a one-to-one correspondence with files in a file system, but this correspondence is not required. For maximal portability, it is recommended that files in a file system be encoded with the UTF-8 encoding.

Conceptually speaking, a program is compiled using four steps:

Lexical analysis, which translates a stream of Unicode input characters into a stream of tokens. Lexical analysis evaluates and executes pre-processing directives.
Syntactic analysis, which translates the stream of tokens into an abstract syntax tree.
Semantic analysis, which resolves all symbols in the abstract syntax tree, type checks the structure, and generates a semantic graph.
Code generation, which generates an image from the semantic graph. An image is a list of executable instructions for some target runtime, for example, SQL Server.

Further tools may link images and load them into a runtime.

Grammars

This specification presents the syntax of the M programming language using two grammars. The lexical grammar defines how Unicode characters are combined to form line terminators, white space, comments, tokens, and pre-processing directives. The syntactic grammar defines how the tokens resulting from the lexical grammar are combined to form M programs.

Grammar Notation

The lexical and syntactic grammars are presented using grammar productions. Each grammar production defines a non-terminal symbol and the possible expansions of that non-terminal symbol into sequences of non-terminal or terminal symbols. In grammar productions, non-terminal symbols are shown in italic type, and terminal symbols are shown in a fixed-width font.

The first line of a grammar production is the name of the non-terminal symbol being defined, followed by a colon. Each successive indented line contains a possible expansion of the non-terminal given as a sequence of non-terminal or terminal symbols. For example, the production:

IdentifierVerbatim: [ IdentifierVerbatimCharacters ]

defines an IdentifierVerbatim to consist of the token “[”, followed by IdentifierVerbatimCharacters, followed by the token “]”.

When there is more than one possible expansion of a non-terminal symbol, the alternatives are listed on separate lines. For example, the production:

DecimalDigits: DecimalDigit DecimalDigits DecimalDigit

defines DecimalDigits to either consist of a DecimalDigit or consist of DecimalDigits followed by a DecimalDigit. In other words, the definition is recursive and specifies that decimal digits list consists of one or more decimal digits.

A subscripted suffix “opt” is used to indicate an optional symbol. The production:

DecimalLiteral: IntegerLiteral . DecimalDigit DecimalDigits_opt

is shorthand for:

DecimalLiteral: IntegerLiteral . DecimalDigit IntegerLiteral . DecimalDigit DecimalDigits

and defines an DecimalLiteral to consist of an IntegerLiteral followed by a “.” a DecimalDigit and by optional DecimalDigits.

Alternatives are normally listed on separate lines, though in cases where there are many alternatives, the phrase “one of” may precede a list of expansions given on a single line. This is simply shorthand for listing each of the alternatives on a separate line. For example, the production:

Sign: one of
  +    -

is shorthand for:

Sign:
  +
  -

Conversely, exclusions are designated with the phrase “none of.” For example, the production

TextSimple: none of
"

NewLineCharacter

permits all characters except ‘"’, ‘’, and new line characters.

Lexical Grammar

The lexical grammar of M is presented in 2.3. The terminal symbols of the lexical grammar are the characters of the Unicode character set, and the lexical grammar specifies how characters are combined to form tokens, white space, and comments (Section 2.3.2).

Every source file in an M program must conform to the Input production of the lexical grammar.

Syntactic Grammar

The syntactic grammar of M is presented in the chapters that follow this chapter. The terminal symbols of the syntactic grammar are the tokens defined by the lexical grammar, and the syntactic grammar specifies how tokens are combined to form M programs.

Every source file in an M program must conform to the CompilationUnit production of the syntactic grammar.

Lexical Analysis

The Input production defines the lexical structure of an M source file. Each source file in an M program must conform to this lexical grammar production.

Input: InputSection_optInputSection: InputSectionPart InputSection InputSectionPartInputSectionPart: InputElements_opt NewLineInputElements: InputElement InputElements InputElementInputElement: Whitespace Comment Token

Four basic elements make up the lexical structure of an M source file: line terminators, white space, comments, and tokens. Of these basic elements, only tokens are significant in the syntactic grammar of an M program.

The lexical processing of an M source file consists of reducing the file into a sequence of tokens that becomes the input to the syntactic analysis. Line terminators, white space, and comments can serve to separate tokens, but otherwise these lexical elements have no impact on the syntactic structure of an M program.

When several lexical grammar productions match a sequence of characters in a source file, the lexical processing always forms the longest possible lexical element. For example, the character sequence // is processed as the beginning of a single-line comment because that lexical element is longer than a single / token.

Line Terminators

Line terminators divide the characters of an M source file into lines.

NewLine:
  NewLineCharacter
  U+000D U+000A
NewLineCharacter:
  U+000A  // Line Feed
  U+000D  // Carriage Return
  U+0085  // Next Line
  U+2028  // Line Separator
  U+2029  // Paragraph Separator

For compatibility with source code editing tools that add end-of-file markers, and to enable a source file to be viewed as a sequence of properly terminated lines, the following transformations are applied, in order, to every compilation unit:

If the last character of the source file is a Control-Z character (U+001A), this character is deleted.
A carriage-return character (U+000D) is added to the end of the source file if that source file is nonempty and if the last character of the source file is not a carriage return (U+000D), a line feed (U+000A), a line separator (U+2028), or a paragraph separator (U+2029).

Comments

Two forms of comments are supported: single-line comments and delimited comments. Single-line comments start with the characters // and extend to the end of the source line. Delimited comments start with the characters /* and end with the characters */. Delimited comments may span multiple lines.

Comment:
  CommentDelimited
  CommentLine
CommentDelimited:
  /* CommentDelimitedContents_opt */
CommentDelimitedContent:
  * none of /
CommentDelimitedContents:
  CommentDelimitedContent
  CommentDelimitedContents  CommentDelimitedContent
CommentLine:
  // CommentLineContents_opt
CommentLineContent: none of
  NewLineCharacter
CommentLineContents:
  CommentLineContent
  CommentLineContents  CommentLineContent

Comments do not nest. The character sequences /* and */ have no special meaning within a // comment, and the character sequences // and /* have no special meaning within a delimited comment.

Comments are not processed within Text literals.

The example

// This defines a
// Person entity
//
type Person = {
    Name : Text;
    Age : Number;
}

shows three single-line comments.

The example

/* This defines a
   Person entity
*/
type Person = {
    Name : Text;
    Age : Number;
}

includes one delimited comment.

Whitespace

Whitespace is defined as any character with Unicode class Zs (which includes the space character) as well as the horizontal tab character, the vertical tab character, and the form feed character.

Whitespace:
  WhitespaceCharacters
WhitespaceCharacter:
  U+0009  // Horizontal Tab
  U+000B  // Vertical Tab
  U+000C  // Form Feed
  U+0020  // Space
  NewLineCharacter
WhitespaceCharacters:
  WhitespaceCharacter
  WhitespaceCharacters  WhitespaceCharacter

Tokens

There are several kinds of tokens: identifiers, keywords, literals, operators, and punctuators. White space and comments are not tokens, though they act as separators for tokens.

Token:
	Identifier
	Keyword
	Literal
	OperatorOrPunctuator

Identifiers

A regular identifier begins with a letter or underscore and then any sequence of letter, underscore, dollar sign, or digit. An escaped identifier is enclosed in square brackets. It contains any sequence of Text literal characters.

Identifier:
  IdentifierBegin  IdentifierCharacters_opt
  IdentifierVerbatim
IdentifierBegin:
  _
  Letter
IdentifierCharacter:
  IdentifierBegin
  $
  DecimalDigit
IdentifierCharacters:
  IdentifierCharacter
  IdentifierCharacters  IdentifierCharacter
IdentifierVerbatim:
  [ IdentifierVerbatimCharacters ]
IdentifierVerbatimCharacter:
  none of ]
  IdentifierVerbatimEscape
IdentifierVerbatimCharacters:
  IdentifierVerbatimCharacter
  IdentifierVerbatimCharacters  IdentifierVerbatimCharacter
IdentifierVerbatimEscape:
  \
  ]
Letter:
  a..z
  A..Z
DecimalDigit:
  0..9
DecimalDigits:
  DecimalDigit
  DecimalDigits  DecimalDigit

Keywords

A keyword is an identifier-like sequence of characters that is reserved and cannot be used as an identifier except when escaped with square brackets [].

Keyword:
     accumulate
     by
     equals
     export
     from
     group
     identity
     import
     in
     into
     item
     join
     let
     module
     null
     select
     this
     type
     unique
     value
     where

Literals

A literal is a source code representation of a value.

Literal:
	DecimalLiteral
	IntegerLiteral
	ScientificLiteral
	DateTimeLiteral
	TimeLiteral
	CharacterLiteral
	TextLiteral
	BinaryLiteral
	GuidLiteral
	LogicalLiteral
	NullLiteral

Literals may be ascribed with a type to override the default type ascription.

Decimal Literals

Decimal literals are used to write fixed-point or exact number values.

DecimalLiteral: IntegerLiteral . DecimalDigit DecimalDigits_opt

Decimal literals default to the smallest standard library type that can contain the value. Examples of decimal literal follow:

99.999
0.1
1.0

Integer Literals

Integer literals are used to write integral values.

IntegerLiteral:
	DecimalDigits

Integer literals default to the smallest precision type that can contain the value, starting with Integer32.

Examples of integer literal follow:

0
123
999999999999999999999999999999

Scientific Literals

Scientific literals are used to write values floating-point or inexact numbers.

ScientificLiteral:
  DecimalLiteral  e Sign_opt DecimalDigit  DecimalDigits_opt
  DecimalLiteral  E Sign_opt DecimalDigit  DecimalDigits_opt
Sign: one of
  +   -

Scientific literals default to the smallest precision type that can contain the value, starting with Double.

Examples of scientific literal follow:

.31416e+1
9.9999e-1
0.0E0

Date Literals

Date literals are used to write a date independent of a specific time of day.

DateLiteral:
  Sign_opt DateYear  - DateMonth  - DateDay

The tokens of a DateLiteral must not have white space.

DateDay: one of
  01  02  03  04  05  06  07  08  09  10  11  12  13  14  15  16  17
  18  19  20  21  22  23  24  25  26  27  28  29  30  31
DateMonth: one of
  01  02  03  04  05  06  07  08  09  10  11  12
DateYear:
  DecimalDigit  DecimalDigit  DecimalDigit  DecimalDigit

The type of a DateLiteral is Date.

0001-01-01 is the representation of January1^st, 1 AD.
There is no year 0, therefore ‘0000’ is not a valid Date Time.
-0001 is the representation of January1^st, 1 BC.

Examples of date literal follow:

0001-01-01
2008-08-14
-1184-03-01

DateTime Literals

DateTime literals are used to write a time of day on a specific date independent of time zone.

DateTimeLiteral:
  DateLiteral T TimeLiteral

The type of a DateTime literal is DateTime.

Example of date time literal follow:

2008-08-14T13:13:00
0001-01-01T00:00:00
2005-05-19T20:05:00

Time Literals

TimeLiteral:
  TimeHourMinute : TimeSecond
TimeHourMinute:
  TimeHour : TimeMinute
TimeHour: one of
  00  01  02  03  04  05  06  07  08  09  10  11
  12  13  14  15  16  17  18  19  20  21  22  23
TimeMinute:
  0  DecimalDigit
  1  DecimalDigit
  2  DecimalDigit
  3  DecimalDigit
  4  DecimalDigit
  5  DecimalDigit
TimeSecond:
  0 DecimalDigit  TimeSecondDecimalPart_opt
  1 DecimalDigit  TimeSecondDecimalPart_opt
  2 DecimalDigit  TimeSecondDecimalPart_opt
  3 DecimalDigit  TimeSecondDecimalPart_opt
  4 DecimalDigit  TimeSecondDecimalPart_opt
  5 DecimalDigit  TimeSecondDecimalPart_opt
  60 TimeSecondDecimalPart_opt
TimeSecondDecimalPart:
  . DecimalDigits

Examples of time literal follow:

11:30:00
01:01:01.111
13:13:00

Character Literals

A character literal represents a single character, for example ‘a’.

CharacterLiteral:
  ' Character '
Character:
  CharacterSimple
  CharacterEscapeHex
  CharacterEscapeSimple
  CharacterEscapeUnicode
Characters:
  Character
  Characters Character
CharacterEscapeHex:
  CharacterEscapeHexPrefix  HexDigit
  CharacterEscapeHexPrefix  HexDigit HexDigit
  CharacterEscapeHexPrefix  HexDigit  HexDigit  HexDigit
  CharacterEscapeHexPrefix  HexDigit  HexDigit  HexDigit  HexDigit
  CharacterEscapeHexPrefix: one of
  x  X
CharacterEscapeSimple:
   CharacterEscapeSimpleCharacter
CharacterEscapeSimpleCharacter: one of
  '  "    0  a  b  f  n  r  t  v
CharacterEscapeUnicode:
  u  HexDigit  HexDigit  HexDigit  HexDigit
  U  HexDigit  HexDigit  HexDigit  HexDigit HexDigit  HexDigit  HexDigit  HexDigit
CharacterSimple: none of
  U+0027  // Single Quote
  U+005C  // Backslash
  NewLineCharacter

A hexadecimal escape sequence represents a single Unicode character, with the value formed by the hexadecimal number following the prefix.

If the value represented by a character literal is greater than U+FFFF, a compile-time error occurs.

A Unicode character escape sequence in a character literal must be in the range U+0000 to U+FFFF.

A simple escape sequence represents a Unicode character encoding, as described in the following table.

Escape Sequence	Character Name	Unicode Encoding
`'`	Single quote	`0x0027`
`"`	Double quote	`0x0022`
`\`	Backslash	`0x005C`
	Null	`0x0000`
`a`	Alert	`0x0007`
	Backspace	`0x0008`
`f`	Form feed	`0x000C`
	New line	`0x000A`
	Carriage return	`0x000D`
	Horizontal tab	`0x0009`
`v`	Vertical tab	`0x000B`

Since M uses a 16-bit encoding of Unicode code points in Character and Text values, a Unicode character in the range U+10000 to U+10FFFF is not permitted in a Character literal and is represented using a Unicode surrogate pair in a Text literal. Unicode characters with code points above 0x10FFFF are not supported.

Multiple translations are not performed. For instance, the Text literal u005Cu005C is equivalent to u005C rather than . The Unicode value U+005C is the character .

The type of a Character literal is Character.

Examples of a character literal follow:

'a'
'u2323'
'x2323'

Text Literals

M supports two forms of Text literals: regular Text literals and verbatim Text literals.

A regular Text literal consists of zero or more characters enclosed in double quotes, as in "hello" and may include both simple escape sequences (such as for the tab character), and hexadecimal and Unicode escape sequences.

A verbatim Text literal consists of an @ character followed by a double-quote character, zero or more characters, and a closing double-quote character. A simple example is @"hello". In a verbatim Text literal, the characters between the delimiters are interpreted exactly as they occur in the compilation unit, the only exception being a QuoteEscapeSequence. In particular, simple escape sequences, and hexadecimal and Unicode escape sequences are not processed in verbatim Text literals. A verbatim Text literal may span multiple lines.

TextLiteral:
  " TextCharacters_opt  "
  @ " TextVerbatimCharacters_opt "
TextCharacter:
  TextSimple
  CharacterEscapeHex
  CharacterEscapeSimple
  CharacterEscapeUnicode
TextCharacters:
  TextCharacter
  TextCharacters  TextCharacter
TextSimple: none of
  "
  
  NewLineCharacter
TextVerbatimCharacter:
  none of "
  TextVerbatimCharacterEscape
TextVerbatimCharacterEscape:
  " "
TextVerbatimCharacters:
  TextVerbatimCharacter+;
  TextVerbatimCharacters  TextVerbatimCharacter

The type of a Text literal is Text.

Examples of text literal follow:

"Hello World"
@"""Hello World"""
"u2323"

Logical Literals

Logical literals are used to write logical values.

LogicalLiteral: one of
  true  false

The type of a Logical literal is Logical.

Examples of logical literal:

true
false

Binary and Byte Literals

Binary literals are used to write binary and byte values.

BinaryLiteral:
  0x HexDigits_opt
  0X HexDigits_opt
HexDigit: one of
  0  1  2  3  4  5  6  7  8  9  0  a  b  c  d  e  f  A  B  C  M  E  F
HexDigits:
  HexDigit  HexDigit
  HexDigits  HexDigit HexDigit

The type of a Binary literal with two digits defaults to Binary. Binary literals with two digits default to Byte.

Examples of byte literal follow:

0x00
0XFF
0x01

Examples of binary literal follow:

0x
0x0000000000000000000000000000000000000000000000
0x1234

Null Literal

The null literal is equal to no other value.

NullLiteral:
	`null`

The type of a null literal is Null.

An example of the null literal follows:

null

Guid Literals

GuidLiteral:
  #[ X X X X X X X X - X X X X - X X X X - X X X X - X X X X X X X X X X X X ]
X:
  HexDigit

The type of a Guid literal is Guid.

Examples of Guid literal follows:

#[a0ee7e0f-c6ac-4c63-b57f-816a5259595a]
#[7fbc28ba-8205-45ca-983e-ece117f7a776]
#[a05e63ca-25de-43a6-bf70-0bc04d40a000]

Operators and Punctuators

There are several kinds of operators and punctuators. Operators are used in expressions to describe operations involving one or more operands. For example, the expression a + b uses the + operator to add the two operands a and b. Punctuators are for grouping and separating.

OperatorOrPunctuator: one of
  [ ] ( ) . , : ; ? = < > <= >= == != + - * / % & | ! && || ~ << >> { } # .. @ ' " ??

Pre-processing Directives

Pre-processing directives provide the ability to conditionally skip sections of source files, to report error and warning conditions, and to delineate distinct regions of source code as a separate pre-processing step.

PPDirective:
	PPDeclaration
	PPConditional
	PPDiagnostic
	PPRegion

The following pre-processing directives are available:

#define and #undef, which are used to define and undefine, respectively, conditional compilation symbols.
#if, #else, and #endif, which are used to conditionally skip sections of source code.

A pre-processing directive always occupies a separate line of source code and always begins with a # character and a pre-processing directive name. White space may occur before the # character and between the # character and the directive name.

A source line containing a #define, #undef, #if, #else, or #endif directive may end with a single-line comment. Delimited comments (the /* */ style of comments) are not permitted on source lines containing pre-processing directives.

Pre-processing directives are neither tokens nor part of the syntactic grammar of M. However, pre-processing directives can be used to include or exclude sequences of tokens and can in that way affect the meaning of an M program. For example, after pre-processing the source text:

#define A
#undef B
type C
{
#if A
    F {}
#else
    G {}
#endif
#if B
    H {}
#else
    I {}
#endif
}

results in the exact same sequence of tokens as the source text:

type C
{
    F {}
    I {}
}

Thus, whereas lexically, the two programs are quite different, syntactically, they are identical.

Conditional Compilation Symbols

The conditional compilation functionality provided by the #if, #else, and #endif directives is controlled through pre-processing expressions and conditional compilation symbols.

ConditionalSymbol:
    Any IdentifierOrKeyword except true or false

A conditional compilation symbol has two possible states: defined or undefined. At the beginning of the lexical processing of a source file, a conditional compilation symbol is undefined unless it has been explicitly defined by an external mechanism (such as a command-line compiler option). When a #define directive is processed, the conditional compilation symbol named in that directive becomes defined in that source file. The symbol remains defined until an #undef directive for that same symbol is processed, or until the end of the source file is reached. An implication of this is that #define and #undef directives in one source file have no effect on other source files in the same program.

When referenced in a pre-processing expression, a defined conditional compilation symbol has the Logical value true, and an undefined conditional compilation symbol has the Logical value false. There is no requirement that conditional compilation symbols be explicitly declared before they are referenced in pre-processing expressions. Instead, undeclared symbols are simply undefined and thus have the value false.

Conditional compilation symbols can only be referenced in #define and #undef directives and in pre-processing expressions.

Pre-processing Expressions

Pre-processing expressions can occur in #if directives. The operators !, ==, !=, &&, and || are permitted in pre-processing expressions, and parentheses may be used for grouping.

PPExpression:
  Whitespace_opt PPOrExpression   Whitespace_opt
OrExpression:
  PPAndExpression
  PPOrExpression   Whitespace_opt || Whitespace_opt  PPAndExpression
PPAndExpression:
  PPEqualityExpression
  PPAndExpression Whitespace_opt && Whitespace_opt   PPEqualityExpression
PPEqualityExpression:
  PPUnaryExpression
  PPEqualityExpression   Whitespace_opt == Whitespace_opt   PPUnaryExpression
  PPEqualityExpression   Whitespace_opt != Whitespace_opt  PPUnaryExpression
PPUnaryExpression:
  PPPrimaryExpression
  ! Whitespace_opt  PPUnaryExpression
PPPrimaryExpression:
  true
  false
  ConditionalSymbol
  ( Whitespace_opt  PPExpression Whitespace_opt )

When referenced in a pre-processing expression, a defined conditional compilation symbol has the Logical value true, and an undefined conditional compilation symbol has the Logical value false.

Evaluation of a pre-processing expression always yields a Logical value. The rules of evaluation for a pre-processing expression are the same as those for a constant expression, except that the only user-defined entities that can be referenced are conditional compilation symbols.

Declaration Directives

The declaration directives are used to define or undefine conditional compilation symbols.

PPDeclaration:
  Whitespace_opt  #  Whitespace_opt define Whitespace   ConditionalSymbol  PPNewLine
  Whitespace_opt  #  Whitespace_opt undef Whitespace   ConditionalSymbol  PPNewLine
PPNewLine:
  Whitespace_opt  SingleLineComment_opt  NewLine

The processing of a #define directive causes the given conditional compilation symbol to become defined, starting with the source line that follows the directive. Likewise, the processing of an #undef directive causes the given conditional compilation symbol to become undefined, starting with the source line that follows the directive.

A #define may define a conditional compilation symbol that is already defined, without there being any intervening #undef for that symbol. The following example defines a conditional compilation symbol A and then defines it again.

#define A
#define A

A #undef may “undefine” a conditional compilation symbol that is not defined. The example below defines a conditional compilation symbol A and then undefines it twice; although the second #undef has no effect, it is still valid.

#define A
#undef A
#undef A

Conditional Compilation Directives

The conditional compilation directives are used to conditionally include or exclude portions of a source file.

PPConditional:
  PPIfSection   PPElseSection_opt   PPEndif
PPIfSection:
  Whitespace_opt # Whitespace_opt if Whitespace   PPExpression   PPNewLine   ConditionalSection_opt
PPElseSection:
  Whitespace_opt # Whitespace_opt else PPNewLine   ConditionalSection_opt
PPEndif:
  Whitespace_opt # Whitespace_opt endif PPNewLine
ConditionalSection:
  InputSection
  SkippedSection
SkippedSection:
  SkippedSectionPart
  SkippedSection   SkippedSectionPart
SkippedSectionPart:
  SkippedCharacters_opt NewLine
  PPDirective
SkippedCharacters:
  Whitespace_opt NotNumberSign  InputCharacters_opt
NotNumberSign:
     Any   InputCharacter   except  #

As indicated by the syntax, conditional compilation directives must be written as sets consisting of, in order, an #if directive, zero or one #else directive, and an #endif directive. Between the directives are conditional sections of source code. Each section is controlled by the immediately preceding directive. A conditional section may itself contain nested conditional compilation directives provided these directives form complete sets.

A PPConditional selects at most one of the contained ConditionalSections for normal lexical processing:

The PPExpressions of the #if directives are evaluated in order until one yields true. If an expression yields true, the ConditionalSection of the corresponding directive is selected.
If all PPExpressions yield false, and if an #else directive is present, the ConditionalSection of the #else directive is selected.
Otherwise, no ConditionalSection is selected.

The selected ConditionalSection, if any, is processed as a normal InputSection: The source code contained in the section must adhere to the lexical grammar; tokens are generated from the source code in the section; and pre-processing directives in the section have the prescribed effects.

The remaining ConditionalSections, if any, are processed as SkippedSections: Except for pre-processing directives, the source code in the section need not adhere to the lexical grammar; no tokens are generated from the source code in the section; and pre-processing directives in the section must be lexically correct but are not otherwise processed. Within a ConditionalSection that is being processed as a SkippedSection, any nested ConditionalSections (contained in nested #if...#endif and #region...#endregion constructs) are also processed as SkippedSections.

Except for pre-processing directives, skipped source code is not subject to lexical analysis. For example, the following is valid despite the unterminated comment in the #else section:

#define Debug        // Debugging on
type Purchase
{
    ExtendedPrice {
#if Debug
        Price * Quantity;
#else
        /* Unterminated comment!
#endif
    }
}

Note that pre-processing directives are required to be lexically correct even in skipped sections of source code.

Pre-processing directives are not processed when they appear inside multiline input elements. For example, the program:

type Hello
{
    World =  @"hello,
#if Debug
        world
#else
        Nebraska
#endif
        ";
    }
}

assigns the world field the value:

hello,
#if Debug
        world
#else
        Nebraska
#endif

In peculiar cases, the set of pre-processing directives that is processed might depend on the evaluation of the PPExpression. The example:

#if X
    /*
#else
    /* */ type Q { }
#endif

always produces the same token stream (type Q { }), regardless of whether X is defined. If X is defined, the only processed directives are #if and #endif, due to the multiline comment. If X is undefined, then three directives (#if, #else, #endif) are part of the directive set.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 2. Lexical Structure

Create new playlist

Sign In

Sign Up