Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Back End: Code Generation Intermediate representations Optimization Actual target machine code generation
Interpreter: Creates a virtual machine on the target system that understands your language directly.
Compiled code runs faster than interpreted code, since direct execution of the target language is faster than the indirect emulation of the virtual machine interpreter.
Three Phases of Analysis Lexical Analysis (Scanning) read characters and group them into lexemes in the language; categorize the lexemes into tokens
Syntactical Analysis (Parsing) read tokens and check the nested, heirarchical ordering of the tokens recognize/build trees of tokens that belong to the language grammar
Semantic Analysis given a grammatically correct parse tree, what does each part mean?
The smallest element/unit/word recognized in the compiled or interpreted language is called a lexeme e.g. getc } { abc 1.2 * ++ ,
Each lexeme is recognized by the Scanner to be in some category; that category is returned as the token associated with the lexeme e.g. ( IDENTIFIER, getc )
The Parser recognizes/builds a hierarchical parse tree of tokens; the parse tree expresses the grammar of the language
Lexical Analysis: The Scanner The scanner is a tokenizer: it reads your program and recognizes lexemes It categorizes the lexemes by token type and returns a (token,lexeme) pair to the syntax analyser e.g. 12.57 : the token is number, the lexeme is the value 12.57 e.g. My_Var : the token is identifier, the lexeme is the string My_Var The scanner is self-contained: it doesnt know anything about the syntax or grammar of the language The analysis is usually non-recursive