Source files and tokenization
Tokens
A token is a series of contiguous characters
that the compiler treats as a unit.
Translation phase 3 partitions a source file
into a sequence of tokens.
Tokens fall into seven classes:
- 
Identifiers
 
- 
Keywords
 
- 
Numeric Constants
 
- 
Character Constants
 
- 
String literals
 
- 
Operators
 
- 
Other separators and punctuators
Next topic: 
Identifiers
Previous topic: 
Phases of translation
© 2004 The SCO Group, Inc.  All rights reserved.
UnixWare 7 Release 7.1.4 - 27 April 2004