Форум на CrossPlatform.RU -> Ответ в ошибка при генерации в ANTLRWorks 1.3.1

Последние 10 сообщений [ в обратном порядке ]

Балалайка	Дата 20.12.2010, 17:00
	Ну language = C; я ставлю. Дело в том, что с Java я знаком также второй день. И так, вот что я делал: 1) Я установил себе java-машину - sun jvm 2) Установил себе jdk 3) Скачал antlrworks-1.4.2.jar 4) в настройках компиляции antlrworks выбрал javac и указал путь до папки jdk1.6.0_18 (есть ещё папка jre6, но там нет файла javac) У меня был тестовый проект antl, где всё работает (в antlrworks): лексер, парсер и вызов сем. подпрограмм написанных на java. Так что вроде бы всё работает. После этого я создал лексер, парсер, и выбрал язык java. Жму debug (в antlrworks всё это) и всё нормально, дерево разбора создаётся, подключённый java файл генерирует файл (просто для теста). Далее изменяю язык на Си: options { language = C;}, удаляю папку output и жму ещё раз degug и вылазеет ошибка и в консоле текст: [19:54:53] warning(200): Pascal.g:71:4: Decision can match input such as "'else'" using multiple alternatives: 1, 2 Цитата As a result, alternative(s) 2 were disabled for that input [19:54:53] warning(200): D:\TR2_lab#7\Pascal.g:71:4: Decision can match input such as "'else'" using multiple alternatives: 1, 2 As a result, alternative(s) 2 were disabled for that input [19:54:54] javac: file not found: D:\TR2_lab#7\output\PascalParser.java [19:54:54] Usage: javac <options> <source files> [19:54:54] use -help for a list of possible options При этом в папке output создаётся парсер, лексер - всё на языке си. Понять не могу, зачем требует antlrworks файл PascalParser.java? Также не могу понять, зачем в меню Run пункт Edit *.g Test Rig for C И кстати при компиляции на java создаётся файл __Test__.java, а при указании языка Си , аналогичный файл (__Test__.с) не создаётся
Iron Bug	Дата 20.12.2010, 14:39
	точно сейчас не скажу, но примерно так: 1. в коде должно быть: `options { ... language = C; }` 2. в переменной окружения CLASSPATH для жабы должны быть пути до пакетов ANTLR. система должна находить либо пакетник, который чего-то-там-full называется, либо развёрнутый пакет должен быть установлен в стандартные каталоги жабы. об этом написано в википедии ANTLR на их сайте (например, тут: http://www.antlr.org/wiki/pages/viewpage.action?pageId=728). плюс вроде бы для Си ему нужен ещё Java JDK (не только JRE). 3. <имя файла>.g должно быть таким же, как имя парсера (не знаю, зачем такая дурь, но вот требуется ему это). рекомендую поставить ANTLRWorks, если ещё не поставил. в нём проще, он диагностирует всякие неполадки и выводит более-менее читабельные сообщения. к тому же в последнем ANTLRWorks дополнено некое подобие визарда для простейших проектиков.
Балалайка	Дата 20.12.2010, 14:10
	граждане, только 2ой день с ANTLR знаком. Не пойму как компилятором сделать Си. На яве всё хорошо идёт, но только пишу, language = С, парсер и лексер переводятся в си, но вылазиит ошибка, что javac не видит файла __Test__.java. Сишный подобный файл не генерируется. В настройках компиляции стоит javac. Я не пойму, нужно выбрать какойнибудь сишный компилятор?
Iron Bug	Дата 24.3.2010, 13:37
	не знаю. не задумывалась над этим. читай доки, смотри реализацию libantlr3c.
mannyz	Дата 23.3.2010, 13:45
	IronBug, скажи, пожалуйста, можно ли в варианте для Си (я так понял, что под Java можно) указать или поменять тип значения, возвращаемого парсером. Под Java они используют опцию superClass = имя_класса, таким образом делая, чтобы сам генерируемый класс парсера был расширен от superClass. Но в Си это не прокатывает. Можно ли указать тип возвращаемого значения парсера, структуру или еще чего?
mannyz	Дата 19.3.2010, 8:38
	ясненько. спасибо )
Iron Bug	Дата 18.3.2010, 23:03
	пишут, что разве что так. `rule { int c = 0; } : ( subrule { c++; } )* { if (c < N \|\| c > M) { diediedie(...) // типа, тут прерываем обработку } } ;` (взято с http://www.antlr.org/pipermail/antlr-inter...rch/011591.html - там весь раздув, правда, дико устаревшая переписка) вроде в третьем собирались это реализовать какими-то встроенными средствами, но я пока нифига не нашла на этот счёт.
mannyz	Дата 18.3.2010, 13:20
	кстати, в том примере для Си, который я выше привел, последний мой "раскрывающийся текст", есть пояснения как все-таки можно обеспечить поддержку C++. надо только еще добавить cpp-файл с описанием класса для примера: Раскрывающийся текст #ifndef _SYMBOLPP_H #define _SYMBOLPP_H // A simple demonstration class purely to show that the C target // output is C++ compilable. Note that this is the output files that // can be compiled as C++, not the runtime library code itself, which // is purest C, though it has very few things that are incompatible. There is // however no need for the C runtime library to be compiled as C++, as it // just links in trivially as a library on any system, which is more than // can be said for C++ ;_). // // Later the C output may be extended to give some nice convenience classes // for C++ people. For now though, use C for interactions and instantiate your // own classes and so on. // #ifdef __cplusplus class symbolpp { public: symbolpp(int aline, pANTLR3_STRING aname) { line = aline; name = aname; printf("Created a symbolpp class\n"); } private: pANTLR3_STRING name; int line; public: inline int getLine() { return line; } inline const char * getName() { return (const char *)(name->chars); } }; #endif #endif оказывается, надо просто все, что касается с++ #ifdef/#endif. скажи пожалуйста, может быть, ты случайно знаешь как можно задавать в ANTLR цифрами количество раз повторений подряд того или иного токена/подправила? Например, как в других средствах описание регулярных выражений: `rule: subrule1{min,max} subrule2{,max}; subrule1 : ...; subrule2 : ...;` Таким образом, subrule-ы должны встретиться от min до max количества раз подряд. Подскажешь?
mannyz	Дата 17.3.2010, 23:33
	Цитата мдя... я не юзала хэш-таблицы в ANTLR-овской реализации, но решила проверить, почему этот код не работает. пришлось быть шаманом и прикончить весь запас пива в холодильнике, прежде чем великий дух программирования сжалился и дал мне просветление, чтобы просечь, как это сделать пиво вредно )). спасибо за подробные разъяснения. попробовал твои текста - заработало: все скомпилилось и ошибка не появлялась. здорово! но самое странное, что попробовал свои вчерашние текста (из своего сообщения, с которым ошибка памяти возникала) - тоже все прошло нормально. не понятно тогда, в чем вчера дело было ((. Цитата вот только хрен знает почему нельзя просто так в глобальном пространстве объявлять эту переменную. суть этой проблемы пока мне не ясна. да, получается, урезание в правах какое-то: ведь в ANTLR-книге приводятся примеры, где можно объявлять списки List или еще что-то в глобальном пространстве и потом просто так юзать их где попало и как попало. этоn пример, кстати, в главе про scope есть. проще показать: Цитата @members { String methodName; } method: type ID {methodName=$ID.text;} body ; body: '{' statement+ '}' ; statement : decl {...methodName...} ';' // ref value set in method \| ... ; Видимо, это такая особенность этой хэш-таблицы или всего scope. если посмотришь ниже, в примере для Си, в области memebers рядом с пояснением к SCOPE_SIZE и т.п., они говорят что-то про стэк и все такое. только я окончательно ничего не понимать. В общем, если есть желание разобраться в этом, то вот код, а так ты уже и так помогла очень )). Раскрывающийся текст /** ANSI C ANTLR v3 grammar Adapted for C output target by Jim Idle - April 2007. Translated from Jutta Degener's 1995 ANSI C yacc grammar by Terence Parr July 2006. The lexical rules were taken from the Java grammar. Jutta says: "In 1985, Jeff Lee published his Yacc grammar (which is accompanied by a matching Lex specification) for the April 30, 1985 draft version of the ANSI C standard. Tom Stockfisch reposted it to net.sources in 1987; that original, as mentioned in the answer to question 17.25 of the comp.lang.c FAQ, can be ftp'ed from ftp.uu.net, file usenet/net.sources/ansi.c.grammar.Z. I intend to keep this version as close to the current C Standard grammar as possible; please let me know if you discover discrepancies. Jutta Degener, 1995" Generally speaking, you need symbol table info to parse C; typedefs define types and then IDENTIFIERS are either types or plain IDs. I'm doing the min necessary here tracking only type names. This is a good example of the global scope (called Symbols). Every rule that declares its usage of Symbols pushes a new copy on the stack effectively creating a new symbol scope. Also note rule declaration declares a rule scope that lets any invoked rule see isTypedef boolean. It's much easier than passing that info down as parameters. Very clean. Rule direct_declarator can then easily determine whether the IDENTIFIER should be declared as a type name. I have only tested this on a single file, though it is 3500 lines. This grammar requires ANTLR v3 (3.0b8 or higher) Terence Parr July 2006 / grammar C; options { backtrack = true; memoize = true; k = 2; language = C; } scope Symbols { // Only track types in order to get parser working. The Java example // used the java.util.Set to keep track of these. The ANTLR3 runtime // has a number of useful 'objects' we can use that act very much like // the Java hashtables, Lists and Vectors. You have finer control over these // than the Java programmer, but they are sometimes a little more 'raw'. // Here, for each scope level, we want a set of symbols, so we can use // a ANTLR3 runtime provided hash table, and then later we will see if // a symbols is stored in at any level by using the symbol as the // key to the hashtable and seeing if the table contains that key. // pANTLR3_HASH_TABLE types; } // While you can implement your own character streams and so on, they // normally call things like LA() via function pointers. In general you will // be using one of the pre-supplied input streams and you can instruct the // generated code to access the input pointers directly. // // For 8 bit inputs : #define ANTLR3_INLINE_INPUT_ASCII // For 16 bit UTF16/UCS2 inputs : #define ANTLR3_INLINE_INPUT_UTF16 // // If your compiled recognizer might be given inputs from either of the sources // or you have written your own character input stream, then do not define // either of these. // @lexer::header { #define ANTLR3_INLINE_INPUT_ASCII } @parser::includes { // Include our noddy C++ example class // #include <cpp_symbolpp.h> } // The @header specifier is valid in the C target, but in this case there // is nothing to add over and above the generated code. Here you would // add #defines perhaps that you have made your code reliant upon. // // Use @preincludes for things you want to appear in the output file // before #include <antlr3.h> // @includes to come after #include <antlr3.h> // @header for things that should follow on after all the includes. // // Hence, this java oriented @header is commented out. // // @header { // import java.util.Set; // import java.util.HashSet; // } // @members inserts functions in C output file (parser without other // qualification. @lexer::members inserts functions in the lexer. // // In general do not use this too much (put in the odd tiny function perhaps), // but include the generated header files in your own header and use this in // separate translation units that contain support functions. // @members { // This is a function that is small enough to be kept in the // generated parser code (@lexer::members puts code in the lexer. // // Note a few useful MACROS in use here: // // SCOPE_SIZE returns the number of levels on the stack (1 to n) // for the named scope. // SCOPE_INSTANCE returns a pointer to Scope instance at the // specified level. // SCOPE_TYPE makes it easy to declare and cast the pointer to // the structure typedef that the code generator declares. // // All functions (that need anything to do with the runtime, should // receive a parameter called ctx as the first parameter. ctx is a pointer // to the instance of the parser that is running and ensures thread safety // as well as easy access to all the parser elements etc. All MACROS assume // the presence of this parameter. This would be a pCLexer pointer if this // were a function to be called by the lexer (in which case this would be in // @lexer::members. // ANTLR3_BOOLEAN isTypeName(pCParser ctx, pANTLR3_UINT8 name) { int i; for (i = (int)SCOPE_SIZE(Symbols)-1 ; i >= 0; i--) { pANTLR3_HASH_TABLE symtab; pANTLR3_STRING symbol; SCOPE_TYPE(Symbols) symScope; // Aids in declaring the scope pointers // Pick up the pointer to the scope structure at the current level // We are descending from the inner most scope as that is how C type // scoping works. // symScope = (SCOPE_TYPE(Symbols))SCOPE_INSTANCE(Symbols, i); // The pointer we have is an instance of the dynamic global scope // called Symbols. Within there, as declared above, we have a pointer // to an ANTLR3_HASH_TABLE. We should really check for NULL etc, like all good C code // should. But, this is example code... // symtab = (pANTLR3_HASH_TABLE) symScope->types; // The following call shows that you cannot add a NULL pointer as the entry for // the hash table. You can always just add the pointer to the key and ignore it, but // when you return from this call, you want to test for a NULL pointer, which means // the entry was not found in the table. // symbol = (pANTLR3_STRING) (symtab->get(symtab, (void )name)); // Did we find the symbol in the type lists? // This is generally used for semantic predicates, hence ANTLR3_TRUE or ANTLR3_FALSE // for the return // if (symbol != NULL) { return ANTLR3_TRUE; } } // We did not find the requested symbol in any of the scopes // that are currently in force. // return ANTLR3_FALSE; } // Because our dynamic scope contains an ANTLR3_HASH_TABLE, we need to free // it when it goes out of scope. When we create a new scope, we just set the // free pointer for the scope to point to this embedded function, which will be // called with a pointer to the scope instance that is to be freed, // from whence we take the table pointer, which we can then close :-) // void ANTLR3_CDECL freeTable(SCOPE_TYPE(Symbols) symtab) { // If we supplied an entry in the table with a free pointer, // then calling the table free function will call the free function // for each entry as it deletes it from the table. In this case however // we only stored things that were manufactured by internal factories, which // will be released anyway when the parser/lexer/etc are freed. // symtab->types->free(symtab->types); } } translation_unit scope Symbols; // The entire translation_unit (file) is a scope @init { // The code in @init is executed before the rule starts. Note that this // is C, hence we cannot guarantee to be able to both declare and initialize // variables at the same time. If you need to declare variables as local // to a rule, use the @declarations section and then initialize locals // separately in this @init section. // $Symbols::types = antlr3HashTableNew(11); // parameter is a rough hint for hash alg. as to size SCOPE_TOP(Symbols)->free = freeTable; // This is called when the scope is popped } : external_declaration+ ; /** Either a function definition or any other kind of C decl/def. * The LL() analysis algorithm fails to deal with this due to recursion in the declarator rules. I'm putting in a * manual predicate here so that we don't backtrack over * the entire function. Further, you get a better error * as errors within the function itself don't make it fail * to predict that it's a function. Weird errors previously. * Remember: the goal is to avoid backtrack like the plague * because it makes debugging, actions, and errors harder. * * Note that k=1 results in a much smaller predictor for the * fixed lookahead; k=2 made a few extra thousand lines. * I'll have to optimize that in the future. / external_declaration options { k=1; } : ( declaration_specifiers? declarator declaration '{' )=> function_definition \| declaration ; function_definition scope Symbols; // put parameters and locals into same scope for now @init { $Symbols::types = antlr3HashTableNew(11); SCOPE_TOP(Symbols)->free = freeTable; // This is called when the scope is popped } : declaration_specifiers? declarator ( declaration+ compound_statement // K&R style \| compound_statement // ANSI style ) ; declaration scope { ANTLR3_BOOLEAN isTypedef; } @init { $declaration::isTypedef = ANTLR3_FALSE; } : 'typedef' declaration_specifiers? { $declaration::isTypedef=ANTLR3_TRUE; } init_declarator_list ';' // special case, looking for typedef \| declaration_specifiers init_declarator_list? ';' ; declaration_specifiers : ( storage_class_specifier \| type_specifier \| type_qualifier )+ ; init_declarator_list : init_declarator (',' init_declarator)* ; init_declarator : declarator ('=' initializer)? ; storage_class_specifier : 'extern' \| 'static' \| 'auto' \| 'register' ; type_specifier : 'void' \| 'char' \| 'short' \| 'int' \| 'long' \| 'float' \| 'double' \| 'signed' \| 'unsigned' \| struct_or_union_specifier \| enum_specifier \| type_id ; type_id : {isTypeName(ctx, LT(1)->getText(LT(1))->chars) }? // Note how we reference using C directly IDENTIFIER { // In Java you can just use $xxx.text, which is of type String. // In C, .text returns an ANTLR3 'object' of type pANTLR3_STRING. // the pointer to the actual characters is contained in ->chars and // the object has lots of methods to help you with strings, such as append and // insert etc. pANTLR3_STRING is also auto managed by a string factory, which // will be released when you ->free() the parser. // // printf("'\%s' is a type", $IDENTIFIER.text->chars); } ; struct_or_union_specifier options { k=3; } scope Symbols; // structs are scopes @init { $Symbols::types = antlr3HashTableNew(11); SCOPE_TOP(Symbols)->free = freeTable; // This is called when the scope is popped } : struct_or_union IDENTIFIER? '{' struct_declaration_list '}' \| struct_or_union IDENTIFIER ; struct_or_union : 'struct' \| 'union' ; struct_declaration_list : struct_declaration+ ; struct_declaration : specifier_qualifier_list struct_declarator_list ';' ; specifier_qualifier_list : ( type_qualifier \| type_specifier )+ ; struct_declarator_list : struct_declarator (',' struct_declarator)* ; struct_declarator : declarator (':' constant_expression)? \| ':' constant_expression ; enum_specifier options { k=3; } : 'enum' '{' enumerator_list '}' \| 'enum' IDENTIFIER '{' enumerator_list '}' \| 'enum' IDENTIFIER ; enumerator_list : enumerator (',' enumerator)* ; enumerator : IDENTIFIER ('=' constant_expression)? ; type_qualifier : 'const' \| 'volatile' ; declarator : pointer? direct_declarator \| pointer ; direct_declarator : ( IDENTIFIER { if (SCOPE_TOP(declaration) != NULL && $declaration::isTypedef) { pANTLR3_STRING idText; // When adding an element to a pANTLR3_HASH_TABLE, the first argument // is the hash table itself, the second is the entry key, the third is // a (void ) pointer to a structure or element of your choice (cannot // be NULL) and the 4th is a pointer to a function that knows how to free // your entry structure (if this is needed, give NULL if not) when the table // is destroyed, or the entry is deleted from the table. // idText = $IDENTIFIER.text; $Symbols::types->put($Symbols::types, idText->chars, idText, NULL); #ifdef __cplusplus // If you compile the generated C source as C++, then you can embed // C++ code in your actions. The runtime is still C based and everything // is tagged properly for linking and so on, but because you are using the // C++ compiler, it will happilly accept classes and so on for things like // scopes. This class is defined entirely in the header file C.h, if "compile // as C++ is set for CParser.c and CLexer.c" It is just a silly example // of course and I don't do anythign with this class, just create and delete it. // symbolpp mySymClass; mySymClass = new symbolpp($IDENTIFIER.line, idText); delete mySymClass; #endif // Note that we must escape the percent sign here from ANTLR expression // parsing. It is not seen in the generated C code. // printf("define type \%s\n", $IDENTIFIER.text->chars); } } \| '(' declarator ')' ) declarator_suffix* ; declarator_suffix : '[' constant_expression ']' \| '[' ']' \| '(' parameter_type_list ')' \| '(' identifier_list ')' \| '(' ')' ; pointer : '' type_qualifier+ pointer? \| '' pointer \| '' ; parameter_type_list : parameter_list (',' '...')? ; parameter_list : parameter_declaration (',' parameter_declaration) ; parameter_declaration : declaration_specifiers (declarator \| abstract_declarator)* ; identifier_list : IDENTIFIER (',' IDENTIFIER)* ; type_name : specifier_qualifier_list abstract_declarator? ; abstract_declarator : pointer direct_abstract_declarator? \| direct_abstract_declarator ; direct_abstract_declarator : ( '(' abstract_declarator ')' \| abstract_declarator_suffix ) abstract_declarator_suffix* ; abstract_declarator_suffix : '[' ']' \| '[' constant_expression ']' \| '(' ')' \| '(' parameter_type_list ')' ; initializer : assignment_expression \| '{' initializer_list ','? '}' ; initializer_list : initializer (',' initializer)* ; // E x p r e s s i o n s argument_expression_list : assignment_expression (',' assignment_expression)* ; additive_expression : (multiplicative_expression) ('+' multiplicative_expression \| '-' multiplicative_expression)* ; multiplicative_expression : (cast_expression) ('' cast_expression \| '/' cast_expression \| '%' cast_expression) ; cast_expression : '(' type_name ')' cast_expression \| unary_expression ; unary_expression : postfix_expression \| '++' unary_expression \| '--' unary_expression \| unary_operator cast_expression \| 'sizeof' unary_expression \| 'sizeof' '(' type_name ')' ; postfix_expression : primary_expression ( '[' expression ']' \| '(' ')' \| '(' argument_expression_list ')' \| '.' IDENTIFIER \| '' IDENTIFIER \| '->' IDENTIFIER \| '++' \| '--' ) ; unary_operator : '&' \| '' \| '+' \| '-' \| '~' \| '!' ; primary_expression : IDENTIFIER \| constant \| '(' expression ')' ; constant : HEX_LITERAL \| OCTAL_LITERAL \| DECIMAL_LITERAL \| CHARACTER_LITERAL \| STRING_LITERAL \| FLOATING_POINT_LITERAL ; ///// expression : assignment_expression (',' assignment_expression) ; constant_expression : conditional_expression ; assignment_expression : lvalue assignment_operator assignment_expression \| conditional_expression ; lvalue : unary_expression ; assignment_operator : '=' \| '=' \| '/=' \| '%=' \| '+=' \| '-=' \| '<<=' \| '>>=' \| '&=' \| '^=' \| '\|=' ; conditional_expression : logical_or_expression ('?' expression ':' conditional_expression)? ; logical_or_expression : logical_and_expression ('\|\|' logical_and_expression) ; logical_and_expression : inclusive_or_expression ('&&' inclusive_or_expression)* ; inclusive_or_expression : exclusive_or_expression ('\|' exclusive_or_expression)* ; exclusive_or_expression : and_expression ('^' and_expression)* ; and_expression : equality_expression ('&' equality_expression)* ; equality_expression : relational_expression (('=='\|'!=') relational_expression)* ; relational_expression : shift_expression (('<'\|'>'\|'<='\|'>=') shift_expression)* ; shift_expression : additive_expression (('<<'\|'>>') additive_expression)* ; // S t a t e m e n t s statement : labeled_statement \| compound_statement \| expression_statement \| selection_statement \| iteration_statement \| jump_statement ; labeled_statement : IDENTIFIER ':' statement \| 'case' constant_expression ':' statement \| 'default' ':' statement ; compound_statement scope Symbols; // blocks have a scope of symbols @init { $Symbols::types = antlr3HashTableNew(11); SCOPE_TOP(Symbols)->free = freeTable; // This is called when the scope is popped } : '{' declaration* statement_list? '}' ; statement_list : statement+ ; expression_statement : ';' \| expression ';' ; selection_statement : 'if' '(' expression ')' statement (('else')=> 'else' statement)? \| 'switch' '(' expression ')' statement ; iteration_statement : 'while' '(' expression ')' statement \| 'do' statement 'while' '(' expression ')' ';' \| 'for' '(' expression_statement expression_statement expression? ')' statement ; jump_statement : 'goto' IDENTIFIER ';' \| 'continue' ';' \| 'break' ';' \| 'return' ';' \| 'return' expression ';' ; IDENTIFIER : LETTER (LETTER\|'0'..'9')* ; fragment LETTER : '$' \| 'A'..'Z' \| 'a'..'z' \| '_' ; CHARACTER_LITERAL : '\'' ( EscapeSequence \| ~('\''\|'\\') ) '\'' ; STRING_LITERAL : '"' STRING_GUTS '"' ; fragment STRING_GUTS : ( EscapeSequence \| ~('\\'\|'"') )* ; HEX_LITERAL : '0' ('x'\|'X') HexDigit+ IntegerTypeSuffix? ; DECIMAL_LITERAL : ('0' \| '1'..'9' '0'..'9') IntegerTypeSuffix? ; OCTAL_LITERAL : '0' ('0'..'7')+ IntegerTypeSuffix? ; fragment HexDigit : ('0'..'9'\|'a'..'f'\|'A'..'F') ; fragment IntegerTypeSuffix : ('l'\|'L') \| ('u'\|'U') ('l'\|'L')? ; FLOATING_POINT_LITERAL : ('0'..'9')+ '.' ('0'..'9') Exponent? FloatTypeSuffix? \| '.' ('0'..'9')+ Exponent? FloatTypeSuffix? \| ('0'..'9')+ ( Exponent FloatTypeSuffix? \| FloatTypeSuffix) ; fragment Exponent : ('e'\|'E') ('+'\|'-')? ('0'..'9')+ ; fragment FloatTypeSuffix : ('f'\|'F'\|'d'\|'D') ; fragment EscapeSequence : '\\' ('b'\|'t'\|'n'\|'f'\|'r'\|'\"'\|'\''\|'\\') \| OctalEscape ; fragment OctalEscape : '\\' ('0'..'3') ('0'..'7') ('0'..'7') \| '\\' ('0'..'7') ('0'..'7') \| '\\' ('0'..'7') ; fragment UnicodeEscape : '\\' 'u' HexDigit HexDigit HexDigit HexDigit ; WS : (' '\|'\r'\|'\t'\|'\u000C'\|'\n') {$channel=HIDDEN;} ; COMMENT : '/' ( options {greedy=false;} : . ) '/' {$channel=HIDDEN;} ; LINE_COMMENT : '//' ~('\n'\|'\r') '\r'? '\n' {$channel=HIDDEN;} ; // ignore #line info for now LINE_COMMAND : '#' (' ' \| '\t')* ( 'include' (' ' \| '\t')+ '"' file = STRING_GUTS '"' (' ' \| '\t')* '\r'? '\n' { pANTLR3_STRING fName; pANTLR3_INPUT_STREAM in; // Create an initial string, then take a substring // We can do this by messing with the start and end // pointers of tokens and so on. This shows a reasonable way to // manipulate strings. // fName = $file.text; printf("Including file '\%s'\n", fName->chars); // Create a new input stream and take advantage of built in stream stacking // in C target runtime. // in = antlr3AsciiFileStreamNew(fName->chars); PUSHSTREAM(in); // Note that the input stream is not closed when it EOFs, I don't bother // to do it here (hence this is leaked at the program end), // but it is up to you to track streams created like this // and destroy them when the whole parse session is complete. Remember that you // don't want to do this until all tokens have been manipulated all the way through // your tree parsers etc as the token does not store the text it just refers // back to the input stream and trying to get the text for it will abort if you // close the input stream too early. // } \| (('0'..'9')=>('0'..'9'))+ ~('\n'\|'\r')* '\r'? '\n' ) {$channel=HIDDEN;} ;
Iron Bug	Дата 17.3.2010, 22:19
	мдя... я не юзала хэш-таблицы в ANTLR-овской реализации, но решила проверить, почему этот код не работает. пришлось быть шаманом и прикончить весь запас пива в холодильнике, прежде чем великий дух программирования сжалился и дал мне просветление, чтобы просечь, как это сделать вот: Раскрывающийся текст grammar Expr; options { language=C; } @header { #include <stdlib.h> } @members { pANTLR3_HASH_TABLE types; } prog @init { types = antlr3HashTableNew(11); } :state+ ; state : expr NEWLINE { printf("\%s\n",$expr.text->chars); } \| ID '=' expr NEWLINE \| NEWLINE ; expr : multExpr ( '+' multExpr \| '-' multExpr )* ; multExpr : atom ('' atom ) ; atom returns [int value] : INT { $value = atoi($INT.text->chars); printf("<\%s=\%d>",$INT.text->chars,$value); } \| ID \| '(' expr ')' ; ID : ('a'..'z'\|'A'..'Z')+ ; INT : '0'..'9'+ ; NEWLINE:'\r'? '\n' ; WS : (' '\|'\t')+ { SKIP(); }; вот только хрен знает почему нельзя просто так в глобальном пространстве объявлять эту переменную. суть этой проблемы пока мне не ясна. либо можно сделать так: объявлять переменную во внешнем файле, а в парсере только использовать, указав extern: Раскрывающийся текст grammar Expr; options { language=C; } @header { #include <stdlib.h> } @members { extern pANTLR3_HASH_TABLE types; } prog @init { types = antlr3HashTableNew(11); } :state+ ; state : expr NEWLINE { printf("\%s\n",$expr.text->chars); } \| ID '=' expr NEWLINE \| NEWLINE ; expr : multExpr ( '+' multExpr \| '-' multExpr )* ; multExpr : atom ('' atom ) ; atom returns [int value] : INT { $value = atoi($INT.text->chars); printf("<\%s=\%d>",$INT.text->chars,$value); } \| ID \| '(' expr ')' ; ID : ('a'..'z'\|'A'..'Z')+ ; INT : '0'..'9'+ ; NEWLINE:'\r'? '\n' ; WS : (' '\|'\t')+ { SKIP(); }; на самом деле, инициализировать её тоже можно где угодно. главное, чтобы до использования. и вот ещё вариант с областями видимости: Раскрывающийся текст grammar Expr; options { language=C; } scope SomeScopeName { pANTLR3_HASH_TABLE types; } @header { #include <stdlib.h> } prog scope SomeScopeName; @init { $SomeScopeName::types= antlr3HashTableNew(11); } :statex+ ; statex : expr NEWLINE { printf("\%s\n",$expr.text->chars); } \| ID '=' expr NEWLINE \| NEWLINE ; expr : multExpr ( '+' multExpr \| '-' multExpr )* ; multExpr : atom ('' atom ) ; atom returns [int value] : INT { $value = atoi($INT.text->chars); printf("<\%s=\%d>",$INT.text->chars,$value); } \| ID \| '(' expr ')' ; ID : ('a'..'z'\|'A'..'Z')+ ; INT : '0'..'9'+ ; NEWLINE:'\r'? '\n' ; WS : (' '\|'\t')+ { SKIP(); }; это типа сделано, чтобы различать области видимости и "разрешать" некоторым правилам доступ к конкретным внешним переменным. ну, что больше подходит - то и юзай.
Просмотр темы полностью (откроется в новом окне)