Context Navigation

← Previous Changeset
Next Changeset →

Changeset 148849 in webkit

Timestamp:

Apr 21, 2013 4:26:56 PM (11 years ago)

Author:

oliver@apple.com

Message:

JS Lexer and Parser should be more informative when they encounter errors
https://bugs.webkit.org/show_bug.cgi?id=114924

Reviewed by Filip Pizlo.

Source/JavaScriptCore:

Add new tokens to represent the various ways that parsing and lexing have failed.
This gives us the ability to produce better error messages in some cases,
and to indicate whether or not the failure was due to invalid source, or simply
early termination.

The jsc prompt now makes use of this so that you can write functions that
are more than one line long.

bytecompiler/BytecodeGenerator.cpp:

(JSC::BytecodeGenerator::generate):

jsc.cpp:

(stringFromUTF):
(jscSource):
(runInteractive):

parser/Lexer.cpp:

(JSC::::parseFourDigitUnicodeHex):
(JSC::::parseIdentifierSlowCase):
(JSC::::parseString):
(JSC::::parseStringSlowCase):
(JSC::::lex):

parser/Lexer.h:

(UnicodeHexValue):
(JSC::Lexer::UnicodeHexValue::UnicodeHexValue):
(JSC::Lexer::UnicodeHexValue::valueType):
(JSC::Lexer::UnicodeHexValue::isValid):
(JSC::Lexer::UnicodeHexValue::value):
(Lexer):

parser/Parser.h:

(JSC::Parser::getTokenName):
(JSC::Parser::updateErrorMessageSpecialCase):
(JSC::::parse):

parser/ParserError.h:

(ParserError):
(JSC::ParserError::ParserError):

parser/ParserTokens.h:
runtime/Completion.cpp:

(JSC):
(JSC::checkSyntax):

runtime/Completion.h:

(JSC):

LayoutTests:

Update test results to cover improved error messages.

fast/js/kde/parse-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T1-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T2-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T3-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T4-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T5-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T1-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T2-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T3-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T4-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T1-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T10-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T2-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T3-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T4-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T5-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T6-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T7-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T8-expected.txt:
sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T9-expected.txt:
sputnik/Conformance/13_Function_Definition/S13_A7_T3-expected.txt:

Location:

trunk

Files:

: 32 edited

LayoutTests/ChangeLog (modified) (1 diff)
LayoutTests/fast/js/kde/parse-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T1-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T2-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T3-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T4-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T5-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T1-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T2-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T3-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T4-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T1-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T10-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T2-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T3-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T4-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T5-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T6-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T7-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T8-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T9-expected.txt (modified) (1 diff)
LayoutTests/sputnik/Conformance/13_Function_Definition/S13_A7_T3-expected.txt (modified) (1 diff)
Source/JavaScriptCore/ChangeLog (modified) (1 diff)
Source/JavaScriptCore/bytecompiler/BytecodeGenerator.cpp (modified) (1 diff)
Source/JavaScriptCore/jsc.cpp (modified) (4 diffs)
Source/JavaScriptCore/parser/Lexer.cpp (modified) (21 diffs)
Source/JavaScriptCore/parser/Lexer.h (modified) (2 diffs)
Source/JavaScriptCore/parser/Parser.h (modified) (3 diffs)
Source/JavaScriptCore/parser/ParserError.h (modified) (4 diffs)
Source/JavaScriptCore/parser/ParserTokens.h (modified) (3 diffs)
Source/JavaScriptCore/runtime/Completion.cpp (modified) (1 diff)
Source/JavaScriptCore/runtime/Completion.h (modified) (1 diff)

Legend:

: Unmodified
: Added
: Removed

trunk/LayoutTests/ChangeLog

-                      r148847
+                      r148849
+-04-21  Oliver Hunt  <oliver@apple.com>
+        JS Lexer and Parser should be more informative when they encounter errors
+        https://bugs.webkit.org/show_bug.cgi?id=114924
+        Reviewed by Filip Pizlo.
+        Update test results to cover improved error messages.
+        * fast/js/kde/parse-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T1-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T2-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T3-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T4-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T5-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T1-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T2-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T3-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T4-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T1-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T10-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T2-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T3-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T4-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T5-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T6-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T7-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T8-expected.txt:
+        * sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T9-expected.txt:
+        * sputnik/Conformance/13_Function_Definition/S13_A7_T3-expected.txt:
 -04-21  Christophe Dumez  <ch.dumez@sisa.samsung.com>

trunk/LayoutTests/fast/js/kde/parse-expected.txt

-                      r90535
+                      r148849
 PASS var f\u0030 = 103; f0 is 103
 PASS var \u00E9\u0100\u02AF\u0388\u18A8 = 104; \u00E9\u0100\u02AF\u0388\u18A8; is 104
 PASS var f\u00F7; threw exception SyntaxError: Unrecognized token 'f\u00F7'.
 PASS var \u0030; threw exception SyntaxError: Unrecognized token '\u0030'.
 PASS var test = { }; test.i= 0; test.i\u002b= 1; test.i; threw exception SyntaxError: Unrecognized token 'i\u002b'.
+PASS var f\u00F7; threw exception SyntaxError: Invalid unicode escape in identifier: 'f\u00F7'.
+PASS var \u0030; threw exception SyntaxError: Invalid unicode escape in identifier: '\u0030'.
+PASS var test = { }; test.i= 0; test.i\u002b= 1; test.i; threw exception SyntaxError: Invalid unicode escape in identifier: 'i\u002b'.
 PASS var test = { }; test.i= 0; test.i+= 1; test.i; is 1
 PASS successfullyParsed is true

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T1-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u0009'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u0009'
2	2	S7.2_A5_T1
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T2-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u000B'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u000B'
2	2	S7.2_A5_T2
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T3-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u000C'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u000C'
2	2	S7.2_A5_T3
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T4-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u0020'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u0020'
2	2	S7.2_A5_T4
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.2_White_Space/S7.2_A5_T5-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u00A0'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u00A0'
2	2	S7.2_A5_T5
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T1-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u000A'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u000A'
2	2	S7.3_A6_T1
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T2-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u000D'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u000D'
2	2	S7.3_A6_T2
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T3-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u2028'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u2028'
2	2	S7.3_A6_T3
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.3_Line_Terminators/S7.3_A6_T4-expected.txt

r90535	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u2029'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u2029'
2	2	S7.3_A6_T4
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T1-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u007B'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u007B'
2	2	S7.7_A2_T1
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T10-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u002F'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u002F'
2	2	S7.7_A2_T10
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T2-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u0028'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u0028'
2	2	S7.7_A2_T2
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T3-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u005B'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u005B'
2	2	S7.7_A2_T3
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T4-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u003B'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u003B'
2	2	S7.7_A2_T4
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T5-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 77: SyntaxError: ~~Unrecognized token~~ '\u002E'
	1	CONSOLE MESSAGE: line 77: SyntaxError: Invalid unicode escape in identifier: '\u002E'
2	2	S7.7_A2_T5
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T6-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u002C'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u002C'
2	2	S7.7_A2_T6
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T7-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u002B'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u002B'
2	2	S7.7_A2_T7
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T8-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u002D'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u002D'
2	2	S7.7_A2_T8
3	3

trunk/LayoutTests/sputnik/Conformance/07_Lexical_Conventions/7.7_Punctuators/S7.7_A2_T9-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\u002A'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid unicode escape in identifier: '\u002A'
2	2	S7.7_A2_T9
3	3

trunk/LayoutTests/sputnik/Conformance/13_Function_Definition/S13_A7_T3-expected.txt

r89257	r148849
1		CONSOLE MESSAGE: line 76: SyntaxError: ~~Unrecognized token~~ '\'
	1	CONSOLE MESSAGE: line 76: SyntaxError: Invalid escape in identifier: '\'
2	2	S13_A7_T3
3	3

trunk/Source/JavaScriptCore/ChangeLog

-                      r148820
+                      r148849
+-04-21  Oliver Hunt  <oliver@apple.com>
+        JS Lexer and Parser should be more informative when they encounter errors
+        https://bugs.webkit.org/show_bug.cgi?id=114924
+        Reviewed by Filip Pizlo.
+        Add new tokens to represent the various ways that parsing and lexing have failed.
+        This gives us the ability to produce better error messages in some cases,
+        and to indicate whether or not the failure was due to invalid source, or simply
+        early termination.
+        The jsc prompt now makes use of this so that you can write functions that
+        are more than one line long.
+        * bytecompiler/BytecodeGenerator.cpp:
+        (JSC::BytecodeGenerator::generate):
+        * jsc.cpp:
+        (stringFromUTF):
+        (jscSource):
+        (runInteractive):
+        * parser/Lexer.cpp:
+        (JSC::::parseFourDigitUnicodeHex):
+        (JSC::::parseIdentifierSlowCase):
+        (JSC::::parseString):
+        (JSC::::parseStringSlowCase):
+        (JSC::::lex):
+        * parser/Lexer.h:
+        (UnicodeHexValue):
+        (JSC::Lexer::UnicodeHexValue::UnicodeHexValue):
+        (JSC::Lexer::UnicodeHexValue::valueType):
+        (JSC::Lexer::UnicodeHexValue::isValid):
+        (JSC::Lexer::UnicodeHexValue::value):
+        (Lexer):
+        * parser/Parser.h:
+        (JSC::Parser::getTokenName):
+        (JSC::Parser::updateErrorMessageSpecialCase):
+        (JSC::::parse):
+        * parser/ParserError.h:
+        (ParserError):
+        (JSC::ParserError::ParserError):
+        * parser/ParserTokens.h:
+        * runtime/Completion.cpp:
+        (JSC):
+        (JSC::checkSyntax):
+        * runtime/Completion.h:
+        (JSC):
 -04-21  Mark Lam  <mark.lam@apple.com>

trunk/Source/JavaScriptCore/bytecompiler/BytecodeGenerator.cpp

-                      r148696
+                      r148849
     if (m_expressionTooDeep)
         return ParserError::OutOfMemory;
     return ParserError::ErrorNone;
+        return ParserError(ParserError::OutOfMemory);
+    return ParserError(ParserError::ErrorNone);
+}

trunk/Source/JavaScriptCore/jsc.cpp

-                      r148696
+                      r148849
 };
 static const char interactivePrompt[] = "> ";
+static const char interactivePrompt[] = ">>> ";
 class StopWatch {
 …
+}
 static inline SourceCode jscSource(const char* utf8, const String& filename)
+static inline String stringFromUTF(const char* utf8)
+{
     // Find the the first non-ascii character, or nul.
 …
         pos++;
     size_t asciiLength = pos - utf8;
     // Fast case - string is all ascii.
     if (!*pos)
         return makeSource(String(utf8, asciiLength), filename);
+        return String(utf8, asciiLength);
     // Slow case - contains non-ascii characters, use fromUTF8WithLatin1Fallback.
     ASSERT(*pos < 0);
     ASSERT(strlen(utf8) == asciiLength + strlen(pos));
+    String source = String::fromUTF8WithLatin1Fallback(utf8, asciiLength + strlen(pos));
+    return makeSource(source.impl(), filename);
+    return String::fromUTF8WithLatin1Fallback(utf8, asciiLength + strlen(pos));
+}
+static inline SourceCode jscSource(const char* utf8, const String& filename)
+{
+    String str = stringFromUTF(utf8);
+    return makeSource(str, filename);
+}
 …
+{
     String interpreterName("Interpreter");
+    while (true) {
+    bool shouldQuit = false;
+    while (!shouldQuit) {
 #if HAVE(READLINE) && !RUNNING_FROM_XCODE
+        char* line = readline(interactivePrompt);
+        if (!line)
+            break;
+        if (line[0])
+            add_history(line);
+        ParserError error;
+        String source;
+        do {
+            error = ParserError();
+            char* line = readline(source.isEmpty() ? interactivePrompt : "... ");
+            source = source + line;
+            source = source + '\n';
+            checkSyntax(globalObject->globalExec(), makeSource(source, interpreterName), error);
+            shouldQuit = !line;
+            if (!line || !line[0])
+                break;
+            if (line[0])
+                add_history(line);
+        } while (error.m_syntaxErrorType == ParserError::SyntaxErrorRecoverable);
+        if (error.m_type != ParserError::ErrorNone) {
+            printf("%s:%d\n", error.m_message.utf8().data(), error.m_line);
+            continue;
+        }
         JSValue evaluationException;
+        JSValue returnValue = evaluate(globalObject->globalExec(), jscSource(line, interpreterName), JSValue(), &evaluationException);
+        free(line);
+        JSValue returnValue = evaluate(globalObject->globalExec(), makeSource(source, interpreterName), JSValue(), &evaluationException);
 #else
         printf("%s", interactivePrompt);

trunk/Source/JavaScriptCore/parser/Lexer.cpp

-                      r148696
+                      r148849
 template <typename T>
 int Lexer<T>::parseFourDigitUnicodeHex()
+typename Lexer<T>::UnicodeHexValue Lexer<T>::parseFourDigitUnicodeHex()
+{
     T char1 = peek(1);
 …
     if (UNLIKELY(!isASCIIHexDigit(m_current) || !isASCIIHexDigit(char1) || !isASCIIHexDigit(char2) || !isASCIIHexDigit(char3)))
         return -1;
+        return UnicodeHexValue((m_code + 4) >= m_codeEnd ? UnicodeHexValue::IncompleteHex : UnicodeHexValue::InvalidHex);
     int result = convertUnicode(m_current, char1, char2, char3);
 …
     shift();
     shift();
     return result;
+    return UnicodeHexValue(result);
+}
 …
         shift();
         if (UNLIKELY(m_current != 'u'))
             return ERRORTOK;
         shift();
         int character = parseFourDigitUnicodeHex();
         if (UNLIKELY(character == -1))
             return ERRORTOK;
         UChar ucharacter = static_cast<UChar>(character);
+            return atEnd() ? UNTERMINATED_IDENTIFIER_ESCAPE_ERRORTOK : INVALID_IDENTIFIER_ESCAPE_ERRORTOK;
+        shift();
+        UnicodeHexValue character = parseFourDigitUnicodeHex();
+        if (UNLIKELY(!character.isValid()))
+            return character.valueType() == UnicodeHexValue::IncompleteHex ? UNTERMINATED_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK : INVALID_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK;
+        UChar ucharacter = static_cast<UChar>(character.value());
         if (UNLIKELY(m_buffer16.size() ? !isIdentPart(ucharacter) : !isIdentStart(ucharacter)))
             return ERRORTOK;
+            return INVALID_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK;
         if (shouldCreateIdentifier)
             record16(ucharacter);
 …
 template <typename T>
 template <bool shouldBuildStrings> ALWAYS_INLINE bool Lexer<T>::parseString(JSTokenData* tokenData, bool strictMode)
+template <bool shouldBuildStrings> ALWAYS_INLINE typename Lexer<T>::StringParseResult Lexer<T>::parseString(JSTokenData* tokenData, bool strictMode)
+{
     int startingOffset = currentOffset();
 …
                 if (!isASCIIHexDigit(m_current) || !isASCIIHexDigit(peek(1))) {
                     m_lexErrorMessage = "\\x can only be followed by a hex character sequence";
                     return false;
+                    return (atEnd() || (isASCIIHexDigit(m_current) && (m_code + 1 == m_codeEnd))) ? StringUnterminated : StringCannotBeParsed;
+                }
                 T prev = m_current;
 …
         tokenData->ident = 0;
     return true;
+}
 template <typename T>
 template <bool shouldBuildStrings> bool Lexer<T>::parseStringSlowCase(JSTokenData* tokenData, bool strictMode)
+    return StringParsedSuccessfully;
+}
+template <typename T>
+template <bool shouldBuildStrings> typename Lexer<T>::StringParseResult Lexer<T>::parseStringSlowCase(JSTokenData* tokenData, bool strictMode)
+{
     T stringQuoteCharacter = m_current;
 …
                 if (!isASCIIHexDigit(m_current) || !isASCIIHexDigit(peek(1))) {
                     m_lexErrorMessage = "\\x can only be followed by a hex character sequence";
                     return false;
+                    return StringCannotBeParsed;
+                }
                 T prev = m_current;
 …
             } else if (m_current == 'u') {
                 shift();
                 int character = parseFourDigitUnicodeHex();
                 if (character != -1) {
+                UnicodeHexValue character = parseFourDigitUnicodeHex();
+                if (character.isValid()) {
                     if (shouldBuildStrings)
                         record16(character);
+                        record16(character.value());
                 } else if (m_current == stringQuoteCharacter) {
                     if (shouldBuildStrings)
 …
                 } else {
                     m_lexErrorMessage = "\\u can only be followed by a Unicode character sequence";
                     return false;
+                    return character.valueType() == UnicodeHexValue::IncompleteHex ? StringUnterminated : StringCannotBeParsed;
+                }
             } else if (strictMode && isASCIIDigit(m_current)) {
 …
                 if (character1 != '0' || isASCIIDigit(m_current)) {
                     m_lexErrorMessage = "The only valid numeric escape in strict mode is '\\0'";
                     return false;
+                    return StringCannotBeParsed;
+                }
                 if (shouldBuildStrings)
 …
             } else {
                 m_lexErrorMessage = "Unterminated string constant";
                 return false;
+                return StringUnterminated;
+            }
 …
             if (atEnd() || isLineTerminator(m_current)) {
                 m_lexErrorMessage = "Unexpected EOF";
                 return false;
+                return atEnd() ? StringUnterminated : StringCannotBeParsed;
+            }
             // Anything else is just a normal character
 …
     m_buffer16.resize(0);
     return true;
+    return StringParsedSuccessfully;
+}
 …
                 goto start;
             m_lexErrorMessage = "Multiline comment was not closed properly";
+            token = UNTERMINATED_MULTILINE_COMMENT_ERRORTOK;
             goto returnError;
+        }
 …
                     if (strictMode) {
                         m_lexErrorMessage = "Octal escapes are forbidden in strict mode";
+                        token = INVALID_OCTAL_NUMBER_ERRORTOK;
                         goto returnError;
+                    }
 …
                     if (!parseNumberAfterExponentIndicator()) {
                         m_lexErrorMessage = "Non-number found after exponent indicator";
+                        token = atEnd() ? UNTERMINATED_NUMERIC_LITERAL_ERRORTOK : INVALID_NUMERIC_LITERAL_ERRORTOK;
                         goto returnError;
+                    }
 …
         if (UNLIKELY(isIdentStart(m_current))) {
             m_lexErrorMessage = "At least one digit must occur after a decimal point";
+            token = atEnd() ? UNTERMINATED_NUMERIC_LITERAL_ERRORTOK : INVALID_NUMERIC_LITERAL_ERRORTOK;
             goto returnError;
+        }
 …
     case CharacterQuote:
         if (lexerFlags & LexerFlagsDontBuildStrings) {
+            if (UNLIKELY(!parseString<false>(tokenData, strictMode)))
+            StringParseResult result = parseString<false>(tokenData, strictMode);
+            if (UNLIKELY(result != StringParsedSuccessfully)) {
+                token = result == StringUnterminated ? UNTERMINATED_STRING_LITERAL_ERRORTOK : INVALID_STRING_LITERAL_ERRORTOK;
                 goto returnError;
+            }
         } else {
+            if (UNLIKELY(!parseString<true>(tokenData, strictMode)))
+            StringParseResult result = parseString<true>(tokenData, strictMode);
+            if (UNLIKELY(result != StringParsedSuccessfully)) {
+                token = result == StringUnterminated ? UNTERMINATED_STRING_LITERAL_ERRORTOK : INVALID_STRING_LITERAL_ERRORTOK;
                 goto returnError;
+            }
+        }
         shift();
 …
     case CharacterInvalid:
         m_lexErrorMessage = invalidCharacterMessage();
+        token = ERRORTOK;
         goto returnError;
     default:
         RELEASE_ASSERT_NOT_REACHED();
         m_lexErrorMessage = "Internal Error";
+        token = ERRORTOK;
         goto returnError;
+    }
 …
     tokenLocation->line = m_lineNumber;
     tokenLocation->endOffset = currentOffset();
+    return ERRORTOK;
+    RELEASE_ASSERT(token & ErrorTokenFlag);
+    return token;
+}

trunk/Source/JavaScriptCore/parser/Lexer.h

-                      r148696
+                      r148849
     ALWAYS_INLINE bool atEnd() const;
     ALWAYS_INLINE T peek(int offset) const;
+    int parseFourDigitUnicodeHex();
+    struct UnicodeHexValue {
+        enum ValueType { ValidHex, IncompleteHex, InvalidHex };
+        explicit UnicodeHexValue(int value)
+            : m_value(value)
+        {
+        }
+        explicit UnicodeHexValue(ValueType type)
+            : m_value(type == IncompleteHex ? -2 : -1)
+        {
+        }
+        ValueType valueType() const
+        {
+            if (m_value >= 0)
+                return ValidHex;
+            return m_value == -2 ? IncompleteHex : InvalidHex;
+        }
+        bool isValid() const { return m_value >= 0; }
+        int value() const
+        {
+            ASSERT(m_value >= 0);
+            return m_value;
+        }
+    private:
+        int m_value;
+    };
+    UnicodeHexValue parseFourDigitUnicodeHex();
     void shiftLineTerminator();
 …
     template <bool shouldBuildIdentifiers> ALWAYS_INLINE JSTokenType parseIdentifier(JSTokenData*, unsigned lexerFlags, bool strictMode);
     template <bool shouldBuildIdentifiers> NEVER_INLINE JSTokenType parseIdentifierSlowCase(JSTokenData*, unsigned lexerFlags, bool strictMode);
+    template <bool shouldBuildStrings> ALWAYS_INLINE bool parseString(JSTokenData*, bool strictMode);
+    template <bool shouldBuildStrings> NEVER_INLINE bool parseStringSlowCase(JSTokenData*, bool strictMode);
+    enum StringParseResult {
+        StringParsedSuccessfully,
+        StringUnterminated,
+        StringCannotBeParsed
+    };
+    template <bool shouldBuildStrings> ALWAYS_INLINE StringParseResult parseString(JSTokenData*, bool strictMode);
+    template <bool shouldBuildStrings> NEVER_INLINE StringParseResult parseStringSlowCase(JSTokenData*, bool strictMode);
     ALWAYS_INLINE void parseHex(double& returnValue);
     ALWAYS_INLINE bool parseOctal(double& returnValue);

trunk/Source/JavaScriptCore/parser/Parser.h

-                      r148696
+                      r148849
         case NUMBER:
         case IDENT:
+        case STRING:
+        case STRING:
+        case UNTERMINATED_IDENTIFIER_ESCAPE_ERRORTOK:
+        case UNTERMINATED_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK:
+        case UNTERMINATED_MULTILINE_COMMENT_ERRORTOK:
+        case UNTERMINATED_NUMERIC_LITERAL_ERRORTOK:
+        case UNTERMINATED_STRING_LITERAL_ERRORTOK:
+        case INVALID_IDENTIFIER_ESCAPE_ERRORTOK:
+        case INVALID_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK:
+        case INVALID_NUMERIC_LITERAL_ERRORTOK:
+        case INVALID_OCTAL_NUMBER_ERRORTOK:
+        case INVALID_STRING_LITERAL_ERRORTOK:
         case ERRORTOK:
         case EOFTOK:
+        case EOFTOK:
             return 0;
         case LastUntaggedToken:
 …
             m_errorMessage = "Unexpected string " + getToken();
             return;
+        case ERRORTOK:
+        case UNTERMINATED_IDENTIFIER_ESCAPE_ERRORTOK:
+        case UNTERMINATED_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK:
+            m_errorMessage = "Incomplete unicode escape in identifier: '" + getToken() + '\'';
+            return;
+        case UNTERMINATED_MULTILINE_COMMENT_ERRORTOK:
+            m_errorMessage = "Unterminated multiline comment";
+            return;
+        case UNTERMINATED_NUMERIC_LITERAL_ERRORTOK:
+            m_errorMessage = "Unterminated numeric literal '" + getToken() + '\'';
+            return;
+        case UNTERMINATED_STRING_LITERAL_ERRORTOK:
+            m_errorMessage = "Unterminated string literal '" + getToken() + '\'';
+            return;
+        case INVALID_IDENTIFIER_ESCAPE_ERRORTOK:
+            m_errorMessage = "Invalid escape in identifier: '" + getToken() + '\'';
+            return;
+        case INVALID_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK:
+            m_errorMessage = "Invalid unicode escape in identifier: '" + getToken() + '\'';
+            return;
+        case INVALID_NUMERIC_LITERAL_ERRORTOK:
+            m_errorMessage = "Invalid numeric literal: '" + getToken() + '\'';
+            return;
+        case INVALID_OCTAL_NUMBER_ERRORTOK:
+            m_errorMessage = "Invalid use of octal: '" + getToken() + '\'';
+                return;
+        case INVALID_STRING_LITERAL_ERRORTOK:
+            m_errorMessage = "Invalid string literal: '" + getToken() + '\'';
+            return;
+        case ERRORTOK:
             m_errorMessage = "Unrecognized token '" + getToken() + '\'';
             return;
 …
         // likely, and we are currently unable to distinguish between the two cases.
         if (isFunctionBodyNode(static_cast<ParsedNode*>(0)) || m_hasStackOverflow)
+            error = ParserError::StackOverflow;
+        else if (isEvalNode<ParsedNode>())
+            error = ParserError(ParserError::EvalError, errMsg, errLine);
+        else
+            error = ParserError(ParserError::SyntaxError, errMsg, errLine);
+            error = ParserError(ParserError::StackOverflow, ParserError::SyntaxErrorNone, m_token);
+        else {
+            ParserError::SyntaxErrorType errorType = ParserError::SyntaxErrorIrrecoverable;
+            if (m_token.m_type == EOFTOK)
+                errorType = ParserError::SyntaxErrorRecoverable;
+            else if (m_token.m_type & UnterminatedErrorTokenFlag)
+                errorType = ParserError::SyntaxErrorUnterminatedLiteral;
+            if (isEvalNode<ParsedNode>())
+                error = ParserError(ParserError::EvalError, errorType, m_token, errMsg, errLine);
+            else
+                error = ParserError(ParserError::SyntaxError, errorType, m_token, errMsg, errLine);
+        }
+    }

trunk/Source/JavaScriptCore/parser/ParserError.h

-                      r143147
+                      r148849
 #include "Error.h"
 #include "ExceptionHelpers.h"
+#include "ParserTokens.h"
 #include <wtf/text/WTFString.h>
 …
 struct ParserError {
+    enum ErrorType { ErrorNone, StackOverflow, SyntaxError, EvalError, OutOfMemory } m_type;
+    enum SyntaxErrorType {
+        SyntaxErrorNone,
+        SyntaxErrorIrrecoverable,
+        SyntaxErrorUnterminatedLiteral,
+        SyntaxErrorRecoverable
+    };
+    enum ErrorType {
+        ErrorNone,
+        StackOverflow,
+        EvalError,
+        OutOfMemory,
+        SyntaxError
+    };
+    ErrorType m_type;
+    SyntaxErrorType m_syntaxErrorType;
+    JSToken m_token;
     String m_message;
     int m_line;
 …
+    {
+    }
     ParserError(ErrorType type)
+    explicit ParserError(ErrorType type)
         : m_type(type)
+        , m_syntaxErrorType(SyntaxErrorNone)
         , m_line(-1)
+    {
+    }
     ParserError(ErrorType type, String msg, int line)
+    ParserError(ErrorType type, SyntaxErrorType syntaxError, JSToken token)
         : m_type(type)
+        , m_syntaxErrorType(syntaxError)
+        , m_token(token)
+        , m_line(-1)
+    {
+    }
+    ParserError(ErrorType type, SyntaxErrorType syntaxError, JSToken token, String msg, int line)
+        : m_type(type)
+        , m_syntaxErrorType(syntaxError)
+        , m_token(token)
         , m_message(msg)
         , m_line(line)
 …
         return createOutOfMemoryError(globalObject); // Appease Qt bot
+    }
+#undef GET_ERROR_CODE
 };

trunk/Source/JavaScriptCore/parser/ParserTokens.h

-                      r146318
+                      r148849
     BinaryOpTokenAllowsInPrecedenceAdditionalShift = 4,
     BinaryOpTokenPrecedenceMask = 15 << BinaryOpTokenPrecedenceShift,
+    ErrorTokenFlag = 1 << (BinaryOpTokenAllowsInPrecedenceAdditionalShift + BinaryOpTokenPrecedenceShift + 7),
+    UnterminatedErrorTokenFlag = ErrorTokenFlag << 1
 };
 …
     COLON,
     DOT,
-    ERRORTOK,
     EOFTOK,
     EQUAL,
 …
     TIMES = 20 | BINARY_OP_PRECEDENCE(10),
     DIVIDE = 21 | BINARY_OP_PRECEDENCE(10),
+    MOD = 22 | BINARY_OP_PRECEDENCE(10)
+    MOD = 22 | BINARY_OP_PRECEDENCE(10),
+    ERRORTOK = 0 | ErrorTokenFlag,
+    UNTERMINATED_IDENTIFIER_ESCAPE_ERRORTOK = 0 | ErrorTokenFlag | UnterminatedErrorTokenFlag,
+    INVALID_IDENTIFIER_ESCAPE_ERRORTOK = 1 | ErrorTokenFlag,
+    UNTERMINATED_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK = 2 | ErrorTokenFlag | UnterminatedErrorTokenFlag,
+    INVALID_IDENTIFIER_UNICODE_ESCAPE_ERRORTOK = 3 | ErrorTokenFlag,
+    UNTERMINATED_MULTILINE_COMMENT_ERRORTOK = 4 | ErrorTokenFlag | UnterminatedErrorTokenFlag,
+    UNTERMINATED_NUMERIC_LITERAL_ERRORTOK = 5 | ErrorTokenFlag | UnterminatedErrorTokenFlag,
+    INVALID_OCTAL_NUMBER_ERRORTOK = 6 | ErrorTokenFlag | UnterminatedErrorTokenFlag,
+    INVALID_NUMERIC_LITERAL_ERRORTOK = 7 | ErrorTokenFlag,
+    UNTERMINATED_STRING_LITERAL_ERRORTOK = 8 | ErrorTokenFlag | UnterminatedErrorTokenFlag,
+    INVALID_STRING_LITERAL_ERRORTOK = 9 | ErrorTokenFlag,
 };

trunk/Source/JavaScriptCore/runtime/Completion.cpp

-                      r148696
+                      r148849
     return true;
+}
+bool checkSyntax(ExecState* exec, const SourceCode& source, ParserError& error)
+{
+    JSLockHolder lock(exec);
+    RELEASE_ASSERT(exec->vm().identifierTable == wtfThreadData().currentIdentifierTable());
+    VM* vm = &exec->vm();
+    RefPtr<ProgramNode> programNode = parse<ProgramNode>(vm, source, 0, Identifier(), JSParseNormal, JSParseProgramCode, error);
+    return programNode;
+}
 JSValue evaluate(ExecState* exec, const SourceCode& source, JSValue thisValue, JSValue* returnedException)

trunk/Source/JavaScriptCore/runtime/Completion.h

-                      r140718
+                      r148849
 namespace JSC {
+    struct ParserError;
     class ExecState;
     class JSScope;
     class SourceCode;
+    JS_EXPORT_PRIVATE bool checkSyntax(ExecState*, const SourceCode&, ParserError&);
     JS_EXPORT_PRIVATE bool checkSyntax(ExecState*, const SourceCode&, JSValue* exception = 0);
     JS_EXPORT_PRIVATE JSValue evaluate(ExecState*, const SourceCode&, JSValue thisValue = JSValue(), JSValue* exception = 0);

Note: See TracChangeset for help on using the changeset viewer.

Context Navigation

Changeset 148849 in webkit

Legend:

Download in other formats: