ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2025-10-16 04:59:23 +00:00

Author	SHA1	Message	Date
Timothy Flynn	62d85dd90a	LibJS: Port RegExp flags and patterns to UTF-16	2025-08-13 09:56:13 -04:00
Timothy Flynn	b955c9b2a9	LibJS: Port the Identifier AST (and related) nodes to UTF-16 This eliminates quite a lot of UTF-8 / UTF-16 churn.	2025-08-13 09:56:13 -04:00
Timothy Flynn	00182a2405	LibJS: Port the JS lexer and parser to UTF-16 This ports the lexer to UTF-16 and deals with the immediate fallout up to the AST. The AST will be dealt with in upcoming commits. The lexer will still accept UTF-8 strings as input, and will transcode them to UTF-16 for lexing. This doesn't actually incur a new allocation, as we were already converting the input StringView to a ByteString for each lexer. One immediate logical benefit here is that we do not need to know off- hand how many UTF-8 bytes some special code points occupy. They all happen to be a single UTF-16 code unit. So instead of advancing the lexer by 3 positions in some cases, we can just always advance by 1.	2025-08-13 09:56:13 -04:00
Timothy Flynn	eb74781a2d	LibJS: Keep the lookahead lexer alive after parsing its next token Currently, the lexer holds a ByteString, which is always heap-allocated. When we create a copy of the lexer for the lookahead token, that token will outlive the lexer copy. The token holds a couple of string views into the lexer's source string. This is fine for now, because the source string will be kept alive by the original lexer. But if the lexer were to hold a String or Utf16String, short strings will be stored on the stack due to SSO. Thus the token will hold views into released stack data. We need to keep the lookahead lexer alive to prevent UAF on views into its source string.	2025-08-13 09:56:13 -04:00
ayeteadoe	2e2484257d	LibJS: Enable EXPLICIT_SYMBOL_EXPORT and annotate minimum symbol set	2025-07-22 11:51:29 -04:00
ayeteadoe	539a675802	LibJS: Revert Enable EXPLICIT_SYMBOL_EXPORT This reverts commit `c14173f651`. We should only annotate the minimum number of symbols that external consumers actually use, so I am starting from scratch to do that	2025-07-22 11:51:29 -04:00
Timothy Flynn	66006d3812	AK+LibJS: Extract some UTF-16 helpers for use in an outside class An upcoming Utf16String will need access to these helpers. Let's make them publicly available.	2025-07-03 09:51:56 -04:00
ayeteadoe	c14173f651	LibJS: Enable EXPLICIT_SYMBOL_EXPORT	2025-06-30 10:50:36 -06:00
Luke Wilde	f12b6b258f	LibJS: Don't use presence of function params to identify function scope Instead, we can just use the scope type to determine if a scope is a function scope. This fixes using `this` for parameter default values in arrow functions crashing. This happened by `uses_this_from_environment` was not set in `set_uses_this`, as it didn't think it was in a function scope whilst parsing parameters. Fixes closing modal dialogs causing a crash on https://www.ikea.com/ No test262 diff. Reverts the functional part of `08cfd5f`, because it was a workaround for this issue.	2025-06-17 20:48:45 +02:00
Viktor Szépe	19f88f96dc	Everywhere: Fix typos - act III Some checks are pending CI / macOS, arm64, Sanitizer_CI, Clang (push) Waiting to run Details CI / Linux, x86_64, Fuzzers_CI, Clang (push) Waiting to run Details CI / Linux, x86_64, Sanitizer_CI, GNU (push) Waiting to run Details CI / Linux, x86_64, Sanitizer_CI, Clang (push) Waiting to run Details Package the js repl as a binary artifact / macOS, arm64 (push) Waiting to run Details Package the js repl as a binary artifact / Linux, x86_64 (push) Waiting to run Details Run test262 and test-wasm / run_and_update_results (push) Waiting to run Details Lint Code / lint (push) Waiting to run Details Label PRs with merge conflicts / auto-labeler (push) Waiting to run Details Push notes / build (push) Waiting to run Details	2025-06-16 14:20:48 +01:00
Aliaksandr Kalenik	dcfc515cd0	LibJS: Fix arrow function parsing bug Some checks are pending CI / Lagom (arm64, Sanitizer_CI, false, macos-15, macOS, Clang) (push) Waiting to run Details CI / Lagom (x86_64, Fuzzers_CI, false, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details CI / Lagom (x86_64, Sanitizer_CI, false, ubuntu-24.04, Linux, GNU) (push) Waiting to run Details CI / Lagom (x86_64, Sanitizer_CI, true, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (arm64, macos-15, macOS, macOS-universal2) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (x86_64, ubuntu-24.04, Linux, Linux-x86_64) (push) Waiting to run Details Run test262 and test-wasm / run_and_update_results (push) Waiting to run Details Lint Code / lint (push) Waiting to run Details Label PRs with merge conflicts / auto-labeler (push) Waiting to run Details Push notes / build (push) Waiting to run Details In the following example: ```js const f = (i) => ({ obj: { a: { x: i }, b: { x: i } }, g: () => {}, }); ``` The body of function `f` is initially parsed as an arrow function. As a result, what is actually an object expression is interpreted as a formal parameter with a binding pattern. Since duplicate identifiers are not allowed in this context (`i` in the example), the parser generates an error, causing the entire script to fail parsing. This change ignores the "Duplicate parameter names in bindings" error during arrow function parameter parsing, allowing the parser to continue and recognize the object expression of the outer arrow function with an implicit return. Fixes error on https://chat.openai.com/	2025-05-26 12:44:21 +03:00
Aliaksandr Kalenik	db480b1f0c	LibJS: Preserve information about local variables declaration kind This is required for upcoming change where we want to emit ThrowIfTDZ for assignment expressions only for lexical declarations.	2025-05-06 12:06:23 +02:00
Andreas Kling	bf1b754e91	LibJS: Optimize reading known-to-be-initialized `var` bindings Some checks are pending CI / Lagom (arm64, Sanitizer_CI, false, macos-15, macOS, Clang) (push) Waiting to run Details CI / Lagom (x86_64, Fuzzers_CI, false, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details CI / Lagom (x86_64, Sanitizer_CI, false, ubuntu-24.04, Linux, GNU) (push) Waiting to run Details CI / Lagom (x86_64, Sanitizer_CI, true, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (arm64, macos-15, macOS, macOS-universal2) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (x86_64, ubuntu-24.04, Linux, Linux-x86_64) (push) Waiting to run Details Run test262 and test-wasm / run_and_update_results (push) Waiting to run Details Lint Code / lint (push) Waiting to run Details Label PRs with merge conflicts / auto-labeler (push) Waiting to run Details Push notes / build (push) Waiting to run Details `var` bindings are never in the temporal dead zone (TDZ), and so we know accessing them will not throw. We now take advantage of this by having a specialized environment binding value getter that doesn't check for exceptional cases. 1.08x speedup on JetStream.	2025-05-04 02:31:18 +02:00
Timothy Flynn	3867a192a1	LibJS: Update spec steps / links for the import-assertions proposal This proposal has reached stage 4 and been merged into the main ECMA-262 spec. See: `4e3450e`	2025-04-29 07:33:08 -04:00
Aliaksandr Kalenik	2d732b2251	LibJS: Skip allocating locals for arguments that allowed to be local This allows us to get rid of instructions that move arguments to locals and allocate smaller JS::Value vector in ExecutionContext by reusing slots that were already allocated for arguments. With this change for following function: ```js function f(x, y) { return x + y; } ``` we now produce following bytecode: ``` [ 0] 0: Add dst:reg6, lhs:arg0, rhs:arg1 [ 10] Return value:reg6 ``` instead of: ``` [ 0] 0: GetArgument 0, dst:x~1 [ 10] GetArgument 1, dst:y~0 [ 20] Add dst:reg6, lhs:x~1, rhs:y~0 [ 30] Return value:reg6 ```	2025-04-26 11:02:29 +02:00
Aliaksandr Kalenik	81a3bfd492	LibJS: Allow using locals if `arguments` is used in strict mode Some checks are pending CI / Lagom (arm64, Sanitizer_CI, false, macos-15, macOS, Clang) (push) Waiting to run Details CI / Lagom (x86_64, Fuzzers_CI, false, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details CI / Lagom (x86_64, Sanitizer_CI, false, ubuntu-24.04, Linux, GNU) (push) Waiting to run Details CI / Lagom (x86_64, Sanitizer_CI, true, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (arm64, macos-15, macOS, macOS-universal2) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (x86_64, ubuntu-24.04, Linux, Linux-x86_64) (push) Waiting to run Details Run test262 and test-wasm / run_and_update_results (push) Waiting to run Details Lint Code / lint (push) Waiting to run Details Label PRs with merge conflicts / auto-labeler (push) Waiting to run Details Push notes / build (push) Waiting to run Details Previously we blocked using locals for function arguments whenever `arguments` was mentioned in function body, however, this is not necessary in strict mode, where mutations to the arguments object are not reflected in the function arguments and vice versa.	2025-04-25 21:08:24 +02:00
Aliaksandr Kalenik	7932091e02	LibJS: Allow using local variable for catch parameters Local variables are faster to access and if all catch parameters are locals we can skip lexical environment allocation.	2025-04-22 21:57:25 +02:00
Aliaksandr Kalenik	0f14c70252	LibJS: Use Identifier to represent CatchClause parameter names By doing that we consistently use Identifier node for identifiers and also enable mechanism that registers identifiers in a corresponding ScopePusher for catch parameters, which is necessary for work in the upcoming changes.	2025-04-22 21:57:25 +02:00
Andrew Kaster	c471faee10	LibJS: Launder const in the parser where required with strict RefPtrs These places should be updated to not require this hackery, but pulling on this thread involves touching almost every method in the parser.	2025-04-16 10:41:44 -06:00
Andreas Kling	ef4e7b7945	LibJS: Make JS parser emit accurate `this` insights for constructors This way we don't have to handle it when instantiating the constructor.	2025-04-08 18:52:35 +02:00
devgianlu	08cfd5ff1b	LibJS: Set empty function parameters on ClassStaticInit scope This prevents the variables declared inside a class static initializer to escape to the nearest containing function causing all sorts of memory corruptions.	2025-04-05 18:20:36 +01:00
devgianlu	6aea459e00	LibJS: Wrap `static_init_block_scope` call in its own scope	2025-04-05 18:20:36 +01:00
R-Goc	28d5d982ce	Everywhere: Remove unused private fields Some checks are pending CI / Lagom (x86_64, Fuzzers_CI, false, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details CI / Lagom (arm64, Sanitizer_CI, false, macos-15, macOS, Clang) (push) Waiting to run Details CI / Lagom (x86_64, Sanitizer_CI, false, ubuntu-24.04, Linux, GNU) (push) Waiting to run Details CI / Lagom (x86_64, Sanitizer_CI, true, ubuntu-24.04, Linux, Clang) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (arm64, macos-15, macOS, macOS-universal2) (push) Waiting to run Details Package the js repl as a binary artifact / build-and-package (x86_64, ubuntu-24.04, Linux, Linux-x86_64) (push) Waiting to run Details Run test262 and test-wasm / run_and_update_results (push) Waiting to run Details Lint Code / lint (push) Waiting to run Details Label PRs with merge conflicts / auto-labeler (push) Waiting to run Details Push notes / build (push) Waiting to run Details This commit removes the -Wno-unusued-private-field flag, thus reenabling the warning. Unused field were either removed or marked [[maybe_unused]] when unsure.	2025-04-04 12:40:07 +02:00
Andreas Kling	6c70dc5f09	LibJS: Create FunctionParameters earlier in the parser This avoids making multiple copies of the Vector<FunctionParameter> in the parser.	2025-03-27 19:50:13 +00:00
Andreas Kling	7477002e46	LibJS: Keep parsed function parameters in a shared data structure Instead of making a copy of the Vector<FunctionParameter> from the AST every time we instantiate an ECMAScriptFunctionObject, we now keep the parameters in a ref-counted FunctionParameters object. This reduces memory usage, and also allows us to cache the bytecode executables for default parameter expressions without recompiling them for every instantiation. :^)	2025-03-27 15:00:43 +00:00
Andreas Kling	46a5710238	LibJS: Use FlyString in PropertyKey instead of DeprecatedFlyString This required dealing with substantial fallout.	2025-03-24 22:27:17 +00:00
Timothy Flynn	b64a355a30	LibJS: Remove support for the "assert" keyword for import attributes This was removed from the spec some time ago. See: `14286bb`	2025-01-21 14:58:32 +01:00
Timothy Flynn	47ba231a9b	LibJS: Do not consume "with" tokens in import statements as identifiers The "with" statement is its own token (TokenType::With), and thus would fail to parse as an identifier. We've already asserted that the token we are parsing is "with" or "assert", so just consume it.	2025-01-21 14:58:32 +01:00
Timothy Flynn	7d420bbd3d	LibJS: Update the noted grammar for ImportDeclaration	2025-01-21 14:58:32 +01:00
Luke Wilde	5f33383a7b	LibJS: Propagate direct eval presence if the current scope is screwed Previously it only deoptimized the parent scope if the current scope contains direct eval, which is incorrect because code ran in direct eval mode has access to the entire scope chain it was executed in. The fix is to also propagate direct eval's presence if the current scope is marked as being screwed by direct eval. This fixes Google's botguard failing to complete on Google sign in, as it tried to access local variables outside of a direct parent function with eval, causing it throw "unhandled" exceptions. Unhandled is in quotes because their bytecode VM _technically_ caught it, but it was considered an unhandled exception. This was determined by removing get optimizations and then adding debug output for every get operation. Using this, I noticed that for these errors, it would access the 'message' and 'stack' properties. This is because their error handler function noticed this was not a synthesised error, which is never expected to happen. That was determined by using Chrome Devtools 'pause on handled exception' feature, and noticing it never threw a '[var] is not defined' exception, but only synthesized error objects which contained a sentinel value to let it know it was synthesized. I added debug output to eval to print out what was being eval'd because it makes heavy use of eval. This revealed that the exceptions only came from eval. I then dumped every generated executable and noticed the variables it was trying to access were generated as local variables in the top scope. This led to checking what makes a variable considered local or not, which then lead to this block of code in ~ScopePusher that propagates eval presence only to the immediate parent scope. This variable directly controls whether to create all variables properly with variable environments and bindings or allow them to be stored as local registers tied to that function's executable. Since this now lets botguard run to completion, it no longer considers us to be an insecure/potential bot browser when signing in, now allowing us to be able to sign in to Google.	2025-01-17 14:36:03 +01:00
Timothy Flynn	ada36e5c0a	LibJS: Allow async functions named "async" as function properties For example, https://locals.com/site/discover has a script with an object of the form: var f = { parser: { sync() {}, async async() {}, } }; We were previously throwing a syntax error on the async function, as we specifically did not allow using "async" as a function name here.	2024-12-26 17:23:10 +01:00
Timothy Flynn	a5455ac121	LibJS: Ensure a function follows an `async` identifier in objects	2024-12-26 17:23:10 +01:00
Luke Wilde	bd4c29322c	LibJS: Allow division after IdentifierNames in optional chain The following syntax is valid: ```js e?.example / 1.2 ``` Previously, the `/` would be treated as a unterminated regex literal, because it was calling the regular `consume` instead of `consume_and_allow_division`. This is what is done when parsing IdentifierNames in parse_secondary_expression when a period is encountered. Allows us to parse clients-main-[hash].js on https://ubereats.com/	2024-11-11 20:19:26 +01:00
Timothy Flynn	93712b24bf	Everywhere: Hoist the Libraries folder to the top-level	2024-11-10 12:50:45 +01:00
Andreas Kling	13d7c09125	Libraries: Move to Userland/Libraries/	2021-01-12 12:17:46 +01:00
Andreas Kling	db790dda62	LibJS: Remove hand-rolled type information in JS AST in favor of RTTI	2021-01-01 19:34:07 +01:00
AnotherTest	8ca0e8325a	LibJS: Don't save rule start positions along with the parser state This fixes #4617. Also fixes the small problem where some save states would be leaked.	2020-12-29 17:39:42 +01:00
AnotherTest	d0363bca01	LibJS: `save_state()' before creating a RulePosition Fixes #4617.	2020-12-29 10:51:33 +01:00
AnotherTest	b34b681811	LibJS: Track source positions all the way down to exceptions This makes exceptions have a trace of source positions too, which could probably be helpful in making fancier error tracebacks.	2020-12-29 00:58:43 +01:00
Stephan Unverwerth	be9c2feff0	LibJS: Fix parsing of numeric object keys Numeric keys were interpreted as their source text, leading to something like {0x10:true} to end up as {"0x10":true} instead of {16:true}	2020-12-27 23:04:09 +01:00
Linus Groh	5eb1f752ab	LibJS: Use new format functions everywhere This changes the remaining uses of the following functions across LibJS: - String::format() => String::formatted() - dbg() => dbgln() - printf() => out(), outln() - fprintf() => warnln() I also removed the relevant 'LogStream& operator<<' overloads as they're not needed anymore.	2020-12-06 18:52:52 +01:00
Linus Groh	3ac7fb9f6c	LibJS: Disallow 'with' statement in strict mode	2020-11-28 20:33:41 +01:00
Andreas Kling	d617120499	LibJS: Parse "with" statements :^)	2020-11-28 17:16:48 +01:00
Linus Groh	39a1c9d827	LibJS: Implement 'new.target' This adds a new MetaProperty AST node which will be used for 'new.target' and 'import.meta' meta properties. The parser now distinguishes between "in function context" and "in arrow function context" (which is required for this). When encountering TokenType::New we will attempt to parse it as meta property and resort to regular new expression parsing if that fails, much like the parsing of labelled statements.	2020-11-02 22:40:59 +01:00
Linus Groh	e07a39c816	LibJS: Replace 'size_t line, size_t column' with 'Optional<Position>' This is a bit nicer for two reasons: - The absence of line number/column information isn't based on 'values are zero' anymore but on Optional's value - When reporting syntax errors with position information other than the current token's position we had to store line and column ourselves, like this: auto foo_start_line = m_parser_state.m_current_token.line_number(); auto foo_start_column = m_parser_state.m_current_token.line_column(); ... syntax_error("...", foo_start_line, foo_start_column); Which now becomes: auto foo_start= position(); ... syntax_error("...", foo_start); This makes it easier to report correct positions for syntax errors that only emerge a few tokens later :^)	2020-11-02 22:40:59 +01:00
Linus Groh	9e80c67608	LibJS: Fix "use strict" directive false positives By having the "is this a use strict directive?" logic in parse_string_literal() we would apply it to any string literal, which is incorrect and would lead to false positives - e.g.: "use strict" + 1 `"use strict"` "\123"; ({"use strict": ...}) Relevant part from the spec which is now implemented properly: [...] and where each ExpressionStatement in the sequence consists entirely of a StringLiteral token [...] I also got rid of UseStrictDirectiveState which is not needed anymore. Fixes #3903.	2020-11-02 13:13:54 +01:00
Linus Groh	a598a2c19d	LibJS: Function declarations in if statement clauses https://tc39.es/ecma262/#sec-functiondeclarations-in-ifstatement-statement-clauses B.3.4 FunctionDeclarations in IfStatement Statement Clauses The following augments the IfStatement production in 13.6: IfStatement[Yield, Await, Return] : if ( Expression[+In, ?Yield, ?Await] ) FunctionDeclaration[?Yield, ?Await, ~Default] else Statement[?Yield, ?Await, ?Return] if ( Expression[+In, ?Yield, ?Await] ) Statement[?Yield, ?Await, ?Return] else FunctionDeclaration[?Yield, ?Await, ~Default] if ( Expression[+In, ?Yield, ?Await] ) FunctionDeclaration[?Yield, ?Await, ~Default] else FunctionDeclaration[?Yield, ?Await, ~Default] if ( Expression[+In, ?Yield, ?Await] ) FunctionDeclaration[?Yield, ?Await, ~Default] This production only applies when parsing non-strict code. Code matching this production is processed as if each matching occurrence of FunctionDeclaration[?Yield, ?Await, ~Default] was the sole StatementListItem of a BlockStatement occupying that position in the source code. The semantics of such a synthetic BlockStatement includes the web legacy compatibility semantics specified in B.3.3.	2020-10-31 15:25:12 +01:00
Linus Groh	563d3c8055	LibJS: Require initializer for 'const' variable declaration	2020-10-30 23:43:38 +01:00
Linus Groh	b4e51249e9	LibJS: Always insert semicolon after do-while statement if missing https://tc39.es/ecma262/#sec-additions-and-changes-that-introduce-incompatibilities-with-prior-editions 11.9.1: In ECMAScript 2015, Automatic Semicolon Insertion adds a semicolon at the end of a do-while statement if the semicolon is missing. This change aligns the specification with the actual behaviour of most existing implementations.	2020-10-28 21:11:32 +01:00
Linus Groh	7112031bfb	LibJS: Use message from invalid token in syntax error	2020-10-26 21:38:34 +01:00

1 2 3 4 5

203 commits