ladybird

mirror of https://github.com/LadybirdBrowser/ladybird synced 2026-05-11 17:37:33 +02:00

Author	SHA1	Message	Date
Andreas Kling	95fec309cd	Tests/LibJS: Add bytecode test for nested try-finally continue Add a test that exercises break/continue trampolines through nested try-finally blocks, ensuring exception handler ranges are correct.	2026-03-01 21:20:54 +01:00
Andreas Kling	d0b9905de1	LibJS/Rust: Use GetLengthWithThis for super.length property access The C++ pipeline has an optimization that uses the GetLengthWithThis instruction instead of GetByIdWithThis when accessing the "length" property. Add the same optimization to the Rust pipeline by introducing an emit_get_by_id_with_this helper that checks for the "length" property name and emits the optimized instruction. Also update emit_get_by_value_with_this to use GetLengthWithThis when the computed property is a constant "length" string.	2026-03-01 21:20:54 +01:00
Andreas Kling	56603319b4	LibJS/Rust: Fix evaluation order in delete super[key] Per spec, the property key expression should be evaluated before calling ResolveSuperBase. Fix the Rust codegen to match the C++ pipeline's correct evaluation order.	2026-03-01 21:20:54 +01:00
Andreas Kling	176a618fce	LibJS: Don't emit dead code after Throw for invalid LHS expressions When the left-hand side of an assignment, update, or for-in loop is invalid (e.g. `foo() = "bar"`), the bytecode generator emits a Throw instruction. Previously, it would also create a dead basic block after the Throw, resulting in unreachable instructions in the output. Fix this by returning early from the relevant codegen paths after emitting the Throw, and by guarding for-in/for-of body generation with an is_current_block_terminated() check.	2026-03-01 21:20:54 +01:00
Andreas Kling	18c40a1328	LibJS/Rust: Fix has_parameter_expressions and TDZ checks for arguments Fix two bugs in the Rust bytecode codegen: 1. has_parameter_expressions incorrectly treated any destructuring parameter as a "parameter expression", when it should only do so for patterns that contain expressions (defaults or computed keys). This caused an unnecessary CreateLexicalEnvironment for simple destructuring like `function f({a, b}) {}`. The same bug existed in both codegen.rs and lib.rs (SFD metadata computation). 2. emit_set_variable used is_local_lexically_declared(index) for argument locals, but that function indexes into the local_variables array using the argument's index, checking the wrong variable. This caused spurious ThrowIfTDZ instructions when assigning to function arguments that happened to share an index with an uninitialized let/const variable.	2026-03-01 21:20:54 +01:00
Andreas Kling	6cdfbd01a6	LibJS: Add alternative source-to-bytecode pipeline in Rust Implement a complete Rust reimplementation of the LibJS frontend: lexer, parser, AST, scope collector, and bytecode code generator. The Rust pipeline is built via Corrosion (CMake-Cargo bridge) and linked into LibJS as a static library. It is gated behind a build flag (ENABLE_RUST, on by default except on Windows) and two runtime environment variables: - LIBJS_CPP: Use the C++ pipeline instead of Rust - LIBJS_COMPARE_PIPELINES=1: Run both pipelines in lockstep, aborting on any difference in AST or bytecode generated. The C++ side communicates with Rust through a C FFI layer (RustIntegration.cpp/h) that passes source text to Rust and receives a populated Executable back via a BytecodeFactory interface.	2026-02-24 09:39:42 +01:00
Andreas Kling	f3b675fb37	Tests/LibJS: Import various tests developed alongside Rust pipeline	2026-02-24 09:39:42 +01:00
Andreas Kling	234203ed9b	LibJS: Ensure deterministic ordering in scope analysis and codegen The scope collector uses HashMaps for identifier groups and variables, which means their iteration order is non-deterministic. This causes local variable indices and function declaration instantiation (FDI) bytecode to vary between runs. Fix this by sorting identifier group keys alphabetically before assigning local variable indices, and sorting vars_to_initialize by name before emitting FDI bytecode. Also make register allocation deterministic by always picking the lowest-numbered free register instead of whichever one happens to be at the end of the free list. This is preparation for bringing in a new source->bytecode pipeline written in Rust. Checking for regressions is significantly easier if we can expect identical output from both pipelines.	2026-02-24 09:39:42 +01:00
Andreas Kling	d4f222e442	LibJS: Don't reset switch case completion value for empty results When a statement in a switch case body doesn't produce a result (e.g. a variable declaration), we were incorrectly resetting the completion value to undefined. This caused the completion value of preceding expression statements to be lost.	2026-02-19 12:02:50 +01:00
Andreas Kling	b0b0275e9e	LibJS: Add bytecode test for switch statement completion values The completion value of a switch case is incorrectly reset to undefined when a statement without a result (like a variable declaration) follows an expression statement. This will be fixed in the next commit.	2026-02-19 12:02:50 +01:00
Andreas Kling	afae23e270	LibJS: Don't optimize body vars to locals when referenced in defaults When a function has parameter expressions (default values), body var declarations that shadow a name referenced in a default parameter expression must not be optimized to local variables. The default expression needs to resolve the name from the outer scope via the environment chain, not read the uninitialized local. We now mark identifiers referenced during formal parameter parsing with an IsReferencedInFormalParameters flag, and skip local variable optimization for body vars that carry both this flag and IsVar (but not IsForbiddenLexical, which indicates parameter names themselves).	2026-02-19 02:45:37 +01:00
Andreas Kling	cd2576c031	LibJS: Mark block-scoped function declaration locals as initialized When emitting block declaration instantiation, we were not calling set_local_initialized() after writing block-scoped function declarations to local variables via Mov. This caused unnecessary ThrowIfTDZ checks to be emitted when those locals were later read. Block-scoped function declarations are always initialized at block entry (via NewFunction + Mov), so TDZ checks for them are redundant.	2026-02-19 02:45:37 +01:00
Andreas Kling	47e552e8fd	LibJS: Consolidate TDZ check emission into Generator helper Move the duplicated ThrowIfTDZ emission logic from three places in ASTCodegen.cpp into a single Generator::emit_tdz_check_if_needed() helper. This handles both argument TDZ (which requires a Mov to empty first) and lexically-declared variable TDZ uniformly. This avoids emitting some unnecessary ThrowIfTDZ instructions.	2026-02-17 20:44:57 +01:00
Andreas Kling	9923745d34	LibJS: Remove unused bytecode register allocation in array destructuring	2026-02-17 20:44:57 +01:00
Andreas Kling	fdd7809bd1	Tests/LibJS: Add a big pile of AST, bytecode, and runtime tests Created these while experimenting with LibJS. Might as well bring them into the tree and increase our coverage.	2026-02-17 20:44:57 +01:00
Andreas Kling	2dca137d9e	LibJS: Handle ThisExpression in expression_identifier() Add ThisExpression handling to the expression_identifier() helper used for base_identifier in bytecode instructions. This makes PutById and GetById emit base_identifier:this when the base is a this expression.	2026-02-15 23:21:46 +01:00
Andreas Kling	4724b08a26	LibJS: Make MemberExpression::to_string_approximation() recursive Previously, the function only handled a single level of member access, producing strings like "<object>.isWall" for chained expressions like "graphSet[j][k].isWall". Now it recurses through nested member expressions, identifiers, string/numeric literals, and `this`.	2026-02-15 23:21:46 +01:00
Andreas Kling	49f2f1e7cd	LibJS: Skip unnecessary Mov in emit_load_from_reference for reads When MemberExpression::generate_bytecode calls emit_load_from_reference, it only uses the loaded_value and discards the reference operands. For computed member expressions (e.g. a[0]), this was generating an unnecessary Mov to save the property register for potential store-back. Add a ReferenceMode parameter to emit_load_from_reference. When LoadOnly is passed, the computed property path skips the register save and Mov.	2026-02-15 23:21:46 +01:00
Andreas Kling	c0f38c82d8	LibJS: Fix evaluation order in array destructuring assignment Per AssignmentRestElement and AssignmentElement in the specification, the DestructuringAssignmentTarget reference must be evaluated before iterating or stepping the iterator. We were doing it in the wrong order, which caused observable differences when the target evaluation has side effects, and could lead to infinite loops when the iterator never completes. Add Generator::emit_evaluate_reference() to evaluate a member expression's base and property into ReferenceOperands without performing a load or store, then use the pre-evaluated reference for the store after iteration completes.	2026-02-15 23:21:46 +01:00
Andreas Kling	ecadf3ce21	LibJS: Inline AsyncIteratorClose with proper Await in for-await-of The AsyncIteratorClose bytecode op calls async_iterator_close() which uses synchronous await() internally. This spins the event loop while execution contexts are on the stack, violating the microtask checkpoint assertion in LibWeb. Replace AsyncIteratorClose op emissions in for-await-of close handlers with inline bytecode that uses the proper Await op, allowing the async function to yield and resume naturally through the event loop. For the non-throw path (break/return/continue-to-outer): emit GetMethod, Call, Await, and ThrowIfNotObject inline. For the throw path: wrap the close steps in an exception handler so that any error from GetMethod/Call/Await is discarded and the original exception is rethrown, per spec step 5.	2026-02-12 11:37:43 +01:00
Andreas Kling	9a7f836d27	LibJS: Call IteratorClose on abrupt completion in for-of loops When a for-of or for-await-of loop exits via break, return, throw, or continue-to-outer-loop, we now correctly call IteratorClose (or AsyncIteratorClose) to give the iterator a chance to clean up resources. This uses a synthetic FinallyContext that wraps the LHS assignment and loop body, reusing the existing try/finally completion record machinery. The ReturnToFinally boundary is placed between Break and Continue so that continue-to-same-loop bypasses the close (zero overhead on normal iteration) while all other abrupt exits route through the iterator close dispatch chain. for-in (enumerate) does not require iterator close per spec.	2026-02-12 11:37:43 +01:00
Andreas Kling	94cef3228f	LibJS: Make IteratorClose/AsyncIteratorClose take Operand for value Change the completion_value field from Optional<Value> to Operand in both IteratorClose and AsyncIteratorClose bytecode instructions. This allows passing a dynamic value from a register, which is needed for iterator close on abrupt completion where the exception value is not known at codegen time.	2026-02-12 11:37:43 +01:00
Andreas Kling	ec2f4e4a7b	LibJS: Wire NewClass to ClassBlueprint Replace the ClassExpression const& reference in the NewClass instruction with a u32 class_blueprint_index. The interpreter now reads from the ClassBlueprint stored on the Executable and calls construct_class() instead of the AST-based create_class_constructor(). Literal field initializers (numbers, booleans, null, strings, negated numbers) are used directly in construct_class() without creating an ECMAScriptFunctionObject, avoiding function creation overhead for common field patterns like `x = 0` or `name = "hello"`. Set class_field_initializer_name on SharedFunctionInstanceData at codegen time for statically-known field keys (identifiers, private identifiers, string literals, and numeric literals). For computed keys, the name is set at runtime in construct_class(). ClassExpression AST nodes are no longer referenced from bytecode.	2026-02-11 23:57:41 +01:00
Andreas Kling	6b0003b057	LibJS: Pre-create SharedFunctionInstanceData in NewFunction Replace the FunctionNode const& stored on the NewFunction bytecode instruction with an index into a table of pre-created SharedFunctionInstanceData objects on the Executable. During bytecode compilation, we now eagerly create SharedFunctionInstanceData for each function that will be instantiated by NewFunction, and store it on both the FunctionNode (for caching) and the Executable (for GC tracing). At runtime, NewFunction simply looks up the SharedFunctionInstanceData by index and calls create_from_function_data() directly, bypassing the AST entirely. This removes one of the main reasons the AST had to stay alive after compilation. The instantiate_ordinary_function_expression() helper in Interpreter.cpp is removed as its non-trivial code path (creating a scope for named function expressions) was dead code -- it was only called when !has_name(), so the has_own_name branch never executed.	2026-02-11 23:57:41 +01:00
Andreas Kling	32389c5e96	Tests/LibJS: Add tests for scope analysis and destructuring Add bytecode tests verifying identifier resolution produces correct register-backed locals, global lookups, argument indices, and environment lookups for eval/with/captured cases. Add runtime tests for destructuring assignment patterns with expression defaults: class expressions (named/anonymous), function expressions, arrow functions, nested destructuring, eval in defaults, MemberExpression targets with setter functions, and class name scoping.	2026-02-10 02:05:20 +01:00
Andreas Kling	029708b2cf	Tests: Rebaseline bytecode tests after handler/finalizer collapse The exception handler dump format changed from showing both handler and finalizer offsets to showing only a single handler offset.	2026-02-09 16:35:39 +01:00
Andreas Kling	4e1d6faf0e	Tests: Rebaseline bytecode tests after unwind context removal EnterUnwindContext is now Jump and LeaveUnwindContext is gone, which shifts bytecode offsets in try/catch/finally tests.	2026-02-09 16:35:39 +01:00
Andreas Kling	7f89158d20	LibJS: Replace implicit environment stack with explicit registers Replace the saved_lexical_environments stack in ExecutionContextRareData with explicit register-based environment tracking. Environments are now stored in registers and restored via SetLexicalEnvironment, making the environment flow visible in bytecode. Key changes: - Add GetLexicalEnvironment and SetLexicalEnvironment opcodes - CreateLexicalEnvironment takes explicit parent and dst operands - EnterObjectEnvironment stores new environment in a dst register - NewClass takes an explicit class_environment operand - Remove LeaveLexicalEnvironment opcode (instead: SetLexicalEnvironment) - Remove saved_lexical_environments from ExecutionContextRareData - Use a reserved register for the saved lexical environment to avoid dominance issues with lazily-emitted GetLexicalEnvironment	2026-02-09 16:35:39 +01:00
Andreas Kling	a439dc8490	LibJS: Use explicit completion records for try/finally dispatch Each finally scope gets two registers (completion_type and completion_value) that form an explicit completion record. Every path into the finally body sets these before jumping, and a dispatch chain after the finally body routes to the correct continuation. This replaces the old implicit protocol that relied on the exception register, a saved_return_value register, and a scheduled_jump field on ExecutionContext, allowing us to remove: - 5 opcodes (ContinuePendingUnwind, ScheduleJump, LeaveFinally, RestoreScheduledJump, PrepareYield) - 1 reserved register (saved_return_value) - 2 ExecutionContext fields (scheduled_jump, previously_scheduled_jumps)	2026-02-09 08:51:12 +01:00
dosisod	dab739771f	LibJS: Reduce number of template literal op codes There is no need to concat empty string literals when building template literals. Now strings will only be concatenated if they need to be. To handle the edge case where the first segment is not a string literal, a new `ToString` op code has been added to ensure the value is a string concatenating more strings. In addition, basic const folding is now supported for template literal constants (templates with no interpolated values), which is commonly used for multi-line string constants.	2026-01-31 18:24:02 +01:00
dosisod	2c3077b878	LibJS: Dead code elimination for always truthy/falsey conditions This improves and expands the ability to do dead code elimination on conditions which are always truthy or falsey. The following cases are now optimized: * `if (true){}` -> Only emit `if` block, ignore `else` * `if (false){}` -> Only emit `else if`/`else` block * `while (false){}` -> Ignore `while` loop entirely * `for (x;false;){}` -> Only emit `x` (if it exists), skip `for` block * Ternary -> Directly return left/right hand side if condition is const	2026-01-31 18:22:40 +01:00
Andreas Kling	5238841da2	LibJS: Mark named function expression identifiers at individual level Previously, when parsing a named function expression like `Oops = function Oops() { Oops }`, the parser set a group-level flag `might_be_variable_in_lexical_scope_in_named_function_assignment` that propagated to the parent scope. This incorrectly prevented ALL `Oops` identifiers from being marked as global, including those outside the function expression. Fix this by marking identifiers individually using `set_is_inside_scope_with_eval()` only for identifiers inside the function scope. This allows identifiers outside the function expression to correctly use GetGlobal/SetGlobal while identifiers inside still use GetBinding (since they may refer to the function's name binding).	2026-01-27 10:58:39 +01:00
Andreas Kling	871d93355b	LibJS: Stop propagating is_inside_scope_with_eval across functions Previously, when a nested function contained eval(), the parser would mark all identifiers in parent functions as "inside scope with eval". This prevented those identifiers from being marked as global, forcing them to use GetBinding instead of GetGlobal. However, eval() can only inject variables into its containing function's scope, not into parent function scopes. So a parent function's reference to a global like `Number` should still be able to use GetGlobal even if a nested function contains eval(). This change adds a new flag `m_eval_in_current_function` that propagates through block scopes within the same function but stops at function boundaries. This flag is used for marking identifiers, while the existing `m_screwed_by_eval_in_scope_chain` continues to propagate across functions for local variable deoptimization (since eval can access closure variables). Before: `new Number(42)` in outer() with eval in inner() -> GetBinding After: `new Number(42)` in outer() with eval in inner() -> GetGlobal	2026-01-27 10:58:39 +01:00
Jelle Raaijmakers	e3faa9b5ad	LibJS: Move tests to `/Tests/LibJS`	2026-01-22 07:46:48 -05:00

34 Commits