ladybird

mirror of https://github.com/LadybirdBrowser/ladybird synced 2026-04-26 01:35:08 +02:00

Author	SHA1	Message	Date
Andreas Kling	3e18136a8c	LibJS: Add a String.fromCharCode builtin opcode Specialize only the fixed unary case in the bytecode generator and let all other argument counts keep using the generic Call instruction. This keeps the builtin bytecode simple while still covering the common fast path. The asm interpreter handles int32 inputs directly, applies the ToUint16 mask in-place, and reuses the VM's cached ASCII single-character strings when the result is 7-bit representable. Non-ASCII single code unit results stay on the dedicated builtin path via a small helper, and the dedicated slow path still handles the generic cases.	2026-04-12 19:15:50 +02:00
Andreas Kling	7bc40bd54a	LibJS: Add a charAt builtin bytecode fast path Tag String.prototype.charAt as a builtin and emit a dedicated bytecode instruction for non-computed calls. The asm interpreter can then stay on the fast path when the receiver is a primitive string with resident UTF-16 data and the selected code unit is ASCII. In that case we can return the VM's cached empty or single-character ASCII string directly.	2026-04-12 19:15:50 +02:00
Andreas Kling	879ac36e45	LibJS: Cache stable for-in iteration at bytecode sites Cache the flattened enumerable key snapshot for each `for..in` site and reuse a `PropertyNameIterator` when the receiver shape, dictionary generation, indexed storage kind and length, prototype chain validity, and magical-length state still match. Handle packed indexed receivers as well as plain named-property objects. Teach `ObjectPropertyIteratorNext` in `asmint.asm` to return cached property values directly and to fall back to the slow iterator logic when any guard fails. Treat arrays' hidden non-enumerable `length` property as a visited name for for-in shadowing, and include the receiver's magical-length state in the cache key so arrays and plain objects do not share snapshots. Add `test-js` and `test-js-bytecode` coverage for mixed numeric and named keys, packed receiver transitions, re-entry, iterator reuse, GC retention, array length shadowing, and same-site cache reuse.	2026-04-10 15:12:53 +02:00
Andreas Kling	bb0acb54ae	LibJS: Optimize x >> 0 to ToInt32 in bytecode codegen x >> 0 is a common JS idiom equivalent to ToInt32(x). We already had this optimization for x \| 0, now do it for right shift by zero as well. This allows the asmint handler for ToInt32 to run instead of the more expensive RightShift handler, which wastes time loading and checking the rhs operand and performing a shift by zero.	2026-03-20 00:51:23 -05:00
Andreas Kling	02b0746676	LibJS: Deduplicate double constants in bytecode generator Add a deduplication cache for double constants, matching the existing approach for int32 and string constants. Multiple references to the same floating-point value now share a single constant table entry.	2026-03-20 00:51:23 -05:00
Andreas Kling	b4185f0ecd	LibJS: Split packed and holey asm indexed fast paths Use dedicated Packed branches in GetByValue and PutByValue so in-bounds indexed accesses can skip hole checks and slot reloads. Keep Holey writes on the guarded arm, and keep append writes on the C++ slow path so PutByValue still respects non-extensible indexed objects and arrays with a non-writable length. Add a bytecode regression that exercises both append failure cases through the real js binary path.	2026-03-17 22:28:35 -05:00
Andreas Kling	31606fddd3	LibJS: Add Mov2/Mov3 instructions to reduce dispatch overhead Add Mov2 and Mov3 bytecode instructions that perform 2 or 3 register moves in a single dispatch. A peephole optimization pass during bytecode assembly merges consecutive Mov instructions within each basic block into these combined instructions. When merging, identical Movs are deduplicated (e.g. two identical Movs become a single Mov, not a Mov2). This optimization is implemented in both the C++ and Rust codegen pipelines. The goal is to reduce the per-instruction dispatch overhead, which is significant compared to the actual cost of moving a value. This isn't fancy or elegant, but provides a real speed-up on many workloads. As an example, Kraken/imaging-desaturate.js improves by ~1.07x on my laptop.	2026-03-11 17:04:32 +01:00
Andreas Kling	ac35ef465b	LibJS: Emit ThrowIfTDZ before simple assignment to let variables The Rust bytecode codegen was missing a TDZ check before assigning to local let/const variables in simple assignment expressions (a = expr). The C++ pipeline correctly emits ThrowIfTDZ before the store to ensure temporal dead zone semantics are enforced. Add an emit_tdz_check_if_needed helper matching the C++ equivalent, and call it in the simple assignment path.	2026-03-04 18:53:12 +01:00
Andreas Kling	fb61294df7	LibJS: Add UsingDeclaration to needs_block_declaration_instantiation Blocks containing non-local using declarations need a lexical environment, just like let/const declarations. Add the missing UsingDeclaration case to match C++ behavior.	2026-03-04 12:17:59 +01:00
Andreas Kling	bd7fc2b1b1	LibJS: Fix ResolveThisBinding/ResolveSuperBase emission order Emit ResolveThisBinding before ResolveSuperBase in both emit_evaluate_member_reference and emit_store_to_reference, matching the C++ pipeline's evaluation order for super property references. Also restructure emit_evaluate_member_reference to move non-super base evaluation into the else branch, since the super path now handles base evaluation differently (explicit ResolveSuperBase instead of going through generate_expression on Super).	2026-03-04 12:17:59 +01:00
Andreas Kling	4120765497	LibJS: Keep arg_holders alive in generate_arguments_array Keep the arg_holders vector alive through the spread arguments loop, matching the C++ pipeline where the args Vector keeps registers held through the loop. This ensures consistent register allocation.	2026-03-04 12:17:59 +01:00
Andreas Kling	7ceba6d2cb	LibJS: Fix register order in private logical assignment Move the destination register allocation after RHS evaluation in private identifier logical assignment (&&=, \|\|=, ??=), matching the C++ pipeline's register allocation order.	2026-03-04 12:17:59 +01:00
Andreas Kling	fa72fd9f95	LibJS: Optimize constant string computed properties to MemberId When a computed member expression uses a constant string (e.g. super["minutes"] or obj["key"]), optimize it to use the MemberId or SuperMemberId reference form instead of the value-based form, matching the C++ pipeline optimization.	2026-03-04 12:17:59 +01:00
Andreas Kling	33a6b90ccf	LibJS: Clear pending_lhs_name for named class expressions Named class expressions don't use pending_lhs_name, but we must still clear it to prevent it from leaking through to nested anonymous functions inside the class body.	2026-03-04 12:17:59 +01:00
Andreas Kling	722a897b28	LibJS: Remove redundant ThrowIfTDZ from Rust emit_set_variable The caller is responsible for emitting ThrowIfTDZ before calling emit_set_variable(), matching the C++ pipeline behavior. Remove the redundant TDZ checks from both the const and non-const local paths.	2026-03-04 12:17:59 +01:00
Andreas Kling	d88374e119	Tests/LibJS: Add bytecode test for for-of with conditional in RHS This adds a test case where the for-of iterable is a sequence expression containing a conditional expression. The C++ pipeline creates loop blocks before evaluating the iterable, giving them lower block numbers, while the Rust pipeline evaluates the iterable first.	2026-03-01 21:20:54 +01:00
Andreas Kling	aadfe0f02a	Tests/LibJS: Add test for compound assignment after destructuring This adds a test case for compound assignment to a variable that was initialized via a let destructuring pattern. The Rust pipeline emits a redundant ThrowIfTDZ after the compound assignment because the variable is not tracked as initialized after destructuring.	2026-03-01 21:20:54 +01:00
Andreas Kling	86191ce229	Tests/LibJS: Add bytecode test for destructuring assignment in && This adds a test case for array destructuring assignment inside a logical AND expression, e.g. `t && ([a, b] = t(e))`. The C++ pipeline allocates a separate register for the RHS and copies it to the result register after destructuring, while the Rust pipeline evaluates the RHS directly into the preferred destination, omitting the copy.	2026-03-01 21:20:54 +01:00
Andreas Kling	f0c34d54a1	Tests/LibJS: Add bytecode test for for-of continue with block scope Test that continue inside a for-of loop body properly restores the lexical environment when the for-of creates a per-iteration scope for the loop variable.	2026-03-01 21:20:54 +01:00
Andreas Kling	fa9c1a6885	Tests/LibJS: Add bytecode test for return from switch with block scope Test that returning from inside a switch statement that has a lexical environment (for const/let declarations) properly emits SetLexicalEnvironment to restore the parent environment before each Return instruction.	2026-03-01 21:20:54 +01:00
Andreas Kling	6432754251	Tests/LibJS: Add bytecode test for postfix increment on private member	2026-03-01 21:20:54 +01:00
Andreas Kling	ffa380f15b	Tests/LibJS: Add bytecode test for async await in try-catch with scope Test that the await continuation's throw path properly unwinds the lexical environment when inside a with statement within a try-catch.	2026-03-01 21:20:54 +01:00
Andreas Kling	dc464ba270	Tests/LibJS: Add bytecode test for var environment capacity Add a test that exercises CreateVariableEnvironment capacity when a function has parameter expressions and non-local var bindings.	2026-03-01 21:20:54 +01:00
Andreas Kling	17c8a80afc	Tests/LibJS: Add bytecode test for postfix update in logical AND Add a test that exercises postfix increment/decrement as the RHS of a logical AND expression, verifying the register allocation matches C++.	2026-03-01 21:20:54 +01:00
Andreas Kling	95fec309cd	Tests/LibJS: Add bytecode test for nested try-finally continue Add a test that exercises break/continue trampolines through nested try-finally blocks, ensuring exception handler ranges are correct.	2026-03-01 21:20:54 +01:00
Andreas Kling	d0b9905de1	LibJS/Rust: Use GetLengthWithThis for super.length property access The C++ pipeline has an optimization that uses the GetLengthWithThis instruction instead of GetByIdWithThis when accessing the "length" property. Add the same optimization to the Rust pipeline by introducing an emit_get_by_id_with_this helper that checks for the "length" property name and emits the optimized instruction. Also update emit_get_by_value_with_this to use GetLengthWithThis when the computed property is a constant "length" string.	2026-03-01 21:20:54 +01:00
Andreas Kling	56603319b4	LibJS/Rust: Fix evaluation order in delete super[key] Per spec, the property key expression should be evaluated before calling ResolveSuperBase. Fix the Rust codegen to match the C++ pipeline's correct evaluation order.	2026-03-01 21:20:54 +01:00
Andreas Kling	176a618fce	LibJS: Don't emit dead code after Throw for invalid LHS expressions When the left-hand side of an assignment, update, or for-in loop is invalid (e.g. `foo() = "bar"`), the bytecode generator emits a Throw instruction. Previously, it would also create a dead basic block after the Throw, resulting in unreachable instructions in the output. Fix this by returning early from the relevant codegen paths after emitting the Throw, and by guarding for-in/for-of body generation with an is_current_block_terminated() check.	2026-03-01 21:20:54 +01:00
Andreas Kling	18c40a1328	LibJS/Rust: Fix has_parameter_expressions and TDZ checks for arguments Fix two bugs in the Rust bytecode codegen: 1. has_parameter_expressions incorrectly treated any destructuring parameter as a "parameter expression", when it should only do so for patterns that contain expressions (defaults or computed keys). This caused an unnecessary CreateLexicalEnvironment for simple destructuring like `function f({a, b}) {}`. The same bug existed in both codegen.rs and lib.rs (SFD metadata computation). 2. emit_set_variable used is_local_lexically_declared(index) for argument locals, but that function indexes into the local_variables array using the argument's index, checking the wrong variable. This caused spurious ThrowIfTDZ instructions when assigning to function arguments that happened to share an index with an uninitialized let/const variable.	2026-03-01 21:20:54 +01:00
Andreas Kling	f3b675fb37	Tests/LibJS: Import various tests developed alongside Rust pipeline	2026-02-24 09:39:42 +01:00
Andreas Kling	b0b0275e9e	LibJS: Add bytecode test for switch statement completion values The completion value of a switch case is incorrectly reset to undefined when a statement without a result (like a variable declaration) follows an expression statement. This will be fixed in the next commit.	2026-02-19 12:02:50 +01:00
Andreas Kling	afae23e270	LibJS: Don't optimize body vars to locals when referenced in defaults When a function has parameter expressions (default values), body var declarations that shadow a name referenced in a default parameter expression must not be optimized to local variables. The default expression needs to resolve the name from the outer scope via the environment chain, not read the uninitialized local. We now mark identifiers referenced during formal parameter parsing with an IsReferencedInFormalParameters flag, and skip local variable optimization for body vars that carry both this flag and IsVar (but not IsForbiddenLexical, which indicates parameter names themselves).	2026-02-19 02:45:37 +01:00
Andreas Kling	cd2576c031	LibJS: Mark block-scoped function declaration locals as initialized When emitting block declaration instantiation, we were not calling set_local_initialized() after writing block-scoped function declarations to local variables via Mov. This caused unnecessary ThrowIfTDZ checks to be emitted when those locals were later read. Block-scoped function declarations are always initialized at block entry (via NewFunction + Mov), so TDZ checks for them are redundant.	2026-02-19 02:45:37 +01:00
Andreas Kling	fdd7809bd1	Tests/LibJS: Add a big pile of AST, bytecode, and runtime tests Created these while experimenting with LibJS. Might as well bring them into the tree and increase our coverage.	2026-02-17 20:44:57 +01:00
Andreas Kling	2dca137d9e	LibJS: Handle ThisExpression in expression_identifier() Add ThisExpression handling to the expression_identifier() helper used for base_identifier in bytecode instructions. This makes PutById and GetById emit base_identifier:this when the base is a this expression.	2026-02-15 23:21:46 +01:00
Andreas Kling	4724b08a26	LibJS: Make MemberExpression::to_string_approximation() recursive Previously, the function only handled a single level of member access, producing strings like "<object>.isWall" for chained expressions like "graphSet[j][k].isWall". Now it recurses through nested member expressions, identifiers, string/numeric literals, and `this`.	2026-02-15 23:21:46 +01:00
Andreas Kling	49f2f1e7cd	LibJS: Skip unnecessary Mov in emit_load_from_reference for reads When MemberExpression::generate_bytecode calls emit_load_from_reference, it only uses the loaded_value and discards the reference operands. For computed member expressions (e.g. a[0]), this was generating an unnecessary Mov to save the property register for potential store-back. Add a ReferenceMode parameter to emit_load_from_reference. When LoadOnly is passed, the computed property path skips the register save and Mov.	2026-02-15 23:21:46 +01:00
Andreas Kling	ecadf3ce21	LibJS: Inline AsyncIteratorClose with proper Await in for-await-of The AsyncIteratorClose bytecode op calls async_iterator_close() which uses synchronous await() internally. This spins the event loop while execution contexts are on the stack, violating the microtask checkpoint assertion in LibWeb. Replace AsyncIteratorClose op emissions in for-await-of close handlers with inline bytecode that uses the proper Await op, allowing the async function to yield and resume naturally through the event loop. For the non-throw path (break/return/continue-to-outer): emit GetMethod, Call, Await, and ThrowIfNotObject inline. For the throw path: wrap the close steps in an exception handler so that any error from GetMethod/Call/Await is discarded and the original exception is rethrown, per spec step 5.	2026-02-12 11:37:43 +01:00
Andreas Kling	9a7f836d27	LibJS: Call IteratorClose on abrupt completion in for-of loops When a for-of or for-await-of loop exits via break, return, throw, or continue-to-outer-loop, we now correctly call IteratorClose (or AsyncIteratorClose) to give the iterator a chance to clean up resources. This uses a synthetic FinallyContext that wraps the LHS assignment and loop body, reusing the existing try/finally completion record machinery. The ReturnToFinally boundary is placed between Break and Continue so that continue-to-same-loop bypasses the close (zero overhead on normal iteration) while all other abrupt exits route through the iterator close dispatch chain. for-in (enumerate) does not require iterator close per spec.	2026-02-12 11:37:43 +01:00
Andreas Kling	ec2f4e4a7b	LibJS: Wire NewClass to ClassBlueprint Replace the ClassExpression const& reference in the NewClass instruction with a u32 class_blueprint_index. The interpreter now reads from the ClassBlueprint stored on the Executable and calls construct_class() instead of the AST-based create_class_constructor(). Literal field initializers (numbers, booleans, null, strings, negated numbers) are used directly in construct_class() without creating an ECMAScriptFunctionObject, avoiding function creation overhead for common field patterns like `x = 0` or `name = "hello"`. Set class_field_initializer_name on SharedFunctionInstanceData at codegen time for statically-known field keys (identifiers, private identifiers, string literals, and numeric literals). For computed keys, the name is set at runtime in construct_class(). ClassExpression AST nodes are no longer referenced from bytecode.	2026-02-11 23:57:41 +01:00
Andreas Kling	32389c5e96	Tests/LibJS: Add tests for scope analysis and destructuring Add bytecode tests verifying identifier resolution produces correct register-backed locals, global lookups, argument indices, and environment lookups for eval/with/captured cases. Add runtime tests for destructuring assignment patterns with expression defaults: class expressions (named/anonymous), function expressions, arrow functions, nested destructuring, eval in defaults, MemberExpression targets with setter functions, and class name scoping.	2026-02-10 02:05:20 +01:00
Andreas Kling	7f89158d20	LibJS: Replace implicit environment stack with explicit registers Replace the saved_lexical_environments stack in ExecutionContextRareData with explicit register-based environment tracking. Environments are now stored in registers and restored via SetLexicalEnvironment, making the environment flow visible in bytecode. Key changes: - Add GetLexicalEnvironment and SetLexicalEnvironment opcodes - CreateLexicalEnvironment takes explicit parent and dst operands - EnterObjectEnvironment stores new environment in a dst register - NewClass takes an explicit class_environment operand - Remove LeaveLexicalEnvironment opcode (instead: SetLexicalEnvironment) - Remove saved_lexical_environments from ExecutionContextRareData - Use a reserved register for the saved lexical environment to avoid dominance issues with lazily-emitted GetLexicalEnvironment	2026-02-09 16:35:39 +01:00
Andreas Kling	a439dc8490	LibJS: Use explicit completion records for try/finally dispatch Each finally scope gets two registers (completion_type and completion_value) that form an explicit completion record. Every path into the finally body sets these before jumping, and a dispatch chain after the finally body routes to the correct continuation. This replaces the old implicit protocol that relied on the exception register, a saved_return_value register, and a scheduled_jump field on ExecutionContext, allowing us to remove: - 5 opcodes (ContinuePendingUnwind, ScheduleJump, LeaveFinally, RestoreScheduledJump, PrepareYield) - 1 reserved register (saved_return_value) - 2 ExecutionContext fields (scheduled_jump, previously_scheduled_jumps)	2026-02-09 08:51:12 +01:00
dosisod	dab739771f	LibJS: Reduce number of template literal op codes There is no need to concat empty string literals when building template literals. Now strings will only be concatenated if they need to be. To handle the edge case where the first segment is not a string literal, a new `ToString` op code has been added to ensure the value is a string concatenating more strings. In addition, basic const folding is now supported for template literal constants (templates with no interpolated values), which is commonly used for multi-line string constants.	2026-01-31 18:24:02 +01:00
dosisod	2c3077b878	LibJS: Dead code elimination for always truthy/falsey conditions This improves and expands the ability to do dead code elimination on conditions which are always truthy or falsey. The following cases are now optimized: * `if (true){}` -> Only emit `if` block, ignore `else` * `if (false){}` -> Only emit `else if`/`else` block * `while (false){}` -> Ignore `while` loop entirely * `for (x;false;){}` -> Only emit `x` (if it exists), skip `for` block * Ternary -> Directly return left/right hand side if condition is const	2026-01-31 18:22:40 +01:00
Andreas Kling	5238841da2	LibJS: Mark named function expression identifiers at individual level Previously, when parsing a named function expression like `Oops = function Oops() { Oops }`, the parser set a group-level flag `might_be_variable_in_lexical_scope_in_named_function_assignment` that propagated to the parent scope. This incorrectly prevented ALL `Oops` identifiers from being marked as global, including those outside the function expression. Fix this by marking identifiers individually using `set_is_inside_scope_with_eval()` only for identifiers inside the function scope. This allows identifiers outside the function expression to correctly use GetGlobal/SetGlobal while identifiers inside still use GetBinding (since they may refer to the function's name binding).	2026-01-27 10:58:39 +01:00
Andreas Kling	871d93355b	LibJS: Stop propagating is_inside_scope_with_eval across functions Previously, when a nested function contained eval(), the parser would mark all identifiers in parent functions as "inside scope with eval". This prevented those identifiers from being marked as global, forcing them to use GetBinding instead of GetGlobal. However, eval() can only inject variables into its containing function's scope, not into parent function scopes. So a parent function's reference to a global like `Number` should still be able to use GetGlobal even if a nested function contains eval(). This change adds a new flag `m_eval_in_current_function` that propagates through block scopes within the same function but stops at function boundaries. This flag is used for marking identifiers, while the existing `m_screwed_by_eval_in_scope_chain` continues to propagate across functions for local variable deoptimization (since eval can access closure variables). Before: `new Number(42)` in outer() with eval in inner() -> GetBinding After: `new Number(42)` in outer() with eval in inner() -> GetGlobal	2026-01-27 10:58:39 +01:00
Jelle Raaijmakers	e3faa9b5ad	LibJS: Move tests to `/Tests/LibJS`	2026-01-22 07:46:48 -05:00

48 Commits