ladybird

mirror of https://github.com/LadybirdBrowser/ladybird synced 2026-05-13 10:26:37 +02:00

Author	SHA1	Message	Date
pwespi	3dc3bcb556	LibJS: Fix expected SyntaxErrors for private fields	2026-03-20 16:06:51 -05:00
Ollie Hensman-Crook	df8ead1f12	LibJS: Treat concise methods as non-constructors	2026-03-20 15:58:05 -05:00
Johan Dahlin	1179e40d3f	LibJS: Eliminate GeneratorResult GC cell allocation on yield/await Store yield_continuation and yield_is_await directly in ExecutionContext instead of allocating a GeneratorResult GC cell. This removes a heap allocation per yield/await and fixes a latent bug where continuation addresses stored as doubles could lose precision.	2026-03-20 15:57:23 -05:00
Andreas Kling	943319453d	LibJS: Fix syntax highlighter position starting at invalid sentinel The RehighlightState designated initializer used `.position = {}` which invokes TextPosition's default constructor, initializing line and column to 0xFFFFFFFF (the "invalid" sentinel). This overrode the struct's default member initializer of { 0, 0 }. When advance_position() processed the first newline, it incremented 0xFFFFFFFF to 0x100000000, producing line numbers in the billions. These bogus positions propagated into folding regions, causing an out-of-bounds crash in Document::set_folding_regions() when viewing page source on pages with <script> blocks. Fix by explicitly initializing position to { 0, 0 }. Fixes #8529.	2026-03-20 15:32:33 +01:00
Timothy Flynn	b3795eb5bb	LibJS: Handle time zone gaps in JS::utc_time Commit `88365031f2` added support for time zone gaps in general, but missed this method.	2026-03-20 14:46:46 +01:00
Jelle Raaijmakers	6ce327f715	LibJS: Reduce size of `Optional<EnvironmentCoordinate>` Reduces the size of `Optional<EnvironmentCoordinate>` from 12 to 8 bytes, and by reordering the fields in `Reference` we shrink that down from 64 to 56 bytes as well.	2026-03-20 12:03:36 +01:00
Jelle Raaijmakers	e123d48043	AK: Add `SentinelOptional` We specialize `Optional<T>` for value types that inherently support some kind of "empty" value or whose value range allow for a unlikely to be useful sentinel value that can mean "empty", instead of the boolean flag a regular Optional<T> needs to store. Because of padding, this often means saving 4 to 8 bytes per instance. By extending the new `SentinelOptional<T, Traits>`, these specializations are significantly simplified to just having to define what the sentinel value is, and how to identify a sentinel value.	2026-03-20 12:03:36 +01:00
Andreas Kling	bb0acb54ae	LibJS: Optimize x >> 0 to ToInt32 in bytecode codegen x >> 0 is a common JS idiom equivalent to ToInt32(x). We already had this optimization for x \| 0, now do it for right shift by zero as well. This allows the asmint handler for ToInt32 to run instead of the more expensive RightShift handler, which wastes time loading and checking the rhs operand and performing a shift by zero.	2026-03-20 00:51:23 -05:00
Andreas Kling	02b0746676	LibJS: Deduplicate double constants in bytecode generator Add a deduplication cache for double constants, matching the existing approach for int32 and string constants. Multiple references to the same floating-point value now share a single constant table entry.	2026-03-20 00:51:23 -05:00
Andreas Kling	144ab69715	LibJS: Remove C++ pipeline compatibility hacks from Rust codegen Now that the C++ bytecode pipeline has been removed, we no longer need to match its register allocation or block layout. This removes: - All manual drop() calls that existed solely to match C++ register lifetimes, replaced with scope blocks to naturally limit register lifetimes without increasing register pressure. - The unnecessary saved_property copy in update expressions. The property register is now used directly since emit_update_op doesn't evaluate user expressions that could mutate it. The copy is retained in compound/logical assignments where the RHS can mutate the property variable (e.g. a[i] \|= a[++i]). - All "matching C++", "Match C++", etc. comments throughout codegen.rs and generator.rs that referenced the removed pipeline.	2026-03-20 00:51:23 -05:00
Andreas Kling	bc4379983f	LibJS: Improve bytecode executable dump format Add a metadata header showing register count, block count, local variable names, and the constants table. Resolve jump targets to block labels (e.g. "block1") instead of raw hex addresses, and add visual separation between basic blocks. Make identifier and property key formatting more concise by using backtick quoting and showing base_identifier as a trailing parenthetical hint that joins the base and property names. Generate a stable name for each executable by hashing the source text it covers (stable across codegen changes). Named functions show as "foo$9beb91ec", anonymous ones as "$43362f3f". Also show the source filename, line, and column.	2026-03-20 00:51:23 -05:00
Andreas Kling	f5eea4d232	LibJS: Fix catch parameter and new.target regressions - Restrict catch parameter conflict check to only direct children of the catch body block, not nested scopes - Set new_target_is_valid for dynamic function compilation (new Function) - Move check_parameters_post_body before flag restoration in parse_method_definition so generator methods inside static init blocks correctly allow 'await' as a parameter name	2026-03-19 23:15:03 -05:00
Andreas Kling	5374f0a85c	LibJS: Add more early errors in Rust parser - Reject duplicate bindings in catch parameter patterns - Reject redeclaration of catch parameter with let/const/function - Reject binding patterns with initializers in for-in heads (AnnexB only permits simple BindingIdentifier with initializer) - Reject 'await' as binding identifier in class static init blocks and module code	2026-03-19 23:15:03 -05:00
Andreas Kling	49cc44a3eb	LibJS: Reject arguments/eval in strict mode destructuring and arrows Check identifier name validity for destructuring assignment pattern bound names, and validate arrow function parameters after the arrow is confirmed rather than during speculative parameter parsing. This fixes arguments/eval as destructuring assignment targets and as arrow function parameter names in strict mode.	2026-03-19 23:15:03 -05:00
Andreas Kling	66dbb355fe	LibJS: Reject new.target in arrow functions at global scope Arrow functions don't have their own new.target binding -- they inherit from the enclosing scope. At the global level, there is no enclosing function, so new.target inside a global arrow is invalid. Add a new_target_is_valid flag to ParserFlags that is set to true when entering regular (non-arrow) function bodies, method definitions, and class static init blocks. Arrow functions inherit the flag from their enclosing scope rather than setting it.	2026-03-19 23:15:03 -05:00
Andreas Kling	6029a3d40e	LibJS: Add missing early errors in Rust parser - Reject `true`, `false`, `null` as label identifiers - Reject generator declarations in if-statement bodies (not covered by Annex B) - Reject `await` as label in class static init blocks and modules - Reject `arguments` in class static initialization blocks - Reject generator shorthand without method body in object literals - Reject `get constructor()` / `set constructor()` in class bodies - Reject `super.#private` member access	2026-03-19 23:15:03 -05:00
Andreas Kling	f491d44b3b	LibJS: Replace ScopedOperand with Operand in bytecode ops ScopedOperand was a ref-counted wrapper around Operand used by the C++ bytecode Generator for register lifetime tracking. Now that the Generator is gone, it's just a pointless indirection. Update the bytecode def code generator to emit Operand directly instead of ScopedOperand in variable-argument op constructors, and delete ScopedOperand.h.	2026-03-19 21:55:10 -05:00
Andreas Kling	362207b45d	LibJS: Remove remaining C++ pipeline artifacts Clean up leftover references to the removed C++ pipeline: - Remove stale forward declarations from Forward.h (ASTNode, Parser, Program, FunctionNode, ScopeNode, etc.) - Delete unused FunctionParsingInsights.h - Remove dead get_builtin(MemberExpression const&) declaration from Builtins.h - Update stale comments referencing ASTCodegen.cpp and generate_bytecode()	2026-03-19 21:55:10 -05:00
Andreas Kling	30f108ba36	LibJS: Remove C++ lexer, use Rust tokenizer for syntax highlighting Delete Lexer.cpp/h and Token.cpp, replacing all tokenization with a new rust_tokenize() FFI function that calls back for each token. Rewrite SyntaxHighlighter.cpp and js.cpp REPL to use the Rust tokenizer. The token type and category enums in Token.h now mirror the Rust definitions in token.rs. Move is_syntax_character/is_whitespace/is_line_terminator helpers into RegExpConstructor.cpp as static functions, since they were only used there.	2026-03-19 21:55:10 -05:00
Andreas Kling	8ec7e7c07c	LibJS: Remove C++ AST Delete AST.cpp, AST.h, ASTDump.cpp, ScopeRecord.h, and the dead get_builtin(MemberExpression const&) from Builtins.cpp. Extract ImportEntry and ExportEntry into a new ModuleEntry.h, since they are data types used by the module system, not AST node types. Inline ModuleRequest's sorting constructor and SourceRange::filename(). Remove the dead annex_b_function_declarations field from EvalDeclarationData, which was only populated by the C++ parser.	2026-03-19 21:55:10 -05:00
Andreas Kling	169452f41b	LibJS: Remove C++ parser Delete Parser.cpp/h and ScopeCollector.cpp/h, now that all parsing goes through the Rust pipeline. Port test262-runner to use RustIntegration::parse_program() for its fast parse-only check instead of the C++ Parser. Add parsed_program_has_errors() and free_parsed_program() to the RustIntegration public API for parse-only use cases.	2026-03-19 21:55:10 -05:00
Andreas Kling	1f6ca58e55	LibJS: Remove C++ AST constructor from SharedFunctionInstanceData Remove the constructor that took C++ AST nodes (FunctionParameters, Statement), along with create_for_function_node() and the m_formal_parameters / m_ecmascript_code fields. These were only used by the now-removed C++ compilation pipeline. Also remove the dead EvalDeclarationData::create(VM&, Program&, bool) and ECMAScriptFunctionObject::ecmascript_code() accessor.	2026-03-19 21:55:10 -05:00
Andreas Kling	c25227d324	LibJS: Remove C++ bytecode codegen Delete the C++ bytecode code generator, now that all compilation goes through the Rust pipeline: - Bytecode/ASTCodegen.cpp (4417 lines) - Bytecode/Generator.cpp (1961 lines) - Bytecode/Generator.h (535 lines) - Bytecode/ScopedOperand.cpp (23 lines) Also remove all generate_bytecode() and generate_labelled_evaluation() virtual method declarations from AST.h, and their associated Bytecode includes.	2026-03-19 21:55:10 -05:00
Andreas Kling	272562ddc5	LibJS: Remove dead C++ bytecode compilation functions Remove Bytecode::compile() and the old create() overloads on ECMAScriptFunctionObject that accepted C++ AST nodes. These have no remaining callers now that all compilation goes through the Rust pipeline. Also remove the if-constexpr Parse Node branch from async_block_start, since the Statement template instantiation was already removed. Fix transitive include dependencies on Generator.h by adding explicit includes for headers that were previously pulled in transitively.	2026-03-19 21:55:10 -05:00
Andreas Kling	3518efd71c	LibJS+LibWeb: Port remaining callers to Rust pipeline Port all remaining users of the C++ Parser/Lexer/Generator to use the Rust pipeline instead: - Intrinsics: Remove C++ fallback in parse_builtin_file() - ECMAScriptFunctionObject: Remove C++ compile() fallback - NativeJavaScriptBackedFunction: Remove C++ compile() fallback - EventTarget: Port to compile_dynamic_function - WebDriver/ExecuteScript: Port to compile_dynamic_function - LibTest/JavaScriptTestRunner.h: Remove Parser/Lexer includes - FuzzilliJs: Remove unused Parser/Lexer includes Also remove the dead Statement-based template instantiation of async_block_start/async_function_start.	2026-03-19 21:55:10 -05:00
Andreas Kling	0c7d50b33d	LibJS: Remove LIBJS_CPP env var and ENABLE_RUST guards The Rust pipeline is now the only compilation path, so remove: - The LIBJS_CPP environment variable check - The rust_pipeline_enabled() helper - The #ifdef ENABLE_RUST / #else stub section - The test-js-cpp CTest target and LIBJS_TEST_PARSER_MODE env var - The ParserMode enum and canParseSourceWithCpp/Rust test functions rust_pipeline_available() now unconditionally returns true.	2026-03-19 21:55:10 -05:00
Andreas Kling	77cd434710	LibJS: Remove C++ compiler pipeline fallback paths Now that the Rust pipeline is the sole compilation path, remove all C++ parser/codegen fallback paths from the callers: - Script::parse() no longer falls back to C++ Parser - SourceTextModule::parse() no longer falls back to C++ Parser - perform_eval() no longer falls back to C++ Parser + Generator - create_dynamic_function() no longer falls back to C++ Parser - ShadowRealm eval no longer falls back to C++ Parser + Generator - Interpreter::run(Script&) no longer falls back to Generator Also remove the now-dead old constructors that took C++ AST nodes, the module_requests() helper, and AST dump code from js.cpp.	2026-03-19 21:55:10 -05:00
Andreas Kling	2c45472a11	LibJS: Remove pipeline comparison infrastructure Remove PipelineComparison.cpp/h and all LIBJS_COMPARE_PIPELINES support from RustIntegration.cpp. This includes: - The compare_pipelines_enabled() function - All comparison blocks in compile_script/eval/module/function - The pair_shared_function_data() helper - The m_cpp_comparison_sfd field on SharedFunctionInstanceData The Rust pipeline has been validated extensively through comparison testing and no longer needs the side-by-side verification harness.	2026-03-19 21:55:10 -05:00
Andrew Kaster	f06bd0303f	LibJS: Use enum for retrieving well known symbols from C++ to Rust	2026-03-19 09:48:32 +01:00
Andrew Kaster	5d43707896	LibJS: Directly use LiteralValueKind enum across FFI boundary	2026-03-19 09:48:32 +01:00
Andreas Kling	3efd1a1bb5	LibJS: Reject duplicate params across destructuring patterns in C++ The C++ parser was not rejecting duplicate parameter names across destructuring patterns in non-simple parameter lists. For example, `function f({ bar, ...a }, { bar, ...b }) {}` was accepted despite being a syntax error per spec. The existing inline duplicate check only ran for identifier parameters, missing the case where both parameters are binding patterns. Add a post-parse pass that collects all bound names and checks for duplicates when the parameter list is non-simple (or in strict mode/arrows). Also fix existing tests that relied on the incorrect behavior and add new test coverage for destructuring duplicate detection.	2026-03-19 09:43:11 +01:00
Andreas Kling	1ff61754a7	LibJS: Re-box double arithmetic results as Int32 when possible When the asmint computes a double result for Add, Sub, Mul, Math.floor, Math.ceil, or Math.sqrt, try to store it as Int32 if the value is a whole number in [INT32_MIN, INT32_MAX] and not -0.0. This mirrors the JS::Value(double) constructor and allows downstream int32 fast paths to fire. Also add label uniquification to the DSL macro expander so the same macro can be used multiple times in one handler without label collisions.	2026-03-19 09:42:04 +01:00
Andreas Kling	5e403af5be	LibJS: Tighten asmint ToInt32 boxing Teach js_to_int32 to leave a clean low 32-bit result on success, then use box_int32_clean in the ToInt32 fast path and adjacent boolean coercions. This removes one instruction from the AArch64 fjcvtzs path and trims the boolean boxing path without changing behavior.	2026-03-19 09:42:04 +01:00
Andreas Kling	645f481825	LibJS: Fast-path Float32Array indexed access Add the small AsmIntGen float32 load, store, and conversion operations needed to handle Float32Array directly in the AsmInt typed-array GetByValue and PutByValue paths. This covers direct indexed reads plus both int32 and double stores, and adds regression coverage for Math.fround rounding, negative zero, and NaN.	2026-03-19 09:42:04 +01:00
Andreas Kling	6614971e6f	LibJS: Fast-path Uint8ClampedArray indexed access Teach the asm typed-array GetByValue and PutByValue paths to handle Uint8ClampedArray directly. Reads can share the Uint8Array load path, while int32 stores clamp in asm instead of bailing out to C++. Add a direct indexed access regression test for clamped int32 stores.	2026-03-19 09:42:04 +01:00
RubenKelevra	fae2f8f3ba	LibJS: Align new-expression paren flags with C++ parser	2026-03-18 17:41:36 -05:00
RubenKelevra	3cb636ca38	LibJS: Keep new call-paren optional chaining valid	2026-03-18 17:41:36 -05:00
RubenKelevra	ea8fa63e79	LibJS: Reject optional chaining on unparenthesized new	2026-03-18 17:41:36 -05:00
RubenKelevra	04b27429de	LibJS: Isolate super validity in nested function scopes	2026-03-18 17:41:36 -05:00
RubenKelevra	d8469c384d	LibJS: Reject invalid bare private identifier usage	2026-03-18 17:41:36 -05:00
RubenKelevra	d6229a1cc8	LibJS: Fix async arrow and for-of async parsing	2026-03-18 17:41:36 -05:00
RubenKelevra	af777b5d86	LibJS: Align duplicate parameter early errors	2026-03-18 17:41:36 -05:00
RubenKelevra	40984d0f39	LibJS: Enforce const initializers in declarations	2026-03-18 17:41:36 -05:00
Andreas Kling	9299d430c8	LibJS: Cache typed array data pointers for indexed access Cache raw data pointers on fixed-length typed array views so asm GetByValue and PutByValue can use them directly for indexed element access. Replace the asm typed-array hot-path ArrayBuffer/DataBlock/ByteBuffer walk with one cached_data_ptr load. Remove six unconditional loads, four branches, and the byte_offset add before the element access, trading them for one cached_data_ptr null check. Keep direct C++ typed-array access on IsValidIntegerIndex-based checks, invalidate cached pointers eagerly when a backing ArrayBuffer is detached, and add regression coverage for shrink, regrow, and detach on number and BigInt typed arrays.	2026-03-18 13:59:05 -05:00
Andreas Kling	b4185f0ecd	LibJS: Split packed and holey asm indexed fast paths Use dedicated Packed branches in GetByValue and PutByValue so in-bounds indexed accesses can skip hole checks and slot reloads. Keep Holey writes on the guarded arm, and keep append writes on the C++ slow path so PutByValue still respects non-extensible indexed objects and arrays with a non-writable length. Add a bytecode regression that exercises both append failure cases through the real js binary path.	2026-03-17 22:28:35 -05:00
Andreas Kling	5f586ae406	LibJS: Promote Holey arrays to Packed when all holes are filled Arrays created via new Array(N) or by setting .length start as Holey since their elements are not present. After sequential fill (e.g. for (i=0; i<N; i++) a[i]=v), all holes are filled but the array remained Holey, preventing the Packed fast paths in the asm interpreter from triggering. Now, whenever indexed_put() writes to the last index of a Holey array, we scan for remaining holes and promote to Packed if none are found. Only checking on writes to the last index avoids O(N^2) scanning on partial fills while still catching the common sequential fill pattern.	2026-03-17 22:28:35 -05:00
Andreas Kling	5895cacc21	LibJS: Add Array.prototype fast paths for packed arrays When the receiver is an Array with packed storage and an intact default prototype chain, some methods can skip the generic property access machinery and operate directly on the indexed element storage. This patch adds fast paths for push(), pop(), concat(), slice() and splice().	2026-03-17 22:28:35 -05:00
Andreas Kling	614713ed08	LibJS: Replace IndexedProperties with inline Packed/Holey/Dictionary Replace the OwnPtr<IndexedPropertyStorage> indirection with inline indexed element storage directly on Object. This eliminates virtual dispatch and reduces indirection for indexed property access. The new system uses three storage kinds tracked by IndexedStorageKind: - Packed: Dense array, no holes. Elements stored in a malloced Value* array with capacity header (same layout as named properties). - Holey: Dense array with possible holes marked by empty sentinel. Same physical layout as Packed. - Dictionary: Sparse storage using GenericIndexedPropertyStorage, type-punned into the m_indexed_elements pointer. Transitions: None->Packed->Holey->Dictionary (mostly monotonic). Dictionary mode triggers on non-default attributes or sparse arrays. Object keeps the same 48-byte size since m_indexed_elements (8 bytes) replaces IndexedProperties (8 bytes), and the storage kind + array size fit in existing padding alongside m_flags. The asm interpreter benefits from one fewer indirection: it now reads the element pointer and array size directly from Object fields instead of chasing through OwnPtr -> IndexedPropertyStorage -> Vector. Removes: IndexedProperties, SimpleIndexedPropertyStorage, IndexedPropertyStorage, IndexedPropertyIterator. Keeps: GenericIndexedPropertyStorage (for Dictionary mode).	2026-03-17 22:28:35 -05:00
Andreas Kling	f574ef528d	LibJS: Replace Vector<Value> with Value* for named property storage Replace the 24-byte Vector<Value> m_storage with an 8-byte raw Value* m_named_properties pointer, backed by a malloc'd allocation with an inline capacity header. Memory layout of the allocation: [u32 capacity] [u32 padding] [Value 0] [Value 1] ... m_named_properties points to Value 0. This shrinks JS::Object from 64 to 48 bytes (on non-Windows platforms) and removes one level of indirection for property access in the asm interpreter, since the data pointer is now stored directly on the object rather than inside a Vector's internal metadata. Growth policy: max(4, max(needed, old_capacity * 2)).	2026-03-17 22:28:35 -05:00
Andrew Kaster	92e4c20ad5	LibJS: Generate FFI header using cbindgen instead of hand-rolling Replace the BytecodeFactory header with cbindgen. This will help ensure that types and enums and constants are kept in sync between the C++ and Rust code. It's also a step in exporting more Rust enums directly rather than relying on magic constants for switch statements. The FFI functions are now all placed in the JS::FFI namespace, which is the cause for all the churn in the scripting parts of LibJS and LibWeb.	2026-03-17 20:49:50 -05:00

1 2 3 4 5 ...

2338 Commits