ladybird

mirror of https://github.com/LadybirdBrowser/ladybird synced 2026-05-05 22:52:22 +02:00

Author	SHA1	Message	Date
Andreas Kling	6492c88ad8	AsmIntGen: Elide redundant FP comparisons in consecutive branch_fp_* When consecutive branch_fp_* instructions use the same operands (e.g. branch_fp_unordered followed by branch_fp_equal), the 2nd ucomisd/fcmp is redundant since flags are still valid from the first comparison. Track the last FP comparison operands in HandlerState and skip the comparison instruction when it would be identical. This is common in the double_equality_compare macro which checks for unordered (NaN) before testing equality.	2026-03-07 22:18:22 +01:00
Andreas Kling	472edb3448	AsmIntGen: Use mov r32 for unsigned 32-bit immediates on x86_64 Values in the range 0x80000000..0xFFFFFFFF were incorrectly emitted as plain `mov r64, imm` which GAS encodes as a 10-byte movabs. Use `mov r32, imm32` instead (5 bytes, implicitly zero-extends to 64 bits). This affects constants like ENVIRONMENT_COORDINATE_INVALID (0xFFFFFFFE) which appeared 5 times in the generated assembly.	2026-03-07 22:18:22 +01:00
Andreas Kling	c6fd52e317	AsmIntGen: Move NaN canonicalization to cold fixup blocks canonicalize_nan previously emitted its full NaN fixup inline: on x86_64, a 10-byte movabs + cmovp; on aarch64, a multi-instruction mov sequence + fcsel. These were always on the hot path even though NaN results from arithmetic are extremely rare. Move the NaN fixup to a cold block emitted after the handler body. The hot path is now just: movq/fmov + ucomisd/fcmp + jp/b.vs (a forward branch predicted not-taken). This removes 14 bytes of instructions from the hot path of every handler that produces double results (Add, Sub, Mul, Div, and several builtins). Both backends gain a HandlerState struct (shared between them) that accumulates cold fixup blocks during code generation, emitted after the main body.	2026-03-07 22:18:22 +01:00
Andreas Kling	5b8114a96b	AsmInt: Use hardware overflow flag for int32 arithmetic Replace the pattern of 64-bit arithmetic + sign-extend + compare with dedicated 32-bit overflow instructions that use the hardware overflow flag directly. Before: add t3, t4 / unbox_int32 t5, t3 / branch_ne t3, t5, .overflow After: add32_overflow t3, t4, .overflow On x86_64 this compiles to `add r32, r32; jo label` (the 32-bit register write implicitly zeros the upper 32 bits). On aarch64, `adds w, w, w; b.vs label` for add/sub, `smull + sxtw + cmp + b.ne` for multiply, and `negs + b.vs` for negate. Nine call sites updated: Add, Sub, Mul, Increment, Decrement, PostfixIncrement, PostfixDecrement, UnaryMinus, and CallBuiltin(abs).	2026-03-07 22:18:22 +01:00
Andreas Kling	9ae5445493	LibJS: Add AsmIntGen assembly interpreter code generator AsmIntGen is a Rust tool that compiles a custom assembly DSL into native x86_64 or aarch64 assembly (.S files). It reads struct field offsets from a generated constants file and instruction layouts from Bytecode.def (via the BytecodeDef crate) to emit platform-specific code for each bytecode handler. The DSL provides a portable instruction set with register aliases, field access syntax, labels, conditionals, and calls. Each backend (codegen_x86_64.rs, codegen_aarch64.rs) translates this into the appropriate platform assembly with correct calling conventions (SysV AMD64, AAPCS64).	2026-03-07 13:09:59 +01:00

5 Commits