serenity

mirror of https://github.com/SerenityOS/serenity synced 2026-05-14 19:06:55 +02:00

Author	SHA1	Message	Date
Sönke Holz	4f8490b5ff	Kernel: Move boot info variables into a shared struct This commit reorganizes the BootInfo struct definition so it can be shared for all architectures. The existing free extern "C" boot info variables have been removed and replaced with a global BootInfo struct, 'g_boot_info'. On x86-64, the BootInfo is directly copied from the Prekernel-provided struct. On AArch64 and RISC-V, BootInfo is populated during pre_init.	2024-10-30 18:51:35 -04:00
Liav A.	96e1391c23	Kernel/Devices: Remove the DeviceManagement singleton This change has many improvements: - We don't use `LockRefPtr` to hold instances of many base devices as with the DeviceManagement class. Instead, we have a saner pattern of holding them in a `NonnullRefPtr<T> const`, in a small-text footprint class definition in the `Device.cpp` file. - The awkwardness of using `::the()` each time we need to get references to mostly-static objects (like the Event queue) in runtime is now gone in the migration to using the `Device` class. - Acquiring a device feel more obvious because we use now the Device class for this method. The method name is improved as well.	2024-10-05 12:26:48 +02:00
Liav A.	b93ca74d81	Kernel: Add a prctl option to enter jail mode until an execve syscall In addition to the already existing option to enter jail mode (which is set indefinitely), there should be a less restrictive option that should allow exiting jail mode when doing the execve syscall. This option will be useful for programs that need this kind of security layer only in their runtime, but they're meant to actually initiate another program in the end.	2024-10-03 12:39:45 +02:00
Liav A.	b90a36d2a9	Kernel+Userland: Rename jailed => jailed_until_exit In all instances, it should be clear that the jailing of a process is ending when the process exits. This is a preparation before introducing another option to set a process as jailed until it calls the execve syscall.	2024-10-03 12:39:45 +02:00
Dan Klishch	e4840e3c84	Everywhere: Fix compilation with Clang 19 3 fixes here come from better diagnostics in the latest Clang.	2024-10-02 21:42:33 -04:00
Liav A.	0482f4e117	Kernel: Remove passing of register state to IRQ handlers Linux did the same thing 18 years ago and their reasons for the change are similar to ours - https://github.com/torvalds/linux/commit/7d12e78 Most interrupt handlers (i.e. IRQ handlers) never used the register state reference anywhere so there's simply no need of passing it around. I didn't measure the performance boost but surely this change can't make things worse anyway.	2024-09-01 21:00:18 +02:00
Liav A.	fdf3e0aca1	Kernel: Don't assume sizes of needed buffers early in the execve syscall Instead, start by trying to read a buffer with size of Elf_Ehdr, and check it for the shebang sign. If it's indeed an executable with shebang then read again from the file, now with PAGE_SIZE size, which should suffice for finding the interpreter path. However, if the executable is an ELF, we quickly validate it and then pass the preliminary buffer to the find_elf_interpreter_for_executable method. That method calculates the last byte offset which is needed to read all of the program headers, so we don't just assume 4096 bytes is sufficient anymore. The same pattern is applied when loading the interpreter ELF main header and its program headers.	2024-09-01 20:52:55 +02:00
Sönke Holz	75ccee81be	Kernel: Use the correct ip in the dispatch_signal debug output The signal handler RegisterState is in 'regs', not 'm_regs'.	2024-08-21 08:17:17 -04:00
Liav A.	79d9abd3cc	Kernel: Create coredump by using the crashed process VFS root context Resolve a regression caused by `01e1af732b`. This unbreaks coredump generation, because we need to use the VFS root context of the crashed process and not of the FinalizerTask, as it will hold an empty VFS root context that is assigned to kernel processes.	2024-08-10 10:14:37 -04:00
Idan Horowitz	1e2919b5c1	Kernel: Add create_kernel_thread overload that accepts a lambda Also fixes the entry function argument initialization to be architecture-independent.	2024-07-26 14:25:49 -04:00
Sönke Holz	87f194b3f5	Kernel: Don't truncate the pc value in the scheduler state dump This is likely a leftover from the i686-only days. Also rename the get_eip function to get_pc to be more arch-agnostic.	2024-07-23 09:03:31 -04:00
Liav A.	4aec3f4ef9	Kernel+Userland: Simplify loading of an ELF interpreter path The LibELF validate_program_headers method tried to do too many things at once, and as a result, we had an awkward return type from it. To be able to simplify it, we no longer allow passing a StringBuilder* but instead we require to pass an Optional<Elf_Phdr> by reference so it could be filled with actual ELF program header that corresponds to an INTERP header if such found. As a result, we ensure that only certain implementations that actually care about the ELF interpreter path will actually try to load it on their own and if they fail, they can have better diagnostics for an invalid INTERP header. This change also fixes a bug that on which we failed to execute an ELF program if the INTERP header is located outside the first 4KiB page of the ELF file, as the kernel previously didn't have support for looking beyond that for that header.	2024-07-21 15:38:52 +02:00
Liav A.	0e6624dc86	Kernel: Introduce the unshare syscall family These 2 syscalls are responsible for unsharing resources in the system, such as hostname, VFS root contexts and process lists. Together with an appropriate userspace implementation, these syscalls could be used for creating a sandbox environment (containers) for user programs.	2024-07-21 11:44:23 +02:00
Liav A.	e52abd4c09	Kernel: Introduce the HostnameContext class Similarly to VFSRootContext and ScopedProcessList, this class intends to form resource isolation as well. We add this class as an infrastructure preparation of hostname contexts which should allow processes to obtain different hostnames on the same machine.	2024-07-21 11:44:23 +02:00
Liav A.	3692af528e	Kernel: Move most of VirtualFileSystem code to be in a namespace There's no point in constructing an object just for the sake of keeping a state that can be touched by anything in the kernel code. Let's reduce everything to be in a C++ namespace called with the previous name "VirtualFileSystem" and keep a smaller textual-footprint struct called "VirtualFileSystemDetails". This change also cleans up old "friend class" statements that were no longer needed, and move methods from the VirtualFileSystem code to more appropriate places as well. Please note that the method of locking all filesystems during shutdown is removed, as in that place there's no meaning to actually locking all filesystems because of running in kernel mode entirely.	2024-07-21 11:44:23 +02:00
Liav A.	4370bbb3ad	Kernel+Userland: Introduce the copy_mount syscall This new syscall will be used by the upcoming runc (run-container) utility. In addition to that, this syscall allows userspace to neatly copy RAMFS instances to other places, which was not possible in the past.	2024-07-21 11:44:23 +02:00
Liav A.	dd59fe35c7	Kernel+Userland: Reduce jails to be a simple boolean flag The whole concept of Jails was far more complicated than I actually want it to be, so let's reduce the complexity of how it works from now on. Please note that we always leaked the attach count of a Jail object in the fork syscall if it failed midway. Instead, we should have attach to the jail just before registering the new Process, so we don't need to worry about unsuccessful Process creation. The reduction of complexity in regard to jails means that instead of relying on jails to provide PID isolation, we could simplify the whole idea of them to be a simple SetOnce, and let the ProcessList (now called ScopedProcessList) to be responsible for this type of isolation. Therefore, we apply the following changes to do so: - We make the Jail concept no longer a class of its own. Instead, we simplify the idea of being jailed to a simple ProtectedValues boolean flag. This means that we no longer check of matching jail pointers anywhere in the Kernel code. To set a process as jailed, a new prctl option was added to set a Kernel SetOnce boolean flag (so it cannot change ever again). - We provide Process & Thread methods to iterate over process lists. A process can either iterate on the global process list, or if it's attached to a scoped process list, then only over that list. This essentially replaces the need of checking the Jail pointer of a process when iterating over process lists.	2024-07-21 11:44:23 +02:00
Liav A.	91c87c5b77	Kernel+Userland: Prepare for considering VFSRootContext when mounting Expose some initial interfaces in the mount-related syscalls to select the desired VFSRootContext, by specifying the VFSRootContext index number. For now there's still no way to create a different VFSRootContext, so the only valid IDs are -1 (for currently attached VFSRootContext) or 1 for the first userspace VFSRootContext.	2024-07-21 11:44:23 +02:00
Liav A.	01e1af732b	Kernel/FileSystem: Introduce the VFSRootContext class The VFSRootContext class, as its name suggests, holds a context for a root directory with its mount table and the root custody/inode in the same class. The idea is derived from the Linux mount namespace mechanism. It mimicks the concept of the ProcessList object, but it is adjusted for a root directory tree context. In contrast to the ProcessList concept, processes that share the default VFSRootContext can't see other VFSRootContext related properties such as as the mount table and root custody/inode. To accommodate to this change progressively, we internally create 2 main VFS root contexts for now - one for kernel processes (as they don't need to care about VFS root contexts for the most part), and another for all userspace programs. This separation allows us to continue pretending for userspace that everything is "normal" as it is used to be, until we introduce proper interfaces in the mount-related syscalls as well as in the SysFS. We make VFSRootContext objects being listed, as another preparation before we could expose interfaces to userspace. As a result, the PowerStateSwitchTask now iterates on all contexts and tear them down one by one.	2024-07-21 11:44:23 +02:00
Liav A.	ef243d42fc	Kernel: Remove the ConsoleManagement singleton We don't really need it, and the entire functionality can be organically intergrated into the VirtualConsole class, to switch between the Virtual consoles, and manage initialization of all consoles in the global array.	2024-07-04 22:20:35 +02:00
Liav A.	ecc9c5409d	Kernel: Ignore dirfd if absolute path is given in VFS-related syscalls To be able to do this, we add a new class called CustodyBase, which can be resolved on-demand internally in the VirtualFileSystem resolving path code. When being resolved, CustodyBase will return a known custody if it was constructed with such, if that's not the case it will provide the root custody if the original path is absolute. Lastly, if that's not the case as well, it will resolve the given dirfd to provide a Custody object.	2024-06-01 19:25:15 +02:00
Sönke Holz	fe12a413a1	Kernel: Use AK::unwind_stack_from_frame_pointer	2024-05-14 14:02:06 -06:00
Liav A.	15ddc1f17a	Kernel+Userland: Reject W->X prot region transition after a prctl call We add a prctl option which would be called once after the dynamic loader has finished to do text relocations before calling the actual program entry point. This change makes it much more obvious when we are allowed to change a region protection access from being writable to executable. The dynamic loader should be able to do this, but after a certain point it is obvious that such mechanism should be disabled.	2024-05-14 12:41:51 -06:00
Liav A.	2bba9411ca	Kernel: Use the AK SetOnce container class in various cases We have many places in the kernel code that we have boolean flags that are only set once, and never reset again but are checked multiple times before and after the time they're being set, which matches the purpose of the SetOnce class.	2024-04-26 23:46:23 -06:00
Timothy Flynn	ab602cfc2c	Kernel: Colorize log message for capabilities that have not been pledged The log message can be hard to spot in a sea of debug messages. Colorize it to make the message more immediately pop out.	2024-04-26 09:29:02 -04:00
Sönke Holz	243d7003a2	Kernel+LibC+LibELF: Move TLS handling to userspace This removes the allocate_tls syscall and adds an archctl option to set the fs_base for the current thread on x86-64, since you can't set that register from userspace. enter_thread_context loads the fs_base for the next thread on each context switch. This also moves tpidr_el0 (the thread pointer register on AArch64) to the register state, so it gets properly saved/restored on context switches. The userspace TLS allocation code is kept pretty similar to the original kernel TLS code, aside from a couple of style changes. We also have to add a new argument "tls_pointer" to SC_create_thread_params, as we otherwise can't prevent race conditions between setting the thread pointer register and signal handling code that might be triggered before the thread pointer was set, which could use TLS.	2024-04-19 16:46:47 -06:00
Sönke Holz	216089c7a1	Kernel: Add a Thread member for arch-specific data This will be used to store the fs_base value on x86-64, which is needed for thread-local storage.	2024-04-19 16:46:47 -06:00
Sönke Holz	57f4f8caf8	Kernel+LibC: Introduce new archctl syscall This syscall will be used for architecture-specific operations.	2024-04-19 16:46:47 -06:00
Dan Klishch	5ed7cd6e32	Everywhere: Use east const in more places These changes are compatible with clang-format 16 and will be mandatory when we eventually bump clang-format version. So, since there are no real downsides, let's commit them now.	2024-04-19 06:31:19 -04:00
Space Meyer	5d89d3090e	Kernel: Add KCOV recursion debugging	2024-04-15 21:16:22 -06:00
Space Meyer	bba94804c2	Kernel: Deduplicate backtrace printing	2024-04-15 21:16:22 -06:00
Space Meyer	a721e4d507	Kernel: Track KCOVInstance via Process instead of HashMap While this clutters Process.cpp a tiny bit, I feel that it's worth it: - 2x speed on the kcov_loop benchmark. Likely more during fuzzing. - Overall code complexity is going down with this change. - By reducing the code reachable from __sanitizer_cov_trace_pc code, we can now instrument more code.	2024-04-15 21:16:22 -06:00
Space Meyer	fdc0328ce3	Kernel: Exclude individual functions from coverage instrumentation Sticking this to the function source has multiple benefits: - We instrument more code, by not excluding entire files. - NO_SANITIZE_COVERAGE can be used in Header files. - Keeping the info with the source code, means if a function or file is moved around, the NO_SANITIZE_COVERAGE moves with it.	2024-04-15 21:16:22 -06:00
Sönke Holz	496a7541a2	Kernel/riscv64: Implement the signal trampoline	2024-03-25 14:17:32 -06:00
Idan Horowitz	209c588ed1	Kernel: Switch a couple of signal dispatch dbglns to dbgln_if These are pretty spammy when using strace.	2024-03-02 09:10:14 +01:00
Idan Horowitz	e38ccebfc8	Kernel: Stop swallowing thread unblocks while process is stopped This easily led to kernel deadlocks if the stopped thread held an important global mutex (like the disk cache lock) while blocking. Resolve this by ensuring stopped threads have a chance to return to the userland boundary before actually stopping.	2024-02-10 08:42:53 +01:00
Idan Horowitz	6a4b93b3e0	Kernel: Protect processes' master TLS with a fine-grained spinlock This moves it out of the scope of the big process lock, and allows us to wean some syscalls off it, starting with sys$allocate_tls.	2023-12-26 19:20:21 +01:00
Idan Horowitz	a49b7e92eb	Kernel: Shrink instead of expand sigaltstack range to page boundaries Since the POSIX sigaltstack manpage suggests allocating the stack region using malloc(), and many heap implementations (including ours) store heap chunk metadata in memory just before the vended pointer, we would end up zeroing the metadata, leading to various crashes.	2023-12-24 16:11:35 +01:00
Sönke Holz	e4019ba9dc	Kernel: Make `CrashHandler` more useful before `init_stage2` Display some helpful information about crashes even before the first process is started.	2023-12-09 22:36:28 +01:00
Daniel Bertalan	45d81dceed	Everywhere: Replace `ElfW(type)` macro usage with `Elf_type` This works around a `clang-format-17` bug which caused certain usages to be misformatted and fail to compile. Fixes #8315	2023-12-01 10:02:39 +02:00
Sönke Holz	da88d766b2	Kernel/riscv64: Make the kernel compile This commits inserts TODOs into all necessary places to make the kernel compile on riscv64!	2023-11-10 15:51:31 -07:00
Tim Schumacher	a2f60911fe	AK: Rename GenericTraits to DefaultTraits This feels like a more fitting name for something that provides the default values for Traits.	2023-11-09 10:05:51 -05:00
Liav A	cbaa3465a8	Kernel: Add jail semantics to methods iterating over thread lists We should consider whether the selected Thread is within the same jail or not. Therefore let's make it clear to callers with jail semantics if a called method checks if the desired Thread object is within the same jail. As for Thread::for_each_* methods, currently nothing in the kernel codebase needs iteration with consideration for jails, so the old Thread::for_each* were simply renamed to include "ignoring_jails" suffix in their names.	2023-09-15 11:06:48 -06:00
Liav A	b55199c227	Kernel: Move TTY-related code to a new subdirectory under Devices The TTY subsystem is represented with unix devices, so it should be under the Devices directory like the Audio, Storage, GPU and HID subsystems.	2023-09-09 12:08:59 -06:00
Jakub Berkop	54e79aa1d9	Kernel+ProfileViewer: Display additional filesystem events	2023-09-09 11:26:51 -06:00
Jakub Berkop	c184a0786f	Kernel: Protect access to PerformanceEventBuffer strings with spinlock	2023-09-09 11:26:51 -06:00
Zak-K-Abdi	abcf05801a	Kernel: Allow Ext2FS::flush_writes() to return ErrorOr<void>	2023-08-25 11:36:57 +01:00
Liav A	1c0aa51684	Kernel+Userland: Remove the {get,set}_thread_name syscalls These syscalls are not necessary on their own, and they give the false impression that a caller could set or get the thread name of any process in the system, which is not true. Therefore, move the functionality of these syscalls to be options in the prctl syscall, which makes it abundantly clear that these operations could only occur from a running thread in a process that sees other threads in that process only.	2023-08-25 11:51:52 +02:00
Liav A	ef6133337e	Kernel: Merge PowerStateSwitchTask reboot and shutdown procedures The reboot procedure should prepare to "shutdown" the system cleanly and therefore has to be merged with how shutdown is handled.	2023-08-20 13:04:42 -06:00
Liav A	b81b2c3fe7	Kernel: Ensure only user processes are terminated properly in shutdown This patch ensures that the shutdown procedure can complete due to the fact we don't kill kernel processes anymore, and only stop the scheduler from running after the filesystems unmount procedure. We also need kernel processes during the shutdown procedure, because we rely on the WorkQueue threads to run WorkQueue items to complete async IO requests initiated by filesystem sync & unmounting, etc. This is also simplifying the code around the killing processes, because we don't need to worry about edge cases such as the FinalizerTask anymore.	2023-08-20 13:04:42 -06:00

1 2 3

104 Commits