ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2025-04-25 05:55:13 +00:00

Author	SHA1	Message	Date
Sam Atkins	1108988656	LibWeb/CSS: Generate property-filtering code for pseudo-elements Pseudo-elements have specific rules about which CSS properties can be applied to them. This is a first step to supporting that. - If a property whitelist isn't present, all properties are allowed. - Properties are named as in CSS. - Names of property groups are prefixed with `#`, which makes this match the spec more clearly. These groups are implemented directly in the code generator for now. - Any property name beginning with "FIXME:" is ignored, so we can mark properties we don't implement yet.	2025-03-24 09:49:50 +00:00
Sam Atkins	193adee164	LibWeb/CSS: Implement aliases for pseudo-elements We previously supported a few -webkit vendor-prefixed pseudo-elements. This patch adds those back, along with -moz equivalents, by aliasing them to standard ones. They behave identically, except for serializing with their original name, just like for unrecognized -webkit pseudo-elements. It's likely to be a while before the forms spec settles and authors start using the new pseudo-elements, so until then, we can still make use of styles they've written for the non-standard ones.	2025-03-24 09:49:50 +00:00
Sam Atkins	2c86fd539d	LibWeb: Generate an enum for generated pseudo-elements	2025-03-24 09:49:50 +00:00
Sam Atkins	021e3f5c7d	LibWeb/CSS: Generate is_has_allowed_pseudo_element()	2025-03-24 09:49:50 +00:00
Sam Atkins	ffa1dba96a	LibWeb: Generate pseudo-element code from JSON Initially, this generates the enum and to/from-string functions. The JSON itself contains more data than that, but it's unused for now.	2025-03-24 09:49:50 +00:00
Ryan Liptak	0b0f47e320	LibWeb: Make named character references more spec-compliant & efficient There are two changes happening here: a correctness fix, and an optimization. In theory they are unrelated, but the optimization actually paves the way for the correctness fix. Before this commit, the HTML tokenizer would attempt to look for named character references by checking from after the `&` until the end of m_decoded_input, which meant that it was unable to recognize things like named character references that are inserted via `document.write` one byte at a time. For example, if `∉` was written one-byte-at-a-time with `document.write`, then the tokenizer would only check against `n` since that's all that would exist at the time of the check and therefore erroneously conclude that it was an invalid named character reference. This commit modifies the approach taken for named character reference matching by using a trie-like structure (specifically, a deterministic acyclic finite state automaton or DAFSA), which allows for efficiently matching one-character-at-a-time and therefore it is able to pick up matching where it left off after each code point is consumed. Note: Because it's possible for a partial match to not actually develop into a full match (e.g. `&notindo` which could lead to `&notindot;`), some backtracking is performed after-the-fact in order to only consume the code points within the longest match found (e.g. `&notindo` would backtrack back to `&not`). With this new approach, `document.write` being called one-byte-at-a-time is handled correctly, which allows for passing more WPT tests, with the most directly relevant tests being `/html/syntax/parsing/html5lib_entities01.html` and `/html/syntax/parsing/html5lib_entities02.html` when run with `?run_type=write_single`. Additionally, the implementation now better conforms to the language of the spec (and resolves a FIXME) because exactly the matched characters are consumed and nothing more, so SWITCH_TO is able to be used as the spec says instead of RECONSUME_IN. The new approach is also an optimization: - Instead of a linear search using `starts_with`, the usage of a DAFSA means that it is always aware of which characters can lead to a match at any given point, and will bail out whenever a match is no longer possible. - The DAFSA is able to take advantage of the note in the section `13.5 Named character references` that says "This list is static and will not be expanded or changed in the future." and tailor its Node struct accordingly to tightly pack each node's data into 32-bits. Together with the inherent DAFSA property of redundant node deduplication, the amount of data stored for named character reference matching is minimized. In my testing: - A benchmark tokenizing an arbitrary set of HTML test files was about 1.23x faster (2070ms to 1682ms). - A benchmark tokenizing a file with tens of thousands of named character references mixed in with truncated named character references and arbitrary ASCII characters/ampersands runs about 8x faster (758ms to 93ms). - The size of `liblagom-web.so` was reduced by 94.96KiB. Some technical details: A DAFSA (deterministic acyclic finite state automaton) is essentially a trie flattened into an array, but it also uses techniques to minimize redundant nodes. This provides fast lookups while minimizing the required data size, but normally does not allow for associating data related to each word. However, by adding a count of the number of possible words from each node, it becomes possible to also use it to achieve minimal perfect hashing for the set of words (which allows going from word -> unique index as well as unique index -> word). This allows us to store a second array of data so that the DAFSA can be used as a lookup for e.g. the associated code points. For the Swift implementation, the new NamedCharacterReferenceMatcher was used to satisfy the previous API and the tokenizer was left alone otherwise. In the future, the Swift implementation should be updated to use the same implementation for its NamedCharacterReference state as the updated C++ implementation.	2025-03-22 16:03:44 +01:00
Andrew Kaster	f22f6e1f5b	CMake: Remove unused CMake functions	2025-03-20 11:36:09 -06:00
Psychpsyo	528af90cd4	Meta: Make CI enforce doctype in test cases This currently does not enforce it on Layout tests, even though it seems most necessary there. This is to speed up the review on this PR due to an excessive amount of layout tests that would need rebaselining if DOCTYPEs were added to them.	2025-03-20 11:50:49 +01:00
Timothy Flynn	086a921213	AK: Disallow construction of JsonParser JsonParser has a footgun where it does not retain ownership of the string to be parsed. For example, the following results in UAF: JsonParser parser(something_returning_a_string()); parser.parse(); Let's avoid this altogether by only allowing use of JsonParser with a static, safe method.	2025-03-20 10:50:24 +01:00
Sam Atkins	a28197669a	LibWeb/CSS: Move property code from CSSStyleDeclaration to *Properties CSSStyleDeclaration is a base class that's used by various collections of style properties or descriptors. This commit moves all style-property-related code into CSSStyleProperties, where it belongs. As noted in the previous commit, we also apply the CSSStyleProperties prototype now.	2025-03-19 13:53:00 +00:00
Sam Atkins	83bb92c4e0	LibWeb/CSS: Merge style declaration subclasses into CSSStyleProperties We previously had PropertyOwningCSSStyleDeclaration and ResolvedCSSStyleDeclaration, representing the current style properties and resolved style respectively. Both of these were the CSSStyleDeclaration type in the CSSOM. (We also had ElementInlineCSSStyleDeclaration but I removed that in a previous commit.) In the meantime, the spec has changed so that these should now be a new CSSStyleProperties type in the CSSOM. Also, we need to subclass CSSStyleDeclaration for things like CSSFontFaceRule's list of descriptors, which means it wouldn't hold style properties. So, this commit does the fairly messy work of combining these two types into a new CSSStyleProperties class. A lot of what previously was done as separate methods in the two classes, now follows the spec steps of "if the readonly flag is set, do X" instead, which is hopefully easier to follow too. There is still some functionality in CSSStyleDeclaration that belongs in CSSStyleProperties, but I'll do that next. To avoid a huge diff for "CSSStyleDeclaration-all-supported-properties-and-default-values.txt" both here and in the following commit, we don't apply the (currently empty) CSSStyleProperties prototype yet.	2025-03-19 13:53:00 +00:00
Luke Wilde	02236be737	LibWeb/CSP: Implement SecurityPolicyViolationEvent This is used to report violations of policies to the element/global object that caused it.	2025-03-19 00:55:14 +01:00
Andrew Kaster	89ecc75ed8	LibCore+Meta: Un-break Swift build on Linux LibCore's list of ignored header files for Swift was missing the Apple only files on non-Apple platforms. Additionally, any generic glue code cannot use -fobjc-arc, so we need to rely on -fblocks only.	2025-03-18 17:15:08 -06:00
Andrew Kaster	01ac48b36f	AK: Support storing blocks in AK::Function This has two slightly different implementations for ARC and non-ARC compiler modes. The main idea is to store a block pointer as our closure and use either ARC magic or BlockRuntime methods to manage the memory for the block. Things are complicated by the fact that we don't yet force-enable swift, so we can't count on the swift.org llvm fork being our compiler toolchain. The patch adds some CMake checks and ifdefs to still support environments without support for blocks or ARC.	2025-03-18 17:15:08 -06:00
Andrew Kaster	72acb1111f	CMake: Add find module for BlocksRuntime on non-Apple platforms	2025-03-18 17:15:08 -06:00
Andrew Kaster	1148116a87	CMake: Allow passing test name to serenity_test This forwards to lagom_test. One day we should simplify this..	2025-03-18 17:15:08 -06:00
Andrew Kaster	20890d7b70	Meta: Add fontconfig and freetype vcpkg overlays Temporary until their canonical git forge is back up and running	2025-03-18 00:25:42 -06:00
Andrew Kaster	c8787e6a9f	LibCore: Add swift bindings for EventLoop as an Executor and Actor	2025-03-15 21:51:22 -06:00
Timothy Flynn	3af63bc5a3	Everywhere: Replace miscellaneous references to the chrome process	2025-03-15 19:57:27 -04:00
Timothy Flynn	810d04b3f4	LibWeb+LibWebView+WebContent: Remove the built-in Inspector	2025-03-15 19:09:40 +01:00
Shannon Booth	f8f21319f9	LibURL/Pattern: Implement the URL Pattern Tokenizer The tokenizer is used for both pattern string and constructor string parsing of URL Patterns.	2025-03-15 07:39:03 -04:00
SINF-KEN	96a5b97131	Base: Add new error page This commit introduces a new error page. Co-authored-by: NotNotNescap <nescap@tuta.com>	2025-03-12 11:56:15 -04:00
Gingeh	5838c73a72	LibWeb: Restrict weird about:foo URIs This commit: - Prevents path traversal via the about: scheme - Prevents loading about:inspector - Requires about: URIs to be opaque paths - Prevents crashes with invalid percent encoded paths	2025-03-12 10:41:06 +00:00
Timothy Flynn	0e057d3a36	LibIPC: Don't generate "take" methods for primitive/simple types	2025-03-09 11:14:20 -04:00
Timothy Flynn	cf69f52d53	LibIPC+Everywhere: Always pass ownership of transferred data to clients This has been a longstanding ergonomic issue with our IPC compiler. Non- trivial types were previously passed by const&. So if we wanted to avoid expensive copies, we would have to const_cast and move the data. We now pass ownership of all transferred data to the client subclasses. This allows us to remove const_cast from these methods, and allows us to avoid some trivial expensive copies that we didn't bother to const_cast.	2025-03-09 11:14:20 -04:00
Timothy Flynn	68947d55d9	LibIPC: Do not require constructing containers when sending IPC messages For example, consider the following IPC message: do_something(u64 page_id, String string, Vector<Data> data) =\| We would previously generate the following C++ method to encode/transfer this message: void do_something(u64 page_id, String string, Vector<Data> data); This required the caller to either have to copy the non-trivial types or `move` them in. In some places, this meant we had to construct temporary vectors just to send an IPC. This isn't necessary because we weren't holding onto these parameters anyways. We would construct an IPC::Message subclass with them (which does require owning types), but then immediate encode the message to an IPC::MessageBuffer and send it. We now generate code such that we don't need to construct a Message. We can simply encode the parameters directly without needing ownership. This allows us to take view-types to IPC parameters. So the above example now becomes: void do_something(u64, StringView, ReadonlySpan<Data>);	2025-03-09 11:14:20 -04:00
Timothy Flynn	da6c6487e5	LibIPC: Extract helper lambda to its own method This will be needed in an upcoming commit so that this method may call itself recursively to generate overloads. Doing this extraction ahead of time will simply make that diff easier to grok.	2025-03-09 11:14:20 -04:00
Timothy Flynn	8277f42812	LibIPC: Remove some unnecessary newlines in generated IPC files This isn't particularly important, but when staring at generated IPC files, it's nice not to have an extra newline after every line of code throughout the files.	2025-03-09 11:14:20 -04:00
Andreas Kling	c333042e63	LibWeb: Add opt-in tracing of update_layout() calls with reason	2025-03-08 03:37:38 +01:00
Totto16	1fc5744aa4	Meta: Make gn linter actually fail Without the `--dry-run` flag, the gn linter just formats the files in place and reports no failure	2025-03-07 09:44:42 +01:00
Totto16	4c24e0f63d	Meta: Format gn files	2025-03-07 09:44:42 +01:00
Luke Wilde	6d1f78198d	LibWeb: Implement Resource Timing	2025-03-06 09:00:53 -07:00
R-Goc	fe26c3145d	AK: Add stacktrace support using the standard lib This commit adds support for using the standard library implementation of <stacktrace> if libbacktrace is not found. This can also be explicitly enabled through ENABLE_STD_STACKTRACE for platforms that have libbacktrace available. Co-Authored-By: Andrew Kaster <andrew@ladybird.org>	2025-03-05 10:25:12 -07:00
Shannon Booth	1a47cfe3b9	Meta: Use URL basic parse for URL Fuzzer	2025-03-04 16:24:19 -05:00
Sam Atkins	f148af0a93	LibWeb: Move XMLSerializer into HTML directory The DOMParsing spec is in the process of being merged into the HTML one, gradually. The linked spec change moves XMLSerializer, but many of the algorithms are still in the DOMParsing spec so I've left the links to those alone. I've done my best to update the GN build but since I'm not actually using it, I might have done that wrong. Corresponds to `2edb8cc7ee`	2025-03-04 16:44:41 +00:00
Sam Atkins	a11848f163	IDLGenerators: Allow DOMString? reflection on non-enumerated attributes Corresponds to `ef2fba2a3b`	2025-03-04 16:44:41 +00:00
Totto16	4c54fa10ed	LibWeb: Improve Enum generation in IDLGenerator Generated enums have no underlying type specifier, this adds one It uses the smallest available, which is mostly u8	2025-03-04 16:35:04 +00:00
Luke Wilde	6cb0f41c57	LibWeb/WebGL: Return from uniform methods if location is null If location is null, the spec simply tells us to return and ignore the passed in data. Fixes #3708.	2025-03-03 08:43:33 +01:00
Sam Atkins	f97ac33cb3	LibWeb/CSS: Use NumericCalculationNode for constants Having multiple kinds of node that hold numeric values made things more complicated than they needed to be, and we were already converting ConstantCalculationNodes to NumericCalculationNodes in the first simplification pass that happens at parse-time, so they didn't exist after that. As noted, the spec allows for other contexts to introduce their own numeric keywords, which might be resolved later than parse-time. We'll need a different mechanism to support those, but ConstantCalculationNode could not have done so anyway.	2025-02-27 21:42:43 +01:00
Andreas Kling	8ab61843be	LibWeb: Parse CSS fit-content(<length-percentage>) values Before this change, we only parsed fit-content as a standalone keyword, but CSS-SIZING-3 added it as a function as well. I don't know of anything else in CSS that is overloaded like this, so it ends up looking a little awkward in the implementation. Note that a lot of code had already been prepped for fit-content values to have an argument, we just weren't parsing it.	2025-02-27 00:44:14 +01:00
Sam Atkins	c729c3fcee	LibWeb/CSS: Add custom-ident blacklists to Properties.json These excluded values use a similar notation to the ranges for numeric types: `![foo,bar,baz]` to exclude `foo`, `bar`, and `baz`.	2025-02-26 11:22:47 +00:00
Luke Wilde	198cec481a	LibWeb: Allow null for optional, nullable, no default value union types For these types, it would previously only accept `undefined` for the `null` state. Fixes GET requests in the Turbo library always failing: `9e057f284a/src/http/fetch_request.js (L219-L220)` `9e057f284a/src/http/fetch_request.js (L51-L64)` This was found on https://www.fangamer.com/.	2025-02-22 18:45:27 +01:00
rmg-x	2b32f94527	CodeGenerators: Fix clang-18 crash by specifying types explicitly clang 18.1.8 was crashing during the build without this change. Filed a bug report here: https://github.com/llvm/llvm-project/issues/128359 Co-authored-by: Tim Flynn <trflynn89@pm.me>	2025-02-22 12:38:11 -05:00
Tim Ledbetter	7ee33529e8	IDLGenerators: Throw TypeError if custom element is already constructed Throwing a TypeError instead of an InvalidStateError DOMException follows the current specification steps.	2025-02-22 05:39:23 -05:00
Jelle Raaijmakers	63b451cb46	LibWeb: Add SVGFilterElement	2025-02-22 10:14:29 +01:00
Jelle Raaijmakers	aeef179668	LibWeb: Add SVGAnimatedEnumeration	2025-02-22 10:14:29 +01:00
Andrew Kaster	13338a9532	CMake: Ensure build and test presets for Distribution and Debug	2025-02-21 12:05:18 +01:00
Timothy Flynn	2c03de60da	AK+Everywhere: Remove now-unecessary use of ByteString with JSON types This removes JsonObject::get_byte_string and JsonObject::to_byte_string.	2025-02-20 19:27:51 -05:00
Timothy Flynn	bc54c0cdfb	AK+Everywhere: Store JSON strings as String	2025-02-20 19:27:51 -05:00
Timothy Flynn	e591636419	AK+Everywhere: Store JSON object keys as String	2025-02-20 19:27:51 -05:00

1 2 3 4 5 ...

3986 commits