ladybird

mirror of https://github.com/LadybirdBrowser/ladybird.git synced 2025-10-09 09:39:13 +00:00

Author	SHA1	Message	Date
Linus Groh	32ad939e44	LibWeb: Rename HighResolutionTime/{CoarsenTime => TimeOrigin}.cpp/h This is being used for more than just time coarsening now, so let's use the spec's section title for the name.	2022-10-05 09:12:59 +01:00
Linus Groh	4ea6cc56be	LibWeb: Move unsafe_shared_current_time() to HighResolutionTime This doesn't belong on the EventLoop at all, as far as I can tell.	2022-10-05 09:12:59 +01:00
Andreas Kling	97ca45d9c6	LibWeb: Store HTML tag name token data as FlyString while parsing This makes checking if a token is a specific tag O(1) instead of O(n).	2022-10-04 21:30:58 +02:00
Linus Groh	fb21271334	LibWeb: Replace incorrect uses of AK::is_ascii_space()	2022-10-02 21:32:49 +02:00
Andrew Kaster	f0c5f77f99	LibWeb: Remove unecessary dependence on Window from HTML classes These classes only needed Window to get at its realm. Pass a realm directly to construct HTML classes.	2022-10-01 21:05:32 +01:00
Luke Wilde	7b8a6b8e7a	LibWeb: Set HTMLParser::m_scripting_enabled as according to the spec This allows <noscript> elements to display their content as proper HTML instead of raw text when scripting is disabled.	2022-09-23 22:25:09 +01:00
Andreas Kling	797d28adca	LibWeb: Save begin/end timestamps for load and DOMContentLoaded events	2022-09-21 11:51:18 +02:00
Andreas Kling	ab8432783e	LibWeb: Implement aborting the HTML parser This is roughly on-spec, although I had to invent a simple "aborted" state for the tokenizer.	2022-09-20 23:44:59 +02:00
Ben Wiederhake	d073a31c21	LibWeb: Add missing includes This remained undetected for a long time as HeaderCheck is disabled by default. This commit makes the following file compile again: // file: compile_me.cpp #include <LibWeb/HTML/CrossOrigin/CrossOriginOpenerPolicy.h> // That's it, this was enough to cause a compilation error. Likewise for most other files touched by this commit.	2022-09-18 13:27:24 -04:00
Andreas Kling	530675993b	LibWeb: Rename Attribute to Attr This name is not very good, but it's what the specification calls it.	2022-09-18 02:08:01 +02:00
Andreas Kling	88f2f50c55	LibWeb: Don't use the internal window object when parsing HTML fragments Instead, use the window object from the context element. This fixes an issue where activating event handlers during fragment parsing would try to set up callbacks using the internal window object's ESO. This caused a verify_cast crash on Google Maps, since the internal realm doesn't have an associated ESO. Perhaps it should, but in this specific case, it makes more sense for fragment parsing to fully adopt the context provided.	2022-09-06 01:12:44 +02:00
Andreas Kling	6f433c8656	LibWeb+LibJS: Make the EventTarget hierarchy (incl. DOM) GC-allocated This is a monster patch that turns all EventTargets into GC-allocated PlatformObjects. Their C++ wrapper classes are removed, and the LibJS garbage collector is now responsible for their lifetimes. There's a fair amount of hacks and band-aids in this patch, and we'll have a lot of cleanup to do after this.	2022-09-06 00:27:09 +02:00
Andreas Kling	7c3db526b0	LibWeb: Make DOM::Event and all its subclasses GC-allocated	2022-09-06 00:27:09 +02:00
sin-ack	c8585b77d2	Everywhere: Replace single-char StringView op. arguments with chars This prevents us from needing a sv suffix, and potentially reduces the need to run generic code for a single character (as contains, starts_with, ends_with etc. for a char will be just a length and equality check). No functional changes.	2022-07-12 23:11:35 +02:00
sin-ack	3f3f45580a	Everywhere: Add sv suffix to strings relying on StringView(char const) Each of these strings would previously rely on StringView's char const constructor overload, which would call __builtin_strlen on the string. Since we now have operator ""sv, we can replace these with much simpler versions. This opens the door to being able to remove StringView(char const*). No functional changes.	2022-07-12 23:11:35 +02:00
stelar7	e547f5887e	LibWeb: Fix Array OOBs in the HTMLTokenizer Accessing last() if there are no elements makes WebContent crash :^)	2022-06-03 12:29:11 +01:00
Andreas Kling	b0008c0934	LibWeb: Use Vector::in_reverse() in HTML::StackOfOpenElements	2022-04-13 19:52:25 +02:00
Andreas Kling	1956c52c68	LibWeb: Remove unused HTML::parse_html_document()	2022-04-06 19:35:07 +02:00
Idan Horowitz	086969277e	Everywhere: Run clang-format	2022-04-01 21:24:45 +01:00
Ali Mohammad Pur	5a0123fd2f	LibWeb: Load X(HT)ML documents and transform them into HTML DOM	2022-03-28 23:11:48 +02:00
Andreas Kling	fda25f9505	LibWeb: Move HTML dimension value parsing from CSS to HTML namespace These are part of HTML, not CSS, so let's not confuse things.	2022-03-26 17:31:01 +01:00
Idan Horowitz	5626e1b324	LibWeb: Rename PARSER_DEBUG => HTML_PARSER_DEBUG Since this macro was created we gained a couple more parsers in the system :^)	2022-03-24 21:37:49 +01:00
Timothy Flynn	5608bc4eaf	LibWeb: Remove inheritance of FormAssociatedElement from HTMLElement HTMLObjectElement will need to be both a FormAssociatedElement and a BrowsingContextContainer. Currently, both of these classes inherit from HTMLElement. This can work in C++, but is generally frowned upon, and doesn't play particularly well with the rest of LibWeb. Instead, we can essentially revert commit `3bb5c62` to remove HTMLElement from FormAssociatedElement's hierarchy. This means that objects such as HTMLObjectElement individually inherit from FormAssociatedElement and HTMLElement now. Some caveats are: * FormAssociatedElement still needs to know when the HTMLElement is inserted into and removed from the DOM. This hook is automatically injected via a macro now, while still allowing classes like HTMLInputElement to also know when the element is inserted. * Casting from a DOM::Element to a FormAssociatedElement is now a sideways cast, rather than directly following an inheritance chain. This means static_cast cannot be used here; but we can safely use dynamic_cast since the only 2 instances of this already use RTTI to verify the cast.	2022-03-24 03:35:11 +01:00
Hendiadyoin1	6a95df2526	LibTextCodec: Don't allocate Strings on encoding normalisation This ripples down to LibWeb's HTML and XHR decoders, which therefore become less allocation heavy.	2022-03-21 10:48:17 +01:00
Simon Wanner	e154c2c2ca	LibWeb: Implement "has element in select scope" per-spec The HTML Specification is quite tricky in this case. Usually "have a particular element in <x> scope" mentions "consisting of the following element types:", but in this case it's "consisting of all element types except the following:" Thanks to @AtkinsSJ for spotting this difference	2022-03-21 10:47:46 +01:00
Simon Wanner	1d95745901	LibWeb: Implement the rest of the Adoption Agency Algorithm This gets us 2 points on html5test.com :^) - Before: https://html5te.st/4cf57659bc08272e (208) - After: https://html5te.st/fb8a9259bda1c115 (210)	2022-03-20 02:52:37 +01:00
Andreas Kling	cbd343dced	LibWeb: Only delay "load" event for script elements that load something We shouldn't delay the load event for scripts that we're completely refusing to run anyway. Also, for scripts that have inline text content, we don't need to delay them either, as they will become ready before returning from "prepare script". This makes the "load" event finally fire on lots of websites, including Wikipedia. :^)	2022-03-19 16:11:36 +01:00
Andreas Kling	2c9dfadb21	LibWeb: Don't delay document "load" event for unclosed script tags We previously had a bug where markup with unclosed script tags caused the document load event to be delayed indefinitely. Fix this by only marking script elements as delaying the load event once we encounter the script end tag.	2022-03-19 15:04:48 +01:00
Lenny Maiorani	c37820b898	Libraries: Use default constructors/destructors in LibWeb https://isocpp.github.io/CppCoreGuidelines/CppCoreGuidelines#cother-other-default-operation-rules "The compiler is more likely to get the default semantics right and you cannot implement these functions better than the compiler."	2022-03-17 17:23:49 +00:00
Idan Horowitz	c575710e5e	LibWeb: Use inline script tag source line as javascript line offset This makes JS exception line numbers meaningful for inline script tags.	2022-03-14 00:25:33 +01:00
Linus Groh	1422bd45eb	LibWeb: Move Window from DOM directory & namespace to HTML The Window object is part of the HTML spec. :^) https://html.spec.whatwg.org/multipage/window-object.html	2022-03-08 00:30:30 +01:00
Andreas Kling	1061c863f8	LibWeb: Fix issue where double-quoted doctype system ID was not captured We were storing double-quoted system ID's in the public ID field. 1% progression on ACID3. :^)	2022-03-02 12:30:15 +01:00
Luke Wilde	46c0d0f7ae	LibWeb: Associate form elements with a form in parsing and dynamically This makes it available for all form associated elements and not just select and input elements. It also makes it more spec compliant, especially around the form attribute. The main thing missing is re-associating form elements with a form attribute when the form attribute changes or an element with an ID is inserted/removed or has its ID changed.	2022-03-01 23:19:41 +01:00
Andreas Kling	8b2499b112	LibWeb: Make document.write() work while document is parsing This necessitated making HTMLParser ref-counted, and having it register itself with Document when created. That makes it possible for scripts to add new input at the current parser insertion point. There is now a reference cycle between Document and HTMLParser. This cycle is explicitly broken by calling Document::detach_parser() at the end of HTMLParser::run(). This is a huge progression on ACID3, from 31% to 49%! :^)	2022-02-21 22:00:28 +01:00
Lorenz Steinert	db789813c9	LibWeb: Add basic support for dynamic markup insertion This implements basic support for dynamic markup insertion, adding * Document::open() * Document::write(Vector<String> const&) * Document::writeln(Vector<String> const&) * Document::close() The HTMLParser is modified to make it possible to create a script-created parser which initially only contains a HTMLTokenizer without any data. Aditionally the HTMLParser::run method gains an overload which does not modify the Document and does not run HTMLParser::the_end() so that we can reenter the parser at a later time. Furthermore all FIXMEs that consern the insertion point are implemented wich is defined in the HTMLTokenizer. Additionally the following member-variables of the HTMLParser are now exposed by getter funcions: * m_tokenizer * m_aborted * m_script_nesting_level The HTMLTokenizer is modified so that it contains an insertion point which keeps track of where the next input from the Document::write functions will be inserted. The insertion point is implemented as the charakter offset into m_decoded_input and a boolean describing if the insertion point is defined. Functions to update, check and {re}store the insertion point are also added. The function HTMLTokenizer::insert_eof is added to tell a script-created parser that document::close was called and HTMLParser::the_end() should be called. Lastly an explicit default constructor is added to HTMLTokenizer to create a empty HTMLTokenizer into which data can be inserted.	2022-02-21 18:26:43 +01:00
Adam Hodgen	b6eaefa87d	LibWeb: Fix 'Comment end state' in HTML Tokenizer Also, update the expected hash in the LibWeb TestHTMLTokenizer regression test. This is due to the "This comment has a few too many dashes." comment token being updated.	2022-02-21 16:31:45 +01:00
Adam Hodgen	d73bb2633c	LibWeb: Implement tokenization newline preprocessing Newline normalization will replace \r and \r\n with \n. The spec specifically states > Before the tokenization stage, the input stream must be preprocessed > by normalizing newlines. wheras this is implemented the processing during the tokenization itself. This should still exhibit the same behaviour, while keeping the tokenization logic in the same place.	2022-02-21 16:31:45 +01:00
Adam Hodgen	c6fcdd0f93	LibWeb: Fix off by one error in HTML Tokenizer In 'NamedCharacterReference' we attempt to lookup the code point by a identifier, eg apos; becomes ' This is done by passing the entire rest of the document to the `HTML::code_points_from_entity` function. However, before this change we didn't sent the final character which meant if the document ended in a named character reference the lookup would fail.	2022-02-21 16:31:45 +01:00
Luke Wilde	9845164f6a	LibWeb: Handle markers when reconstructing active formatting elements The entry we get from the active formatting elements list during the Rewind step of "reconstruct the active formatting elements" can be a marker. Previously we assumed it was not a marker, which can trigger an assertion failure with certain malformed HTML. If the entry in this step is a marker, the spec simply ignores it. This is step 6 of the algorithm. This also makes the index unsigned, as this algorithm is a no-op if the list is empty. Additionally, this also adds spec comments to this algorithm. Fixes #12668.	2022-02-20 10:59:42 +01:00
Andreas Kling	25504f6a1b	LibWeb: Use Vector::clear_with_capacity() in HTMLTokenizer This avoids constantly reallocating the Vector<HTMLToken>.	2022-02-19 14:45:59 +01:00
Linus Groh	06948df393	LibWeb: Fail gracefully when reaching the unimplemented part of the AAA Pages such as https://html5test.com are testing all sorts of weird, incomplete, and wrong HTML but can be useful or at least interesting for development - let's try to avoid crashing the process.	2022-02-15 23:24:34 +01:00
Linus Groh	892f6394b8	LibWeb: Implement state switch for "[CDATA[" in HTML parser	2022-02-15 23:24:34 +01:00
Linus Groh	3f7086f91a	LibWeb: Add an optional pointer to an HTMLParser to the HTMLTokenizer This is needed to access the 'adjusted current node' in the 'Markup declaration open state'. We don't want to create a full parser for something like syntax highlighting, so it's optional (null) by default.	2022-02-15 23:24:34 +01:00
Linus Groh	9130ecfd5e	LibWeb: Remove unused HTMLParser function declaration There is no implementation of this function: HTMLParser::stack_of_open_elements_has_element_with_tag_name_in_scope	2022-02-15 23:24:34 +01:00
Linus Groh	f61fb08492	LibWeb: Add spec links to each HTML tokenizer state section I didn't add full spec comments this time, but this is better than nothing :^)	2022-02-15 23:24:34 +01:00
Andreas Kling	1347c5032b	LibWeb: Add spec comments to the StackOfOpenElements class	2022-02-15 02:05:53 +01:00
Andreas Kling	5cdbea4ae0	LibWeb: Rename element_before() => element_immediately_above() This matches the spec terminology around the "stack of open elements".	2022-02-15 02:05:53 +01:00
Andreas Kling	6fe333607d	LibWeb: Add spec comments to find_appropriate_place_for_inserting_node()	2022-02-15 02:05:53 +01:00
Karol Kosek	c157c2148f	LibWeb: Don't emit current token on EOF in HTML Tokenizer Emitting tokens on EOF caused an infinite loop, freezing the app, which could be a bit annoying when writing an HTML comment at the end of the file in Text Editor. :^)	2022-02-14 12:50:44 +03:30
Karol Kosek	fb5e2670d6	LibWeb: Fix highlighting HTML comments Commit `b193351a99` caused the HTML comments to flash when changing the text cursor. Also, when double-clicking on a comment, the selection started from the beginning of the file instead. The following message was displaying when `TOKENIZER_TRACE_DEBUG` was enabled: (Tokenizer::nth_last_position) Invalid position requested: 4th-last of 4. Returning (0-0). Changing the `nth_last_position` to 3 fixes this. I'm guessing that's because the parser is at that moment on the second hyphen of the `<!--` string, so it has to go back only by three characters.	2022-02-14 12:50:44 +03:30

1 2 3

134 commits