Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

We should probably reinstate HTMLDocument, with a named getter, in the spec #4792

Open
bzbarsky opened this issue Jul 22, 2019 · 15 comments
Open
Labels
interop Implementations are not interoperable with each other needs tests Moving the issue forward requires someone to write tests

Comments

@bzbarsky
Copy link
Contributor

The spec currently has a named getter on Document and all documents implementing the Document interface, with HTMLDocument an alias for Document.

There is no UA that implements that behavior. The implemented behavior is as follows, at this point:

In practice, that means that XML documents returned from XHR's responseXML do not implement the named getter in any browser. Adding the named getter to those would be a pretty serious compat risk due to the [OverrideBuiltins] on Document: attribute values inside the document could then cause APIs to be overridden on the document object, where they are not overridden in any browsers right now.

It seems to me that the simplest path to interop and web compat here is to converge on the Firefox/Safari behavior: reintroduce HTMLDocument with a named getter on it and remove the named getter from Document.

Relevant testcases:

  • Alerts undefined in Chrome and [object HTMLFormElement] in Safari/Firefox:
<!doctype html>
<script>
var url = 'data:text/html,<html xmlns="http://www.w3.org/1999/xhtml"><form name="x"/></html>';
var xhr = new XMLHttpRequest();
xhr.open("GET", url);
xhr.responseType = "document";
xhr.onload = function() {
  alert(xhr.responseXML.x);
};
xhr.send();
</script>
  • Alerts undefined in all browsers:
<!doctype html>
<script>
var url = 'data:text/xml,<html xmlns="http://www.w3.org/1999/xhtml"><form name="x"/></html>';
var xhr = new XMLHttpRequest();
xhr.open("GET", url);
xhr.responseType = "document";
xhr.onload = function() {
  alert(xhr.responseXML.x);
};
xhr.send();
</script>
  • Alerts [object HTMLFormElement] in all browsers when served as text/html and undefined in all browsers when served as text/xml:
<html xmlns="http://www.w3.org/1999/xhtml">
  <form name="x"></form>
  <script>
    alert(document.x);
  </script>
</html>

Per current spec, all of those testcases should always alert [object HTMLFormElement].

@bzbarsky
Copy link
Contributor Author

@domenic @annevk

@bzbarsky
Copy link
Contributor Author

Note whatwg/dom#221

@bzbarsky
Copy link
Contributor Author

I'd also like to look at this in terms of the priority of constituencies:

  • Users: Really don't care about any of this, as long as web pages don't break.
  • Authors: Do authors want the named getter on Document? Do they want it at all, apart from keeping existing pages working? I agree that if there is only one type of document that's a bit less confusion for authors, but does it outweigh the drawbacks of the nasty named getter with its shadowing behavior on everything?
  • Implementors: it's strictly less work to not get rid of the existing HTMLDocument separate from Document for all three implementations. Blink's behavior would be easier to fix to follow a spec with a named getter on HTMLDocument than one with a named getter on Document, as far as I can tell.
  • Specifiers: It's more work to switch back to having HTMLDocument, obviously.
  • Theoretical purity: Better to not have HTMLDocument.

@rniwa
Copy link

rniwa commented Jul 23, 2019

@cdumez

@domenic
Copy link
Member

domenic commented Jul 23, 2019

I think authors would prefer the simpler model of a single Document type with aliases. I know I was very confused by this mess in my web author days.

That said, I can easily be persuaded that at this point, the benefit to authors is small enough, that we should weigh implementers higher than authors/specifiers/theoretical purity.

@ExE-Boss
Copy link

ExE-Boss commented Jul 23, 2019

I still think that the named getter is a legacy mistake.

Sadly, we probably can’t just get rid of it, but we could maybe at least standardise Chrome’s behaviour.

That said, it looks to be a feature that seems to be completely undocumented outside of the DOM specification, and I don’t know how much it’s actually being used.

@rniwa
Copy link

rniwa commented Jul 23, 2019

Sadly, we probably can’t just get rid of it, but we could maybe at least standardise Chrome’s behaviour

I don't see how having an inconsistent behavior on HTMLDocument like Chrome does is more helpful to authors. Having consistency is good. Every inconsistency we introduce into the Web platform is yet another thing author has to deal with.

@bzbarsky
Copy link
Contributor Author

Hmm. Github seems to have lost my comment... Yes, the named getter is a legacy mistake. As far as usage, there have been a bunch of tutorials and copy/paste stuff that relied on it working, even though it didn't explicitly call out its existence.

That said, adding use counters for the named getter in browsers would not be hard. I filed https://bugzilla.mozilla.org/show_bug.cgi?id=1568310 to do that for Firefox.

@bzbarsky
Copy link
Contributor Author

@bzbarsky
Copy link
Contributor Author

If this is fixed as I propose, we should also remove the ability to specify [OverrideBuiltins] on partial interfaces.

@bzbarsky
Copy link
Contributor Author

bzbarsky commented Aug 7, 2019

Only two days worth of data so far, so it's a bit preliminary

We've got well over a week of data now, and for the last 7 days it's stable at about 3% of pageloads, on the Firefox nightly channel. The probe hasn't made it to beta yet, but honestly I don't expect it to be much different there.

@rniwa
Copy link

rniwa commented Aug 7, 2019

We've got well over a week of data now, and for the last 7 days it's stable at about 3% of pageloads, on the Firefox nightly channel.

3% is really high. For a comparison, the last time I checked, attachShadow (which is a function one must call to use shadow DOM) was used on only ~1.5% of page loads according to Chrome Platform Status. So named getters on document is used roughly twice as more frequently than the entirely of shadow DOM API.

@bzbarsky
Copy link
Contributor Author

bzbarsky commented Aug 7, 2019

Yeah, at 3% there's no way this can be removed, imo. So the only question is where it should live.

@ExE-Boss
Copy link

ExE-Boss commented Nov 22, 2019

Well, I believe that it should live on HTMLDocument, which is why I opened #5104 and web-platform-tests/wpt#20403 to do this.

@bzbarsky
Copy link
Contributor Author

Firefox, Safari: HTML documents implement HTMLDocument
Chrome: HTML Documents implement HTMLDocument

It's a little more complicated. Firefox implements HTMLDocument for text/html and application/xhtml+xml, whereas Safari and Chrome only do it for text/html, apparently...

awesomekling added a commit to awesomekling/serenity that referenced this issue Jun 21, 2023
This class is currently not in the spec, but it *is* still in all the
major browser engines. For compatibility reasons, let's do what other
engines do.

There is discussion about bringing HTMLDocument back into specs:
- whatwg/html#4792
- whatwg/dom#221
awesomekling added a commit to SerenityOS/serenity that referenced this issue Jun 21, 2023
This class is currently not in the spec, but it *is* still in all the
major browser engines. For compatibility reasons, let's do what other
engines do.

There is discussion about bringing HTMLDocument back into specs:
- whatwg/html#4792
- whatwg/dom#221
@annevk annevk added needs tests Moving the issue forward requires someone to write tests interop Implementations are not interoperable with each other labels Oct 8, 2024
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
interop Implementations are not interoperable with each other needs tests Moving the issue forward requires someone to write tests
Development

No branches or pull requests

5 participants