Context Navigation

← Previous Changeset
Next Changeset →

Changeset 20170 in webkit

Timestamp:

Mar 13, 2007, 4:18:23 PM (18 years ago)

Author:

darin

Message:

LayoutTests:

Reviewed by Maciej.

tests for http://bugs.webkit.org/show_bug.cgi?id=12794 <rdar://problem/5028154> REGRESSION: TripTik planner at aaa.com never finishes loading due to unclosed canvas tag (12794)

fast/canvas/canvas-hides-fallback-expected.txt: Added.
fast/canvas/canvas-hides-fallback.html: Added.
fast/canvas/script-inside-canvas-fallback-expected.txt: Added.
fast/canvas/script-inside-canvas-fallback.html: Added.
fast/canvas/unclosed-canvas-1-expected.txt: Added.
fast/canvas/unclosed-canvas-1.html: Added.
fast/canvas/unclosed-canvas-2-expected.txt: Added.
fast/canvas/unclosed-canvas-2.html: Added.
fast/canvas/unclosed-canvas-3-expected.txt: Added.
fast/canvas/unclosed-canvas-3.html: Added.
fast/canvas/unclosed-canvas-4-expected.txt: Added.
fast/canvas/unclosed-canvas-4.html: Added.

WebCore:

Reviewed by Maciej.

fix http://bugs.webkit.org/show_bug.cgi?id=12794 <rdar://problem/5028154> REGRESSION: TripTik planner at aaa.com never finishes loading due to unclosed canvas tag (12794)

Change <canvas> elements so that their contents are parsed normally,
but not rendered. This change fixes the bug, because normal parsing
rules close the <canvas> element in that case. The special parser
stuff was just getting in the way.

Also do some basic cleanup to the HTML parser. This was motivated by
an earlier version of this patch that made even more changes to the
parser, but the cleanup is still worth landing.

Test: fast/canvas/canvas-hides-fallback.html
Test: fast/canvas/script-inside-canvas-fallback.html
Test: fast/canvas/unclosed-canvas-1.html
Test: fast/canvas/unclosed-canvas-2.html
Test: fast/canvas/unclosed-canvas-3.html
Test: fast/canvas/unclosed-canvas-4.html

html/HTMLCanvasElement.h: Added a data member to keep track of whether the renderer is a RenderHTMLCanvas or not.
html/HTMLCanvasElement.cpp: (WebCore::HTMLCanvasElement::createRenderer): If JavaScript is enabled, create a RenderHTMLCanvas. If it's not, let the default code create the default type of renderer, which will result in fallback content being visible. The RenderHTMLCanvas class already hides all of its children. Set the m_rendererIsCanvas boolean accordingly. Since the actual storage for the canvas is allocated lazily when you actually get a drawing context, we don't need to do anything special to prevent it when JavaScript is disabled; the relevant functions won't be called. (WebCore::HTMLCanvasElement::reset): Protect the code that manipulates the RenderHTMLCanvas with a check of m_rendererIsCanvas. This is the only code inside the DOM element that relies on the renderer type.

html/HTMLParser.h: Removed unneeded includes. Marked HTMLParser as Noncopyable. Changed the Document parameter to the constructor to instead of HTMLDocument. Renamed discard_until to m_skipModeTag for clarity. Removed unused noSpaces function and unneeded public doc() function. Moved data members all down to the end so you can see them together in order. Renamed map to m_currentMapElement and isindex to m_isindexElement. Removed unused end and headLoaded data members. Renamed m_fragment to m_isParsingFragment to make it clearer that it's a boolean, not a fragment.

html/HTMLParser.cpp: (WebCore::HTMLParser::HTMLParser): Changed to use member construction syntax instead of calling reset(). This is especially helpful in the fragment case, where calling reset() later on is illegal, so not using it in the constructor lets us assert. (WebCore::HTMLParser::~HTMLParser): Did an explicit deref instead of calling setCurrent for its side effect. (WebCore::HTMLParser::reset): Updated for member name changes and removal and to use document instead of doc(). (WebCore::HTMLParser::setCurrent): Use document instead of doc(). (WebCore::HTMLParser::setSkipMode): Added. No longer inline. Now sets the m_inCanvasBeforeFirstOpenTag data member to false. (WebCore::HTMLParser::parseToken): Tightened up the skip mode logic at the top of the function, and added a FIXME about the strange case there where we don't skip yet stay in skip mode. Updated for renaming and doc(). (WebCore::HTMLParser::insertNode): Updated for renaming and doc(). (WebCore::HTMLParser::handleError): Ditto. (WebCore::HTMLParser::framesetCreateErrorCheck): Ditto. (WebCore::HTMLParser::isindexCreateErrorCheck): Changed to use RefPtr. (WebCore::HTMLParser::noscriptCreateErrorCheck): Updated for renaming and doc(). (WebCore::HTMLParser::mapCreateErrorCheck): Ditto. (WebCore::HTMLParser::getNode): Removed the special case for canvas here. Canvas fallback is now handled in the DOM, not the parser. Updated for renaming and doc(). (WebCore::HTMLParser::allowNestedRedundantTag): Changed a #define into a C++ constant. (WebCore::HTMLParser::processCloseTag): Updated for renaming and doc(). (WebCore::HTMLParser::isInline): Ditto. (WebCore::HTMLParser::tagIsOnStack): Added. Used by new canvas logic. (WebCore::HTMLParser::popBlock): Updated for renaming and doc(). Also renamed the local variable Elem to elem. (WebCore::HTMLParser::createHead): Ditto. (WebCore::HTMLParser::handleIsindex): Changed to use RefPtr. (WebCore::HTMLParser::startBody): Updated for renaming and doc(). (WebCore::HTMLParser::finished): Ditto.

Location:

trunk

Files:

: 12 added
: 6 edited

LayoutTests/ChangeLog (modified) (1 diff)
LayoutTests/fast/canvas/canvas-hides-fallback-expected.txt (added)
LayoutTests/fast/canvas/canvas-hides-fallback.html (added)
LayoutTests/fast/canvas/script-inside-canvas-fallback-expected.txt (added)
LayoutTests/fast/canvas/script-inside-canvas-fallback.html (added)
LayoutTests/fast/canvas/unclosed-canvas-1-expected.txt (added)
LayoutTests/fast/canvas/unclosed-canvas-1.html (added)
LayoutTests/fast/canvas/unclosed-canvas-2-expected.txt (added)
LayoutTests/fast/canvas/unclosed-canvas-2.html (added)
LayoutTests/fast/canvas/unclosed-canvas-3-expected.txt (added)
LayoutTests/fast/canvas/unclosed-canvas-3.html (added)
LayoutTests/fast/canvas/unclosed-canvas-4-expected.txt (added)
LayoutTests/fast/canvas/unclosed-canvas-4.html (added)
WebCore/ChangeLog (modified) (1 diff)
WebCore/html/HTMLCanvasElement.cpp (modified) (5 diffs)
WebCore/html/HTMLCanvasElement.h (modified) (1 diff)
WebCore/html/HTMLParser.cpp (modified) (40 diffs)
WebCore/html/HTMLParser.h (modified) (6 diffs)

Legend:

: Unmodified
: Added
: Removed

trunk/LayoutTests/ChangeLog

-              r20166
+              r20170
+-03-13  Darin Adler  <darin@apple.com>
+        Reviewed by Maciej.
+        - tests for http://bugs.webkit.org/show_bug.cgi?id=12794
+          <rdar://problem/5028154> REGRESSION: TripTik planner at aaa.com never
+          finishes loading due to unclosed canvas tag (12794)
+        * fast/canvas/canvas-hides-fallback-expected.txt: Added.
+        * fast/canvas/canvas-hides-fallback.html: Added.
+        * fast/canvas/script-inside-canvas-fallback-expected.txt: Added.
+        * fast/canvas/script-inside-canvas-fallback.html: Added.
+        * fast/canvas/unclosed-canvas-1-expected.txt: Added.
+        * fast/canvas/unclosed-canvas-1.html: Added.
+        * fast/canvas/unclosed-canvas-2-expected.txt: Added.
+        * fast/canvas/unclosed-canvas-2.html: Added.
+        * fast/canvas/unclosed-canvas-3-expected.txt: Added.
+        * fast/canvas/unclosed-canvas-3.html: Added.
+        * fast/canvas/unclosed-canvas-4-expected.txt: Added.
+        * fast/canvas/unclosed-canvas-4.html: Added.
 -03-13  David Harrison  <harrison@apple.com>

trunk/WebCore/ChangeLog

-              r20169
+              r20170
+-03-13  Darin Adler  <darin@apple.com>
+        Reviewed by Maciej.
+        - fix http://bugs.webkit.org/show_bug.cgi?id=12794
+          <rdar://problem/5028154> REGRESSION: TripTik planner at aaa.com never
+          finishes loading due to unclosed canvas tag (12794)
+        Change <canvas> elements so that their contents are parsed normally,
+        but not rendered. This change fixes the bug, because normal parsing
+        rules close the <canvas> element in that case. The special parser
+        stuff was just getting in the way.
+        Also do some basic cleanup to the HTML parser. This was motivated by
+        an earlier version of this patch that made even more changes to the
+        parser, but the cleanup is still worth landing.
+        Test: fast/canvas/canvas-hides-fallback.html
+        Test: fast/canvas/script-inside-canvas-fallback.html
+        Test: fast/canvas/unclosed-canvas-1.html
+        Test: fast/canvas/unclosed-canvas-2.html
+        Test: fast/canvas/unclosed-canvas-3.html
+        Test: fast/canvas/unclosed-canvas-4.html
+        * html/HTMLCanvasElement.h: Added a data member to keep track of whether the
+        renderer is a RenderHTMLCanvas or not.
+        * html/HTMLCanvasElement.cpp:
+        (WebCore::HTMLCanvasElement::createRenderer): If JavaScript is enabled, create
+        a RenderHTMLCanvas. If it's not, let the default code create the default type
+        of renderer, which will result in fallback content being visible. The
+        RenderHTMLCanvas class already hides all of its children. Set the m_rendererIsCanvas
+        boolean accordingly. Since the actual storage for the canvas is allocated lazily
+        when you actually get a drawing context, we don't need to do anything special
+        to prevent it when JavaScript is disabled; the relevant functions won't be called.
+        (WebCore::HTMLCanvasElement::reset): Protect the code that manipulates the
+        RenderHTMLCanvas with a check of m_rendererIsCanvas. This is the only code inside
+        the DOM element that relies on the renderer type.
+        * html/HTMLParser.h: Removed unneeded includes. Marked HTMLParser as
+        Noncopyable. Changed the Document parameter to the constructor to instead
+        of HTMLDocument. Renamed discard_until to m_skipModeTag for clarity.
+        Removed unused noSpaces function and unneeded public doc() function.
+        Moved data members all down to the end so you can see them together in order.
+        Renamed map to m_currentMapElement and isindex to m_isindexElement.
+        Removed unused end and headLoaded data members.  Renamed m_fragment to
+        m_isParsingFragment to make it clearer that it's a boolean, not a fragment.
+        * html/HTMLParser.cpp:
+        (WebCore::HTMLParser::HTMLParser): Changed to use member construction
+        syntax instead of calling reset(). This is especially helpful in the
+        fragment case, where calling reset() later on is illegal, so not using
+        it in the constructor lets us assert.
+        (WebCore::HTMLParser::~HTMLParser): Did an explicit deref instead of
+        calling setCurrent for its side effect.
+        (WebCore::HTMLParser::reset): Updated for member name changes and removal
+        and to use document instead of doc().
+        (WebCore::HTMLParser::setCurrent): Use document instead of doc().
+        (WebCore::HTMLParser::setSkipMode): Added. No longer inline. Now sets the
+        m_inCanvasBeforeFirstOpenTag data member to false.
+        (WebCore::HTMLParser::parseToken): Tightened up the skip mode logic at the
+        top of the function, and added a FIXME about the strange case there where
+        we don't skip yet stay in skip mode. Updated for renaming and doc().
+        (WebCore::HTMLParser::insertNode): Updated for renaming and doc().
+        (WebCore::HTMLParser::handleError): Ditto.
+        (WebCore::HTMLParser::framesetCreateErrorCheck): Ditto.
+        (WebCore::HTMLParser::isindexCreateErrorCheck): Changed to use RefPtr.
+        (WebCore::HTMLParser::noscriptCreateErrorCheck): Updated for renaming and doc().
+        (WebCore::HTMLParser::mapCreateErrorCheck): Ditto.
+        (WebCore::HTMLParser::getNode): Removed the special case for canvas here.
+        Canvas fallback is now handled in the DOM, not the parser. Updated for
+        renaming and doc().
+        (WebCore::HTMLParser::allowNestedRedundantTag): Changed a #define into a C++
+        constant.
+        (WebCore::HTMLParser::processCloseTag): Updated for renaming and doc().
+        (WebCore::HTMLParser::isInline): Ditto.
+        (WebCore::HTMLParser::tagIsOnStack): Added. Used by new canvas logic.
+        (WebCore::HTMLParser::popBlock): Updated for renaming and doc(). Also renamed
+        the local variable Elem to elem.
+        (WebCore::HTMLParser::createHead): Ditto.
+        (WebCore::HTMLParser::handleIsindex): Changed to use RefPtr.
+        (WebCore::HTMLParser::startBody): Updated for renaming and doc().
+        (WebCore::HTMLParser::finished): Ditto.
 -03-13  David Hyatt  <hyatt@apple.com>

trunk/WebCore/html/HTMLCanvasElement.cpp

-              r19876
+              r20170
 /*
  * Copyright (C) 2004, 2006 Apple Computer, Inc.  All rights reserved.
+ * Copyright (C) 2004, 2006, 2007 Apple Inc. All rights reserved.
+ *
  * Redistribution and use in source and binary forms, with or without
 …
 #include "CanvasRenderingContext2D.h"
 #include "CanvasStyle.h"
+#include "Chrome.h"
 #include "Document.h"
 #include "Frame.h"
 …
 #include "Page.h"
 #include "RenderHTMLCanvas.h"
-#include "Chrome.h"
 #include "Settings.h"
-#include "Screen.h"
 #include <math.h>
 …
+}
+RenderObject* HTMLCanvasElement::createRenderer(RenderArena *arena, RenderStyle *style)
+{
+    RenderHTMLCanvas* r = new (arena) RenderHTMLCanvas(this);
+    r->setIntrinsicWidth(width());
+    r->setIntrinsicHeight(height());
+    return r;
+RenderObject* HTMLCanvasElement::createRenderer(RenderArena* arena, RenderStyle* style)
+{
+    if (document()->frame() && document()->frame()->settings()->isJavaScriptEnabled()) {
+        m_rendererIsCanvas = true;
+        RenderHTMLCanvas* r = new (arena) RenderHTMLCanvas(this);
+        r->setIntrinsicWidth(width());
+        r->setIntrinsicHeight(height());
+        return r;
+    }
+    m_rendererIsCanvas = false;
+    return HTMLElement::createRenderer(arena, style);
+}
 …
     m_size = IntSize(w, h);
+    RenderHTMLCanvas* r = static_cast<RenderHTMLCanvas*>(renderer());
+    if (r) {
+        r->setIntrinsicWidth(w);
+        r->setIntrinsicHeight(h);
+        r->repaint();
+    }
+    if (RenderObject* ro = renderer())
+        if (m_rendererIsCanvas) {
+            RenderHTMLCanvas* r = static_cast<RenderHTMLCanvas*>(ro);
+            r->setIntrinsicWidth(w);
+            r->setIntrinsicHeight(h);
+            r->repaint();
+        }
     m_createdDrawingContext = false;

trunk/WebCore/html/HTMLCanvasElement.h

r18874	r20170
77	77	void reset();
78	78
	79	bool m_rendererIsCanvas;
	80
79	81	RefPtr<CanvasRenderingContext2D> m_2DContext;
80	82	IntSize m_size;

trunk/WebCore/html/HTMLParser.cpp

-              r20127
+              r20170
 /*
-    This file is part of the KDE libraries
     Copyright (C) 1997 Martin Jones (mjones@kde.org)
               (C) 1997 Torben Weis (weis@kde.org)
               (C) 1999,2001 Lars Knoll (knoll@kde.org)
               (C) 2000,2001 Dirk Mueller (mueller@kde.org)
     Copyright (C) 2004, 2005, 2006 Apple Computer, Inc.
+    Copyright (C) 2004, 2005, 2006, 2007 Apple Inc. All rights reserved.
     This library is free software; you can redistribute it and/or
 …
 #include "Frame.h"
 #include "HTMLBodyElement.h"
 #include "HTMLCanvasElement.h"
+#include "HTMLDocument.h"
 #include "HTMLDivElement.h"
 #include "HTMLDListElement.h"
 …
 using namespace HTMLNames;
+/**
+ * @internal
+ */
+class HTMLStackElem
+{
+public:
+const int maxRedundantTagDepth = 20;
+struct HTMLStackElem : Noncopyable {
     HTMLStackElem(const AtomicString& t, int lvl, Node* n, bool r, HTMLStackElem* nx)
         : tagName(t)
 …
 /**
+ * @internal
+ * The parser parses tokenized input into the document, building up the
+ * document tree. If the document is well-formed, parsing it is straightforward.
+ *
+ * The parser parses tokenized input into the document, building up the
+ * document tree. If the document is wellformed, parsing it is
+ * straightforward.
+ * Unfortunately, people can't write wellformed HTML documents, so the parser
+ * has to be tolerant about errors.
+ * Unfortunately, we have to handle many HTML documents that are not well-formed,
+ * so the parser has to be tolerant about errors.
+ *
+ * We have to take care of the following error conditions:
+ * We have to take care of at least the following error conditions:
+ *
  * 1. The element being added is explicitly forbidden inside some outer tag.
  *    In this case we should close all tags up to the one, which forbids
  *    the element, and add it afterwards.
+ * 2. We are not allowed to add the element directly. It could be, that
+ *    the person writing the document forgot some tag inbetween (or that the
+ *    tag inbetween is optional...) This could be the case with the following
+ *    tags: HTML HEAD BODY TBODY TR TD LI (did I forget any?)
+ * 3. We wan't to add a block element inside to an inline element. Close all
+ *
+ * 2. We are not allowed to add the element directly. It could be that
+ *    the person writing the document forgot some tag in between (or that the
+ *    tag in between is optional). This could be the case with the following
+ *    tags: HTML HEAD BODY TBODY TR TD LI (did I forget any?).
+ *
+ * 3. We want to add a block element inside to an inline element. Close all
  *    inline elements up to the next higher block element.
+ * 4. If this doesn't help close elements, until we are allowed to add the
+ *
+ * 4. If this doesn't help, close elements until we are allowed to add the
  *    element or ignore the tag.
+ *
  */
+HTMLParser::HTMLParser(Document* doc)
+HTMLParser::HTMLParser(HTMLDocument* doc)
     : document(doc)
     , current(0)
+    , current(doc)
     , didRefCurrent(false)
     , blockStack(0)
+    , m_fragment(false)
+{
+    reset();
+    , form(0)
+    , m_currentMapElement(0)
+    , head(0)
+    , inBody(false)
+    , haveContent(false)
+    , haveFrameSet(false)
+    , m_isParsingFragment(false)
+    , inStrayTableContent(0)
+{
+}
 HTMLParser::HTMLParser(DocumentFragment* frag)
     : document(frag->document())
     , current(0)
     , didRefCurrent(false)
+    , current(frag)
+    , didRefCurrent(true)
     , blockStack(0)
+    , m_fragment(true)
+{
+    reset();
+    setCurrent(frag);
+    inBody = true;
+    , form(0)
+    , m_currentMapElement(0)
+    , head(0)
+    , inBody(true)
+    , haveContent(false)
+    , haveFrameSet(false)
+    , m_isParsingFragment(true)
+    , inStrayTableContent(0)
+{
+    if (frag)
+        frag->ref();
+}
 …
+{
     freeBlock();
     setCurrent(0);
+    if (didRefCurrent)
+        current->deref();
+}
 void HTMLParser::reset()
+{
+    setCurrent(doc());
+    ASSERT(!m_isParsingFragment);
+    setCurrent(document);
     freeBlock();
 …
     haveContent = false;
     inStrayTableContent = 0;
     form = 0;
     map = 0;
+    m_currentMapElement = 0;
     head = 0;
+    end = false;
+    isindex = 0;
+    discard_until = nullAtom;
+    m_isindexElement = 0;
+    m_skipModeTag = nullAtom;
+}
 void HTMLParser::setCurrent(Node* newCurrent)
+{
     bool didRefNewCurrent = newCurrent && newCurrent != doc();
+    bool didRefNewCurrent = newCurrent && newCurrent != document;
     if (didRefNewCurrent)
         newCurrent->ref();
 …
+}
+PassRefPtr<Node> HTMLParser::parseToken(Token *t)
+{
+    if (!discard_until.isNull()) {
+        if (t->tagName == discard_until && !t->beginTag)
+            discard_until = nullAtom;
+        // do not skip </iframe>
+        if (!discard_until.isNull() || (current->localName() != t->tagName))
+PassRefPtr<Node> HTMLParser::parseToken(Token* t)
+{
+    if (!m_skipModeTag.isNull()) {
+        if (!t->beginTag && t->tagName == m_skipModeTag)
+            // Found the end tag for the current skip mode, so we're done skipping.
+            m_skipModeTag = nullAtom;
+        else if (current->localName() == t->tagName)
+            // Do not skip </iframe>.
+            // FIXME: What does that comment mean? How can it be right to parse a token without clearing m_skipModeTag?
+            ;
+        else
             return 0;
+    }
     // Apparently some sites use </br> instead of <br>.  Be compatible with IE and Firefox and treat this like <br>.
     if (t->isCloseTag(brTag) && doc()->inCompatMode())
+    // Apparently some sites use </br> instead of <br>. Be compatible with IE and Firefox and treat this like <br>.
+    if (t->isCloseTag(brTag) && document->inCompatMode())
         t->beginTag = true;
 …
+        }
         if (map == n)
             map = 0;
+        if (m_currentMapElement == n)
+            m_currentMapElement = 0;
         if (form == n)
 …
+}
 bool HTMLParser::insertNode(Node *n, bool flat)
+bool HTMLParser::insertNode(Node* n, bool flat)
+{
     RefPtr<Node> protectNode(n);
 …
             didRefCurrent = true;
+        }
         if (parentAttached && !n->attached() && !m_fragment)
+        if (parentAttached && !n->attached() && !m_isParsingFragment)
             n->attach();
     } else {
         if (parentAttached && !n->attached() && !m_fragment)
+        if (parentAttached && !n->attached() && !m_isParsingFragment)
             n->attach();
         n->closeRenderer();
 …
+{
     // Error handling code.  This is just ad hoc handling of specific parent/child combinations.
     HTMLElement *e;
+    HTMLElement* e;
     bool handled = false;
 …
             if (head) {
                 if (head->addChild(n)) {
                     if (!n->attached() && !m_fragment)
+                    if (!n->attached() && !m_isParsingFragment)
                         n->attach();
                     return true;
 …
         } else if (h->hasLocalName(htmlTag)) {
             if (!current->isDocumentNode() ) {
                 if (doc()->firstChild()->hasTagName(htmlTag)) {
+                if (document->firstChild()->hasTagName(htmlTag)) {
                     // we have another <HTML> element.... apply attributes to existing one
                     // make sure we don't overwrite already existing attributes
                     NamedAttrMap *map = static_cast<Element*>(n)->attributes(true);
                     Element *existingHTML = static_cast<Element*>(doc()->firstChild());
                     NamedAttrMap *bmap = existingHTML->attributes(false);
+                    NamedAttrMap* map = static_cast<Element*>(n)->attributes(true);
+                    Element* existingHTML = static_cast<Element*>(document->firstChild());
+                    NamedAttrMap* bmap = existingHTML->attributes(false);
                     for (unsigned l = 0; map && l < map->length(); ++l) {
                         Attribute* it = map->attributeItem(l);
 …
                 pushBlock(localName, tagPriority);
                 setCurrent(newNode);
                 if (!n->attached() && !m_fragment)
+                if (!n->attached() && !m_isParsingFragment)
                     n->attach();
                 return true;
 …
+            }
         } else if (h->hasLocalName(bodyTag)) {
             if (inBody && doc()->body()) {
+            if (inBody && document->body()) {
                 // we have another <BODY> element.... apply attributes to existing one
                 // make sure we don't overwrite already existing attributes
                 // some sites use <body bgcolor=rightcolor>...<body bgcolor=wrongcolor>
                 NamedAttrMap *map = static_cast<Element*>(n)->attributes(true);
                 Element *existingBody = doc()->body();
                 NamedAttrMap *bmap = existingBody->attributes(false);
+                NamedAttrMap* map = static_cast<Element*>(n)->attributes(true);
+                Element* existingBody = document->body();
+                NamedAttrMap* bmap = existingBody->attributes(false);
                 for (unsigned l = 0; map && l < map->length(); ++l) {
                     Attribute* it = map->attributeItem(l);
 …
             if (equalIgnoringCase(h->getAttribute(typeAttr), "hidden") && form) {
                 form->addChild(n);
                 if (!n->attached() && !m_fragment)
+                if (!n->attached() && !m_isParsingFragment)
                     n->attach();
                 return true;
 …
+            }
         } else if (h->hasLocalName(areaTag)) {
             if (map) {
                 map->addChild(n);
                 if (!n->attached() && !m_fragment)
+            if (m_currentMapElement) {
+                m_currentMapElement->addChild(n);
+                if (!n->attached() && !m_isParsingFragment)
                     n->attach();
                 handled = true;
 …
         if (head) {
             head->addChild(n);
             if (!n->attached() && !m_fragment)
+            if (!n->attached() && !m_isParsingFragment)
                 n->attach();
+        }
 …
             } else {
                 if (n->isTextNode()) {
                     Text *t = static_cast<Text *>(n);
+                    Text* t = static_cast<Text*>(n);
                     if (t->containsOnlyWhitespace())
                         return false;
 …
                 ExceptionCode ec = 0;
                 if (n->isTextNode()) {
                     Text *t = static_cast<Text *>(n);
+                    Text* t = static_cast<Text*>(n);
                     if (t->containsOnlyWhitespace())
                         return false;
                     StringImpl *i = t->string();
+                    StringImpl* i = t->string();
                     unsigned int pos = 0;
                     while (pos < i->length() && ((*i)[pos] == ' ' || (*i)[pos] == noBreakSpace))
 …
+                }
                 if (possiblyMoveStrayContent) {
                     Node *node = current;
                     Node *parent = node->parentNode();
+                    Node* node = current;
+                    Node* parent = node->parentNode();
                     // A script may have removed the current node's parent from the DOM
                     // http://bugs.webkit.org/show_bug.cgi?id=7137
 …
                     if (!parent)
                         return false;
                     Node *grandparent = parent->parentNode();
+                    Node* grandparent = parent->parentNode();
                     if (n->isTextNode() ||
 …
                         node = (node->hasTagName(tableTag)) ? node :
                                 ((node->hasTagName(trTag)) ? grandparent : parent);
                         Node *parent = node->parentNode();
+                        Node* parent = node->parentNode();
                         if (!parent)
                             return false;
 …
         // regressions and the headaches are not worth the work as long as there is
         // no site actually relying on that detail (Dirk)
         if (doc()->body())
             doc()->body()->setAttribute(styleAttr, "display:none");
+        if (document->body())
+            document->body()->setAttribute(styleAttr, "display:none");
         inBody = false;
+    }
 …
 bool HTMLParser::isindexCreateErrorCheck(Token* t, RefPtr<Node>& result)
+{
     Node *n = handleIsindex(t);
+    RefPtr<Node> n = handleIsindex(t);
     if (!inBody) {
         isindex = n;
+        m_isindexElement = n.release();
     } else {
         t->flat = true;
         result = n;
+        result = n.release();
+    }
     return false;
 …
 bool HTMLParser::noscriptCreateErrorCheck(Token* t, RefPtr<Node>& result)
+{
     if (!m_fragment && document->frame() && document->frame()->settings()->isJavaScriptEnabled())
+    if (!m_isParsingFragment && document->frame() && document->frame()->settings()->isJavaScriptEnabled())
         setSkipMode(noscriptTag);
     return true;
 …
 bool HTMLParser::mapCreateErrorCheck(Token* t, RefPtr<Node>& result)
+{
     map = new HTMLMapElement(document);
     result = map;
+    m_currentMapElement = new HTMLMapElement(document);
+    result = m_currentMapElement;
     return false;
+}
-bool HTMLParser::canvasCreateErrorCheck(Token* t, RefPtr<Node>& result)
+{
-    if (document->frame() && document->frame()->settings()->usesDashboardBackwardCompatibilityMode())
-        return true;
-    if (!m_fragment && document->frame() && document->frame()->settings()->isJavaScriptEnabled())
-        setSkipMode(canvasTag);
-    return true;
+}
 …
         gFunctionMap.set(bodyTag.localName().impl(), &HTMLParser::bodyCreateErrorCheck);
         gFunctionMap.set(buttonTag.localName().impl(), &HTMLParser::nestedCreateErrorCheck);
-        gFunctionMap.set(canvasTag.localName().impl(), &HTMLParser::canvasCreateErrorCheck);
         gFunctionMap.set(commentAtom.impl(), &HTMLParser::commentCreateErrorCheck);
         gFunctionMap.set(ddTag.localName().impl(), &HTMLParser::ddCreateErrorCheck);
 …
         proceed = (this->*errorCheckFunc)(t, result);
     if (proceed)
         result = HTMLElementFactory::createHTMLElement(t->tagName, doc(), form);
+        result = HTMLElementFactory::createHTMLElement(t->tagName, document, form);
     return result.release();
+}
+#define MAX_REDUNDANT 20
+bool HTMLParser::allowNestedRedundantTag(const AtomicString& _tagName)
+bool HTMLParser::allowNestedRedundantTag(const AtomicString& tagName)
+{
     // www.liceo.edu.mx is an example of a site that achieves a level of nesting of
 …
     int i = 0;
     for (HTMLStackElem* curr = blockStack;
          i < MAX_REDUNDANT && curr && curr->tagName == _tagName;
+         i < maxRedundantTagDepth && curr && curr->tagName == tagName;
          curr = curr->next, i++);
     return i != MAX_REDUNDANT;
+}
 void HTMLParser::processCloseTag(Token *t)
+    return i != maxRedundantTagDepth;
+}
+void HTMLParser::processCloseTag(Token* t)
+{
     // Support for really broken html.
 …
         form = 0;
     else if (t->tagName == mapTag)
         map = 0;
+        m_currentMapElement = 0;
     HTMLStackElem* oldElem = blockStack;
 …
             e->hasLocalName(supTag) || e->hasLocalName(spanTag) || e->hasLocalName(nobrTag) ||
             e->hasLocalName(wbrTag) || e->hasLocalName(noframesTag) || e->hasLocalName(nolayerTag) ||
             e->hasLocalName(noembedTag) || (e->hasLocalName(noscriptTag) && !m_fragment && document->frame() && document->frame()->settings()->isJavaScriptEnabled()))
+            e->hasLocalName(noembedTag) || (e->hasLocalName(noscriptTag) && !m_isParsingFragment && document->frame() && document->frame()->settings()->isJavaScriptEnabled()))
             return true;
+    }
 …
+}
 void HTMLParser::popBlock(const AtomicString& _tagName)
+{
     HTMLStackElem *Elem = blockStack;
+void HTMLParser::popBlock(const AtomicString& tagName)
+{
+    HTMLStackElem* elem = blockStack;
     int maxLevel = 0;
     while (Elem && (Elem->tagName != _tagName)) {
         if (maxLevel < Elem->level)
             maxLevel = Elem->level;
         Elem = Elem->next;
+    }
     if (!Elem)
+    while (elem && (elem->tagName != tagName)) {
+        if (maxLevel < elem->level)
+            maxLevel = elem->level;
+        elem = elem->next;
+    }
+    if (!elem)
         return;
     if (maxLevel > Elem->level) {
+    if (maxLevel > elem->level) {
         // We didn't match because the tag is in a different scope, e.g.,
         // <b><p>Foo</b>.  Try to correct the problem.
         if (!isResidualStyleTag(_tagName))
+        if (!isResidualStyleTag(tagName))
             return;
         return handleResidualStyleCloseTagAcrossBlocks(Elem);
+    }
     bool isAffectedByStyle = isAffectedByResidualStyle(Elem->tagName);
+        return handleResidualStyleCloseTagAcrossBlocks(elem);
+    }
+    bool isAffectedByStyle = isAffectedByResidualStyle(elem->tagName);
     HTMLStackElem* residualStyleStack = 0;
     Node* malformedTableParent = 0;
     Elem = blockStack;
     while (Elem) {
         if (Elem->tagName == _tagName) {
+    elem = blockStack;
+    while (elem) {
+        if (elem->tagName == tagName) {
             int strayTable = inStrayTableContent;
             popOneBlock();
             Elem = 0;
+            elem = 0;
             // This element was the root of some malformed content just inside an implicit or
 …
+        }
         else {
             if (form && Elem->tagName == formTag)
+            if (form && elem->tagName == formTag)
                 // A <form> is being closed prematurely (and this is
                 // malformed HTML).  Set an attribute on the form to clear out its
 …
             // Schedule this tag for reopening
             // after we complete the close of this entire block.
             if (isAffectedByStyle && isResidualStyleTag(Elem->tagName))
+            if (isAffectedByStyle && isResidualStyleTag(elem->tagName))
                 // We've overloaded the use of stack elements and are just reusing the
                 // struct with a slightly different meaning to the variables.  Instead of chaining
 …
                 // from the outermost to the innermost, i.e., residualStyleStack will end up pointing
                 // to the outermost tag we need to reopen.
                 // We also set Elem->node to be the actual element that corresponds to the ID stored in
                 // Elem->id rather than the node that you should pop to when the element gets pulled off
+                // We also set elem->node to be the actual element that corresponds to the ID stored in
+                // elem->id rather than the node that you should pop to when the element gets pulled off
                 // the stack.
                 moveOneBlockToStack(residualStyleStack);
             else
                 popOneBlock();
             Elem = blockStack;
+            elem = blockStack;
+        }
+    }
 …
 void HTMLParser::createHead()
+{
     if (head || !doc()->firstChild())
+    if (head || !document->firstChild())
         return;
     head = new HTMLHeadElement(document);
     HTMLElement* body = doc()->body();
+    HTMLElement* body = document->body();
     ExceptionCode ec = 0;
     doc()->firstChild()->insertBefore(head, body, ec);
+    document->firstChild()->insertBefore(head, body, ec);
     if (ec)
         head = 0;
+}
 Node* HTMLParser::handleIsindex(Token* t)
+{
     Node* n = new HTMLDivElement(document);
+PassRefPtr<Node> HTMLParser::handleIsindex(Token* t)
+{
+    RefPtr<Node> n = new HTMLDivElement(document);
     NamedMappedAttrMap* attrs = t->attrs.get();
 …
     String text = searchableIndexIntroduction();
     if (attrs) {
         if (Attribute *a = attrs->getAttributeItem(promptAttr))
+        if (Attribute* a = attrs->getAttributeItem(promptAttr))
             text = a->value().domString() + " ";
         t->attrs = 0;
 …
     n->addChild(new HTMLHRElement(document));
     n->addChild(new Text(document, text));
     n->addChild(isIndex.get());
+    n->addChild(isIndex.release());
     n->addChild(new HTMLHRElement(document));
     return n;
+    return n.release();
+}
 void HTMLParser::startBody()
+{
+    if(inBody) return;
+    if (inBody)
+        return;
     inBody = true;
     if (isindex) {
         insertNode(isindex.get(), true /* don't decend into this node */);
         isindex = 0;
+    if (m_isindexElement) {
+        insertNode(m_isindexElement.get(), true /* don't descend into this node */);
+        m_isindexElement = 0;
+    }
+}
 …
     // Warning, this may delete the tokenizer and parser, so don't try to do anything else after this.
     if (!m_fragment)
+    if (!m_isParsingFragment)
         document->finishedParsing();
+}

trunk/WebCore/html/HTMLParser.h

-              r18874
+              r20170
 /*
-    This file is part of the KDE libraries
     Copyright (C) 1997 Martin Jones (mjones@kde.org)
               (C) 1997 Torben Weis (weis@kde.org)
               (C) 1998 Waldo Bastian (bastian@kde.org)
               (C) 1999 Lars Knoll (knoll@kde.org)
     Copyright (C) 2004, 2005, 2006 Apple Computer, Inc.
+    Copyright (C) 2004, 2005, 2006, 2007 Apple Inc. All rights reserved.
     This library is free software; you can redistribute it and/or
 …
     Boston, MA 02111-1307, USA.
 */
-//----------------------------------------------------------------------------
-//
-// KDE HTML Widget -- HTML Parser
 #ifndef HTMLParser_h
 #define HTMLParser_h
+#include "HTMLDocument.h"
+#include "QualifiedName.h"
+#include <wtf/Forward.h>
+#include <wtf/RefPtr.h>
 namespace WebCore {
+class Document;
 class DocumentFragment;
+class FrameView;
+class HTMLElement;
+class HTMLDocument;
 class HTMLFormElement;
 class HTMLHeadElement;
 class HTMLMapElement;
 class HTMLStackElem;
+class Node;
 class Token;
+struct HTMLStackElem;
 /**
  * The parser for html. It receives a stream of tokens from the HTMLTokenizer, and
  * builds up the Document structure form it.
+ * The parser for HTML. It receives a stream of tokens from the HTMLTokenizer, and
+ * builds up the Document structure from it.
  */
+class HTMLParser
+{
+class HTMLParser : Noncopyable {
 public:
     HTMLParser(Document*);
+    HTMLParser(HTMLDocument*);
     HTMLParser(DocumentFragment*);
     virtual ~HTMLParser();
 …
     void reset();
+    bool skipMode() const { return !discard_until.isNull(); }
+    bool noSpaces() const { return !inBody; }
+    HTMLDocument *doc() const { return static_cast<HTMLDocument *>(document); }
+    bool skipMode() const { return !m_skipModeTag.isNull(); }
 private:
     void setCurrent(Node* newCurrent);
+    void setCurrent(Node*);
     void derefCurrent();
+    void setSkipMode(const QualifiedName& qName) { discard_until = qName.localName(); }
+    Document* document;
+    void setSkipMode(const QualifiedName& qName) { m_skipModeTag = qName.localName(); }
     PassRefPtr<Node> getNode(Token*);
 …
     bool textCreateErrorCheck(Token*, RefPtr<Node>&);
     void processCloseTag(Token *);
+    void processCloseTag(Token*);
     bool insertNode(Node *n, bool flat = false);
     bool handleError(Node* n, bool flat, const AtomicString& localName, int tagPriority);
+    bool insertNode(Node*, bool flat = false);
+    bool handleError(Node*, bool flat, const AtomicString& localName, int tagPriority);
+    // The currently active element (the one new elements will be added to). Can be a document fragment, a document or an element.
+    Node* current;
+    // We can't ref a document, but we don't want to constantly check if a node is a document just to decide whether to deref.
+    bool didRefCurrent;
+    HTMLStackElem *blockStack;
+    void pushBlock(const AtomicString& tagName, int _level);
+    void pushBlock(const AtomicString& tagName, int level);
     void popBlock(const AtomicString& tagName);
     void popBlock(const QualifiedName& qName) { return popBlock(qName.localName()); } // Convenience function for readability.
 …
     void createHead();
     bool isResidualStyleTag(const AtomicString& tagName);
     bool isAffectedByResidualStyle(const AtomicString& tagName);
+    static bool isResidualStyleTag(const AtomicString& tagName);
+    static bool isAffectedByResidualStyle(const AtomicString& tagName);
     void handleResidualStyleCloseTagAcrossBlocks(HTMLStackElem*);
     void reopenResidualStyleTags(HTMLStackElem*, Node* malformedTableParent);
 …
     void popNestedHeaderTag();
     bool isInline(Node* node) const;
+    bool isInline(Node*) const;
+    /*
+     * currently active form
+     */
+    HTMLFormElement *form;
+    void startBody(); // inserts the isindex element
+    PassRefPtr<Node> handleIsindex(Token*);
+    /*
+     * current map
+     */
+    HTMLMapElement *map;
+    Document* document;
     /*
      * the head element. Needed for crappy html which defines <base> after </head>
      */
     HTMLHeadElement *head;
+    // The currently active element (the one new elements will be added to). Can be a document fragment, a document or an element.
+    Node* current;
+    // We can't ref a document, but we don't want to constantly check if a node is a document just to decide whether to deref.
+    bool didRefCurrent;
+    /*
+     * a possible <isindex> element in the head. Compatibility hack for
+     * html from the stone age
+     */
+    RefPtr<Node> isindex;
+    Node* handleIsindex(Token*);
+    HTMLStackElem* blockStack;
     /*
      * inserts the stupid isIndex element.
      */
     void startBody();
+    HTMLFormElement* form; // currently active form
+    HTMLMapElement* m_currentMapElement; // current map
+    HTMLHeadElement* head; // head element; needed for HTML which defines <base> after </head>
+    RefPtr<Node> m_isindexElement; // a possible <isindex> element in the head
     bool inBody;
     bool haveContent;
     bool haveFrameSet;
-    bool end;
+    /*
+     * tells the parser to discard all tags, until it reaches the one specified
+     */
+    AtomicString discard_until;
+    AtomicString m_skipModeTag; // tells the parser to discard all tags until it reaches the one specified
+    bool headLoaded;
+    bool m_fragment;
+    bool m_isParsingFragment;
     int inStrayTableContent;
 };

Note: See TracChangeset for help on using the changeset viewer.

Context Navigation

Changeset 20170 in webkit

Legend:

trunk/LayoutTests/ChangeLog

trunk/WebCore/ChangeLog

trunk/WebCore/html/HTMLCanvasElement.cpp

trunk/WebCore/html/HTMLCanvasElement.h

trunk/WebCore/html/HTMLParser.cpp

trunk/WebCore/html/HTMLParser.h

Download in other formats: