CMS 3D CMS Logo

 All Classes Namespaces Files Functions Variables Typedefs Enumerations Enumerator Properties Friends Macros Pages
List of all members | Static Public Attributes
BeautifulSoup.MinimalSoup Class Reference
Inheritance diagram for BeautifulSoup.MinimalSoup:
BeautifulSoup.BeautifulSoup BeautifulSoup.BeautifulSoup BeautifulSoup.BeautifulStoneSoup BeautifulSoup.BeautifulStoneSoup BeautifulSoup.BeautifulStoneSoup BeautifulSoup.BeautifulStoneSoup BeautifulSoup.Tag BeautifulSoup.Tag BeautifulSoup.Tag BeautifulSoup.Tag BeautifulSoup.Tag BeautifulSoup.Tag BeautifulSoup.Tag BeautifulSoup.Tag BeautifulSoup.RobustInsanelyWackAssHTMLParser BeautifulSoup.RobustInsanelyWackAssHTMLParser

Static Public Attributes

dictionary NESTABLE_TAGS = {}
 
tuple RESET_NESTING_TAGS = buildTagMap('noscript')
 
- Static Public Attributes inherited from BeautifulSoup.BeautifulSoup
tuple CHARSET_RE = re.compile("((^|;)\s*charset=)([^;]*)", re.M)
 
list NESTABLE_BLOCK_TAGS = ['blockquote', 'div', 'fieldset', 'ins', 'del']
 
list NESTABLE_INLINE_TAGS
 
dictionary NESTABLE_LIST_TAGS
 
dictionary NESTABLE_TABLE_TAGS
 
tuple NESTABLE_TAGS
 
list NON_NESTABLE_BLOCK_TAGS = ['address', 'form', 'p', 'pre']
 
tuple PRESERVE_WHITESPACE_TAGS = set(['pre', 'textarea'])
 
dictionary QUOTE_TAGS = {'script' : None, 'textarea' : None}
 
tuple RESET_NESTING_TAGS
 
tuple SELF_CLOSING_TAGS
 
- Static Public Attributes inherited from BeautifulSoup.BeautifulStoneSoup
 ALL_ENTITIES = XHTML_ENTITIES
 
string HTML_ENTITIES = "html"
 
list MARKUP_MASSAGE
 
dictionary NESTABLE_TAGS = {}
 
list PRESERVE_WHITESPACE_TAGS = []
 
dictionary QUOTE_TAGS = {}
 
dictionary RESET_NESTING_TAGS = {}
 
string ROOT_TAG_NAME = u'[document]'
 
dictionary SELF_CLOSING_TAGS = {}
 
dictionary STRIP_ASCII_SPACES = { 9: None, 10: None, 12: None, 13: None, 32: None, }
 
string XHTML_ENTITIES = "xhtml"
 
string XML_ENTITIES = "xml"
 
- Static Public Attributes inherited from BeautifulSoup.PageElement
 fetchNextSiblings = findNextSiblings
 
 fetchParents = findParents
 
 fetchPrevious = findAllPrevious
 
 fetchPreviousSiblings = findPreviousSiblings
 

Additional Inherited Members

- Public Member Functions inherited from BeautifulSoup.BeautifulSoup
def __init__
 
def __init__
 
def extractCharsetFromMeta
 
def extractCharsetFromMeta
 
- Public Member Functions inherited from BeautifulSoup.BeautifulStoneSoup
def __init__
 
def __init__
 
def endData
 
def endData
 
def extractCharsetFromMeta
 
def extractCharsetFromMeta
 
def handle_data
 
def handle_data
 
def isSelfClosingTag
 
def isSelfClosingTag
 
def popTag
 
def popTag
 
def pushTag
 
def pushTag
 
def reset
 
def reset
 
def unknown_endtag
 
def unknown_endtag
 
def unknown_starttag
 
def unknown_starttag
 
- Public Member Functions inherited from BeautifulSoup.PageElement
def append
 
def append
 
def extract
 
def extract
 
def findAllNext
 
def findAllNext
 
def findAllPrevious
 
def findAllPrevious
 
def findNext
 
def findNext
 
def findNextSibling
 
def findNextSibling
 
def findNextSiblings
 
def findNextSiblings
 
def findParent
 
def findParent
 
def findParents
 
def findParents
 
def findPrevious
 
def findPrevious
 
def findPreviousSibling
 
def findPreviousSibling
 
def findPreviousSiblings
 
def findPreviousSiblings
 
def insert
 
def insert
 
def nextGenerator
 
def nextGenerator
 
def nextSiblingGenerator
 
def nextSiblingGenerator
 
def parentGenerator
 
def parentGenerator
 
def previousGenerator
 
def previousGenerator
 
def previousSiblingGenerator
 
def previousSiblingGenerator
 
def replaceWith
 
def replaceWith
 
def setup
 
def setup
 
def substituteEncoding
 
def substituteEncoding
 
def toEncoding
 
def toEncoding
 
- Public Attributes inherited from BeautifulSoup.BeautifulSoup
 declaredHTMLEncoding
 
 originalEncoding
 
- Public Attributes inherited from BeautifulSoup.BeautifulStoneSoup
 builder
 
 convertEntities
 
 convertHTMLEntities
 
 convertXMLEntities
 
 currentData
 
 currentTag
 
 declaredHTMLEncoding
 
 escapeUnrecognizedEntities
 
 fromEncoding
 
 hidden
 
 instanceSelfClosingTags
 
 literal
 
 markup
 
 markupMassage
 
 originalEncoding
 
 parseOnlyThese
 
 previous
 
 quoteStack
 
 smartQuotesTo
 
 tagStack
 
- Public Attributes inherited from BeautifulSoup.PageElement
 next
 
 nextSibling
 
 parent
 
 previous
 
 previousSibling
 

Detailed Description

The MinimalSoup class is for parsing HTML that contains
pathologically bad markup. It makes no assumptions about tag
nesting, but it does know which tags are self-closing, that
<script> tags contain Javascript and should not be parsed, that
META tags may contain encoding information, and so on.

This also makes it better for subclassing than BeautifulStoneSoup
or BeautifulSoup.

Definition at line 1640 of file BeautifulSoup.py.

Member Data Documentation

dictionary BeautifulSoup.MinimalSoup.NESTABLE_TAGS = {}
static

Definition at line 1651 of file BeautifulSoup.py.

tuple BeautifulSoup.MinimalSoup.RESET_NESTING_TAGS = buildTagMap('noscript')
static

Definition at line 1650 of file BeautifulSoup.py.