Parse options for passing to Nokogiri.XML or Nokogiri.HTML
You can build your own combinations of these parse options by using any of the following methods: Note: All examples attempt to set the RECOVER
& NOENT
options. All examples use Ruby 2 optional parameter syntax.
You can use the Ruby bitwise operators to set various combinations.
Nokogiri.XML('<content>Chapter 1</content', options: Nokogiri::XML::ParseOptions.new((1 << 0) | (1 << 1)))
Every option has an equivalent method in lowercase. You can chain these methods together to set various combinations.
Nokogiri.XML('<content>Chapter 1</content', options: Nokogiri::XML::ParseOptions.new.recover.noent)
You can also setup parse combinations in the block passed to Nokogiri.XML or Nokogiri.HTML
Nokogiri.XML('<content>Chapter 1</content') {|config| config.recover.noent}
You can also remove options from an instance of ParseOptions
dynamically. Every option has an equivalent no{option}
method in lowercase. You can call these methods on an instance of ParseOptions
to remove the option. Note that this is not available for STRICT
.
# Setting the RECOVER & NOENT options... options = Nokogiri::XML::ParseOptions.new.recover.noent # later... options.norecover # Removes the Nokogiri::XML::ParseOptions::RECOVER option options.nonoent # Removes the Nokogiri::XML::ParseOptions::NOENT option
compact small text nodes; no modification of the tree allowed afterwards (will possibly crash if you try to modify the tree)
the default options used for parsing HTML documents
the default options used for parsing XML documents
Default DTD attributes
Load external subsets
validate with the DTD
relax any hardcoded limit from the parser
do not fixup XINCLUDE xml:base uris
remove blank nodes
merge CDATA as text nodes
Do not reuse the context dictionary
Substitute entities
suppress error reports
Forbid network access. Recommended for dealing with untrusted documents.
suppress warning reports
do not generate XINCLUDE START/END nodes
remove redundant namespaces declarations
parse using XML-1.0 before update 5
pedantic error reporting
Recover from errors
use the SAX1 interface internally
Strict parsing
Implement XInclude substitution
# File lib/nokogiri/xml/parse_options.rb, line 77 def initialize options = STRICT @options = options end
# File lib/nokogiri/xml/parse_options.rb, line 111 def inspect options = [] self.class.constants.each do |k| options << k.downcase if send(:"#{k.downcase}?") end super.sub(/>$/, " " + options.join(', ') + ">") end
# File lib/nokogiri/xml/parse_options.rb, line 100 def strict @options &= ~RECOVER self end
# File lib/nokogiri/xml/parse_options.rb, line 105 def strict? @options & RECOVER == STRICT end
© 2008–2016 Aaron Patterson, Mike Dalessio, Charles Nutter, Sergio Arbeo
Patrick Mahoney, Yoko Harada, Akinori Musha, John Shahid
Licensed under the MIT License.