ruby - 100% unending CPU usage fetching SSL page without cert -
using mechanize 2.6.0 on ruby 1.9.3 i'm trying fetch web page on https windows 7x64. when attempt get()
url cpu usage goes 100% , method never returns:
require 'mechanize' uri = "https://my.com/wiki/api.php?action=query&titles=us4&prop=info&format=xml" agent = mechanize.new u,p = %w[myusername mypassword] agent.add_auth( uri, u, p ) agent.agent.http.verify_mode = openssl::ssl::verify_none info = agent.get( uri )
when interrupt it, these stack traces (three different runs):
>> info = agent.get( page_api ) irb::abort: abort interrupt! c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/www_authenticate_parser.rb:27:in `call' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/www_authenticate_parser.rb:27:in `parse' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/agent.rb:716:in `response_authenticate' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/agent.rb:306:in `fetch' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize.rb:431:in `get' (irb):10 c:/ruby193/bin/irb:12:in `<main>' >> info = agent.get( page_api ) irb::abort: abort interrupt! c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/www_authenticate_parser.rb:29:in `call' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/www_authenticate_parser.rb:29:in `new' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/www_authenticate_parser.rb:29:in `parse' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/agent.rb:716:in `response_authenticate' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/agent.rb:306:in `fetch' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize.rb:431:in `get' (irb):11 c:/ruby193/bin/irb:12:in `<main>' >> info = agent.get( page_api ) irb::abort: abort interrupt! c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/www_authenticate_parser.rb:114:in `call' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/www_authenticate_parser.rb:114:in `token' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/www_authenticate_parser.rb:31:in `parse' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/agent.rb:716:in `response_authenticate' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize/http/agent.rb:306:in `fetch' c:/ruby193/lib/ruby/gems/1.9.1/gems/mechanize-2.6.0/lib/mechanize.rb:431:in `get' (irb):12 c:/ruby193/bin/irb:12:in `<main>'
how can work around problem , fetch https url via ruby on windows? (if there's better solution mechanize this—since need source of page feed nokogiri anyhow—i'm open not using mechanize @ all.)
another datapoint: trying same code on os x produces same result.
here's actual content of page, using alternative fetching method described in workaround answer below:
p fetch_https_without_ssl_verification(uri, u, p) #=> "\t\t <?xml version=\"1.0\"?><api><query><normalized><n from=\"devtools/ui_composer/designspec/us7294\" to=\"devtools/ui composer/designspec/us7294\" /></normalized><pages><page ns=\"0\" title=\"devtools/ui composer/designspec/us7294\" missing=\"\" /></pages></query></api>"
if need contents of url (as do) using curl instead of mechanize far easier, , works:
def fetch_https_without_ssl_verification( uri, user=nil, pass=nil ) `curl -s -k #{%q{-u "#{user}#{":"<<pass if pass}"} if user} "#{uri}"` end
Comments
Post a Comment