Scraping the web with Ruby tutorial – Part 1

Get more content like this at http://www.techmaker.tv.

In this episode, we explore how to use Ruby, OpenURI, and Nokogiri to parse HTML from the web, extract interesting bits of data, and reconstruct it into a new format.

Resources:

OpenURI
http://ruby-doc.org/stdlib-2.1.0/libdoc/open-uri/rdoc/OpenURI.html

Nokogiri
http://www.nokogiri.org/

Comments

Ben Pohl says:

Thank you! This was way more helpful than anything else I’ve come across regarding Nokogiri. From your tutorial I was able to provide an update to a different tutorial that had out of date information.

Nil Amara says:

Attempting to open https sites in ruby using open-uri gives ssl issue.. here the solution :

(open(‘yoururl’,{ssl_verify_mode: OpenSSL::SSL::VERIFY_NONE}))

Rogerio Lira says:

Very good. Is this code in github?
Could you trade Craglist for google searches? Reply with quote

Wilfred Ruck says:

This video definitely helped me understand how to use Nokogiri and complete my project. Thanks!

Nikhil Patil says:

hmmm.. Interesting! 🙂

Wiliner says:

When I try require ‘nokogiri’ on terminal it works but using sublime it shows this error: /usr/lib/ruby/2.3.0/rubygems/core_ext/kernel_require.rb:55:in `require’: cannot load such file — nokogiri (LoadError)

I installed nokogiri using: gem install nokogiri

geo says:

Thanks!!! this video is so helpfull,
I subscribed to TechmakerTV I hope you can take this to a rails app.

do you have github account to check the code?

SouvlakiGyros says:

Nice tutorial 🙂
Keep up the good work !

 Write a comment

*

Do you like our videos?
Do you want to see more like that?

Please click below to support us on Facebook!