Skip to content

Instantly share code, notes, and snippets.

View kberov's full-sized avatar
🐢
I may be slow to respond.

Красимир Беров kberov

🐢
I may be slow to respond.
View GitHub Profile
@kberov
kberov / html_tree_builder_vs_mojo_dom.pl
Created November 9, 2012 22:53
HTML::TreeBuilder VS Mojo::DOM
#!/usr/bin/env perl
use 5.14.2;
use Benchmark qw(timethese cmpthese);
use HTML::TreeBuilder;
use Mojo::UserAgent;
use Mojo::DOM;
my $url = 'http://www.i-can.eu/index.html';
my $content = Mojo::UserAgent->new->get($url)->res->body;
#PARSING SPEED
@kberov
kberov / html_elements.pl
Created November 9, 2012 21:32
Get only what you want from html using HTML::TreeBuilder and HTML::Element
#!/usr/bin/env perl
#Mojo::DOM and Mojo::UserAgent just rock, but sometimes you are not allowed to use them.
use 5.14.2;
use HTML::TreeBuilder;
my $url = 'http://contao.org/en/extension-list/view/i18nl10n.html';
my $root = HTML::TreeBuilder->new_from_url($url);
$root->eof(); # done parsing for this tree
my $h1 = $root->find_by_tag_name('h1');
say $h1->as_trimmed_text; #h1