Meta Scraper

Page meta scraper parse meta information from page.

Installation

via composer:

composer require tomaj/meta-scraper

How to use

Example:

use Tomaj\Scraper\Scraper;
use Tomaj\Scraper\Parser\OgParser;

$scraper = new Scraper();
$parsers = [new OgParser()];
$meta = $scraper->parse(file_get_contents('http://www.google.com/'), $parsers);

var_dump($meta);

or you can use parseUrl method (internally use Guzzle library)

use Tomaj\Scraper\Scraper;
use Tomaj\Scraper\Parser\OgParser;

$scraper = new Scraper();
$parsers = [new OgParser()];
$meta = $scraper->parseUrl('http://www.google.com/', $parsers);

var_dump($meta);

Parsers

There are 3 parsers included in package and you can create new implementing interface Tomaj\Scraper\Parser\ParserInterface.

3 parsers:

Tomaj\Scraper\Parser\OgParser - based on og (Open Graph) meta attributes in html (built on regular expressions)
Tomaj\Scraper\Parser\OgDomParser - also based on og (Open Graph) meta attributes in html (built on php DOM extension)
Tomaj\Scraper\Parser\SchemaParser - based on schema json structure

You can combine these parsers. Data that will not be found in first parser will be replaced with data from second parser.

use Tomaj\Scraper\Scraper;
use Tomaj\Scraper\Parser\SchemaParser;
use Tomaj\Scraper\Parser\OgParser;

$scraper = new Scraper();
$parsers = [new SchemaParser(), new OgParser()];
$meta = $scraper->parseUrl('http://www.google.com/', $parsers);

var_dump($meta);

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
src		src
tests		tests
.gitignore		.gitignore
.travis.yml		.travis.yml
Makefile		Makefile
README.md		README.md
composer.json		composer.json
phpunit.xml		phpunit.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meta Scraper

Installation

How to use

Parsers

About

Releases

Packages

Contributors 4

Languages

tomaj/meta-scraper

Folders and files

Latest commit

History

Repository files navigation

Meta Scraper

Installation

How to use

Parsers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages