hedii/extractors

Extractor is a package that find targeted types of resources in html dom.

Maintainers

👁 hedii

Package info

github.com/hedii/extractors

pkg:composer/hedii/extractors

Statistics

Installs: 1 629

Dependents: 1

Suggesters: 0

Stars: 7

Open Issues: 1

2.1.0 2018-05-18 09:53 UTC

Requires

Requires (Dev)

Suggests

None

Provides

None

Conflicts

None

Replaces

None

MIT 3c7781e8eddef2e203b7d110366723ecd00d8e04

hediiextractors

This package is auto-updated.

Last update: 2026-06-18 22:49:31 UTC


README

👁 Build Status

Extractors

Extractor is a package that find targeted types of resources in html dom. Currently, only 2 types of resources are available: urls and emails. The found resources are returned as an array.

The Extractor class can be extended to add more resources types.

Install

Via Composer

composer require hedii/extractors

Usage

Currently, only 2 types of resources are available: urls and emails.

// require composer autoloader
require '/path/to/vendor/autoload.php';

// instantiate 
$extractor = new \Hedii\Extractors\Extractor();

// get all the urls on example.com page dom
$urls = $extractor->searchFor(['urls'])
 ->at('http://example.com')
 ->get();

// get all the emails on example.com page dom
$emails = $extractor->searchFor(['emails'])
 ->at('http://example.com')
 ->get();

// get all the urls and emails on example.com page dom
$urlsAndEmails = $extractor->searchFor(['urls', 'emails'])
 ->at('http://example.com')
 ->get();

Testing

composer test