becklyn/search-text-transformer

A library that extracts plain text from HTML for usage in search engines (like Elasticsearch)

Maintainers

👁 becklyn

Package info

github.com/Becklyn/SearchTextTransformer

pkg:composer/becklyn/search-text-transformer

Statistics

Installs: 930

Dependents: 1

Suggesters: 0

Stars: 4

Open Issues: 0

2.0.0 2022-12-12 10:40 UTC

Requires

  • php: >=8.1

Requires (Dev)

Suggests

None

Provides

None

Conflicts

None

Replaces

None

BSD-3-Clause 2df69801ec6e097d8ae4cd54b0f5425eda6ffa00

Becklyn

This package is auto-updated.

Last update: 2026-06-12 18:22:39 UTC


README

Transforms HTML to searchable plain text for usage in conjunction with a search engine (like Elasticsearch).

Installation

Install via composer.

Usage

<?php

use Becklyn\SearchText\SearchTextTransformer;

$transformer = new SearchTextTransformer();
$plain = $transformer->transform("<p>Some HTML content</p>");

Testing

All test cases belong into tests/fixtures and must have the file extension .test.

The test format is:

--TEST--
Here is a plain text description of this test.
--HTML--
<p>Some html.</p>
--EXPECT--
The expected result.

The --TEST-- segment is optional.