deevee15/text-similarity-php
最新稳定版本:v1.0.0
Composer 安装命令:
composer require deevee15/text-similarity-php
包简介
php service for text similarity
README 文档
README
TextSimilarity is pure PHP library for detecting duplicate news articles using morphological analysis and entity-weighted scoring.
Read more about TextSimilarity:
Features
- Morphological text processing via phpMorphy
- Named entity extraction (names, locations, organizations, abbreviations)
- Weighted scoring system with configurable coefficients
- File-based word cache for performance
- English and Russian language support
Installation
TextSimilarity is installed via Composer. To add a dependency to TextSimilarity in your project,
Run the following to use the latest stable version
composer require deevee15/text-similarity-php
Requirements
- PHP 8.1 or above
- phpMorphy library
Getting started
use PHPTextSimilarity\TextSimilarity; $result = TextSimilarity::compare( 'en', 'First article`s text...', 'Second article`s text...', ['first' => "Article's title", 'second' => "Article's title"] );
How it works
The TextSimilarity library divides all words from compared texts and article titles into entities (proper names, common nouns, locations, abbreviations, organizations), converts them to the nominative case, then retains only the matching ones, assigns points based on the matches, and multiplies them by the importance coefficients specified in src/Config/WeightConfig.php.
Demo
Here is the link to the demo website
License
Apache 2.0
统计信息
- 总下载量: 1
- 月度下载量: 0
- 日度下载量: 0
- 收藏数: 1
- 点击次数: 3
- 依赖项目数: 0
- 推荐数: 0
其他信息
- 授权协议: Apache-2.0
- 更新时间: 2026-04-15