dr4g0nsr/sitemap-crawler
最新稳定版本:1.0
Composer 安装命令:
composer require dr4g0nsr/sitemap-crawler
包简介
Crawler for any type of site using robots.txt and sitemap.xml as the source of URL. Useful for cache regenerating.
README 文档
README
Sitemap Crawler
Crawler using sitemap to crawl site/regenerate cache.
Files are not stored, point is just to trigger url.
Get code using composer
composer require dr4g0nsr/sitemap-crawler
How to implement
Create config.php:
<?php
$settings = [
"sleep" => 0,
"excluded" => []
];
Use code like this:
<?php
require __DIR__ . '/vendor/autoload.php';
require __DIR__ . '/config.php';
use dr4g0nsr\Crawler;
$url = 'https://candymapper.com';
print "Crawler version: " . Crawler::version() . PHP_EOL;
$crawler = new Crawler(['sleep' => 0, 'verbose' => true]);
$crawler->loadConfig(__DIR__ . '/config.php');
$sitemap = $crawler->getSitemap($url);
$crawler->crawlURLS($sitemap);
That would be simplest code, you can also find it in test subdir under vendor/dr4g0nsr/SitemapCrawler/test.
统计信息
- 总下载量: 22
- 月度下载量: 0
- 日度下载量: 0
- 收藏数: 4
- 点击次数: 1
- 依赖项目数: 0
- 推荐数: 0
其他信息
- 授权协议: OSL-3.0
- 更新时间: 2022-11-07