example | ||
src | ||
tests | ||
.gitignore | ||
composer.json | ||
composer.lock | ||
README.md |
Similar project to spatie/http-status-check but this will collect ALL the found on URL's (not just first occurrence) and return the results as an array for further processing in your own application (no console command).
Install
composer config repositories.jhodges composer https://git.jhodges.co.uk/composer
composer require jhodges/sitemap
Usage
Note: start the test server as shown below if you want to use the localhost:8080 site as per usage example.
use \JHodges\Sitemap\Crawler;
$crawler=new Crawler();
$crawler->crawl('https://localhost:8080');
$sitemap=$crawler->getResults();
print_r($sitemap);
To crawl multiple areas of the same site that are not interlinked, you may:
use \JHodges\Sitemap\Crawler;
$crawler=new Crawler();
$crawler->crawl('https://localhost:8080/interlinked1');
$crawler->crawl('https://localhost:8080/deeplinked1');
$sitemap=$crawler->getResults();
print_r($sitemap);
Tests
Start the test server, will listen on localhost:8080
cd tests/server
./start_server.sh
Run the tests:
vendor/bin/phpunit tests