Playground
A companion test site for evaluation and automated checks — provides predictable pages and structures to validate crawling and scraping end‑to‑end.
Inside a docker-compose network, it’s available on the URL http://playground
and all examples on this website use the service to provide users with predictable idempotent query results. Additionally, the playground is available on http://localhost:2808
.
This service is a web server with static files in the following structure:
playground/
|- index.html
|- about.html
|- faq.html
|- robots.txt
|- sitemap.xml
|- armor_and_accessories/
| |- 1/
| | |- index.html
| | |- cloak_of_the_phantom.html
| | |- ...
| | |- shield_of_the_thunder_god.html
| | ...
| |- 3/
| | |- index.html
| | |- chalice_of_dreams.html
| | |- ...
| | |- mask_of_the_forgotten.html
| - beast_and_creature_items/
| - ...
The playground is designed to allow testing of most of the web crawling scenarios, for instance:
- Tree page navigation
- Graph page navigation
- Paging
- etc.
As for scraping, the leaf pages have the same structure:
<h1>NAME</h1>
<div class="price">
<b>Price: </b>
<span>PRICE</span>
</div>
<div class="desc">
<b>Description:</b>
<p>DESCRIPTION</p>
</div>
So that the majority of the example queries would extract these three data items from the pages
- NAME
- PRICE
- DESCRIPTION
In the future versions, the playground might be extended with new data items and subfolders, but this core part will be kept as long as possible for backward compatibility