Apologies if this is a stoopid question, but I've tried various solutions and cannot resolve my problem.
I am looking to download each sub-page from a site as a separate file, but the search seems to go off at a tangent.
Given that it is a .gov site, as you can imagine, my error results in lots of 'wrong' pages being downloaded.
The site is https://www.gov.uk/countryside-stewardship-grants which has 260 pages below it. It is only the 260 pages that I need to download, but I cannot see how to isolate those within the A1 options.
Can someone please advise where I am going wrong and how to resolve it?
Thanks.
When you say
QuoteThe site is https://www.gov.uk/countryside-stewardship-grants which has 260 pages below it.
Do you mean it has links to 260 pages you want or that all page URLs are below
countryside-stewardship-grants/Either way, many ways to achieve what you want:
- Root path to https://www.gov.uk/
- Add to start search paths: https://www.gov.uk/countryside-stewardship-grants
- Limit "scan website | output filters" paths to
:countryside-stewardship-grants
Relevant help pages:
https://www.microsystools.com/products/website-download/help/root-aliases-start-paths/ (https://www.microsystools.com/products/website-download/help/root-aliases-start-paths/)
https://www.microsystools.com/products/website-download/help/website-crawler-output-filters/ (https://www.microsystools.com/products/website-download/help/website-crawler-output-filters/)
https://www.microsystools.com/products/website-download/help/website-crawler-scanner-filters/ (https://www.microsystools.com/products/website-download/help/website-crawler-scanner-filters/)
https://www.microsystools.com/products/website-download/help/crawl-robots-noindex-nofollow/ (https://www.microsystools.com/products/website-download/help/crawl-robots-noindex-nofollow/)
If you need help configuring a project file, you can also drop us an email:
https://www.microsystools.com/home/contact.php (https://www.microsystools.com/home/contact.php)