Losing the Analysis Filter Settings

Started by richard123, April 20, 2012, 08:21:17 PM

richard123

After I do a scan, when I go back to check the Analysis Filter settings (before I run the scan again) they are gone. What am I doing wrong? I assume I should not have to add them again for each scan, right?


Webhelpforums

1)
Are you sure you actually added them using the [ + ] button?
(Many people forget this.)

2)
Are you sure they are not still there if you click the "down" arrow to see the list?
(The text field box will be cleared. But the filters are still in the list.)
TechSEO360 | MicrosysTools.com  | A1 Sitemap Generator, A1 Website Analyzer etc.

richard123

The + button did the trick :) Thanks!

Two more questions:

I only want to index pages that start with http://www.mydomain.com/page?lvl
(i.e. http://www.mydomain.com/page?lvl0, http://www.mydomain.com/page?lvl01&lvl1=2 etc)

I thought the correct regex would be http://www.mydomain.com/page?lvl[.]+ but that's not working for me.

Also, in the "Limit analysis of internal URLs..." file extension .htm appears by default. I don't want it limited to htm extensions. Can you please explain how that works? Can I just leave it as is?

Webhelpforums

#3
Assuming you have set website root to http://mydomain.com


> pages that start with http://www.mydomain.com/page?lvl

a "limit to" regex filter would look like this
"::page\?lvl" without the ""

You could add [ + ] that to both

"output filers"
http://www.microsystools.com/products/sitemap-generator/help/website-crawler-output-filters/

"analysis filters"
http://www.microsystools.com/products/sitemap-generator/help/website-crawler-scanner-filters/

As you limit output and analysis/crawl to specific URLs you may need to add URLs to "start search from paths":
http://www.microsystools.com/products/sitemap-generator/help/root-aliases-start-paths/


Be sure not to have any incorrect filters in the lists since that can really break the crawl. If you continue to have problems, feel free to email directly with your project file:
http://www.microsystools.com/home/contact.php


> "Limit analysis of internal URLs..." file extension .htm appears by default

Please click the "down" arrow. You will see lots of file extensions are in the list. (Not just the one you mention!)

If you do not want to use file extensions (although the list is extremely comprehensive) there is also a list using MIME types meaning you *can* remove all the file extensions from the list if you wish using the [ - ] button. However, unless you are very sure, I would recommend against it :)


> Can you please explain how that works?

Based on your questions, I think you should check his page about how the [ + ] [ - ] and drop down lists work in A1SG configuration (basicly, all items in the dropdown list are active/selected)

http://www.microsystools.com/products/sitemap-generator/help/sitemap-generator-user-interface/
TechSEO360 | MicrosysTools.com  | A1 Sitemap Generator, A1 Website Analyzer etc.

More About Our Webmaster Tools for Windows and Mac

HTML, image, video and hreflang XML sitemap generatorA1 Sitemap Generator
      
website analysis spider tool for technical SEOA1 Website Analyzer
      
SEO tools for managing keywords and keyword listsA1 Keyword Research
      
complete website copier toolA1 Website Download
      
create custom website search enginesA1 Website Search Engine
      
scrape data into CSV, SQL and databasesA1 Website Scraper