I want to exclude certain pages from being indexed. However, it is an AND statement. I want to exclude all pages that are in the /products/ folder AND have the extension of .html. How would I do this?
You will want to use regular expressions in those case.
First see:
http://www.microsystools.com/products/sitemap-generator/help/website-crawler-scanner-filters/ (http://www.microsystools.com/products/sitemap-generator/help/website-crawler-scanner-filters/)
(which URLs/pages get analyzed)
http://www.microsystools.com/products/sitemap-generator/help/website-crawler-output-filters/ (http://www.microsystools.com/products/sitemap-generator/help/website-crawler-output-filters/)
(which URLs/pages get included in output)
Other help pages
http://www.microsystools.com/products/sitemap-generator/help/easy-sitemap-generator-mode/ (http://www.microsystools.com/products/sitemap-generator/help/easy-sitemap-generator-mode/)
http://www.microsystools.com/products/sitemap-generator/help/sitemap-generator-user-interface/ (http://www.microsystools.com/products/sitemap-generator/help/sitemap-generator-user-interface/)
Then create a regular expression like this
/products/.*\.html
which will match any URL with */products/* and *.html*
As seen from help above, A1SG needs regex filters prepended with "::" so it becomes
::/products/.*\.html