Presale Questions Sitemap

Started by Kimbr, May 28, 2010, 12:05:18 PM

Kimbr

I am giving A1 Sitemap Generator a test run and have a question. I made a sitemap and was able to delete the cgi fold it found but in each folder I has folders which make codes like:

NAME is a replacement for the username it was using
FILE-NAME to a replacement for the file name

<siteMapNode title="DOCUME~1" description="DOCUME~1" url="~/printable/FILE-NAME/file:/C:/DOC~1/">

<siteMapNode title="NAME~1" description="NAME~1" url="~/printable/FILE-NAME/file:/C:/DOC~1/NAME~1/">

<siteMapNode title="LOCALS~1" description="LOCALS~1" url="~/printable/FILE-NAME/file:/C:/DOC~1/NAME~1/LOCALS~1/">

<siteMapNode title="Temp" description="Temp" url="~/printable/FILE-NAME/file:/C:/DOC~1/NAME~1/LOCALS~1/Temp/">

<siteMapNode title="msohtml1" description="msohtml1" url="~/printable/FILE-NAME/file:/C:/DOC~1/NAME~1/LOCALS~1/Temp/msohtml1/">
<siteMapNode title="01" description="01" url="~/printable/FILE-NAME/file:/C:/DOC~1/NAME~1/LOCALS~1/Temp/msohtml1/01/">
                      </siteMapNode>

Should all this really be in the sitemap? I could think only the web site name with url should be in the sitemap but please correct me if I am wrong. I know very little about sitemaps. I can't figure out how to block file name or types.

Thank you
Kimberly

Webhelpforums

Hi Kimberly,

It appears you are generating a sitemap file for ASP.Net controls. Yhe URL paths do puzzle me quite a bit. What's your website address? I would like to run a test scan!

As a general answer: A1 Sitemap Generator supports very flexible output filters that allow you to control very finely which pages get included after website scan / sitemap generation. It can also handle noindex, robots, canonical etc. To see these options, you may need to disable easy mode.

I will be happy to help you block filenames and/or types.
TechSEO360 | MicrosysTools.com  | A1 Sitemap Generator, A1 Website Analyzer etc.

Kimbr

Hello,

I did disable to easy mode and tried to put the file names I didn't want to index in "output filters"  in the "exclude output  of internal url...." I tired different words and paths but nothing seemed to work.

The site I was working with is http://www.momsbreak.com/

Thanks, Kimberly

Webhelpforums

I just test scanned your website twice :)


1)
I don't think you actually want a sitemap file for ASP.Net controls? You probably want to create one of the other kinds. (Ignore what I write if you are sure you want an ASP.Net control sitemap file, e.g. if you are converting your website to ASP.Net)

2)
The exact example you gave URL is caused by you having a reference to
http://www.momsbreak.com/cgi-bin/printable/file:/C:/DOCUME~1/KIMBER~1/LOCALS~1/Temp/msohtml1/01/clip_filelist.xml
e.g. at
http://www.momsbreak.com/cgi-bin/printable/search.cgi?action=search&page=&perpage=3&template=articleLists/categoryIndex.html&includeSubcats=1&categoryNum=87
using code like this
<link href="file:///C:\DOCUME~1\KIMBER~1\LOCALS~1\Temp\msohtml1\01\clip_filelist.xml" rel="File-List" />
Read how to view internal linking in sitemap generator tool.

3)
I just tried add
path match filter: ":cgi-bin/printable/" (without "")
text match filter: "Contact" (without "")

like described in the list filters help I then rescanned your website and both worked. If you have email, please email me your project file to "info @at@ microsystools.com" (without "" and replacing @at@ with @) and I will send it back to you corrected!
TechSEO360 | MicrosysTools.com  | A1 Sitemap Generator, A1 Website Analyzer etc.

More About Our Webmaster Tools for Windows and Mac

HTML, image, video and hreflang XML sitemap generatorA1 Sitemap Generator
      
website analysis spider tool for technical SEOA1 Website Analyzer
      
SEO tools for managing keywords and keyword listsA1 Keyword Research
      
complete website copier toolA1 Website Download
      
create custom website search enginesA1 Website Search Engine
      
scrape data into CSV, SQL and databasesA1 Website Scraper