internal links wrongly recognized as external links for https

Started by bigben, September 21, 2015, 02:04:58 AM


Hi, I came across your site as I need to generate sitemap for my page.
My site is in https and I was testing my page which contains around 50 links to other pages within my site, also in https.

I want the crawler to crawl those pages linked to from my test page, however, the crawler is classifying the linked internal pages as "external links". I think it's because the crawler automatically attached the port number into the link and hence see that as a different domain?

For instance:
from my test page the link is

your crawler would identify the linked page as and classify it as External link.

I need all these page in my sitemap, is there ways around it for your software?


Default settings should automaically alias http, https and port 80 variations.

Try first turn off easy mode off

Then read how you can add more root domain aliases yourself:
(i.e. if A1 encounters a https link and http is default, it simply "aliases" it)

If you are curious, notice that all domains in an XML sitemap has to be the exact same:
TechSEO360 |  | A1 Sitemap Generator, A1 Website Analyzer etc.

More About Our Webmaster Tools for Windows and Mac

HTML, image, video and hreflang XML sitemap generatorA1 Sitemap Generator
website analysis spider tool for technical SEOA1 Website Analyzer
SEO tools for managing keywords and keyword listsA1 Keyword Research
complete website copier toolA1 Website Download
create custom website search enginesA1 Website Search Engine
scrape data into CSV, SQL and databasesA1 Website Scraper