Best Options for Indexing Huge, DB-Driven Site

Started by pbickford, October 28, 2024, 05:44:47 PM

pbickford

Hi folks,

I'm trying to set up a sitemap to encourage Google to index the content of a very large (1mil + records) ASP.Net site, particularly with an eye toward having Google notice that we have information on the various items listed.

A typical URL would be something like:

MySite/app/ItemDetail?ItemID=12345

(assume that the ItemID will run to 1mil+ records)

Is my best strategy to let it index for a day or two to iterate the site, then submit the entire list of URLs from the matches? Or is there a better way to go?

Also: What's the best way to automatically have the crawler exclude any URLs that would terminate in a login screen prompt (much of the site is intentionally inaccessible unless you're logged in--don't care if those parts are crawled/returned in search results).

Thanks!
-Pete

Webhelpforums

You may want to consider have A1SG crawl localhost version:
https://www.microsystools.com/products/sitemap-generator/help/xml-sitemap-generator-localhost/

You can also check this about crawling large websites:
https://www.microsystools.com/products/sitemap-generator/help/creating-sitemaps-large-websites/

If you are unhappy about certain URLs included in *final* XML sitemaps (A1SG may show your URLs post scan it knows not to include in sitemaps you can consider using "Output filers" which are very powerful for configuration:
https://www.microsystools.com/products/sitemap-generator/help/website-crawler-output-filters/

Regarding submitting: Just submit the sitemap index file generated (and upload all):
https://www.microsystools.com/products/sitemap-generator/help/xml-sitemaps-page-limit/
TechSEO360 | MicrosysTools.com  | A1 Sitemap Generator, A1 Website Analyzer etc.

More About Our Webmaster Tools for Windows and Mac

HTML, image, video and hreflang XML sitemap generatorA1 Sitemap Generator
      
website analysis spider tool for technical SEOA1 Website Analyzer
      
SEO tools for managing keywords and keyword listsA1 Keyword Research
      
complete website copier toolA1 Website Download
      
create custom website search enginesA1 Website Search Engine
      
scrape data into CSV, SQL and databasesA1 Website Scraper