See Our Webmaster Tools for Windows and Mac

HTML, image, video and hreflang XML sitemap generatorA1 Sitemap Generator
      
website analysis spider tool for technical SEOA1 Website Analyzer
      
SEO tools for managing keywords and keyword listsA1 Keyword Research
      
complete website copier toolA1 Website Download
      
create custom website search enginesA1 Website Search Engine
      
scrape data into CSV, SQL and databasesA1 Website Scraper
      

Offline filenames

  • 1 Replies
  • 3095 Views
*

Zeno

  • Newbie
  • *
  • 1
  • +0/-0
    • View Profile
Offline filenames
« on: February 27, 2011, 06:32:03 PM »
I've just started using Website Downloader, but I can't see an answer to this anywhere.

The downloaded files mainly have filenames of the form MS_XXX.html. I'm not sure why it does this, but is there any way to preserve the online filenames?

Thanks!

*

Webhelpforums

  • Administrator
  • Hero Member
  • *****
  • 1435
  • +6/-0
  • Shared between Microsys, WebHelpForums and helpers
    • View Profile
    • Webmaster and Website Help Forums
Re: Offline filenames
« Reply #1 on: February 28, 2011, 05:01:56 AM »
Hi,

Imagine A1 Website Download encounters a website with these 3 urls:
page?var=valueA
page?var=valueB
pagevar=valueA

When saving these to disk, e.g. "?" is not allowed in file names by Windows.
This would force A1WD do convert those URLs. End result file names on disk could be like this:

page?var=valueA :: pagevar=valueA
page?var=valueB ::pagevar=valueB
pagevar=valueA :: pagevar=valueA

But that gave collision... Above is just an example. There are many possible ways s collision can happen. And for various reasons the fastest/easiest/practical way to deal with that is to use the MS_xxx system (where xxx is a counter)

The MS_xxx system is only used when collisions are possible. Possibly the system can be improved upon, but collisions actually happen quite often (I tried various methods in past "modifying" URLs when saving to disk, but it often led to collisions. However, one could add separate tracking of this and only use MS_xxx when *really* necessary.)
« Last Edit: February 28, 2011, 05:13:57 AM by Webhelpforums »
TechSEO360 | MicrosysTools.com  | A1 Sitemap Generator, A1 Website Analyzer etc.

 

HTML, image, video and hreflang XML sitemap generatorA1 Sitemap Generator
      
website analysis spider tool for technical SEOA1 Website Analyzer
      
SEO tools for managing keywords and keyword listsA1 Keyword Research
      
complete website copier toolA1 Website Download
      
create custom website search enginesA1 Website Search Engine
      
scrape data into CSV, SQL and databasesA1 Website Scraper