On Page SEO

Started by samaustin141, September 13, 2012, 02:46:00 AM

samaustin141

What code is write in robots.txt file?
How to create robots.txt file and where to submit in website?
After that any change in webmaster tools?

siyajoshi

Hello..
To remove your site from search engines and prevent all robots from crawling it in the future, place the following robots.txt file in your server root:
User-agent: *
Disallow: /
To add your site from search engines and allow all robots from crawling it in the future, place the following robots.txt file in your server root:
User-agent: *
Allow: /

marymnewland

 Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

It works likes this: a robot wants to vists a Web site URL, say http://www.example.com/welcome.html. Before it does so, it firsts checks for http://www.example.com/robots.txt, and finds:

User-agent: *
Disallow: /

The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

There are two important considerations when using /robots.txt:

    robots can ignore your /robots.txt. Especially malware robots that scan the web for security vulnerabilities, and email address harvesters used by spammers will pay no attention.
    the /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use.

So don't try to use /robots.txt to hide information.

josshray

Go to Google webmaster code and check the crawler error options, you'll see the robot.txt code there.

marymnewland

Quote from: josshray on October 23, 2012, 04:41:42 PM
Go to Google webmaster code and check the crawler error options, you'll see the robot.txt code there.

I agree with this. Google Webmasters will help you more.

fab

Quote from: marymnewland on September 18, 2012, 02:14:41 AM
Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

It works likes this: a robot wants to vists a Web site URL, say http://www.example.com/welcome.html. Before it does so, it firsts checks for http://www.example.com/robots.txt, and finds:

User-agent: *
Disallow: /

The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

There are two important considerations when using /robots.txt:

    robots can ignore your /robots.txt. Especially malware robots that scan the web for security vulnerabilities, and email address harvesters used by spammers will pay no attention.
    the /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use.

So don't try to use /robots.txt to hide information.
Thanks For Share .

JigarNetworks

On page SEO or search engine optimisation is making sure that your website is as search engine friendly as possible.

Make sure that you have unique content on every single page.

kevvin@20

You have must required for tobots.txt and sitemap on your webmaster...!

QuizMEOnline

Quote from: marymnewland on September 18, 2012, 02:14:41 AM
Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

It works likes this: a robot wants to vists a Web site URL, say http://www.example.com/welcome.html. Before it does so, it firsts checks for http://www.example.com/robots.txt, and finds:

User-agent: *
Disallow: /

The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

There are two important considerations when using /robots.txt:

    robots can ignore your /robots.txt. Especially malware robots that scan the web for security vulnerabilities, and email address harvesters used by spammers will pay no attention.
    the /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use.

So don't try to use /robots.txt to hide information.
There is good suggestion for your site.

icecube media

Robots.txt file contains the instructions which are for Google bot that which page to crawl and which page is not to crawl.
You can create it manually or with the help of online free tools. And once you create robots.txt file then you have to upload it in the root folder of your website.

allenhill

On-Page SEO is the process of ensuring that the search engines understand what your page is about. This is done by structuring the page around a Keyword so that when that Keyword is typed into the search box your page is seen by the search engines as highly relevant.


martinsherman

The robots.txt file is a text file that tells search engine crawlers which portions of your website they should NOT index. If you don't want to restrict search engine crawlers, you should simply create an empty robots.txt file (e.g., touch robots.txt) or one that looks like this:

User-agent: *
Disallow:
Once you have created a robots.txt file, you store it in the root directory of your Web server.

Hope this helps you!!

bradely

Be very careful if you use robots file, as a mistake can blank out the entire site

Alex Thompson

On-page is considered as one of the primary process of SEO Campaign. All the changes are applied on the website page and it is the process for improving the appearance of the website page and we can say that it is the process of creating a web-page more friendly with the search engine.

Rohit1982

Quote from: marymnewland on September 18, 2012, 02:14:41 AM
Web site owners use the /robots.txt file to give instructions about their site to web robots; this is called The Robots Exclusion Protocol.

It works likes this: a robot wants to vists a Web site URL, say http://www.example.com/welcome.html. Before it does so, it firsts checks for http://www.example.com/robots.txt, and finds:

User-agent: *
Disallow: /

The "User-agent: *" means this section applies to all robots. The "Disallow: /" tells the robot that it should not visit any pages on the site.

There are two important considerations when using /robots.txt:

    robots can ignore your /robots.txt. Especially malware robots that scan the web for security vulnerabilities, and email address harvesters used by spammers will pay no attention.
    the /robots.txt file is a publicly available file. Anyone can see what sections of your server you don't want robots to use.

So don't try to use /robots.txt to hide information.
Nice information, many thanks to the marymnewlandr

More About Our Webmaster Tools for Windows and Mac

HTML, image, video and hreflang XML sitemap generatorA1 Sitemap Generator
      
website analysis spider tool for technical SEOA1 Website Analyzer
      
SEO tools for managing keywords and keyword listsA1 Keyword Research
      
complete website copier toolA1 Website Download
      
create custom website search enginesA1 Website Search Engine
      
scrape data into CSV, SQL and databasesA1 Website Scraper