Why
and Who Would Want to Stop Search Engines
For most websites, search engines
are the major source of traffic. You may ask, why would anyone want to block
search engines?
Believe it or not, there are many
users who work on their sites by putting them on a live publicly accessible
domain instead of creating a local development environment or creating a development site.
Some people create project management sites
using WordPress. There are many people who use WordPress to create private blogs.
In all these situations, you probably don’t want to be indexed by search
engines and be found when you are not ready for that.
A common misconception is that if I
do not have links pointing to my domain, then search engines will probably
never find my website. This is not completely true.
There are many ways search engines
can find a website linked elsewhere. For example:
- Your domain name could have been previously owned by
someone else and they still have some links pointing to your website now.
- Some domain search site’s results could get indexed
with your link on them.
- There are literally thousands of pages with just list
of domain names, your site can appear on one of those.
There are many things happening on
the web and none of them are under your control. However, your website is still
under your control, and you can instruct search engines to not follow or index
your website.
Blocking
Search Engines from Crawling and Indexing Your WordPress Site
WordPress comes with a built-in
feature that allows you to instruct search engines not to index your site. All
you need to do is visit Settings » Reading and check the box next to
Search Engine Visibility option.
When this box is checked, WordPress
adds this line to your website’s header:
1
|
<meta name='robots'
content='noindex,follow'
/>
|
WordPress also modifies your site’s
robots.txt file and add these lines to it:
1
2
|
User-agent: *
Disallow: /
|
These lines ask robots (web
crawlers) not to index your pages. However, it is totally up to search engines
to accept this request or ignore it. Even though most search engines respect
this, some page or random image from your site may get indexed.
How
to Make Sure Your Site Doesn’t Appear in Search Results?
The most effective way to block
search engines from a live website is by password protecting your entire
WordPress site on the server level. This means when anyone accesses your
website they are asked to provide a username and password even before they
reach WordPress. This includes search engines as well. Upon login failure, they
are shown 401 error and the bots turn away. Here is how to password protect an
entire WordPress site.
Password
Protecting an Entire Site using cPanel
If your WordPress
hosting provider offers cPanel access to
manage your hosting account, then you can protect your entire site using
cPanel. Simply login to cPanel dashboard and then click on password protect
directories.
This will bring up a popup where you
need to choose the document root.
On the next screen, select the
folder where your WordPress site is installed. It is usually public_html or www
directory. After that, check the box next to ‘Password protect this directory’
option. Next, provide a name to the protected directory and hit the save
button.
Later, you will see a success
message and link to go back. Clicking on the go back link will bring you to the
password protection screen. Now, you need to add a username and password which
will be required to view your website.
That’s all your website is now
password protected, and no one including the search engines can access your
website.
Here is another tutorial on how to password protect your WordPress site. However, in this tutorial, you will be using a plugin. If
for some reason you deactivate the plugin, then your site will be accessible to
search engines again.
We hope this article helped you stop
search engines from crawling or indexing your WordPress site.