While most of our efforts in SEO and content marketing focus on getting pages indexed and ranking, another consideration is what to do when you don’t want a page to be indexed.
Depending on your CMS, you can simply utilize a plugin (WordPress) and toggle not to index a page, or in Webflow you can configure the page in a similar way, and the list could go on.
But what’s going on under the hood, and what type of page wouldn’t we want to index?
Let’s start with sourcing Google directly: “noindex is a rule set with either a <meta> tag or HTTP response header and is used to prevent indexing content by search engines that support the noindex rule, such as Google. When Googlebot crawls that page and extracts the tag or header, Google will drop that page entirely from Google Search results, regardless of whether other sites link to it.”
Here’s what’s that looks like:
Essentially:
The tag goes inside the <head> section of your page or HTML document:
<head>
<meta name="robots" content="noindex, nofollow">
</head>
Then, the next time Google recrawls the page, the no-index will take effect. Or to speed up removal, use Google Search Console → Removals and request temporary removal.
Be precise here, as you don’t want to apply this on a template level when attempting to do so on a page level, as that could result in an entire section of your website losing traffic. The important thing is to be intentional about it.
Google also notes: “For the noindex rule to be effective, the page or resource must not be blocked by a robots.txt file, and it has to be otherwise accessible to the crawler. If the page is blocked by a robots.txt file or the crawler can't access the page, the crawler will never see the noindex rule, and the page can still appear in search results, for example if other pages link to it.”
Which is included in the noindex variations below:
Most of the pages on your website will be indexed, but some examples that you want noindex included on are below.
Internal and Admin Pages:
Search & Filtered Results Pages
Thank You & Confirmation Pages
Staging & Test Environments
Cart & Checkout Pages
Expired or Discontinued Product Pages
Affiliate or Syndicated Content
Implementing noindex tags is an essential SEO strategy for controlling what parts of your website search engines include in their index and ultimately what users will see. While we often focus on getting pages indexed, there are situations where preventing index is equally beneficial.
The goal is to apply these practices strategically to enhance site structure, get any garbage out of the SERPs, and protect sensitive information, while doing so intentionally to not experience any unintended ranking losses.
If you’re unsure which pages to noindex, performing an audit can help determine the right approach.
And we’ll leave you on one last note from Google: “We have to crawl your page in order to see <meta> tags and HTTP headers. If a page is still appearing in results, it's probably because we haven't crawled the page since you added the noindex rule. Depending on the importance of the page on the internet, it may take months for Googlebot to revisit a page. You can request that Google recrawl a page using the URL Inspection tool.”