Understand the JavaScript SEO basics

Shezad Ali Khan

Published Jul 20, 2019

+ Follow

How Googlebot processes JavaScript

Googlebot processes JavaScript web apps in three main phases:

Crawling
Rendering
Indexing

Googlebot crawls, renders, and indexes a page.

Googlebot queues pages for both crawling and rendering. It is not immediately obvious when a page is waiting for crawling and when it is waiting for rendering.

When Googlebot fetches a URL from the crawling queue by making an HTTP request it first checks if you allow crawling. Googlebot reads the robots.txt file. If it marks the URL as disallowed, then Googlebot skips making an HTTP request to this URL and skips the URL.

Googlebot then parses the response for other URLs in the href attribute of HTML links and adds the URLs to the crawl queue. To prevent link discovery, use the nofollow mechanism.

Crawling a URL and parsing the HTML response works well for classical websites or server-side rendered pages where the HTML in the HTTP response contains all content. Some JavaScript sites may use the app shell modelwhere the initial HTML does not contain the actual content and Googlebot needs to execute JavaScript before being able to see the actual page content that JavaScript generates.

Googlebot queues all pages for rendering, unless a robots meta tag or header tells Googlebot not to index the page. The page may stay on this queue for a few seconds, but it can take longer than that. Once Googlebot's resources allow, a headless Chromium renders the page and executes the JavaScript. Googlebot parses the rendered HTML for links again and queues the URLs it finds for crawling. Googlebot also uses the rendered HTML to index the page.

Keep in mind that server-side or pre-rendering is still a great idea because it makes your website faster for users and crawlers, and not all bots can run JavaScript.

Describe your page with unique titles and snippets

Unique, descriptive titles and helpful meta descriptions help users to quickly identify the best result for their goal and we explain what makes good titles and descriptions in our guidelines.

You can use JavaScript to set or change the meta description as well as the title.

Google Search might show a different title or description based on the user's query. This happens when the title or description have a low relevance for the page content or when we found alternatives in the page that better match the search query. See this page for more information on titles and the description snippet.

Write compatible code

Browsers offer many APIs and JavaScript is a quickly-evolving language. Googlebot has some limitations regarding which APIs and JavaScript features it supports. To make sure your code is compatible with Googlebot, follow our guidelines for troubleshooting JavaScript problems.

We recommend using differential serving and polyfills if you feature-detect a missing browser API that you need. Since some browser features cannot be polyfilled, we recommend that you check the polyfill documentation for potential limitations.

Use meaningful HTTP status codes

Googlebot uses HTTP status codes to find out if something went wrong when crawling the page.

You should use a meaningful status code to tell Googlebot if a page should not be crawled or indexed, like a 404for a page that could not be found or a 401 code for pages behind a login. You can use HTTP status codes to tell Googlebot if a page has moved to a new URL, so that the index can be updated accordingly.

Here's a list of HTTP status codes and when to use them:

HTTP statusWhen to use301 / 302The page has moved to a new URL.401 / 403The page is unavailable due to permission issues.404 / 410The page is no longer available.5xxSomething went wrong on the server side.

Use meta robots tags carefully

You can prevent Googlebot from indexing a page or following links through the meta robots tag. For example, adding the following meta tag to the top of your page blocks Googlebot from indexing the page:

    <!-- Googlebot won't index this page or follow links on this page -->
    <meta name="robots" content="noindex, nofollow">

When Googlebot encounters "noindex" in the robots meta tag, it doesn't render or index the page.

Using JavaScript to change or remove the robots meta tag might not work as expected. Googlebot skips rendering and JavaScript execution if the meta robots tag initially contains "noindex". If you want to use JavaScript to change the content of the robots meta tag, do not set the meta tag's value to "noindex".

Fix images and lazy-loaded content

Images can be quite costly on bandwidth and performance. A good strategy is to use lazy-loading to only load images when the user is about to see them. To make sure you're implementing lazy-loading in a search-friendly way, follow our lazy-loading guidelines.

Credit:Google

To view or add a comment, sign in

Understand the JavaScript SEO basics

Shezad Ali Khan

How Googlebot processes JavaScript

Describe your page with unique titles and snippets

Write compatible code

Use meaningful HTTP status codes

Use meta robots tags carefully

Fix images and lazy-loaded content

More articles by Shezad Ali Khan

Others also viewed

Trend Watch: Why Next.js 15 Is Google’s New Favorite Framework

Google Quietly Retires Old JavaScript SEO Warning — What It Really Means for Marketers

Google Warns Against Too Much JavaScript: What It Means for AI Search

Into the (vanilla) JavaScript weeds-this week's ready from Web Directions

JavaScript Usage by Google

How to Optimize JavaScript Performance for Faster Websites 🚀

Solving JavaScript Indexing Challenges for Better SEO and Performance

Frontend concepts explained: Hydration & SSR / Hydration

JavaScript-Loaded Images: Can They Be Indexed? Google Confirms JavaScript-Loaded Images Can Be Indexed

TailwindCSS: A Dead-End Technology?

Explore content categories

How Googlebot processes JavaScript

Describe your page with unique titles and snippets

Write compatible code

Use meaningful HTTP status codes

Use meta robots tags carefully

Fix images and lazy-loaded content

More articles by Shezad Ali Khan

ON PAGE-OFF PAGE 2019

How to Build a Website: A Complete Guide

Basic SEO practices for New Website:

Others also viewed

Trend Watch: Why Next.js 15 Is Google’s New Favorite Framework

Google Quietly Retires Old JavaScript SEO Warning — What It Really Means for Marketers

Google Warns Against Too Much JavaScript: What It Means for AI Search

Into the (vanilla) JavaScript weeds-this week's ready from Web Directions

JavaScript Usage by Google

How to Optimize JavaScript Performance for Faster Websites 🚀

Solving JavaScript Indexing Challenges for Better SEO and Performance

Frontend concepts explained: Hydration & SSR / Hydration

JavaScript-Loaded Images: Can They Be Indexed? Google Confirms JavaScript-Loaded Images Can Be Indexed

TailwindCSS: A Dead-End Technology?

Explore content categories