Changing the way you receive insight through tools and analytics
Continues learning of on-going lifetime risk in keyword
Niche/market monitoring and prioritization
Early detection of anomalies of Google SERPs
Home SEO Guidelines Guidelines for Google...
Written by cmlabs
|

Guidelines for Google Crawling

Last updated: Jul 16, 2021
Disclaimer: This cmlabs SEO guidelines contain comprehensive information about SEO, such as an introduction and general guidelines. You may be sent through SEO Terms in cmlabs.co from third parties or links. Such external links are not investigated, or checked for accuracy and reliability by us. We do not assume responsibility for the accuracy or reliability of any information offered by third-party websites.

Written by cmlabs

A Guide to Crawling and Link Building

 

How to make it easy for Google crawler to index your website? Obviously every webmaster worth their salt wants their contents to stand out in the Google SERP. Learn further about the following comprehensive guide on how to make your links crawlable.

 

The crawling process starts with a list of web addresses taken from the previous crawling activities and sitemaps shared by website owners. The way crawlers work is by using links on particular websites to find other new web pages. What you have to know is Google can only detect links with < a > tag on them that match with the URL. Here are some considerations that you have to keep in mind to make your links crawlable. Berikut cmlabs uraikan syarat yang harus diperhatikan agar tautan Anda crawlable.

Using the Proper < a > Tag

To reiterate what we have previously mentioned, you have to understand that Google can only detect and follow links if they use < a > tag with an href attribute. Other formats are not supported by Google, and therefore Google crawler cannot follow such links with unsupported formats. Another point to consider is Google cannot follow < a > links without an href tag or other tags that function as links because of problems related to their scripts. Here are some examples of followable or unfollowable links. href. Tautan dengan format lain tidak bisa diikuti oleh crawler Google. Google juga tidak dapat mengikuti tautan <a> tanpa tag href atau tag lain yang berfungsi sebagai tautan karena peristiwa skrip. Berikut ini contoh tautan yang dapat dan tidak bisa diikuti oleh Google.

Followable Links

  • <a href=”https://example.com”>
  • <a href=”/relative/path/file”>

Unfollowable Links

  • <a routerLink="some/path">
  • <span href="https://example.com">
  • <a onclick="goto('https://example.com')">

Linking Resolvable URLs

Aside from using the proper tag, you have to make sure that the URLs linked by < a > tag is a verified web address that Googlebot could send requests to. Pay attention to the following examples.

Resolvable URLs

  • https://example.com/stuff
  • /products
  • /products.php?id=123

Unresolvable URLs

  • javascript:goTo('products')
  • javascript:window.location.href='/products'

In conclusion, these are the two methods that you can use to make your links crawlable. 

icon-green-line

ROBOT.TXT

File used by crawler in a website page to find out which files to crawl or not to crawl.

icon-black-line

CRAWLER/SPIDER

A bot belonging to a search engine (for instance, Google) that searches or crawls the web pages so that all content can be indexed in the database.

How to Maximize Crawling for SEO

In order for your pages to show up in SERP or search results, you have to make sure that your pages have been crawled and indexed by Google beforehand. Pay close attention to the following points if you want to ensure that the crawling process goes smoothly. crawling, perhatikan poin-poin berikut.

Give Access Permissions to the Important Pages so That Robot.txt can Crawl Them

Robot.txt is a part of web pages which function to make the crawling process easier and faster. In order to do that, you just have to add robot.txt into the list of tools options in your site. You can then give permissions and restrictions to crawler on whichever web pages you want in a matter of seconds.

Pay Attention to the Redirect Code

In a website, it will be easier to avoid one or two redirect chains in all domains. However, several redirects packed together is a different matter altogether as it will be harder to avoid and deal with. As a result, your crawl limit will be affected and the whole crawling process won’t perform as effectively in indexing your pages.

3. Do not let HTTP Error Affect Crawl

In a middle of the crawling process, getting 404 and 410 page errors would be frustrating indeed for anyone trying to load their websites. That is exactly why you have to fix all of the 4xx and 5xx errors as soon as possible. Because not only will it increase your users’ experience, but it will definitely make the crawling process easier in indexing your web pages.

4. Use HTML

For the time being, it is said that a crawler’s performance got a little better at crawling JavaScript. Then again, there are still lots of search engines out there who have not yet used JavaScript. Because of that, you should keep using HTML whenever possible. HTML ketika melakukan proses crawl.

5. Taking Care of URL Parameter

You have to remember that separate URLs are considered as separate pages by a crawler. It will be better for you to let Google know of these URL parameters. If you’re asking why, it is because by doing that will make the crawling process more effective, and also you can avoid any concern about the possibility of duplicate content.

6. Update Your Sitemap

An updated sitemap will make it easier for bots to understand as well as identify where the internal links are headed. Also, it is important to always keep in mind that you have to upload the latest version of robot.txt aside from an updated sitemap.

7. Use Hreflang Tags

Hreflang tags are used by the spiders or crawlers of search engines to analyze the localized pages during the crawling process. These tags are usually located in your page’s header where the supported code language is “lang_code”.

Use the Crawling Guide for Your Website

Agency Website

Google has to be able to index an agency website so that it will appear in SERP whenever clients search for them.

E-Commerce Website

You have to make sure that your website will show up on Google search as it will make your potential customers aware about the products you are trying to sell.

Brand Website

If your brand website appears on Google search, it will have many benefits for you personally such as an increase in sales, raising awareness about the web itself and also improving your online branding.

Blog Website

A blog is the place where its writers share their thoughts and stories, however it can be said that a blog also functions as a wealth of information for those who seek them.

CMLABS BLOG

SEO untuk Kebutuhan dan Perkembangan Bisnis Online

Belajar HTML: Mengenal HTML Dasar Untuk Pemula

Mengenal Hingga Memulai Usaha Dengan Strategi Digital Marketing

18 Google SERP Features for First Page Content

For pricing, licensing, renewals, quotes, payments, and other related questions, please contact us at hello@cmlabs.co.
Share Now




Copied to Clipboard !