News 11 min read July 27, 2017

Rank Your Page For a Whole Group Instead Of a Single Keyword With Our New Keyword Clustering Tool

Elena K.
Editorial Head at Serpstat
Hope you had enough time to read our previous post about Keyword Clustering tool. As I promised, here is the second part devoted to Text Analytics tool.

As these tools are interlinked and you cannot launch text analysis without keyword clustering, read the previous article if you haven't done so already.

What is Text Analytics tool and why do you need it?

Text Analytics is a tool designed to help you to optimize the text on your page to rank for the whole clusters instead of keywords. It provides you with a list of recommended keywords to use in Title, H1, and Body based on top-10 search results of all keywords.

No secret that content is one the crucial ranking factors. Which means that the websites with well-optimized texts rank higher. If the target URL doesn't rank high for the desired keywords, rewriting the page's content to enhance its relevance for these keywords makes sense.

Advantages of our algorithm

Our goal is to provide you with the most accurate data and recommendations, thus we paid particular attention to the accuracy of our algorithm. And here is how it works:
As I've already said, our recommendations are based on top-10 search results as your most successful competitors. To avoid making recommendations based on irrelevant subjects, we divide your competitors' pages into semantic groups. Let's take the keyword "pool" e.x. and here is how it's SERP looks like:
As you see, Google offers both "swimming pool" and "cue sport" pages. We divide these results into 2 groups and make recommendations based on the URL you added. So, if your page is devoted to outdoor swimming pools, we'll exclude pool game results from the competitor analysis. If you don't add the target URL, we'll take the topical group with the biggest part in SERP. BTW, here it'll be pool as a game as it's 5 against 2. Thus, mind adding your target URL in case the topic may have several meanings.

The same works with commercial and informational queries. If you Google "macbook," here's what you'll get:
Book design is the art of incorporating the content, style, format, design, and sequence of the various components of a book into a coherent whole. In the words of Jan Tschichold, "methods and rules upon which it is impossible to improve, have been developed over centuries. To produce perfect books, these rules have to be brought back to life and applied."
Front matter, or preliminaries, is the first section of a book, and is usually the smallest section in terms of the number of pages. Each page is counted, but no folio or page number is expressed, or printed, on either display pages or blank pages.

How our keyword clustering tool works?

Unlike many competitors' solutions, Serpstat employs intelligent hierarchical clustering where clusters are combined in a supercluster. This being said, no preliminary data collecting like keyword search volumes required, you only need to upload a list of keywords and choose the region and clustering parameters.

Advantages of our method:
We don't set a cluster center;
Serpstat considers all keywords analyzing their connections in SERPs;
We use intelligent hierarchical clustering: the keywords are grouped into clusters, clusters are merged into higher-level groups called superclusters, and finally, superclusters are combined into even higher-level groups called protoclusters;
Serpstat checks the connection strength of all analyzed keywords according to the settings you chose.
Connection strength — is the number of identical URLs in keywords' top-30 search results. Thus the highest number of mutual results is 30 URLs.

What keyword clustering methods Serpstat provides?

In fact, there are only two of them: Linkage strength and Type of

1) There are two types of linkage strengthWeak and Strong.

By setting "Weak" parameter the keywords with several mutual URLs in Top-30 search results will be combined into the cluster.

While "Strong" sets more URLs in common as a condition for keywords merging into a single cluster.

2) There are two types of grouping to choose from: Soft and Hard

"Soft" parameter tells the system that a cluster can be created if at least one pair of keywords has less or more common URLs in Top-30 search results (depending on the previous Weak/Strong choice).

Hard one requires all keywords in a cluster to have less or more common URLs in Top-30 search results (the requirement for the number of common keywords is defined in the previous step where you selected Weak or Strong clustering).

The resulting clusters contain synonymous keywords with a high semantic similarity. At the same time, this clustering method produces lots of clusters as the keywords can be merged into a cluster only if they are closely related.
After clustering is finished, some keywords can fall into the Unsorted directory. These are keywords that have no semantic similarity to the topic of the analyzed keyword set and should be removed from the dataset.

An alternative solution here is to create separate pages for these keywords or move them to one of the created clusters if you believe they belong there.

What clustering method to choose?

Weak+Soft? Weak+Hard? Strong+Soft? Or maybe Strong+Hard?

The default is Weak+Soft. But you can choose any pairing according to your needs. The decision should be based on the semantic similarity of the objects from your dataset.

If the keywords are initially closely related, for example, sneakers of different brands, you should choose Strong+Hard or Strong+Soft so that only the closest synonyms are combined into a cluster. As a result, you'll get lots of clusters to use for separate pages or specific categories.

In the case of various products and services, for example, you're collecting keywords for a multi-product store or medical center with a full range of health-care services, it's worth selecting Weak+Soft. The choice of Strong+Soft will produce more clusters and a possibility to get more topic-specific clusters.
To learn more on how our clustering tool works watch this video →

What about the price?

No additional payments, Keyword clustering tool is included to your tariff plan price:

  • Users with Plan A have no access to keyword clustering tool.
  • Plan B — 4000 keywords.
  • Plan С — 12 000 keywords.
  • Plan D — 25 000 keywords.
Note: As Keyword clustering is currently available as a Beta version, you can upload no more than 2 000 keywords within one project.

How to use Keyword Clustering tool?

Launching the keyword clustering

1. Go to the "Tools" section and open Grouping and Text Analytics tool.
2. Click on "Create a project" button.

3. Enter the project name and domain (if you want to run a text analytics for a particular domain).

4. Paste the list of keywords or import them from csv or txt files.

5. Then choose search engine and region.

6. Finally, choose linkage strength and type of grouping and click "Finish."

Make a cup of coffee and sit back, grouping takes some time :)
7. After a while, you'll get something like this:
Where 3 stands for cluster, which includes the keywords on the right,
2 — supercluster, and 1 — protocluster.
Protocluster — is a set of superclusters. Generally, protocluster is made up of superclusters related to a specific category of objects.

Supercluster — is a set of clusters. It combines keywords with a high semantic similarity score, but slightly less similar than keywords in a cluster.

Analyzing the report:

On the right you'll see all your keywords grouped into clusters with some additional info:
1. Every keyword has its connection strength. It shows how related to the cluster's subject this keyword is (from 0 to 1).

2. Homogeneity shows the strength of connection between keywords.

3. If you add a domain while creating a project, we'll display the page which suits the cluster's subject the most. If you don't specify the domain, you can add URL manually by clicking Add URL.

Let's pass to some additional options, click on drop-down menu:

Click on "Search keywords" to find the desired keyword within the cluster.

2. To delete the cluster click on "Delete subgroup" button.

3. If you want to delete some keywords from the cluster, tick them and click on "Delete keywords."

4. By clicking on "Toggle metatop" you'll see the list of major competitors in SERP for keywords from this cluster. The higher a page's rank in the meta-top, the more relevant it is to the cluster's topic.

That's finally it ;)
P.S. Our math analyst is already working on more detailed keyword clustering tool overview. Don't hesitate to ask, if you have any questions. We'll do our best to answer all of them ;)

Recommended articles

Subscribe to our newsletter
Keep up to date with our latest news, events and blog posts!


Sign In Free Sign Up

You’ve reached your query limit.

Or email
Forgot password?
Or email
By clicking 'Sign Up Free' I agree to Serptat's
Terms of Service and Privacy Policy
Back To Login

Don’t worry! Just fill in your email and we’ll send over your password.

Are you sure?
Please pick the project to work on

Personal demonstration

Serpstat is all about saving time, and we want to save yours! One of our specialists will contact you and discuss options going forward. These may include a personal demonstration, a trial period, comprehensive training articles & webinar recordings, and custom adivce from a Serpstat specialist. It is our goal to make you feel comfortable while using Serpstat.

Skype or Google Hangouts

Upgrade your plan

Sign Up Free

Спасибо, мы с вами свяжемся в ближайшее время


You have run out of limits

You have reached the limit for the number of created projects. You cannot create new projects unless you increase the limits or delete existing projects.

I want more limits

Christmas is a time for miracles.

You are almost on the finish line of our Christmas quest. The last brick of your lego-promocode is left on the way up 55% discount.

Did not find previous lego-bricks? Fill the form anyway.

Skype or Google Hangouts
Write your discount code