Google’s Martin Splitt posted a video in his search engine optimization Made Straightforward collection on the subject of the Google Search Console “Found – At present Not Listed” web page indexing report standing notice. In brief, there are three main causes you’d see pages on this class, they’re:
(1) High quality points with these pages
(2) Your server is gradual for Googlebot
(3) Google simply wants extra time to index these pages (could also be associated to #2 above).
On the standard situation, Martin Splitt stated, “When Google Search notices a sample of low high quality or skinny content material on pages, they is likely to be faraway from the index and may keep in found.”
“Googlebot is aware of about these pages however is selecting to not proceed with them,” as a result of they aren’t top quality sufficient, he defined. He added, “If Google Search detects a sample in URLs with low-quality content material in your web site, it would skip these URLs altogether, leaving them in is found as effectively.”
What are you able to do? “Should you care about these pages you may wish to rework the content material to be of upper high quality and ensure your inside linking relates this content material to different elements of your current content material,” he stated. So ensure to have a look at the content material and enhance it but additionally see what pages you’ll be able to hyperlink that content material to from different pages which might be already listed.
To be clear, Google’s help documentation for discovered – currently not indexed solely actually mentions server points. It reads:
The web page was discovered by Google, however not crawled but. Sometimes, Google wished to crawl the URL however this was anticipated to overload the location; subsequently Google rescheduled the crawl. For this reason the final crawl date is empty on the report.
However as we lined again in 2018, we all know it is usually about high quality points. So this isn’t new, however it’s good to have a video on this.
Right here is the video:
Here’s a screenshot of this web page indexing report with the “Found – At present Not Listed” for this web site:
Right here is the transcript:
Google Video On Found – At present Not Listed
At the moment, we are going to dive into Google Search Console’s “Found – at present not listed” standing within the web page indexing report.
When utilizing Google Search Console, and it is best to use it, you in all probability went into the web page indexing report and maybe noticed these sorts of causes for pages not being listed. One of the vital frequent questions we’re getting about that is the found at present not listed standing let’s examine what it means and what you possibly can do about it.
At the start, Google will nearly by no means index all content material from a web site. This is not an error and never even essentially an issue that wants trying into. It is a notice on the standing of those pages talked about there. To know what this implies we have to take a look at how a web page proceeds by means of the methods and processes that make up Google Search.
On the very starting, Googlebot finds a URL someplace that may be a sitemap or a hyperlink for instance. Googlebot has now found that this URL exists. Google bot principally places it right into a to-do record of URLs to go to and probably index afterward. In a really perfect world, Googlebot would instantly get to work on this URL however as you in all probability know from your personal to-do record that is not all the time potential. And that is the primary purpose why you may see this in Google Search Console. Googlebot merely did not get round to crawling the URL but because it was busy with different URLs. So generally it is only a matter of a bit extra endurance in your finish to get this consequence. Finally Googlebot may get round to crawling it. That is the second when it fetches the web page out of your server and processes it additional to probably index it. As soon as it will get to crawling the URL would transfer on to the crawled at present not listed or the web page will get listed.
However what if it doesn’t get crawled and stays in found not listed? Nicely that often both has to do together with your server or together with your web site’s high quality.
Let’s take a look at potential technical causes first. Say you may have a webshop and simply added 1,000 new merchandise. Googlebot discovers all these merchandise on the similar time and want to crawl them. In earlier crawls, nonetheless, it has seen that your server will get actually gradual and even overwhelmed when it tries to crawl greater than 10 merchandise on the similar time. It needs to keep away from overwhelming your server so if it decides to crawl it would achieve this over an extended time frame, say 10 merchandise at a time over a couple of hours, reasonably than all of the thousand merchandise throughout the similar hour. That implies that not all 1,000 merchandise get crawled on the similar time. Googlebot will take longer to get round these merchandise then.
It is sensible to have a look at the crawl stats report and the reply part in there to see in case your server responds slowly or with HTTP 500 errors when Googlebot tries to crawl. Word that this often solely issues for websites with very massive quantities of pages, say tens of millions or extra, however server points can occur with smaller websites too/ It is sensible to examine together with your internet hosting firm what to do to repair these efficiency points in the event that they come up.
The opposite much more widespread purpose for pages staying in found at present not listed is high quality although. When Google Search notices a sample of low-quality or skinny content material on pages, they is likely to be faraway from the index and may keep in found. Googlebot is aware of about these pages however is selecting to not proceed with them. If Google Search detects a sample in URLs with low-quality content material in your web site, it would skip these URLs altogether, leaving them in is found as effectively.
Should you care about these pages you may wish to rework the content material to be of upper high quality and ensure your inside linking relates this content material to different elements of your current content material. See our episode on inside linking for extra data on this.
So in abstract, some websites may have some pages that will not get listed and that is often positive. Should you suppose a web page ought to be listed then it is best to think about checking the standard of the content material on these pages that keep in found at present not listed. Make certain, as effectively, that your server is not giving Googlebot indicators that it’s overwhelmed when it is crawling.
Discussion board dialogue at X.