How long does it take to deindex low-quality or thin content published by accident?[ case study]

I had an e-commerce company reach out to me earlier in the year for help. They wanted to have an audit completed after making some important changes to their site.

As part of our initial communication, they prepared a bulleted list of changes that had been implemented so I would be aware of them before investigating the site. That list included any changes in rankings, traffic and indexation.

One of those bullets stood out: They had assured a big spike in indexation after the recent changes ran live. Now, this is a site that had been impacted by major algorithm updates over the years, so the combined effects of big site changes( without SEO guidance) and a subsequent spike in indexation scared the living daylights out of me.

Credit: GIPHY

I checked Google Search Console( GSC ), and this is what I ensure: 6,560 pages indexed jumped to 16,215 in one week. Thatas an increase of 160 percent.

It was clear that digging into this problem and used to identify what happened would be a priority. My hope was that if mistakes were pushed to production, and the incorrect pages were being indexed, I could surface those problems and fix them before any major damage was done.

I unleashed Screaming Frog and DeepCrawl on the site, using both Googlebot and Googlebot for Smartphones as the user-agents. I was eager to dig into the creep data.

The problem: Mobile faceted navigation and a surge in thin content

First, the site is not responsive. Instead, it uses dynamic serve, which means different HTML and CSS can be delivered based on user-agent.

The recent changes were made to the mobile version of the site. After those changes were implemented, Googlebot was being driven to many thin URLs via a faceted navigation( only available on the mobile pages ). Those thin URLs were apparently being indexed. At a hour where Googleas quality algorithms seem to be on overload, thatas never a good thing.

The crawls I performed surfaced a number of pages based on the mobile faceted navigation — and many of them were horribly thin or blank. In addition, the HTML Improvements report( yes, that report many people totally dismiss) listed a number of those thin URLs in the replicate title tags report.

I dug into GSC while the creep were running and started surfacing many of those problematic URLs. Here’s a screen shot presenting close to 4,000 thin URLs in the report. That wasn’t all of the problematic URLs, but you could see Google was procuring them.

We clearly had a situation where technological SEO problems led to thin content. Iave mentioned this problem many times while writing about major algorithm updates, and this was a great example of that happening. Now, it was time to collect as much data as possible, and then communicate the core problems to my client.

The fix

The first thing I explained was that the mobile-first index would be coming soon, and it would probably be best if the site were moved to a responsive design. Then my client could be confident that all of the pages contained the same content, structured data, directives and so on. They agreed with me, and thatas the long-term objective for the site.

Second, and directly related to the problem I surfaced, I explained that they should either canonicalize , noindex or 404 all of the thin pages being links between from the faceted navigation on mobile. As Googlebot crawls those pages again, it should pick up the changes and start falling them from the index.

My client requested information about blocking via robots.txt, and I explained that if the pages are blocked, then Googlebot will never insure the noindex tag. Thatas a common question, and I know thereas a lot of confusion about that.

Itas merely after those pages are removed from the index that they should be blocked via robots.txt( if you choose to go down that track ). My client actually decided to 404 the pages, rolled out the changes, and then moved on to other important findings from the audit and crawl analysis.

The question

And then my client asked an important question. Itas one that many have asked after noindexing or removing low-quality or thin pages from their sites.

aHow long will it take for Google to drop those pages from the index ?? a

[ Read the full article on Search Engine Land .]

Read more:

Leave a Reply

Your email address will not be published. Required fields are marked *