Recovering from Google's Helpful Content Update: A 2026 Playbook
In March 2024, Google folded the Helpful Content classifier into the core ranking system. Most of the recovery advice you'll find online was written before that change — and almost none of it accounts for what we learned from the May 2024 API leak. This is the playbook I use now, with the clients I'm actively working with, post-2024.
Three things changed in 2024 — and they changed everything
- HCU is no longer a separate update. Google announced in March 2024 that the helpful content classifier was rolled into the core ranking system. There is no "next HCU update" to wait for. The classifier runs continuously, on every crawl.
- The May 2024 API leak gave us names. Attributes like
siteAuthority,siteFocusScore,hostAge, and acontentEffortscore appeared in the leaked documentation — and they map closely to what HCU appears to measure. Google has not confirmed how these attributes are weighted, but their existence is now public record. - The 2024 spam policy expansion. Alongside the March 2024 core update, Google added explicit policies for scaled content abuse, site reputation abuse, and expired domain abuse. Recovery isn't only about HCU anymore — it's about the broader quality framework these policies represent.
If you're reading recovery advice written before April 2024, throw most of it out. The mechanics changed.
Were you actually hit by HCU?
Most "I think I got hit" reports are misdiagnosed. Before you start a recovery, confirm what you're dealing with. Several other failures look very similar to HCU and require completely different fixes.
Open Google Search Console. In the Performance report, set the date range to "Last 16 months" and look at the daily clicks chart.
The HCU signature looks like this:
- The drop is sharp — usually within 1–3 days, not gradual
- The drop coincides with a confirmed core update date (the relevant ones for HCU-class signals: March 5 2024, August 15 2024, December 12 2024, and any subsequent core updates)
- The drop is site-wide — most or all pages affected, not a few sections
- Impressions drop too, not just clicks (rules out a CTR or SERP-feature issue)
- No manual action notification in Search Console
- Recovery has not happened automatically over time

What looks like HCU but isn't:
- Gradual decline over 4–6 weeks: usually a technical issue — accidental
noindex, robots.txt mistake, broken canonical, or migration aftermath - Drop affecting only specific URL patterns: probably an algorithm targeting a specific content type (e.g., the parasite SEO sweep against rented subfolders)
- Drop with a manual action message in GSC: not algorithmic — fix the manual action first, recovery is different
- Drop only on mobile or only on desktop: probably Core Web Vitals, mobile usability, or rendering
- Drop the day after a site migration: the migration broke something — check redirects, internal links, canonicals before assuming HCU
Cross-reference your drop date with public update trackers (Search Engine Roundtable, Semrush Sensor, Sistrix Visibility Index for your domain). If the dates align with a confirmed core update and the pattern matches above, you're dealing with HCU-class signals. Continue.
What the HCU classifier appears to measure
The plain-English version first. Google looks at every page on your site and decides whether it's helpful or filler. Then Google looks at the whole site and asks: "are most of these pages helpful, or mostly filler?" A site that's mostly filler gets pushed down everywhere — even on its good pages. So recovery means doing two things at the same time: cutting the filler, and making the good content genuinely better. Now the details on how this actually appears to work.
I'll say "appears to measure" throughout this section. Google does not publish the algorithm. What we have:
- Google's official guidance on creating helpful, reliable, people-first content
- Public statements from John Mueller, Danny Sullivan, and the Google Search Central team
- The May 2024 leaked API documentation revealing roughly 14,000 ranking attributes
- Observed recovery patterns across sites that did and didn't bounce back
Cross-referencing those, HCU appears to be a site-level multiplier built from per-page signals. The per-page signals correlate with:
- First-hand experience markers. Original screenshots, original data, specific anecdotes, named products / places / people, photographs you took yourself. Pages that read like the writer has actually done the thing.
- Information gain. Content that adds something the top 10 ranking pages don't have. A page that just summarises what's already ranking adds zero gain.
- Author signals. Real author with verifiable credentials,
Personschema,sameAslinks to authoritative external profiles (LinkedIn, conference talks, recognisable bylines elsewhere). - Effort indicators. Custom illustrations, original research, structured data quality, internal linking depth. The leaked
contentEffortattribute appears to score this. - User satisfaction proxies. Click-through rate from the SERP, dwell time, return-to-SERP behaviour. Google has consistently denied these are direct ranking signals; the leak suggests they're at least observed.
- Site focus. A site that covers many unrelated topics scores worse than a topically-focused site. The leaked
siteFocusScoreattribute appears to measure this. A site about SEO that publishes 200 articles on cooking will fight an uphill battle.
The site-level multiplier is the part that surprises people. Think of a restaurant: if 8 out of 10 dishes are mediocre, the place gets a bad overall rating — even if 2 dishes are excellent. Google appears to work the same way for websites. It judges the whole based on the proportion of pages worth visiting, then applies that single judgement to every page on the site. Five excellent pages can't lift the score if there are 500 mediocre ones around them.
This is why recovery requires aggressive cleanup, not just polishing your best work. You have to fix the proportion.
The recovery playbook
Set realistic expectations first. "90-day playbook" means 90 days of work. After the work, recovery is a function of how often Google recrawls and re-evaluates your site. For most sites that means 1–3 additional months before meaningful movement, sometimes longer. Anyone promising 30-day recovery either doesn't understand how the classifier works or is lying.
Phase 1: Diagnose (Week 1–2)
Don't guess at what's wrong. Build the evidence first.
1. Pull a complete URL inventory. Run Screaming Frog on the site. The free version handles up to 500 URLs; if you have more, the licence pays for itself on the first audit. Export the full crawl with HTTP status, title, meta description, word count, indexability status, and inlinks.
2. Pull GSC data for the last 16 months. Search Console → Performance → Pages tab. Sort by clicks descending. Export. Then sort by impressions descending and export again. The first list shows what's working; the second shows what Google still sees but isn't sending traffic to.
3. Identify the "zombie" pages. Pages with high impressions but near-zero clicks are the ones Google indexes but has lost confidence in. These are usually your strongest HCU recovery candidates — Google still thinks they're relevant but doesn't think users will be helped.
4. Identify the thin pages. From the Screaming Frog export, filter to pages under 500 words. In most niches, thin pages need to be either improved or removed. Don't auto-delete by word count alone — some short pages (definitions, calculators, tool pages) are appropriately concise. Use word count as a flag for review, not a delete trigger.
5. Categorise every URL. Build a spreadsheet with these columns: URL · Current monthly clicks · Current monthly impressions · Word count · Has unique value? (Y/N) · Has first-hand expertise? (Y/N) · Decision (Keep / Improve / Remove). Every URL gets a row. Yes, every one.
The spreadsheet is the recovery roadmap. Without it, you're guessing.
Phase 2: Cut (Week 3–4)
This is where most sites get it wrong. Either they panic-delete half the site (destroying link equity) or they refuse to delete anything (leaving the multiplier suppressed).
For pages in the Remove bucket:
- If the page has no backlinks and no organic traffic: 410 Gone (preferred over 404; signals "intentionally removed" to Google)
- If the page has backlinks but is genuinely not worth keeping: 301 redirect to the most relevant alternative page (not the homepage — too vague)
- If you're unsure:
noindexfirst, monitor for 6 weeks, then decide.noindexis reversible; deletion is not.
Don't:
- Redirect every removed page to a single hub — Google treats mass redirects to one URL as soft 404s
- Delete pages without exporting their backlinks first (use
link:in Ahrefs / Semrush) - Remove pages with no clicks but high-value impression queries (rare, but verify before cutting)
Consolidate cannibalising pages. If three pages target slight keyword variations of the same topic, merge them. Pick the strongest URL (most backlinks, oldest, best title), redirect the others to it, and combine the best content into one comprehensive page.
Expect to remove or noindex 20–50% of indexed URLs. That sounds aggressive. It is. The multiplier doesn't ease up unless the unhelpful proportion of the site comes down meaningfully.
Phase 3: Rebuild (Week 5–10)
Now the surviving pages need to actually be helpful. "Improving" content is not adding more words. It's adding things only a real practitioner would know.
For each page in the Improve bucket, work through this list:
- Lead with the answer. Most thin pages bury the useful content under three paragraphs of throat-clearing. Cut to the answer in the first 50 words.
- Add at least one piece of first-hand evidence. A screenshot from your own work. A specific number from a real engagement. A named example. An anecdote that begins "When we worked with [project], we found…"
- Cite primary sources. Link the original Google blog post, the original study, the original patent. Not a third-party article that re-cited it. This alone is a real differentiator — most thin pages don't cite anything.
- Answer the People Also Ask questions. Use the queries Google itself surfaces for your target. Add an FAQ section with proper
FAQPageschema if the questions are real. Don't fake them — misleading structured data is its own penalty risk. - Add custom visuals. A diagram you made. An annotated screenshot. A chart from your own data. Stock photos add zero quality signal.
- Add author signals. Real byline with photo. Real bio. Real
Personschema withsameAslinks. Real credentials.
Pace: 3–5 pages per week. One genuinely excellent page is worth more than ten that have been padded with filler.
Phase 4: Wait and monitor (Week 10+)
Here's the part nobody wants to hear: HCU recovery happens on Google's clock, not yours. The classifier needs to recrawl your changed pages, re-evaluate the site-level signal, and propagate the new evaluation. That takes weeks. Sometimes it takes a core update.
What to watch in Search Console, weekly:
- Impressions trend — stabilising or slowly increasing?
- Average position — moving up for your priority queries?
- Indexed pages count — dropping (good, your removals are processing) or staying flat (bad, Google hasn't recrawled yet)
- Last crawled date on key pages — use the URL inspection tool
If 8–12 weeks in you're seeing no positive movement, your cleanup wasn't aggressive enough. Go back to phase 2 and cut more. The multiplier doesn't budge until the unhelpful proportion crosses a threshold — and we don't know exactly where that threshold sits.
What we still don't know
The HCU classifier is not fully understood by anyone outside Google. Things I don't know, and won't pretend to:
- The exact threshold of "unhelpful content" that flips the site-level multiplier
- Whether the per-page evaluation considers user signals (CTR, dwell) directly, or via a proxy
- How long the negative signal persists after content is genuinely fixed
- Whether the leaked attributes (
siteFocusScore,contentEffort) are weighted as heavily as their names suggest, or are deprecated artefacts - How HCU interacts with the new spam policies (site reputation abuse, scaled content abuse) — it's possible they share evaluation infrastructure
If anyone tells you they have certainty on these, they're either at Google or guessing. The honest answer is: we work with the framework above, observe outcomes, and adjust.
What NOT to do (every one of these makes recovery harder)
- Don't panic-delete the whole site. You'll destroy years of accumulated authority and start from zero on a domain Google already partially trusts.
- Don't pad thin pages with filler to hit a word count. The classifier appears to evaluate quality and originality, not length. Padding makes it worse, not better.
- Don't migrate to a new domain. The site-level signal stays with the content, not the domain. You'll inherit the problem on the new site, plus lose your link equity. (One exception: if the old domain has a separate spam history. Different problem.)
- Don't hire anyone promising 30-day recovery. Either they don't understand HCU or they're lying.
- Don't mass-rewrite flagged content with AI. If scaled AI content got you classified, more AI content won't unclassify you. Real human expertise is the only thing the classifier appears to reward.
- Don't only fix the top pages. The site-level multiplier means the long tail of mediocre pages is suppressing your good ones. Address proportions, not just stars.
Frequently asked questions
How long does HCU recovery actually take?
From start of work to meaningful recovery is typically 3–9 months for most sites. The 90-day playbook is the work phase. After that, you're waiting for Google to recrawl and re-evaluate. Some sites bounce back inside the next core update; some take two or three core updates. There's no fixed answer, and anyone selling you a fixed answer is selling you something.
If HCU is now part of the core algorithm, can I still recover?
Yes — and arguably more easily than before. When HCU was a separate update, recovery required waiting for the next HCU refresh, sometimes 6–12 months apart. Now that the classifier runs continuously inside the core system, a meaningfully improved site can be re-evaluated as soon as Google recrawls it. The work is still hard. The waiting is shorter.
Should I use AI to write content during recovery?
You can use AI to assist with outlines, first drafts, schema generation, and editing. You should not publish raw AI output. The HCU classifier appears to reward first-hand experience, originality, and effort — none of which are present in unedited AI text. Use AI as a research assistant, not as a writer.
How do I tell HCU apart from a manual action?
Manual actions show up explicitly in Google Search Console under Security & Manual Actions. If there's a notification there, you're dealing with a manual action — different problem, different fix (file a reconsideration request after fixing the issue). HCU is algorithmic, never has a notification, and never gets a reconsideration request — only the algorithm itself can lift the suppression.
Does removing pages hurt my SEO?
Removing pages that have no traffic and no backlinks rarely hurts. Removing pages that rank well or have backlinks does hurt — that's why the diagnostic phase exports backlinks first. The decision is per-URL, not blanket. When in doubt, noindex first and monitor for six weeks before deleting.
Is the 2024 Google API leak reliable enough to act on?
The leak revealed attribute names and structures. It did not reveal weights, formulas, or how the attributes combine into ranking decisions. Treat it as confirmation that signals like siteFocusScore and contentEffort exist as observable quantities — not as a recipe for ranking. Use it to inform direction, not to chase exact metrics.
If you've worked through this and you're stuck
Recovery work is unforgiving when you're too close to your own content. A second pair of eyes — someone who didn't make the original choices — finds problems faster.
I take on a small number of HCU recovery engagements at any time. If your traffic dropped on a confirmed core update date, you've worked through this playbook, and you're not seeing movement after 90+ days, the issue is usually one of three things and I can usually identify it inside an hour.
Need a second opinion on your recovery?
I'll review your GSC data and your top affected pages, and tell you specifically what I'd cut, fix, or rebuild. No slides, no pitch.
Book a 30-minute review →