It would be nice if you could easily noindex blog tag/topic pages and author pages.
There are 36 pages being indexed in google for our blog tags and authors. All 36 of these pages are thin content and showing up as duplicate content using siteliner.com which isn't helpding our visibility in search.
Our only recourse right now is to add /author/ and /topic/ to our robots.txt file. However, I doubt this fix is likely to remove them from Google's index permanently because there is a link to the author profile and topic page on every blog. Maybe editing the blog post template to unlink the author name and topic on blogs could help?
I think it would also be helpful to have an option that there be no author or tag page created at all as a few customers have requested the ability to remove them entirely.
Hi all! Thank you for your feedback here about wanting to no-index author and/or blog tag listing pages. Giving more granular control over which pages Google can/cannot index is something my team is reviewing. I have a few questions to help me better understand your needs/use case here:
1. Can you help me understand why the 36 blog tag/author pages being indexed is a negative (as opposed to a good thing with more pages being indexed/visible in search)? In my research, I have seen pros/cons for each so I would love to hear how you are thinking about it
2. Can you tell me more about how you are using blog author pages and category tags today such that there is significant overlap between authors/tags (enough that it is being surfaced as duplicate content)?
Hi Amanda, I don't know if you've received messages with answers about questions. However, my answer on the first question would be as following. 1). My goal is not to have as many pages indexed as possible. My goal is to have only qualitative pages indexed. Qualitative as in pages that people search for and therefor I want to have it indexed. Having pages indexed that do not attract search visitors only means that I'm spilling crawling budget and link juice. Particularly in our situation this is the case, since we have 3 types of listing templates (for events, news and blogs/whitepapers etc) which each create their own author pages. Not all authors attract visitors from search but they could be having an author page because they host an event, another author page because they wrote a blog and another because of a newsitem concerning their product/service. Preferably I would like to have just one author page (although having several listing pages), and decide per author page if it should be indexed or not. Please reach out if you have some questions about my answers. Best, Esther
This would be great. I didn't want to mess with the robots.txt file. Would be great if we could edit author, tag and other system pages, and be able to add the "noindex" in the HTML section just like we can do with the regular webpages and blog pages.
We also should have the ability to remove/unlink those pages from the blog posts. Otherwise, they are still being accessed anyways.
Waiting to see when Hubspot will make this possible! Over 2,5 years since this topic was created.
Do you know if HubSpot has already worked on this topic? I also don't want to touch the robots.txt file, but I would really like to have all author and tags pages as noindex. However, I cannot find any place where to add this tag, as we can do with the blog pages.
Yes, we are also missing this feature A LOT. With four languages. Ten different authors, and using the blog module for also resources and cases - which all have tags and authors - we have about 200 pages that are indexed but with duplicate and thin content 😥