Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Exclude unpublished articles from Google search #769

Open
LeMurphant opened this issue Sep 27, 2024 · 1 comment
Open

Exclude unpublished articles from Google search #769

LeMurphant opened this issue Sep 27, 2024 · 1 comment

Comments

@LeMurphant
Copy link
Collaborator

Currently, pages that are in-progress are indexed by Google, and we don't want that. See for context
https://discord.com/channels/677546901339504640/1088468403406258196/1287127033503154186

One option is that pages in "Live" or "Unlisted" should be indexed in the robots.txt. I'm not sure how this would interact with caching on google.

Another option is that we simply make these pages not be rendered, with e.g. a 404

@Aprillion
Copy link
Collaborator

if I understand the black magic, pages listed in robots.txt are still indexed for search, just not visited by the crawler so the only info it gets is from the URL (and from the links from other pages to this page if those other pages are allowed to follow) - since we have the question title in the URL, I don't think it would completely remove those questions from search results by using robots.txt, it would remove snippets from the page

using X-Robots-Tag: noindex HTTP header instead might have higher chance of successfully removing them from search index, and the page should get re-crawled once in a while, so then it goes live and we remove the header, it should appear in search index again sooner or later :blobmaybe: https://developers.google.com/search/docs/crawling-indexing/robots-meta-tag

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

2 participants