Prevent search engines indexing your Help Center using Cloudfront | Community
Skip to main content
Answered

Prevent search engines indexing your Help Center using Cloudfront

  • July 22, 2024
  • 6 replies
  • 457 views

I am interested in implementing SEO directives for custom.intercom.help while using cloudfront distribution. Is there a way to configure robots.txt, set X-Robots-Tag headers, or provide APIs to control content behavior? 

Best answer by mateusz.leszkiewicz

Hi there, It’s Mat from the Support Engineering Team 😀

Currently we are not supporting  robot.txt file modification in the Help Center.

6 replies

Dianah Tripac
Intercom Team
Forum|alt.badge.img+3
  • Leading the Community @ Intercom
  • July 22, 2024

Looping in @Nathan Sudds @Milan @Joseph S.  in that in case they have some feedback to share 🙌


mateusz.leszkiewicz
Intercom Team
Forum|alt.badge.img+7

Hi there, It’s Mat from the Support Engineering Team 😀

Currently we are not supporting  robot.txt file modification in the Help Center.


  • New Participant
  • October 10, 2024

How can we update https://help.taskade.com/robots.txt as it is preventing Google from crawling our support articles for https://taskade.com/ despite turning the crawling on per https://www.intercom.com/help/en/articles/5145271-prevent-search-engines-indexing-your-help-center 

 

Dianah Tripac
Intercom Team
Forum|alt.badge.img+3
  • Leading the Community @ Intercom
  • October 10, 2024

cc @mateusz.leszkiewicz 


mateusz.leszkiewicz
Intercom Team
Forum|alt.badge.img+7

@John Xie currently it is not possible to update robot.txt for the Help Center.

 

All Google - or any search engine - needs is to encounter a link to any page in your Help Center.

Most teams link to their Help Center from their marketing site. Google can index your whole Help Center starting with that link.

While you can submit your URL to Google Search Console directly, you don’t need to. — Google regularly crawls the web to add new sites to its index.

All our pages include metadata tags that search engines use to index content:

image+%2832%29.png?expires=1728999000&signature=10513295ceb5ce26cc5d81b171dda2c2f16fb72b687b242515f38fd3abb37824&req=dSAmEct7mIZfUPMW1HO4zauIbms2Wh%2BFvRiFZi68ZJcU%2F7pyK7pQOaD67pSc%0A683g%0A

Our pages also include OpenGraph metadata. This is useful when sharing links on social media (X (formerly Twitter) / Facebook / Slack etc.). Search engines probably don't care, but worth mentioning.

We use the canonical link metadata tag. So if you include `utm_source` or other query string parameters in your URL, or if you have a custom domain and someone accesses your intercom.help Help Center, the search engine will know what the canonical URL is for this content:

image+%281%29+%281%29.png?expires=1728999000&signature=8c0d2e884d0569e22458c7cf92f5fa232faed56af910e85db88c6a20f0a284aa&req=dSAmEct7mIlYWfMW1HO4zRp4xd%2B6A4motKbN%2FMp6HC9WAa5blzlMjzJveOF%2F%0ABBlc%0A

If you are having issues with google not indexing your Help Center and you have a custom domain, ensure you have HTTPS set up correctly. You can find the troubleshooting steps here.


  • New Participant
  • May 5, 2026

If you’re serving robots.txt through Cloudfront, I usually set a dedicated behavior for that path and attach a simple Lambda@Edge to force the noindex rules. For pages that need stronger control, adding an X-Robots-Tag header in the origin response works well. While tightening things up, I’ve used a Quora backlinks service to build authority for public articles, keeping the private help stuff hidden but still boosting the rest.