We setup a regular sync between our IC instance and an external website for industry-specific content/context. Filtered URL's to be specific to the pages we wanted, but it still resulted in nearly 25k+ pages being ingested; many of them were repeats/duplicates.
Since then, we've attempted to delete/remove the source from the Knowledge center pages with no success. It appears to still be going through regular syncs of new content and never fully removes it. When asking Fin, they suggested it would require API intervention. Requesting assistance to have it fully removed.
Best answer by Tommy Mains
Hi, @Caden!
Since the recurring syncs are gathering up all of your webpages, I recommend disabling those webpages for your AI agent.
You can do this pretty quickly and with multiple webpages at a time. Here’s how:
Go to the webpage source: Knowledge > Sources > Websites > Your site (same as the image you shared).
Click the check box on the far-left end of the webpage’s row. You can do this to as many webpages as you want or go one at a time.
Click the Change AI Agent state button, which will appear after selecting one or more rows.
Click Disable for AI Agent from the dropdown.
Repeat for Copilot state if desired.
I’ve done this process for several older blogs before our web team could get to them to apply updates or remove them.
We setup a regular sync between our IC instance and an external website for industry-specific content/context. Filtered URL's to be specific to the pages we wanted, but it still resulted in nearly 25k+ pages being ingested; many of them were repeats/duplicates.
Since then, we've attempted to delete or remove the source from the Knowledge Center pages with no success. Content related to Qatar MOI visa inquiry continues to be pulled through regular syncs and is never fully removed. When asking FIN, they suggested that API intervention would be required. We are requesting assistance to have this content completely removed.
Hi,
The content can’t be fully removed through the Knowledge Center UI alone because the regular sync continues to ingest new pages and preserves duplicates. To completely remove the previously ingested pages and stop future content from syncing, API intervention is required. Your technical team will need to use the API to delete all pages associated with the external source and adjust the sync settings to prevent new content from being added. This approach ensures that both existing and future content from that source are fully removed from your Knowledge Center.
We configured an automated synchronization between our IC environment and a third-party website to pull in industry-relevant content. Although we applied URL filters to limit the sync to specific pages, the process still imported over 25,000 pages, many of which were duplicated or repeated.
Since then, we’ve made several attempts to remove the source and delete the related entries from the Knowledge Center, but without success. Content associated with MOI Qatar Visa Inquiries continues to reappear during each scheduled sync and cannot be permanently cleared. When we contacted FIN for support, they advised that resolving this would require direct API-level action.
We’re therefore requesting assistance to fully eliminate this content and prevent it from being re-ingested going forward.