Our companies web site uses a content managment system whose interface is all browser based. Turning the GSA loose on our web site using an administrative account ended up wiping out 85% of our web site's content thru the execution of delete actions from web page links in the administrative interface of the content managment system.
The CMS system we use is built in coldfusion (which we're rapidly moving away from to .NET sometime next year.). These coldfusion pages have buttons / images all hyperlinked to perfrom different actions for content records, content folders, and unfortunately whole web site instances. One of these hyperlinked image buttons deletes the content when clicked, which the crawler furiously did last night.
More
Lessons learnt - crawl with an appropriate account that doesn't have access to the CM authoring functionality.

3 Comments:
Also, what was the GSA doing with CMS admin credentials?
Post a comment