Yes, both as training data and as a source of high(-ish) quality documents for RAG to reduce hallucination. But now that we can generate large quantities of "almost correct" content, the real value might be preserving how we learned something in the first place: the evidence.
There are APIs to get all of the data, and downloads available. What I’m not sure is what would be the most help - a list of articles is easy, a working backup of the site is more difficult. What would be a good way to communicate?
Sounds about right.