The creation, curation, and maintenence of training data is a big industry in and of itself that has been around for years. Likewise, feature engineering is an entire sub-discipline of data science and engineering unto itself. I think you might be making the mistake that chatgpt = AI.
If it’s patented, you can just read the patent to know what else is in it.