If you were to train a classifier on noise/BS/sensationalist papers, what would you include in the training data?

Eg
https://x.com/minjunesh/status/1940589653410959784?s=19

Or physionic is good at reviewing papers critically
Matt kaeberlein also is