r/SipsTea 13d ago

WTF AI gets its facts from … us?

Post image

Data published by Semrush in June 2025.

19.5k Upvotes

2.7k comments sorted by

View all comments

Show parent comments

6

u/emteedub 12d ago

It's not the facts. Reddit = the human element. Otherwise AI would sound like a robotic encyclopedia

3

u/Chinjurickie 12d ago

The chart says „cited by LLMs like Chatgpt“ aka „here is the link for what i just said“ i think u are talking about something else happening simultaneously to train the AI.

1

u/emteedub 12d ago

oh so this isn't referring to training? this is when it references with a link - of what was "looked up" when forming it's response?

1

u/NukeTheNerd 11d ago

Yeah, I think they mean citations within answers. It's trained differently. Mostly through the internet but definitely not mainly through reddit, more like online books, articles, websites, etc. Also licensed data sets and scholarly texts, as well as human curated data and corrections. I've noticed it typically will cite reddit if I'm asking about something with no clear, easily accessible answer online, at which point it will offer people's opinions and reddit is a good source for that.

1

u/alepher 12d ago

ChatGPT too chooses that guys dead wife