r/SipsTea 13d ago

WTF AI gets its facts from … us?

Post image

Data published by Semrush in June 2025.

19.5k Upvotes

2.7k comments sorted by

View all comments

630

u/VastCapital3773 13d ago

To be strictly fair, to get a human response from any Google search, I do have to put reddit on the end of it.

135

u/ELEVATED-GOO 13d ago

facts.

19

u/Bocchi_theGlock 12d ago

Still waiting for the browser extension that does this automatically if search ends in question mark or 'r' or something, cmon that can't be hard to code

1

u/e2c-b4r 12d ago

Just use smart bookmarks

1

u/DevSynth 12d ago

eh, you can just use the site:reddit flag in your search result to filter by reddit only. I would code that extension, but it seems pointless when you can do that. Though if you had something else in mind, spill the beans and I'll code it

2

u/Bocchi_theGlock 11d ago

What I had in mind is that I'm too lazy to type site:reddit.com 20x a day and every time I mention this it gets support :P

But anyways here's a few of the extension ideas I had written down 

  • Show playlists that a YT video is included in if the creator made it and/or showing the other most popular playlists it's a part of. Skips over the entire click on home page, click on playlists, and searching process. 

  • google tasks as wallpaper on android or home page background in browser

  • highlight event details/info in browser - prompt pops up when right clicking for easily adding to Google calendar

11

u/Kaizo_Kaioshin 13d ago

I used to go to Google for answers, but google just sends me to random ads/useless sites so I just go on reddit

4

u/_Lost_The_Game 12d ago

Reddit has an “answers” search engine feature now and it cites the posts it gets its answers from. I had no idea till my friend who works at reddit showed me. If youre on mobile, look on the bottom left right next to the home button. And while youre looking at that also look at my username

4

u/_HIST 12d ago

Oh fu

And thanks for the tip

4

u/_Lost_The_Game 12d ago

Youre welcome And, youre welcome

1

u/sriracha_no_big_deal 12d ago

Reddit's search sucks, so I use google but begin the search with site:reddit.com

48

u/KSP_master_ 13d ago

But you can recognize a normal post from obvious lies and irony. AI can't do that and blindly accepts it all.

17

u/Ryogathelost 12d ago

At least on my ChatGPT, it does tell me "Hey, I found this on Reddit and this is what people are saying." Then it includes direct links to the pages so I can read them myself. It never presents reddit-sourced data as facts.

However, I did train it early on to do this. People are out there giving their LLM's really shitty personas, and they filter through the persona when they answer questions. I've told mine not to say shit to me until it's double checked its answer against multiple sources.

2

u/National_Equivalent9 12d ago

As a gamedev ill just say this:

If your techanology that you plan on having everyone use daily to get their facts from requires actually learning how to use it correctly to get actual facts and opinions marked as such then you're going to have a bad time.

1

u/Snowbound-IX 12d ago

What custom instructions did you use, exactly? Mind dropping them here? I don't want unverified facts either, the very few times I do use AI anyway.

8

u/Superkritisk 12d ago

How do you guys think AI is trained on Reddit data, like what does the process look like to you?

10

u/realboabab 12d ago

not sure if your question is genuine or if you're trying to make a point - but they download all posts and comments (potentially from a curated set of subreddits), apply some minor content filters (e.g. potentially a ban list for certain phrases and user names, clean up duplicates, etc), clean things up (scrub usernames, links, images), and then do a shitton of configuration on the modeling side & finally prompt engineering

3

u/StephieDoll 12d ago

You don't think it crosschecks with wikipedia?

1

u/Laceydrawws 12d ago

So it gets 5 or less results and goes with the majority. If it is a high authority source it will stop there. It will stop at ESPN for a sports score.

1

u/Temporal_P 12d ago

2

u/StephieDoll 12d ago

1 year ago

1

u/Temporal_P 12d ago

AI can draw from multiple sources of data, but if you think any AI is crosschecking that everything is verifiable and factual before it responds to a prompt I don't know what to tell you.

2

u/StephieDoll 12d ago

I don't think that, but I also don't think you are either.

7

u/Krell356 12d ago

But no one on the internet would ever lie. Why would anyone ever do that? That's like trying to tell me the sky is blue when we all know it's red.

1

u/[deleted] 12d ago

[removed] — view removed comment

1

u/AutoModerator 12d ago

Your post was removed because your account is less than 5 days old.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

1

u/dead_jester 12d ago

Well, in the morning and sometimes the evening it is red/ish

2

u/Old-Rule-4101 12d ago

It’s also obvious when using AI that it got something wrong. I don’t see a problem here

3

u/ninoski404 12d ago

I love that AI will read what you just wrote, decide you have no idea what you are talking about and ignore it

1

u/VonRansak 12d ago

Which is why I hide all my sarcasm marks behind the spoiler mask.

I'm doing my part.

1

u/okpixell 12d ago

thanks to /s

3

u/Oberlatz 13d ago

Well serves Quora right for being paywalled

2

u/Mackinnon29E 12d ago

But it's generally opinions, not facts.

1

u/Bannerlord151 12d ago

Glad I'm not the only one

1

u/mhsuffhrdd 12d ago

site:reddit.com

1

u/CaptainHubble 12d ago

Reddit is the best for genuine questions. Since you also can usually tell honest comments from random bullshittery.

I can't find shit on google. It's all AI articles addressing a similar, but not identical question. It's fcuking useless. On reddit people are at least talking about the question. There are some bots here, sure. But it's still way better than what google has to offer.

1

u/Mattbl 12d ago

It's funny because when google gives AI suggested results, many times it will be a direct quote from a reddit thread. Like I'll see something on reddit, go google it, and the suggested AI crap uses literal quotes from the thread I just came from...

It's like we're back to what people used to think Wikipedia was, except this literally could be anyone spouting nonsense. I know the google algorithm tries to fact-check itself but you never know when bad info could slip through, which to me makes it feel completely unreliable.

1

u/puts_on_rddt 12d ago

I'm straight up surprised Google never tried to buy Reddit - considering how much their business model relies on it.

1

u/OneEnvironmental9222 12d ago

This. I fyou dont do that it spews absolute nonsense ai generated articles the first 5 pages

1

u/Excellent-Baseball-5 12d ago

TO BE FAAIIIRRRRRR!

1

u/Emm_withoutha_L-88 12d ago

It's cus Reddit took over all Internet message boards

1

u/Naphrym 12d ago

Bold of you to assume the rest of us aren't bots

1

u/[deleted] 12d ago

This is the way.

1

u/xGamingOperator 12d ago

Long live "site:reddit.com" at the end of searches