A new study from Columbia Journalism Review showed that AI search engines and chatbots, such as OpenAI’s ChatGPT Search, Perplexity, Deepseek Search, Microsoft Copilot, Grok and Google’s Gemini, are just wrong, way too often.

  • TommySoda@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    24 天前

    I miss the days when Google would just give a snippet of a Wikipedia article at the top and you just click the “read more” button. It may not have been exactly what you were looking for but at least it wasn’t blatantly wrong. Nowadays you have to almost scroll down to the bottom just to find something relevant.

    • SlopppyEngineer@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      23 天前

      They are in the end BS generation machines that are trained so much they accidentally happen to be right often enough.

  • criitz@reddthat.com
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    23 天前

    When LLMs are wrong they are only confidently wrong. They don’t know any other way to be wrong.

    • 4am@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      ·
      23 天前

      They do not know wright from wrong, they only know probability of the next word.

      LLMs are a brute forcing of the immigration of intelligence. They do not think, they are not intelligent.

      But I mean people today believe that 5G vaccines made the frogs gay.

    • kubica@fedia.io
      link
      fedilink
      arrow-up
      0
      ·
      24 天前

      We only notice when they are wrong, but they can also be right just by accident.

  • venotic@kbin.melroy.org
    link
    fedilink
    arrow-up
    0
    arrow-down
    1
    ·
    24 天前

    Then again, so has the search engines themselves been proven to be wrong, inaccurate and just plain irrelevant. I’ve asked questions in Google before about things I need to know in general about my state out of curiosity and it’s results always pull up different states that do not apply to mine.

    • TheFogan@programming.dev
      link
      fedilink
      English
      arrow-up
      1
      ·
      24 天前

      well that’s common, but the big thing is, you can see what you are working with. Big difference in at least knowing you need to try a different site when say

      Google: Law about X in state1

      Top result: Law about X in state3: It’s illegal

      Result 2 pages in: here’s a list of each page and whether law X is legal in your state… (State 1 legal)

      Versus chatgpt

      Is X legal in state1?

      Chatgpt: No