Via @[email protected]

Right now if you search for “country in Africa that starts with the letter K”:

  • DuckDuckGo will link to an alphabetical list of countries in Africa which includes Kenya.

  • Google, as the first hit, links to a ChatGPT transcript where it claims that there are none, and summarizes to say the same.

This is because ChatGPT at some point ingested this popular joke:

“There are no countries in Africa that start with K.” “What about Kenya?” “Kenya suck deez nuts?”

  • Eager Eagle@lemmy.world
    link
    fedilink
    English
    arrow-up
    180
    arrow-down
    35
    ·
    1 year ago

    when will people learn that search results change all the time and are different for different people

    • Sami@lemmy.zip
      link
      fedilink
      English
      arrow-up
      132
      ·
      1 year ago

      I get the same as the main post. Either way the point still stands. I had someone correct me with a misconception about something because he googled it and thats what it said in the answer box. It’s getting increasingly difficult to rely on search results especially when google synthesizes them into questions and answers with little context

      • cybervseas@lemmy.world
        link
        fedilink
        English
        arrow-up
        57
        ·
        1 year ago

        Yes. I have to keep reminding my parents that those little Google answer boxes aren’t real search results and can’t be trusted. They sometimes say the exact opposite of the page they’re citing!

    • Neato@kbin.social
      link
      fedilink
      arrow-up
      52
      arrow-down
      2
      ·
      1 year ago

      It doesn’t matter? If you search for something and you get a blatantly wrong answer parroted from an AI text completion service, it’s still a fail.

      I got the wrong answer from Google just now and I’ve never heard of that joke before. So clearly it isn’t just OP with polluted metrics.

      • Landrin201@lemmy.ml
        link
        fedilink
        English
        arrow-up
        5
        ·
        1 year ago

        I just tried it and got this result. The sentence is incredible, I can’t get over how painfully stupid it is.

        Apparently Kenya starts with a k sound and a letter that resembles k but not the letter k

    • OldFartPhil@lemm.ee
      link
      fedilink
      English
      arrow-up
      14
      arrow-down
      1
      ·
      edit-2
      1 year ago

      I get the result you got, but the Emergent Mind response is the second organic result. That’s still way too high.

    • Notorious_handholder@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      1 year ago

      Take out “the letter” part and search as just: “countries in africa that start with k”. For some reason it seems the search involving the words “the letter” got fixed but others did not. Confirmed I was able to get both results by doing that and as of this typing I still able to switch between the two results by just adding or removing those words

    • deleted@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      1 year ago

      Interestingly enough, when I removed the word “letter” I got none. If I put it back I get Kenya.

      This was done on the same device and same browser session.

  • madsen@lemmy.world
    link
    fedilink
    English
    arrow-up
    114
    ·
    1 year ago

    Oh, this is great… And because the ChatGPT transcript is highly ranked on Google, it’s almost certainly going to be used for training ChatGPT. A feedback loop of shitty information. Praise ChatGPT!

    • MalReynolds@slrpnk.net
      link
      fedilink
      English
      arrow-up
      49
      ·
      1 year ago

      Remember GIGO (Garbage In, Garbage Out). Due to years of SEO and content farming (which google profited from, so you get what you deserve assholes) most of the internet, by volume, is self-congratulatory, for profit, garbage, or, you know, reddit garbage. Hopefully someone points a large LLM at the library of congress or other large, well curated data source, but of course copyright will not allow, thanks mickey mouse. Wouldn’t surprise me if the military is already on it, hopefully that leaks…

      • tony@lemmy.hoyle.me.uk
        link
        fedilink
        English
        arrow-up
        24
        arrow-down
        2
        ·
        1 year ago

        LLMs will eventually start feeding of search results from other LLMs and they’ll just start regurgitating each others nonsense. If that isn’t happening already.

        Nobody is going to point an LLM at a good data source because that would mean spending money on actually useful stuff not fast cars and booze.

        • T156@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          1 year ago

          Nobody is going to point an LLM at a good data source because that would mean spending money on actually useful stuff not fast cars and booze.

          It would also mean that you need to sort through that data, and most people don’t have the time or money to bother, not when it might reduce their data pool.

  • outdated_belated@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    77
    ·
    1 year ago

    The second-highest hit gives a clue as to why:

    Relevantly to Lemmy’s existence in the first place, it suggests Reddit as a pretty pivotal training data source, which Reddit tries to cash in on while also killing 3rd party apps due to apathy

    • TimewornTraveler@lemm.ee
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      Oh god chatgpt is going to start talking like a redditor. “I went to make some MAC AND FUCKING CHEESE after I had SEX with my HOT FUCKING WIFE”

  • Rottcodd@literature.cafe
    link
    fedilink
    English
    arrow-up
    76
    arrow-down
    1
    ·
    1 year ago

    I doubted this, so I tried it. I haven’t used google for ages, so I first had to search “google” in DDG, then I went to the main page. When I started typing it in, it suggested the full text of the search, so I thought it was even less likely that it would work like the OP said - that even if it had been the case that it previously did that, so many people have self-evidently done that search that the results would now be correct.

    But no - there it was, right at the top - “While there are 54 recognized countries in Africa, none of them begin with the letter “K”. The closest is Kenya, which starts with a “K” sound, but is actually spelled with a “K” sound.”

    And with that, I’ll contentedly go back to not using google.

    • jigsaw250@lemmy.world
      link
      fedilink
      English
      arrow-up
      31
      ·
      1 year ago

      The closest is Kenya, which starts with a “K” sound, but is actually spelled with a “K” sound."

      Not going to lie, that’s pretty disturbing. This search engine is used by millions (maybe billions?) and they’re providing false information to all those people. That’s scary.

      • dx1@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        ·
        1 year ago

        If you’re using Firefox you can set these custom aliases up directly, @w and so on.

  • Vlhacs@reddthat.com
    link
    fedilink
    English
    arrow-up
    61
    ·
    edit-2
    1 year ago

    Chatgpt creating incorrect feedback loops like this is one of my main concerns about AI being used so prevalently. This and original thoughts disappearing because every new content in the web is generated by AI and not by a human.

    • UnknownQuantity@lemm.ee
      link
      fedilink
      English
      arrow-up
      15
      ·
      1 year ago

      I typed in “what countries in Africa start with k” and got Kenya. When I tried your search term I got your result also. Weird.

      • Jojo@lemm.ee
        link
        fedilink
        English
        arrow-up
        5
        ·
        1 year ago

        It kinda seems like that’s a joke that got popular, and if you quote the joke it finds the joke but if you ask a similar question it gives the real answer

        Did it go viral or something and that’s why google is finding it?

    • GreenMario@lemm.ee
      link
      fedilink
      English
      arrow-up
      6
      ·
      1 year ago

      I’ve tried to upload a similar image so I’ll just add a +1 using Firefox, Android Google.

  • owiseedoubleyou@lemmy.ml
    link
    fedilink
    English
    arrow-up
    57
    arrow-down
    2
    ·
    1 year ago

    Which starts with a “K” sound, but is actually spelled with a “K” sound

    AI is truly magical

  • Blackmist@feddit.uk
    link
    fedilink
    English
    arrow-up
    53
    arrow-down
    1
    ·
    1 year ago

    AI generated content is pure search result pollution.

    What Google should be doing rather than pushing Bard is detecting AI nonsense and purging it from their search database.

    • diffuselight@lemmy.world
      link
      fedilink
      English
      arrow-up
      24
      arrow-down
      1
      ·
      1 year ago

      You cannot detect AI generated content. Not with any real world accuracy and it will only get worse.

      Also, because google relies on growth for everything from compensation structure to business model, they are in a bind - ads is not growing anymore, it’s done.

      And while they managed to create an illusion of growth this earnings round by juicing subscription fees 20% and increasing ad load everywhere, it’s not a sustainable tactic. We are already seeing a tech sell off as people are getting less and less secure.

      So they rely on AI narrative to keep investors invested Google needs AI to work or the investors will move it to a place that may offer higher returns than a squeezed out ads model.

      Worse even they are being attacked by AI - on the quality front (junk content) and in the marketplace (openAI), they don’t have a choice but to take a pro AI stance.

      • assassin_aragorn@lemmy.world
        link
        fedilink
        English
        arrow-up
        9
        ·
        1 year ago

        AI is its own worst enemy. If you can’t identify AI output, that means AIs are going to train on AI generated content, which really hurts the model.

        Its literally in everyone’s best interest, including AI itself, to start leaving identification of some kind inherent to all output.

        • diffuselight@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          arrow-down
          1
          ·
          1 year ago

          Those studies are flawed. by definition when you can no longer tell the difference the difference on training is nil.

          • pedalmore@lemmy.world
            link
            fedilink
            English
            arrow-up
            7
            ·
            1 year ago

            It’s more like successive generations of inbreeding. Unless you have perfect AI content, perfect meaning exactly mirroring the diversity of human content, the drivel will amplify over time.

            • diffuselight@lemmy.world
              link
              fedilink
              English
              arrow-up
              4
              ·
              1 year ago

              Given chinchilla law, nobody in their right mind trains models via shotgun ingesting all data anymore. Gains are made with quality of data at this point, less than volume.

  • MarkHughes4096@lemmy.world
    link
    fedilink
    English
    arrow-up
    45
    arrow-down
    1
    ·
    1 year ago

    I actually stopped using google search all together last night, I was searching for something and the result where just abysmal and mostly irrelevant, I searched on DuckDuckGo and found much better results. I was using both, Google used to be so good and now it’s just a mess.

      • body_by_make@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        14
        ·
        1 year ago

        Double quotes also didn’t work to apply explicit phrase searches last time I tried, which is incredibly annoying. Last I heard they were looking into fixing this though

      • JoBo@feddit.ukOP
        link
        fedilink
        English
        arrow-up
        8
        arrow-down
        3
        ·
        1 year ago

        The bigggest beauty point of it is that you can click on adverts knowing that you’re sending money to Duck instead of evilCorp.

        Also why, even if you know you’re going to have to go to Amazon for something, you should search for it in your less evil search engine of choice so that Amazon are forced to pay them for the referral. A small way to assuage the guilt, but a goodun.

    • JoBo@feddit.ukOP
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      1
      ·
      1 year ago

      My default has been Duck Duck Go for a few years now, originally on privacy and fuck-google grounds. I used to have to (reluctantly) stick !g in the search quite a lot when it couldn’t find what I wanted. Hardly ever need to now.

    • sibbl@feddit.de
      link
      fedilink
      English
      arrow-up
      21
      arrow-down
      4
      ·
      1 year ago

      This sounds like “Hmm, maybe calculators won’t replace mathematicians.” to me.

      Not sure why it should replace them. They’ll co-exist. Sometimes you can do the math in your brain and for other things you use calculators. Results of calculators can still be wrong it you don’t use them properly.

    • mindlight@lemm.ee
      link
      fedilink
      English
      arrow-up
      13
      arrow-down
      8
      ·
      1 year ago

      I’m pretty sure a lot of people said something like “Hmm,maybe the automobile won’t replace horses.” after reading about the first car accidents.

      • regalia@literature.cafe
        link
        fedilink
        English
        arrow-up
        15
        ·
        1 year ago

        Finding sources will always be relevant, and so will finding links to multiple sources (search results). Until we have some technological breakthrough that can fact check LLM models, it’s not a replacement for objective information, and you have no idea where it’s getting its information. Figuring out how to calculate objective truth with math is going to be a tough one.

      • Hiccup@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        1 year ago

        What a blast from the past! AI gives me second hand embarrassment for the people that work and get paid on this/for this shit. It’s the second (or third) coming of crypto and NFTs. Just junk software that fixes nothing and that wastes people’s time.

        • regalia@literature.cafe
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          3
          ·
          1 year ago

          LLMs have absolutely tons of actual applications that it’s crazy, and it already changed the world. Crypto and NFTs were just speculative assets that were trying to solve a problem that didn’t exist. LLMs have already solved a huge amount of real world problems, and continues to do so.

          • ALostInquirer@lemm.ee
            link
            fedilink
            English
            arrow-up
            3
            arrow-down
            1
            ·
            1 year ago

            LLMs have already solved a huge amount of real world problems, and continues to do so.

            Would you happen to have some examples? I don’t disagree that LLMs have more of a use case and application than the cryptoNFT misapplications of blockchain, but I’m honestly not familiar with where they’ve solved real world problems (and not just demonstrated some research breakthroughs, which while impressive in their own respect do not always extend to immediate applications).

            • regalia@literature.cafe
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 year ago

              They are augmenting search engines. They write and can digest articles. GitHub co-pilot has been a pretty big deal. It can act like a personal tutor to walk you through math problems, code, language, whatever. Building trust LLM search for medical information without hallucinating. It can do financial analysis and all sorts of stuff with that. It’s replacing a huge amount of jobs. This stuff is like all over the news, I’m not sure if you’ve lived under a rock this whole time. For very little effort you can find an endless more amount of examples. It’s creating real world use cases daily, so fast that it feels impossible to keep up.

              • ALostInquirer@lemm.ee
                link
                fedilink
                English
                arrow-up
                1
                ·
                1 year ago

                This stuff is like all over the news, I’m not sure if you’ve lived under a rock this whole time.

                Oh, no, I’ve heard it, I’m just skeptical of their accuracy and reliability, and that skepticism has been borne out by the numerous reports of glitching (“hallucinations” as they insist on calling them, in furtherance of their inappropriate personification of the technology). Moreover, I’ve found their mass theft of others’ work to further call into question the creators’ trustworthiness, which has only been compounded by their overselling of their technology’s capabilities while simultaneously suggesting it’s just untenable to log & cite all the sources that they push into it.

                It can supposedly do all you describe, but it can’t effectively credit its sources? It can tutor but it can’t even keep basic information straight? Please. It’s impressive technology, but it’s being overblown because the markets favor exaggeration to facts, at least as long as people can be kept enamored with the fantasy they spin.

                • regalia@literature.cafe
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  1 year ago

                  With all that silicon valley, it’ll probably be pushed more to do those things regardless of its hallucinations and accuracy lol.

  • const_void@lemmy.ml
    link
    fedilink
    English
    arrow-up
    31
    ·
    1 year ago

    Searched for ‘office gym’ on YouTube yesterday and it returned a bunch of videos of Jim from The Office. The enshittification is everywhere these days.

    • bcore@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      arrow-down
      3
      ·
      1 year ago

      What kind of video were you actually hoping to see? That feels like such an odd subject to want to watch videos about.

        • bcore@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 year ago

          Hmm fair enough I guess. It’d never occur to me to want to look for youtube videos about the intersection between home offices and home gyms, but I’m sure people do. I think I’d probably use a term like “home office gym” though in that case. Honestly I’d bet way more people want videos about “Office Jim” but are terrible at spelling than want home office exercise equipment videos though…

    • Fungah@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      I’ve googled error codes for programs with hundreds of thousands of users and had 0 results.

      Really Google? You’re telling me out of all these people that use / develop this application, that no one, kot ever. Once, has ever written hay error code down anywhere you index?

      It’s all so fucking shitty it has to be intentional but I can’t for the life of me figure out WHY. Showing more ads? Maybe htnyhetrs other options. People will just use them. Making everyone dumber? Saving bandwidth??

      I don’t boy the “so has just gotten that good” narrative. It’ll leave out sites from 2010 when it would be useful to see them and include them when it isn’t.

      I just don’t get it.

  • adeoxymus@lemmy.world
    link
    fedilink
    English
    arrow-up
    32
    arrow-down
    2
    ·
    1 year ago

    Tbh the problem is not due to chat gpt but because Google doesn’t rank search results by correctness but something that is related to popularity.

  • Chatotorix@lemmy.world
    link
    fedilink
    English
    arrow-up
    30
    ·
    1 year ago

    I tried and it is true here at least. Says Kenya is the closest but although it starts with a K sound, it’s spelled with a K sound. lmao

    The only time I tried ChatGPT for something that required a little bit of processing it failed miserably. I had a shower thought, “what is the most used noun on lyrics of this band I like?” I asked it and it gave me random words. I decided to investigate and ask it what are specific lyrics of some of the most popular songs and it kept telling me made up lyrics, when I could actually find them immediately on Google.

  • fernandu00@lemmy.ml
    link
    fedilink
    English
    arrow-up
    27
    ·
    1 year ago

    I’ve been using DDG for a year and like the search results but what bothers me is that you can’t exclude a word from your query with minus sign!

    • Fungah@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      1 year ago

      DDG is just bing and I think google too? I don’t know what’s happening but over the last 2 years or so every search engine has been getting worse and worse and worse. They’re all borderline un fucking usable.

      Except kagi. Which you have to pay for. Which I am totally fucking okay with given the state of everything else.

    • akulium@feddit.de
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      1 year ago

      Try search.brave.com, it can do that.

      They have their own index unlike ddg which is just a proxy for bing. Bing removed the minus feature at some point.