There are lots of articles about bad use cases of ChatGPT that Google already provided for decades.
Want to get bad medical advice for the weird pain in your belly? Google can tell you it’s cancer, no problem.
Do you want to know how to make drugs without a lab? Google even gives you links to stores where you can buy the materials for it.
Want some racism/misogyny/other evil content? Google is your ever helpful friend and garbage dump.
What’s the difference apart from ChatGPT’s inability to link to existing sources?
It’s precisely ChatGPT’s inability to link the sources that is the concern.
also, the fact that Google is so bad about this stuff should give us some pause about directly handing even more incredibly powerful, not-well-curated tools to a constituency as broad as “everyone with an internet connection”. it’s actually insanely bad this a problem with Google, we’ve just normalized it there and it sucks!
The hype cycle around AI right now is misleading. It isn’t revolutionary because of these niche one-off use-cases, it’s revolutionary because it’s one AI that can do anything. Problem with that is what it’s most useful for is boring for non-technical people.
Take the library I wrote to create “semantic functions” from natural language tasks - one of the examples I keep going to in order to demonstrate the usefulness is
@semantic def list_people(text) -> list[str]: '''List the people mentioned in the given text.'''
8 months ago, this would’ve been literally impossible. I could approximate it with thousands of lines of code using SpaCy and other NLP libraries to do NER, maybe a dictionary of known names with fuzzy matching, some heuristics to rule out city names or more advanced sentence structure parsing for false positives, but the result would be guaranteed to be worse for significantly more effort. Here, I just tell the AI to do it and it… does. Just like that. But you can’t hype up an algorithm that does boring stuff like NLP, so people focus on the danger of AI (which is real, but laymen and news focus on the wrong things), how it’s going to take everyone’s jobs (it will, but that’s a problem with our system which equates having a job to being allowed to live), how it’s super-intelligent, etc. It’s all the business logic and doing things that are hard to program but easy to describe that will really show off its power.
I’m both really excited and worried about the part where AI takes over so many jobs that enough people will be without work. I wonder how society will deal with that, will everyone get a proper base “salary” for existing or will there be huge refuge-like camps for the poor jobless people?
I think it’s a combination of inability to link to sources (as you have stated) as well as the confidence in which it may provide incorrect information, and a lack of proper understanding from many people as to how LLMs work and exactly how incorrect they can be at times.
Sure, people can lie on the internet, but a chat bot talking to me and lying? Shouldn’t computers not be able to do that? (/s of course)
The issue is that LLMs are fundamentally not able to not know something. Non-LLM filters that are strapped in front of an LLM can catch stuff like that (“As an LLM I am not able to…”), but if the request makes it through the filter, the LLM is not able to say “Sorry, I don’t know that”, because the data set doesn’t contain that.
For example, there aren’t a lot of API documentations that contain a “Sorry, I don’t know how this endpoint works”.
It’s not the inability to link sources, it’s the wholesale manufacture of them. It’s a language model, not a search engine. It doesn’t get its information from somewhere. It generates it probibalistically based on the structure of the sentence its forming.
It’ll include sources if the sentence structure suggests they should be there, but they’ll also just be built by probabilistic insertion of words.
It’s not so much the inability to link sources but the active laundering of those sources that bugs me. We’ve been lucky that shady information has largely had a vibe that’s pretty easy to spot. ChatGPT presents everything with the same level of professionalism.
Worse, while we might collectively start discounting direct chatbot output because LLMs are dirty liars, scammers can now cheaply rewrite their typo-ridden weird ass screeds into something resembling professionally produced copy.
I like to think of it from a different standpoint. Propaganda and fake news has existed for hundreds of years, if not millennia. It’s just that in the past it was mostly created by wealthy folks, and now anyone can create their own.
A friend of mine uses it to re-type emails to sound more professional. He even got a couple of others to start doing it at his workplace. A few people have started to notice one particular employee has suddenly completely changed how he talks in emails. It’s very amusing, but it works extremely well for my friend.
He even pays the $20usd/m for the “premium” or whatever version. He’s a C-Suite at the company so it’s nothing to him to pay for the service. Other than instances like that, or simple coding (hey I need a quick bs landing page, or I need this added to whatever) it’s pretty overblown for how people seem to think it works.
I have to copy a lot of text from a pdf but the returns are inserted in weird places.
I used to do this whole workaround in word.
Now I’m just like, “chatgpt, can you remove all the extra returns from the text I send you?” “Sure no problem”
It takes me like 5 minutes instead of 20 per document.
The problem with chatGPT is that it allows for automation of content creation.
Imagine a a single guy using chatGPT to control thousands of social media bots, who answer in a human-like way and are able to follow conversations and context, but who all defend the same point of view.
Or imagine a single guy controlling thousands of “local news blogs” that have a constant stream of fresh AI-generated content (both articles and comments), once again all pushing the same narrative.
That is the main problem with things like chatGPT, if not controlled - they allow anyone to create their own “troll farm”.
I just wish humans could be not awful for once in our history. You know what I’ve done with ChatGPT? I had it help me convert a big python function into a one-line delta, and got it to write a short horror story about a man eating a can of beans in theater, amongst other silly things. But everyone suffers, and things get harder to make use of because of power-mad scumbags who see everything as a means to gain control of others.
Hell is other people
I had it write a rap about the poop emoji
I mostly use it for silly things or to give me ideas, and I might try it if we get back into running a game in foundry vtt because the macro I had it test generate messing around looked like it should work fine and I’m lazy
But I can’t imagine using it for serious stuff ever. Maybe in a few years.
I feel like it’s the occasional unpredictability that people are scared of. Whether it’s people being unable to tell if something was created by ChatGPT, if it’s pulling false sources, or people finding ways around set limitations and filters.
I used it to help a friend with a cover letter for a job. I pasted in what my friend had written and asked if it could make it sound better. It literally just made up stuff to make it sound better.
I think the difference is Google is just linking you to this content. On the other hand ChatGPT is pretty confidently telling people all these things. Add the fact that a lot of people consider ChatGPT to be some sort of all-knowing entity and it’s a recipe for disaster.
You should try perplexity.ai as it provides sources & even has filters too. I prefer the academic filter, plus I like checking the provided sources to ensure the AI connected the various info together correctly.
Edit: Perplexity is also updated throughout the day while ChatGPT only has info from 2021 & prior.
Ebay has rolled out some kind of language model plugin for their listing app. It will generate a short product description for you. I’ve been using chat gpt to do this for awhile and also asking it to optimize titles within the 80 character limit.
I can’t always think of related words that other people might search for so this helps me a lot.
I get chatgpt prompts in every search on bing and specifically for a troubleshoot in Linux. It is garbage, and I skip it completely.
It’s only good for stringing together nonsense, say for meeting minutes to pretend the meeting actually did something useful.
brave’s summarizer AI is pretty good.
but then you have to use Brave.
Woah that’s so scary
no, just pretty yucky, given how their CEO is a homophobe and how they’re trying to push crypto and everything.
I don’t enjoy the fact that they tie in crypto. I just want a browser.
That’s the thing: LLMs are great for any content where quality and accuracy doesn’t matter.
- Stock photos
- Product descriptions
- Meeting notes
- Summaries of stuff that’s so boring and useless that you can’t be bothered to read the long form
Then again, if the conteent doesn’t matter, why even create it?
Each step drastically lowers the barriers to get to that end, and also distorts the monetary incentives for people operating that technology to make them more likely to deceive you.
It used to be you have to go to a bookstore/library to read some crack theory on the ailment of your choice. In order to get your crack theories published, you had gatekeepers, publishers, bookstore owners, and then librarians, who would choose what books to stock. Pretty hard to abuse your powers to deceive people.
Then it became easier because you can do it on a desktop with google. Then it became easier because you can just ask google on your phone. Now if you can get a solid SEO page, you’re not gatekept by anyone.
Now it’s easier since you have an authoritative AI that tells you exactly how to do it, and in theory, you can freely develop these AIs to give answers that match your “version” of reality in order to get the most engagement and money out of you. Imagine you go to a website and some “doctor” chats you up that’s actually just a conversational AI, and it just persuades you via pseudo-scientific language that is targeted towards your personal preferences just to get you to buy their snake oil.
It’s the “unknown unknowns” that are really scary, beyond the short term ability it’s easy to understand now. All the ability extrapolated, layered and multiplied.
While the inability to source is a huge problem, but you also have to keep in mind that complaining about AI has other objective beyond the obvious “AI bad”.
- it’s marketing: “Our thing is so powerful it could irreparably change someone’s life” is still advertising even if that irreparable change is bad. Saying “AI so powerful it’s dangerous” just sounds less advertis-y than “AI so powerful you cannot not invest in it” despite both leading to similar conclusions (you can look back at the “fearvertising” done during the original AI boom: same paint, different color)
- it’s begging for regulatory zeals to be put into place: Everyone with a couple of millions can build an LLM from scratch. That might sound like a lot, but it’s only getting cheaper and it doesn’t need highly intricate systems to replicate. Specifically the ability to finetune a large model with few datapoints allows even open-source non-profits like OpenAssistant to compete against the likes of google and openai: Google has made that very explicit in their leaked We have no moat memo. This is why you see people like Sam Altman talking to congress about the dangers of AI: He has no serious competetive advantage and hopes that with sufficient fear-mongering he can get the government to give him one.
Complaining about AI is as much about the AI as it is about the economical incentives behind AI.
One enables the other, or rather the snake is constantly eating itself. SEO content and clickbait were already plagiarizing and consuming human communication, polluting the web by crowding out actual information – ChatGPT and LLMs calcify and turbo-charge this. Tech companies are reacting by piling their own LLMs on top – ingesting garbage and generating yet more garbage. Soon enough, appending " reddit" to our search terms will not be enough to quickly and freely get human information from the web.
Meanwhile – laymen are being told that ChatGPT is an oracle, an intelligence, by companies and enthusiasts trying to build a crypto-style hype train. And the laymen are reacting accordingly. They are being told that ChatGPT knows everything. It doesn’t even know what a pineapple is.