Yep, I spent a month refactoring a few thousand lines of code using GPT4 and I felt like I was working with the best senior developer with infinite patience and availability.
I could vaguely describe what I was after and it would identify the established programming patterns and provide examples based on all the code snippets I fed it. It was amazing and a little terrifying what an LLM is capable of. It didn’t write the code for me but it increased my productivity 2 fold... I’m a developer now a getting rusty being 5 years into management rather than delivering functional code, so just having that copilot was invaluable.
Then one day it just stopped. It lost all context for my project. I asked what it thought what we were working on and it replied with something to do with TCP relays instead of my little Lua pet project dealing with music sequencing and MIDI processing… not even close to the fucking ballpark’s overflow lot.
It’s like my trusty senior developer got smashed in the head with a brick. And as described, would just give me nonsense hand wavy answers.
Was this around the time right after "custom GPTs" was introduced? I've seen posts since basically the beginning of ChatGPT claming
it got stupid and thinking it was just confirmation bias. But somewhere around that point I felt a shift myself in GPT4:s ability to program; where it before found clever solutions to difficult problems, it now often struggles with basics.
I do think part of it is expectation creep but also that it's got better at some harder elements which aren't as noticeable - it used to invent functions which should exist but don't, I haven't seen it do that in a while but it does seem to have limited the scope it can work with. I think it's probably like how with images you can have it make great images OR strictly obey the prompt but the more you want it to do one the less it can do the other.
I've been using 3.5 to help code and it's incredibly useful for things it's good at like reminding me what a certain function call does and what my options are with it, it's got much better at that and tiny scripts like 'a python script that reads all the files in a folder and sorts the big images into a separate folder' or something like that. Getting it to handle anything with more complexity it's got worse at, it was never great at it tbh so I think maybe it's getting to s block where now it knows it can't do it so rejects the answers with critical failures (like when it makes up function of a standard library because it'd be useful) and settles on a weaker but less wrong one - a lot of the making up functions errors were easy to fix because you could just say 'pil doesn't have a function to do that can you write one'
So yeah I don't think it's really getting worse but there are tradeoffs - if only openAI lived by any of the principles they claimed when setting up and naming themselves then we'd be able to experiment and explore different usage methods for different tasks just like people do with stable diffusion. But capitalists are going to lie, cheat, and try to monopolize so we're stuck guessing.
First it just starts making shit up, then lying about it, now it’s just at the stage where it’s like, “Fuck this shit.” It’s becoming more human by the day.
AI systems such as ChatGPT are notoriously costly for the companies that run them, and so giving detailed answers to questions can require considerable processing power and computing time.
This is the crux of the problem. Here's my speculation on OpenAI's business model:
Build good service to attract users, operate at a loss.
Slowly degrade service to stem the bleeding.
Begin introducing advertised content.
Further enshitify.
It's basically the Google playbook. Pretend to be good until people realize you're just trying to stuff ads down their throats for the sweet advertising revenue.
It would be awesome if someone had been querying it with the same prompt periodically (every day or something), to compare how responses have changed over time.
I guess the best time to have done this would have been when it first released, but perhaps the second best time is now..
I asked it a question about the ten countries with the most XYZ regulations, and got a great result. So then I thought hey, I need all the info so can I get the name of such regulation for every county?
ChatGPT 4: “That would be exhausting, but here are a few more…”
I feel like the quality has been going down especially when you ask it anything that may hint at anything "immoral" and it starts giving you a whole lecture instead of answering.
Honestly I kinda wish it would give shorter answers unless I ask for a lot of detail. I can use those custom instructions but it's tedious difficult to tune that properly.
Like if I ask it 'how to do XYZ in blender' it gives me a long winded response, when it could have just said 'Hit Ctrl-Shift-Alt-C'
That’s why I use Bard more now. I’ll ask something and it’ll also answer stuff I would’ve asked as follow-up questions. It’s great and I’m excited for their Ultra model.
Am late to the game here but after reading the article I would agree.
I use it off and on if I am looking up formulas and scripts and find it a great tool for work. It saves a ton of time. It works great and haven’t noticed any change there. Request it to give/write you a specific formula to solve X and it will. It’s a huge time saver.
But I’ve found recently if I am trying to just find information on a subject that I want summarized or something found on the web and explained it will often ‘recommend I check out the company’s website for the latest news or recent developments.’
That last statement was an exact quote I got recently that made me laugh when I went asking for the explanation of how something worked. It was a NO SHIT SHERLOCK moment I had after getting several of these sort of replies.
I mean I har gotten detailed explanations of string theory ages back from ChatGPI and now it’s telling me ‘ummm just go look it up - I can’t right now..m
In recent days, more and more users of the latest version of ChatGPT – built on OpenAI’s GPT-4 model – have complained that the chatbot refuses to do as people ask, or that it does not seem interested in answering their queries.
If the person asks for a piece of code, for instance, it might just give a little information and then instruct users to fill in the rest.
In numerous Reddit threads and even posts on OpenAI’s own developer forums, users complained that the system had become less useful.
They also speculated that the change had been made intentionally by OpenAI so that ChatGPT was more efficient, and did not return long answers.
AI systems such as ChatGPT are notoriously costly for the companies that run them, and so giving detailed answers to questions can require considerable processing power and computing time.
OpenAI gave no indication of whether it was convinced by the complaints, and if it thought ChatGPT had changed the way it responded to queries.
The original article contains 307 words, the summary contains 166 words. Saved 46%. I'm a bot and I'm open source!
My partner is a CompSci teacher and have been training a local llm in her class. As soon as they named their AI it started producing all these weird emotes with every answer, it became super annoying to where it would rather make up stuff than say I don’t know that answer. It was definitely an eye opener for the kids.