What made me stop is
-
they allowed AI answers. Full stop, I don’t want AI regurgitating AI.
-
They did nothing to improve the community except repeatedly say they’ll improve the community
-
The barrier to entry is still way too high. You can’t post/comment without rep, but you can’t get rep without post/commenting. So people joining struggle to even say hello.
As a engineer who was a power user on StackOverflow, I hate that we are losing a major community that helps coders, beginners or advance, ask questions.
They don’t allow AI answers though?
Other than that, yeah I agree. They needed to switch to curating their content and helping their community and they didn’t.
What made me stop is everything I’ve posted in the last five years has been downvoted and/or closed for the stupidest reasons imaginable. Even a nearly decade old question I posted has recently been downvoted and closed. It’s been true for ages that they have created a culture of elitist rule followers hellbent on following the letter of the rule and not the spirit (and many times even ignoring the letter of the rule just to close things), but nowadays it’s just so much worse.
I’ll write a question. Spend like 30 minutes making sure it’s good and that there aren’t duplicates because I have so much fucking anxiety about getting downvoted and closed. I’ll find similar questions and explain why it’s different. Then when I post? Downvote, closed as duplicate. Commenters being condescending assholes.
Not to mention all the other shit over the years. They’re violating the license everyone contributes under by not allowing the content to be used for certain purposes. Meta has been a joke for ages. They don’t listen or engage.
Kind of what happens when you build community, then try to scale for profit.
-
So what do we train gpt on when stack overflow degrades?
Will library docs be enough? Maybe.
SO is already degraded because they didn’t allow new answers even though the old answers are based on old depreciated versions and no longer relevant.
This has been a concern of mine for a long time. People act like docs and code bases are enough, but it’s obvious when looking up something niche that it isn’t. These models need a lot of input data, and we’re effectively killing the source(s) of new data.
It feels like less stack overflow is a narrowing, and that’s kind of where my question comes from. The remaining content for training is the actual authoritative library documentation source material. I’m not sure that’s necessarily bad, it’s certainly less volume, but it’s probably also higher quality.
I don’t know the answer here, but I think the situation is a lot more nuanced than all of the black and white hot takes.
There’s a serious argument that StackOverflow was, itself, a patch job in a technical environment that lacked good documentation and debug support.
I’d argue the mistake was training on StackExchange to begin with and not using an actual stack of manuals on proper coding written by professionals.
The problem was never having the correct answer but sifting out of the overall pool of information. When ChatGPT isn’t hallucinating, it does that much better than Stack Exchange
Stack Overflow mods finally get what they’ve always dreamed of: no more repeat questions.
StackGPT: begins every answer with “closed as duplicate. Here’s a previous answer I provided to this question…”
The ChatGPT release is close to the SO decision to double down on the moderating rules.
Anyway, where is this data from? This change looks suspiciously intense.
Well, having a crawler search through all that garbage, ads, questions, wrong answers. And converting that to facts or condensed information…
Just makes so much more sense, also for the environment, I would think. It saves a ton of useless traffic.
But the “AI” part may be problematic.
Yeah, the bullshit generator part is not useful.
One time, chatGPT gave me a code in Python to use a specific Python library. When I said I was coding in Ruby on Rails, it converted the Python code to Ruby syntax.
It literally made up a solution.
So it made you use a python library in Ruby?
Not that I doubt people have been avoiding it since ChatGPT, but i6 think a part of it is also Google’s partnership with Reddit pushing more search results that way. I’d be curious to see a similar trend regarding Reddit.
SO has been taking a longer time to load for me. Does anyone have the same problem?
I’d like to see the source of this data.
It looks like it’s from https://stackoverflow.com/site-analytics, but I don’t have sufficient karma to see the raw data.