so i got GPT-4 last month and i honestly feel pretty underwhelmed. sure it does some things better than claude and gpt-3.5 but it feels really unintuitive while doing so. heres some examples on how its features feel for a non software developer.
plugin mode is gimmicky at best unless you want GPT to make you a graph or something.
browsing mode is only useful in limited circumstances as the bot cant parse which information is important and tends to read and regurgitate swaths of generic information over specialized information thats needed. (ask it to help you diagnose a cooling system of a specific vehicle and the information you get back is of generic vehicle cooling systems with unrelated parts and suggestions. seems limited to simpler things like checking the weather forecast)
Dalle-3 mode is admittedly pretty cool but it loves to censor the most mundane things frustratingly. (i tried generic product prototype sketches for a medical kit and it refused to produce some images based on the kits contents.)
voice mode is pretty cool and natural feeling but wont work with browsing so is very limited compared to something like alexa or siri. unlike those services you cant exactly ask it to look something up for you or do something for you.
its ability to parse non code based documents through data analysis mode is severely lacking compared to claude 2. claude has the ability to read data related images and graphs from PDF documents at a glance. GPT-4 falters in this area even though it has image recognition capabilities from directly uploaded images and is decent with making guesses on makes and models of vehicles.
an example of how obtuse GPT-4 can be compared to claude:
https://chat.openai.com/share/b7d2e3b9-20ac-4b86-b125-cdccf11deb54
GPT-4 or Claude 2 (OpenRouter)
Claude 2 vs ChatGPT-4
used it for like 5 minutes, it hallucinated way more than chatgpt 3.0 ever did
More on reddit.comGPT 4 vs Claude 2
GPT-4 is smarter, but Claude 2 has a larger context size.
Use GPT-4 if you need to complete more complex task, or Claude 2 if you're working with a lot of text.
The AlpacaEval Leaderboard is a good place to see what the current best LLMs are.
More on reddit.comClaude2 vs GPT4, my insights thus far
Great comparison, but I feel a lot of the good is the same with Claude 1, so I'm curious to see a direct comparison between 1 and 2.
More on reddit.com