As a subscriber to both Claude and ChatGPT, I've been comparing their performance to decide which one to keep. Here's my experience:
Coding: As a programmer, I've found Claude to be exceptionally impressive. In my experience, it consistently produces nearly bug-free code on the first try, outperforming GPT-4 in this area.
Text Summarization: I recently tested both models on summarizing a PDF of my monthly spending transactions. Claude's summary was not only more accurate but also delivered in a smart, human-like style. In contrast, GPT-4's summary contained errors and felt robotic and unengaging.
Overall Experience: While I was initially excited about GPT-4's release (ChatGPT was my first-ever online subscription), using Claude has changed my perspective. Returning to GPT-4 after using Claude feels like a step backward, reminiscent of using GPT-3.5.
In conclusion, Claude 3.5 Sonnet has impressed me with its coding prowess, accurate summarization, and natural communication style. It's challenging my assumption that GPT-4 is the current "state of the art" in AI language models.
I'm curious to hear about others' experiences. Have you used both models? How do they compare in your use cases?
Videos
What's the difference between TextCortex, Microsoft Copilot & ChatGPT?
Does TextCortex offer Text Generation API?
How does TextCortex work?
so i got GPT-4 last month and i honestly feel pretty underwhelmed. sure it does some things better than claude and gpt-3.5 but it feels really unintuitive while doing so. heres some examples on how its features feel for a non software developer.
plugin mode is gimmicky at best unless you want GPT to make you a graph or something.
browsing mode is only useful in limited circumstances as the bot cant parse which information is important and tends to read and regurgitate swaths of generic information over specialized information thats needed. (ask it to help you diagnose a cooling system of a specific vehicle and the information you get back is of generic vehicle cooling systems with unrelated parts and suggestions. seems limited to simpler things like checking the weather forecast)
Dalle-3 mode is admittedly pretty cool but it loves to censor the most mundane things frustratingly. (i tried generic product prototype sketches for a medical kit and it refused to produce some images based on the kits contents.)
voice mode is pretty cool and natural feeling but wont work with browsing so is very limited compared to something like alexa or siri. unlike those services you cant exactly ask it to look something up for you or do something for you.
its ability to parse non code based documents through data analysis mode is severely lacking compared to claude 2. claude has the ability to read data related images and graphs from PDF documents at a glance. GPT-4 falters in this area even though it has image recognition capabilities from directly uploaded images and is decent with making guesses on makes and models of vehicles.
an example of how obtuse GPT-4 can be compared to claude:
https://chat.openai.com/share/b7d2e3b9-20ac-4b86-b125-cdccf11deb54