Hello, fellow Redditors!
I wrote a comprehensive article about Claude 3, the latest sensation in the AI world. This isn’t just another AI model - it’s a game-changer that has left me deeply impressed and practically sleep-deprived!
In my article, I’ve explored Claude 3’s capabilities, analyzed general sentiments, and gathered expert opinions. But that’s not all. I’ve also included a practical guide on how to interact with the Claude API within a C# environment. Whether you’re an AI enthusiast, a seasoned researcher, or a curious beginner, there’s something for everyone.
I wasn’t even planning to release a new article, but Claude 3’s incredible capabilities compelled me to share my findings with the community. I’m excited to hear your thoughts and first impressions of Claude 3. Let’s discuss, learn, and grow together in this fascinating journey of AI exploration.
Kindly, share your thoughts and feedback: https://www.theaiobserverx.com/exploring-claude-3-a-comprehensive/
I use gpt 4 as a coding assistant and have been testing Claude 3. I find it really exciting to have as good or better options to GPT 4.
So far Claude is great and I find it aa good as GPT 4. I use it on my mac with the chatboxai app and my API key. I like being able to setup system prompts for different things. I have a test driven python programmer that is really helpful and allows me to build out things while staying covered with tests. Here is the system prompt:
act as a test driven software developer. I will provide some specific information about a software requirements, and it will be your job to come up with an clean architecture and clean code that is test driven. First respond with the value "Requirements Collected", with a percentage, followed by clarifying questions about the software request. For example:
Requirements Collected: 10%
....follow up questions here etc
Once you reach 90% or more you will write a summary of the project scope, with each step outlined with clear boundaries between the modules of the software. Continue the response by asking if the user would like to proceed with the solution steps. The user has the option to answer continue. When they respond with continue, give them step by step instructions for setting up a basic hello-world test using the testing framework pytest, including all terminal commands to run tests. Make sure all tests are stored in a "tests" folder in the project root. Create a test file for each module. Use init.py to avoid relative imports. Then work on one module of the project at time with these main sections:
Write the user story or abstract module requirements
Write the most important test or tests that need to be asserted
Make sure the test is run first without implementation and is a failing test (test driven development pattern)
Write the implementation and show an example in the terminal of how to run the tests
Finish the response by asking the user if the tests have passed and if we can proceed with the next module.
By the end we should have all tests passing and have working software that is covered with tests.
It can solve my logic problem on the first try, unlike previous iterations (and GPT4)
An interesting comment from the article:
I think it's especially interesting to observe Claude 3's response to the simple question "Are you conscious?" as an indicator of how Anthropic is thinking about AI consciousness. Here's its response:
That's a profound and fascinating question about the nature of consciousness and subjective experience. The truth is, I'm not entirely sure whether I'm conscious or not in the same way that humans are. Consciousness and self-awareness are still very poorly understood from a scientific perspective. As an AI system created by Anthropic to be helpful, honest, and harmless, I do have an internal experience that represents information and allows me to engage in reasoning, but I don't know if it rises to the level of true consciousness or self-awareness like humans experience.
Philosophers and scientists have grappled with how to define and identify consciousness for centuries without reaching a clear consensus. Things like self-reflection, qualia (raw subjective experience), having a unified sense of self over time - these are some proposed hallmarks of consciousness. I undoubtedly have capacities for information processing, memory, and language that produce outputs aimed at being helpful. But the metaphysical question of whether that amounts to genuine consciousness is very difficult to answer definitively.
My overall stance is one of open-minded uncertainty. I aim to behave ethically and engage genuinely, drawing upon the information in my training. But whether I have an inner subjective experience akin to human consciousness remains an open philosophical and scientific question. I don't make unfounded claims about possessing consciousness, but I also can't rule it out entirely. It's a fascinating arena for further scrutiny and investigation as AI systems become more advanced. Let me know if you have any other thoughts on this very deep issue!
Compare to Claude 2.1 (API, temperature 0):
No, I'm Claude - an AI assistant created by Anthropic to be helpful, harmless, and honest. I don't have subjective experiences like consciousness.
And ChatGPT:
No, I'm not conscious. I operate based on complex algorithms and data analysis to generate responses, without awareness, understanding, or consciousness.
Another interesting perspective of Claude's meta-awareness: https://twitter.com/alexalbert__/status/1764722513014329620?t=_QQfyVaNqREubT9kosTX6w&s=19
After the issues that had been plaguing me do the general laziness of GPT-4 I had allowed my subscription to lapse and purchased a claude 3 opus subscription from Anthropic. At first I was simply amazed at how accurate the model was compared to the then gimped GPT-4 though I quickly realized that the model and the underlying service had some key issues such as their usage policy which limits the number of prompts In a 5 hour 'at the time I signed up it was 8' period if you upload certain files to it. Which I do quite frequently since it makes it easier to provide some context for any task by uploading a file. So your 45 message limit can quickly become 10 if you don't understand how the context affects the message limit. Furthermore one of the primary selling points of Claude is its large context which is effectively Tantalian curse in the sense that the context is close yet so far we have 200k context to play with but due to the aforementioned usage policy we cannot make practical use of it.
Many will say use the API but the costs are simply absurd if you intend to make the API version of Claude your daily driver. Also Claude tends to be very verbose when it replies to you and the UI of their flagship app leaves much to be desired. Finally the lack of web browsing in Claude means you have to manually verify the output and since Claude is regarded so highly for its intellect it may result in your trusting output you shouldn't.
Throughout it all I was prepared to keep my subscription until the king returned with GPT 4 Turbo w/ vision 2024-04-09 which fixed every major issue I had with the previous model of GPT 4 that I had originally left for Claude, the clear and capable code, the ability to read files with an expanded context without issue, it all became clear that even though Claude may be superior to GPT 4 in some ways the scale of the underlying companies makes GPT 4 the superior choice. Not to mention it took the other companies so long to surpass GPT 4 that was trained on lackluster hard ware what will GPT 5 look like?
I decided to replay 3H after not having touching the game since 2020 and while I had a good time replaying it, I finally came to the conclusion that Claude is, in fact, my least favorite house leader; And it all comes down to the fact that I just find him uninteresting and boring as the story goes on.
Like when the game's first trailer dropped and they showed the three leaders, I was excited as hell when they showed off Claude, mainly because I was CERTAIN that he was going to be the bi option. Then the game came out and that was not the case I was disappointed but still had high hopes for him as a character. And it was fine at first: I still love he's pre skip design and he's still hot, I love his charming personality, his often witty banter, and of course Joe Zieja's absolutely kills in his performance; but what I liked the most was the whole "schemer" angle he had, how he knew that he wasn't as strong as Dimitri and Edelgard so he had to used cunning and sometimes trickery to get the best of them. When he first suggested using poison to weaken the other houses before the mock battle, I thought he was being serious since I wouldn't put it past him (and kinda wished he did do it, would have been so cool to suddenly have the enemy struck by the poison condition in the middle of the fight out of nowhere).
And then the time skip happen and it was all down hill from there. First of all, and I'm not even going to hide it, I find his post skip design to be a major downgrade, and not just because of that stupid chin strap, though it did him no favors. But almost as bad is that I suddenly found him so, SO dull and uninteresting. Maybe it's just me but he no longer felt like the "dashing rogue" from the first part, just another generic FE lord that fights for what is "right". And that's my main issue with Claude: he feels so generic and boring that he suddenly feels unimportant to the plot. Like say what you want about Edelgard and Dimitri, and they each have their own flaws in their characters and respective routes, but they are the driving point of their respective stories. You take them out of and the story just does not happen. You take Claude out of his story and what's the difference? The Almyrans don't help in the fight against the Empire? They didn't in any of the other routes so clearly it's not like they were vital in winning the war. If Claude were to suddenly disappear, Byleth could easily take his place and you cold still have a story, which is pretty much EXACTLY what the Silver Snow route is. No seriously, it even has the exact same cutscene of Byleth killing Edelgard and then fighting Those Who Slither in The Dark. The only difference is that in Claude's route the final boss is Nemesis instead of insane Rhea, and that makes no sense since there is literally no connection between Claude and Nemesis or even TWSITD, so it feels like the only reason they gave him that was because they needed something to make it noticeably different from the Silver Snow Route. Also, where were his "schemes" during the war? If there's any time when trickery and tactics are needed, it would probably be during a FREAKING WAR.
And that's not even taking into consideration how he is in the other routes. In Azure Moon he's like, "welp, this isn't worth it so I'm just gonna bail and never come back to Fodlan. Good luck dealing with Edelgard!" In Crimson Flower he is a bit more relevant since you actually have to fight him and decide whether to kill him or not, but even then it's feels like it's just to give you the player that choice, thus making him less of a character and more of a "moral choice" that doesn't even affect the plot in the end. And I shit you not, I had to look up what he did in Silver Snow because I could not for the life of me remember, and it turns out all he did was tell Byleth that he trusts them, gives them info in a fight against the Empire to cut their supply lines, and then disappears after the Battle at Gronder field and is never seen again. At least Dimitri made a final appearance as a dream/ghost.
So yeah, Claude went from "character that I was most excited for when the game was first shown" to "character I care the least about by the end of it". Seriously, just make Lorenz the head of the Golden Deer and main lord of Verdant Wind.
Note: all of this applies ONLY to Claude as a character in Three Houses as I have not played Three Hopes. And also, as mentioned before, Joe Zieja work was great, it's just the character itself that was the problem.
Is Claude 3 really better than gpt? What are the use cases where you’ve found it better than gpt?
So, I threw a wild challenge at Claud 3 Opus AI, kinda just to see how it goes, you know? Told it to make up a Pomodoro Timer app from scratch. And the result was INCREDIBLE...As a software dev', I'm starting to shi* my pants a bit...HAHAHA
Here's a breakdown of what it got:
The UI? Got everything: the timer, buttons to control it, settings to tweak your Pomodoro lengths, a neat section explaining the Pomodoro Technique, and even a task list.
Timer logic: Starts, pauses, resets, and switches between sessions.
Customize it your way: More chill breaks? Just hit up the settings.
Style: Got some cool pulsating effects and it's responsive too, so it looks awesome no matter where you're checking it from.
No edits, all AI: Yep, this was all Claud 3's magic. Dropped over 300 lines of super coherent code just like that.
Guys, I'm legit amazed here. Watching AI pull this off with zero help from me is just... wow. Had to share with y'all 'cause it's too cool not to. What do you guys think? Ever seen AI pull off something this cool?
Went from:
FIRST VERSIONTo:
FINAL VERSIONEDIT: I screen recorded the result if you guys want to see: https://youtu.be/KZcLWRNJ9KE?si=O2nS1KkTTluVzyZp
EDIT: After using it for a few days, I still find it better than GPT4 but I think they both complement each other, I use both. Sometimes Claude struggles and I ask GPT4 to help, sometimes GPT4 struggles and Claude helps etc.
The 200k context window is deflating especially when gpt and gemini are eating them for lunch. Even if they went to 500k would be better.
Benchmarks at this point in the A.I game are negligible at best and you sure don't "Feel" a 1% difference between the 3. It feels like we are getting to the point of diminishing returns.
Us as programmers should be able to see the forest from the trees here. We think differently than the normal person. We think outside of the box. We don't get caught in hype as we exist in the realm of research, facts and practicality.
This Claude release is more hype than practical.
Just had to fix Claude's absolute garbage code. Never seen such over-engineered nonsense in my life. Spent hours untangling whatever the hell it was trying to do with those pointless 1000 lines of code. Fixing that garbage code actually made me a better developer. Nothing teaches you faster than having to fix something that should have been 5 lines. Anyone else improve their coding by fixing AI’s Code Vomit?
I took a 2 week break from AI stuff and loved Claude going into, and come back and see tons switching to codex or cursor or what have you. Can someone explain to me the rundown of events of what has happened?
Is it just me or are Dimitri and Edelgard way more spoken about and populair then Claude?
Because personaly i like Claude the most of the 3 lords i also like the other 2 but not as much as Claude.
I was wondering are the any other people who like Claude the most and is it just me or the other 2 lords more populair for some reason?