I'm using the claude-3-5-sonnet-20240620 API and I need to set the max_tokens. The documentation clearly says the max output for this model should be 8,192 tokens, but I get an error saying the max is 4,096, like for the older models. Am I missing something, or did Anthropric fuck up the validation for the API ?
Videos
I've been using Claude Pro for a while now, and I noticed something strange today. When using Sonnet 3.7, it seems like the token limit for individual responses is lower than before. Previously Claude could generate much longer single responses, but now it seems to cut off earlier.
Has anyone else experienced this? Did Anthropic reduce the response length limits for Claude Pro recently, or am I imagining things? I couldn't find any announcement about changes to the limits.
If you've noticed the same thing or have any information about this, I'd appreciate hearing about it!
Thanks!