There's so much progress in artificial intelligence right now that it feels like, with every new model, some new feature or capability has gone from seemingly impossible to completely possible. That's ...
Claude 3.5 Sonnet was able to solve 64% of problems related to bug fixing and functionality additions with open source codebases, a significant improvement over Claude 3 Opus’ 38% success rate.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results