Openai O1 and Deepseeek R1 models were previously seated in the ranking, as it only achieved 9% of the study. Read more
Source link

OpenAI’s DeepResearch can complete 26% of ‘Humanity’s Last Exam’ — a benchmark for the frontier of human knowledge
No Comments1 Min Read
Related Posts
Add A Comment