Artificial Analysis (@artificialanlys) 's Twitter Profile
Artificial Analysis

@artificialanlys

Independent analysis of AI models and hosting providers - choose the best model and API provider for your use-case

ID: 1743487864934162432

linkhttp://artificialanalysis.ai/ calendar_today06-01-2024 04:21:21

861 Tweet

32,32K Followers

500 Following

Artificial Analysis (@artificialanlys) 's Twitter Profile Photo

Announcing Artificial Analysis Long Context Reasoning (AA-LCR), a new benchmark to evaluate long context performance through testing reasoning capabilities across multiple long documents (~100k tokens) The focus of AA-LCR is to replicate real knowledge work and reasoning tasks,

Announcing Artificial Analysis Long Context Reasoning (AA-LCR), a new benchmark to evaluate long context performance through testing reasoning capabilities across multiple long documents (~100k tokens)

The focus of AA-LCR is to replicate real knowledge work and reasoning tasks,
Artificial Analysis (@artificialanlys) 's Twitter Profile Photo

Here is the new Artificial Analysis Intelligence Index standing. The inclusion of AA-LCR in Artificial Analysis Intelligence Index v2.2 marginally improves the relative ranking of Claude 4 Sonnet (Thinking) from 58 to 59 points and is now equivalent to DeepSeek R1 0528. Note:

Here is the new Artificial Analysis Intelligence Index standing. The inclusion of AA-LCR in Artificial Analysis Intelligence Index v2.2 marginally improves the relative ranking of Claude 4 Sonnet (Thinking) from 58 to 59 points and is now equivalent to DeepSeek R1 0528.

Note:
Artificial Analysis (@artificialanlys) 's Twitter Profile Photo

Example Question: For the company and quarter where the company reported a 13.5% decline on the prior quarter’s operating income. What was their adjusted EBITDA? List the company name and adjusted EBITDA Example Document Provided to Answer Question: d1io3yog0oux5.cloudfront.net/_7d21ae8f93fa6…

Artificial Analysis (@artificialanlys) 's Twitter Profile Photo

Link to AA-LCR announcement: artificialanalysis.ai/articles/annou… Includes further details regarding AA-LCR including how the dataset was developed. You can learn more about the Artificial Analysis Intelligence Index v2.2 at artificialanalysis.ai/methodology/in… Detailed AA-LCR results on Artificial