M3CoT

API

a benchmark that evaluates large language models on a variety of multimodal reasoning tasks, including language, natural and social sciences, physical and social commonsense, temporal reasoning, algebra, and geometry.

Very Poor

0 reviews

Visit Website Write a Review

Score Breakdown

0.0

Performance

25%

0.0

Reliability

20%

0.0

Ease of Use

15%

0.0

Value

15%

0.0

Trust

15%

0.0

Delight

10%

Reviews (0)

Write a Review →

No reviews yet

Be the first to review →