SciBench

SciBench

API

benchmark designed to evaluate large language models (LLMs) on solving complex, college-level scientific problems from domains like chemistry, physics, and mathematics.

0
Very Poor

0 reviews

Score Breakdown

0.0
Performance
25%
0.0
Reliability
20%
0.0
Ease of Use
15%
0.0
Value
15%
0.0
Trust
15%
0.0
Delight
10%