Skip to main content

evaluating-llms-harness

Evaluates LLMs using academic benchmarks like MMLU and GSM8K, aiding in model quality assessment and comparison.

Install this skill

or
evaluating-llms-harness5 files

Comments

Sign in to leave a comment.

No comments yet. Be the first to comment!
Installation guide →