/vault/backup/obsidian_vault/obsidian/Documents/AI/llm-evaluation/notes/frontier-benchmarks/

0 directories 23 files 37 KiB total
List Grid
Name
Size Modified
Up
arc-agi-2.md
1.8 KiB
big-bench.md
1.7 KiB
challenging-big-bench.md
1.6 KiB
charxiv.md
1.6 KiB
facts-grounding.md
1.7 KiB
global-piqa.md
1.5 KiB
gpqa.md
1.7 KiB
humanitys-last-exam.md
1.6 KiB
livecodebench-pro.md
1.6 KiB
mcp-atlas.md
1.5 KiB
michelangelo-long-context.md
1.7 KiB
mmlu-pro-plus.md
1.6 KiB
mmlu-pro.md
1.6 KiB
mmlu.md
1.6 KiB
mmmu-pro.md
1.6 KiB
omnidocbench.md
1.5 KiB
screenspot-pro.md
1.6 KiB
simpleqa-verified.md
1.5 KiB
super-natural-instructions.md
1.6 KiB
swe-bench-verified.md
1.6 KiB
tau2-bench.md
1.5 KiB
vending-bench.md
1.5 KiB
video-mmmu.md
1.6 KiB