VOOZH about

URL: https://arxiv.org/pdf/2604.01418


%PDF-1.7 %���� 1 0 obj << /Metadata 3 0 R /Names 4 0 R /OpenAction 5 0 R /PageMode /UseOutlines /Pages 6 0 R /Type /Catalog >> endobj 2 0 obj << /Author (Michael Krumdick; Adam Wiemerslage; Seth Ebner; Charles Lovering; Chris Tanner) /Creator (arXiv GenPDF \(tex2pdf:a6404ea\)) /DOI (https://doi.org/10.48550/arXiv.2604.01418) /License (http://creativecommons.org/licenses/by/4.0/) /PTEX.Fullbanner (This is pdfTeX, Version 3.141592653-2.6-1.40.28 \(TeX Live 2025\) kpathsea version 6.4.1) /Producer (pikepdf 8.15.1) /Title (Cost-Efficient Estimation of General Abilities Across Benchmarks) /Trapped /False /arXivID (https://arxiv.org/abs/2604.01418v1) >> endobj 3 0 obj << /Subtype /XML /Type /Metadata /Length 1689 >> stream endstream endobj 4 0 obj << /Dests 7 0 R >> endobj 5 0 obj << /D [ 8 0 R /Fit ] /S /GoTo >> endobj 6 0 obj << /Count 30 /Kids [ 9 0 R 10 0 R 11 0 R 12 0 R 13 0 R ] /Type /Pages >> endobj 7 0 obj << /Kids [ 14 0 R 15 0 R 16 0 R 17 0 R 18 0 R ] /Limits [ (Doc-Start) (table.caption.29) ] >> endobj 8 0 obj << /Annots [ 19 0 R 20 0 R 21 0 R 22 0 R 23 0 R 24 0 R 25 0 R 26 0 R 27 0 R 28 0 R 29 0 R 30 0 R 31 0 R 32 0 R 33 0 R 34 0 R 35 0 R 36 0 R 37 0 R 38 0 R 39 0 R 40 0 R 41 0 R 42 0 R 43 0 R 44 0 R 45 0 R 46 0 R ] /Contents [ 47 0 R 48 0 R 49 0 R 50 0 R ] /MediaBox [ 0 0 612 792 ] /Parent 9 0 R /Resources 51 0 R /Type /Page >> endobj 9 0 obj << /Count 6 /Kids [ 8 0 R 52 0 R 53 0 R 54 0 R 55 0 R 56 0 R ] /Parent 6 0 R /Type /Pages >> endobj 10 0 obj << /Count 6 /Kids [ 57 0 R 58 0 R 59 0 R 60 0 R 61 0 R 62 0 R ] /Parent 6 0 R /Type /Pages >> endobj 11 0 obj << /Count 6 /Kids [ 63 0 R 64 0 R 65 0 R 66 0 R 67 0 R 68 0 R ] /Parent 6 0 R /Type /Pages >> endobj 12 0 obj << /Count 6 /Kids [ 69 0 R 70 0 R 71 0 R 72 0 R 73 0 R 74 0 R ] /Parent 6 0 R /Type /Pages >> endobj 13 0 obj << /Count 6 /Kids [ 75 0 R 76 0 R 77 0 R 78 0 R 79 0 R 80 0 R ] /Parent 6 0 R /Type /Pages >> endobj 14 0 obj << /Kids [ 81 0 R 82 0 R 83 0 R 84 0 R 85 0 R 86 0 R ] /Limits [ (Doc-Start) (cite.holzinger1937bi) ] >> endobj 15 0 obj << /Kids [ 87 0 R 88 0 R 89 0 R 90 0 R 91 0 R 92 0 R ] /Limits [ (cite.inspect_evals_preflight) (cite.zhou2023instructionfollowingevaluationlargelanguage) ] >> endobj 16 0 obj << /Kids [ 93 0 R 94 0 R 95 0 R 96 0 R 97 0 R 98 0 R ] /Limits [ (cite.zhou2025generalscalesunlockai) (page.1) ] >> endobj 17 0 obj << /Kids [ 99 0 R 100 0 R 101 0 R 102 0 R 103 0 R 104 0 R ] /Limits [ (page.10) (section*.7) ] >> endobj 18 0 obj << /Kids [ 105 0 R 106 0 R 107 0 R 108 0 R 109 0 R 110 0 R ] /Limits [ (section*.8) (table.caption.29) ] >> endobj 19 0 obj << /A << /D (cite.cobbe2021trainingverifierssolvemath) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 357.594 281.031 411.981 293.091 ] /Subtype /Link /Type /Annot >> endobj 20 0 obj << /A << /D (cite.cobbe2021trainingverifierssolvemath) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 415 281.031 437.237 293.091 ] /Subtype /Link /Type /Annot >> endobj 21 0 obj << /A << /D (cite.zhuo2024bigcodebench) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 173.584 270.072 223.125 282.132 ] /Subtype /Link /Type /Annot >> endobj 22 0 obj << /A << /D (cite.zhuo2024bigcodebench) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 226.126 270.072 248.184 282.132 ] /Subtype /Link /Type /Annot >> endobj 23 0 obj << /A << /D (cite.hendrycks2021measuring) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 109.926 259.114 181.774 271.173 ] /Subtype /Link /Type /Annot >> endobj 24 0 obj << /A << /D (cite.hendrycks2021measuring) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 184.321 259.114 210.722 271.173 ] /Subtype /Link /Type /Annot >> endobj 25 0 obj << /A << /D (cite.ruan2024observational) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 428.53 248.155 478.288 260.214 ] /Subtype /Link /Type /Annot >> endobj 26 0 obj << /A << /D (cite.ruan2024observational) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 481.385 248.155 503.701 260.214 ] /Subtype /Link /Type /Annot >> endobj 27 0 obj << /A << /D (cite.burnell2023revealingstructurelanguagemodel) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 107.004 237.196 168.734 249.255 ] /Subtype /Link /Type /Annot >> endobj 28 0 obj << /A << /D (cite.burnell2023revealingstructurelanguagemodel) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 173.249 237.196 195.565 249.255 ] /Subtype /Link /Type /Annot >> endobj 29 0 obj << /A << /D (cite.Ili__2024) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 200.08 237.196 263.629 249.255 ] /Subtype /Link /Type /Annot >> endobj 30 0 obj << /A << /D (cite.Ili__2024) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 268.143 237.196 290.46 249.255 ] /Subtype /Link /Type /Annot >> endobj 31 0 obj << /A << /D (cite.ye-etal-2023-predictable) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 492.275 226.237 504.996 238.296 ] /Subtype /Link /Type /Annot >> endobj 32 0 obj << /A << /D (cite.ye-etal-2023-predictable) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 107.004 215.278 128.925 227.338 ] /Subtype /Link /Type /Annot >> endobj 33 0 obj << /A << /D (cite.ye-etal-2023-predictable) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 131.279 215.278 152.798 227.338 ] /Subtype /Link /Type /Annot >> endobj 34 0 obj << /A << /D (cite.perlitz-etal-2024-efficient) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 219.58 193.36 273.696 205.42 ] /Subtype /Link /Type /Annot >> endobj 35 0 obj << /A << /D (cite.perlitz-etal-2024-efficient) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 276.685 193.36 298.603 205.42 ] /Subtype /Link /Type /Annot >> endobj 36 0 obj << /A << /D (cite.kipnis2025metabenchsparsebenchmark) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 301.592 193.36 355.997 205.42 ] /Subtype /Link /Type /Annot >> endobj 37 0 obj << /A << /D (cite.kipnis2025metabenchsparsebenchmark) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 358.986 193.36 380.904 205.42 ] /Subtype /Link /Type /Annot >> endobj 38 0 obj << /A << /D (cite.pmlr-v235-maia-polo24a) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 383.892 193.36 453.829 205.42 ] /Subtype /Link /Type /Annot >> endobj 39 0 obj << /A << /D (cite.pmlr-v235-maia-polo24a) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 456.818 193.36 478.736 205.42 ] /Subtype /Link /Type /Annot >> endobj 40 0 obj << /A << /D (cite.jimenez2024swebench) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 340.958 154.506 403.712 166.565 ] /Subtype /Link /Type /Annot >> endobj 41 0 obj << /A << /D (cite.jimenez2024swebench) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 407.24 154.506 429.556 166.565 ] /Subtype /Link /Type /Annot >> endobj 42 0 obj << /A << /D (cite.hendrycks2021measuring) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 146.118 121.629 221.242 133.689 ] /Subtype /Link /Type /Annot >> endobj 43 0 obj << /A << /D (cite.hendrycks2021measuring) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 224.279 121.629 251.602 133.689 ] /Subtype /Link /Type /Annot >> endobj 44 0 obj << /A << /D (cite.hernandez2017measure) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 204.228 77.853 286.937 89.898 ] /Subtype /Link /Type /Annot >> endobj 45 0 obj << /A << /D (cite.hernandez2017measure) /S /GoTo >> /Border [ 0 0 0 ] /C [ 0 1 0 ] /H /I /Rect [ 289.926 77.853 311.843 89.898 ] /Subtype /Link /Type /Annot >> endobj 46 0 obj << /A << /S /URI /URI (https://arxiv.org/abs/2604.01418v1) >> /BS << /W 0 >> /NM (fitz-L0) /Rect [ 12 227.13 32 564.87 ] /Subtype /Link >> endobj 47 0 obj << /Length 10 /Filter /FlateDecode >> stream x�+��| endstream endobj 48 0 obj << /Filter /FlateDecode /Length 3620 >> stream xڵZ�۶�~����uN,A�����8���I.�I�|�HHbM�2�����b$��r�u:77x��>~�T��,�ū�/�����r��YƋ��B�"�b?�bqW,�{o�k�zfߖu�_/����´��0 <�/��: =�����^�]}�p@����h�ﮰOf�B�*S��,�W?@'QS��/U:��,�SG�WM�/_���_� �KS�D�ˮ/w�/���͚�W�6����bUVe_���y�tR,E�g������v��v�蚸�tL� i�m�o��c�o�]Q����E�G�����W�B�16q�8wD�G@�r6�E�w���k�x�ٙ���Ɯ?E�Jf�8�'�o��+�ܹ�����{��m����7 ]�s'$�/���ڒ���Bx��t���dzt�MhEa��̇���ivB�\m��ꈀ�M�mG(�ɷuS5���J�VmYl �o_��O$h��~ ����=yh��(���