{"id":8061,"date":"2024-09-18T15:03:43","date_gmt":"2024-09-18T15:03:43","guid":{"rendered":"https:\/\/dailywashingtoninsider.com\/index.php\/2024\/09\/18\/public-asked-to-help-create-humanitys-last-exam-to-spot-when-ai-achieves-peak-intelligence\/"},"modified":"2024-09-18T15:03:43","modified_gmt":"2024-09-18T15:03:43","slug":"public-asked-to-help-create-humanitys-last-exam-to-spot-when-ai-achieves-peak-intelligence","status":"publish","type":"post","link":"https:\/\/dailywashingtoninsider.com\/index.php\/2024\/09\/18\/public-asked-to-help-create-humanitys-last-exam-to-spot-when-ai-achieves-peak-intelligence\/","title":{"rendered":"Public asked to help create \u2018humanity\u2019s last exam\u2019 to spot when AI achieves peak intelligence"},"content":{"rendered":"<p>Scientists are creating &#8220;humanity&#8217;s last exam&#8221; to test AI and see when it has reached expert-level intelligence.<\/p>\n<p>People are being asked to submit their questions and create &#8220;the world&#8217;s most difficult <strong>artificial intelligence<\/strong> test&#8221; by the Center for AI Safety (CAIS) and Scale AI.<\/p>\n<div class=\"sdc-site-outbrain sdc-site-outbrain--AR_6\" aria-hidden=\"true\" data-component-name=\"sdc-site-outbrain\" data-target=\"\" data-widget-mapping=\"\" data-installation-keys=\"\">    <\/div>\n<p>&#8220;Existing tests now have become too easy and we can no longer track AI developments well, or how far they are from becoming expert-level,&#8221; said the quiz creators in a statement about the test.<\/p>\n<p>A few years ago, AI was giving almost random answers to questions on exams &#8211; that&#8217;s no longer the case.<\/p>\n<p>Last week, <strong>OpenAI&#8217;s<\/strong> newest model, known as OpenAI o1, &#8220;destroyed the most popular reasoning benchmarks&#8221;, according to Dan Hendrycks, executive director of CAIS.<\/p>\n<div class=\"ad ad--teads\">        <\/div>\n<p>However, AI still isn&#8217;t able to answer difficult research questions and other intellectual questions.<\/p>\n<p>It also appears to score poorly on tests involving planning and visual pattern-recognition puzzles, according to Stanford University&#8217;s AI Index Report from April.<\/p>\n<p>Consequently, &#8220;humanity&#8217;s last exam&#8221; will require abstract reasoning to test how clever AI really is.<\/p>\n<p>The submissions shouldn&#8217;t be any ordinary quiz questions.<\/p>\n<p>&#8220;We found questions written by undergraduates tend to be too easy for the models,&#8221; the creators of the quiz said.<\/p>\n<p>Instead, they recommend that question writers have five or more years of experience in a technical industry job like SpaceX, or are a PhD student or above.<\/p>\n<p>The submissions should be difficult for non-experts to answer and &#8220;not easily answerable via a quick online search&#8221;, and trick questions should be avoided.<\/p>\n<p>&#8220;As a rule of thumb, if a randomly selected undergraduate can understand what is being asked, it is likely too easy for the frontier LLMs of today and tomorrow,&#8221; said the quiz creators.<\/p>\n<p>People who submit successful questions will be invited as co-authors on the paper and have a chance to win money from a $500,000 (\u00a3378,400) prize pool, with the writers of the best questions earning $5,000 (\u00a33,780) each.<\/p>\n<p>Questions should be submitted by 1 November.<\/p>\n<\/p>\n<div>This post appeared first on sky.com<\/div>\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Scientists are creating &#8220;humanity&#8217;s last exam&#8221; to test AI and see when it has reached&hellip;<\/p>\n","protected":false},"author":1,"featured_media":8062,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[24],"tags":[],"class_list":["post-8061","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-science"],"_links":{"self":[{"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/posts\/8061","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/comments?post=8061"}],"version-history":[{"count":0,"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/posts\/8061\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/media\/8062"}],"wp:attachment":[{"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/media?parent=8061"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/categories?post=8061"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/dailywashingtoninsider.com\/index.php\/wp-json\/wp\/v2\/tags?post=8061"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}