Connect with us

Hi, what are you looking for?

Tech

Public asked to help create ‘humanity’s last exam’ to spot when AI achieves peak intelligence

Scientists are creating “humanity’s last exam” to test AI and see when it has reached expert-level intelligence.

People are being asked to submit their questions and create “the world’s most difficult artificial intelligence test” by the Center for AI Safety (CAIS) and Scale AI.

“Existing tests now have become too easy and we can no longer track AI developments well, or how far they are from becoming expert-level,” said the quiz creators in a statement about the test.

A few years ago, AI was giving almost random answers to questions on exams – that’s no longer the case.

Last week, OpenAI’s newest model, known as OpenAI o1, “destroyed the most popular reasoning benchmarks”, according to Dan Hendrycks, executive director of CAIS.

However, AI still isn’t able to answer difficult research questions and other intellectual questions.

It also appears to score poorly on tests involving planning and visual pattern-recognition puzzles, according to Stanford University’s AI Index Report from April.

Consequently, “humanity’s last exam” will require abstract reasoning to test how clever AI really is.

The submissions shouldn’t be any ordinary quiz questions.

“We found questions written by undergraduates tend to be too easy for the models,” the creators of the quiz said.

Instead, they recommend that question writers have five or more years of experience in a technical industry job like SpaceX, or are a PhD student or above.

The submissions should be difficult for non-experts to answer and “not easily answerable via a quick online search”, and trick questions should be avoided.

“As a rule of thumb, if a randomly selected undergraduate can understand what is being asked, it is likely too easy for the frontier LLMs of today and tomorrow,” said the quiz creators.

People who submit successful questions will be invited as co-authors on the paper and have a chance to win money from a $500,000 (£378,400) prize pool, with the writers of the best questions earning $5,000 (£3,780) each.

Questions should be submitted by 1 November.

This post appeared first on sky.com

    You May Also Like

    Stocks

    In this episode of StockCharts TV‘s The MEM Edge, Mary Ellen reviews what’s shaping up in the broader markets after the Fed announced their rate cut...

    Tech

    Consumer rights group Which? is suing Apple for £3bn over the way it deploys the iCloud. If the lawsuit succeeds, around 40 million Apple...

    Tech

    Battle lines have been drawn between the almost 200 countries meeting in Azerbaijan as they seek to agree a new pot of money to...

    Tech

    Meta has lowered the minimum age to use the popular messaging platform WhatsApp. The move, which came into effect on Thursday, reduces the age...

    Disclaimer: globalwashingtonwebinar.com, its managers, its employees, and assigns (collectively “The Company”) do not make any guarantee or warranty about what is advertised above. Information provided by this website is for research purposes only and should not be considered as personalized financial advice. The Company is not affiliated with, nor does it receive compensation from, any specific security. The Company is not registered or licensed by any governing body in any jurisdiction to give investing advice or provide investment recommendation. Any investments recommended here should be taken into consideration only after consulting with your investment advisor and after reviewing the prospectus or financial statements of the company.

    Copyright © 2024 globalwashingtonwebinar.com | All Rights Reserved