Black Friday Sale! Save huge on InvestingProGet up to 60% off

Open AI’s GPT-4 demonstrates “human-level performance” on professional and academic benchmarks

Published 15/03/2023, 04:01 pm
Open AI’s GPT-4 demonstrates “human-level performance” on professional and academic benchmarks

ChatGPT's parent company Open AI has exhibited “human-level performance” in its GPT-4 model, a large multimodal model that aced on several professional and academic benchmarks.

GPT-4 outperformed its predecessor GPT-3.5 by a significant margin as demonstrated by its ability to achieve a score in the top 10% of test takers on a simulated bar exam, while GPT-3.5 only scored in the bottom 10%.

While it is currently available to subscribers of ChatGPT Plus, OpenAI plans to launch GPT-4 capabilities through ChatGPT and its commercial API via a wait-listed release.

Aced in simulated exams

Addressing the capabilities of the new model, OpenAI said: “In a casual conversation, the distinction between GPT-3.5 and GPT-4 can be subtle.

“The difference comes out when the complexity of the task reaches a sufficient threshold - GPT-4 is more reliable, creative and able to handle much more nuanced instructions than GPT-3.5.

“To understand the difference between the two models, we tested on a variety of benchmarks, including simulating exams that were originally designed for humans.

“We proceeded by using the most recent publicly-available tests (in the case of the Olympiads and AP free response questions) or by purchasing 2022–2023 editions of practice exams.

“We did no specific training for these exams.

“A minority of the problems in the exams were seen by the model during training, but we believe the results to be representative”.

Exam results.

What’s more

OpenAI also evaluated GPT-4 on traditional benchmarks designed for machine learning models.

Encouragingly, GPT has also significantly outperformed existing large language models, alongside most state-of-the-art (SOTA) models which may include benchmark-specific crafting or additional training protocols.

Apart from textual data, GPT-4 can also accept visual inputs, however, the output will always be textual in nature.

Specifically, it generates text outputs (natural language, code, etc) given inputs consisting of interspersed text and images.

Read more on Proactive Investors AU

Disclaimer

Latest comments

Risk Disclosure: Trading in financial instruments and/or cryptocurrencies involves high risks including the risk of losing some, or all, of your investment amount, and may not be suitable for all investors. Prices of cryptocurrencies are extremely volatile and may be affected by external factors such as financial, regulatory or political events. Trading on margin increases the financial risks.
Before deciding to trade in financial instrument or cryptocurrencies you should be fully informed of the risks and costs associated with trading the financial markets, carefully consider your investment objectives, level of experience, and risk appetite, and seek professional advice where needed.
Fusion Media would like to remind you that the data contained in this website is not necessarily real-time nor accurate. The data and prices on the website are not necessarily provided by any market or exchange, but may be provided by market makers, and so prices may not be accurate and may differ from the actual price at any given market, meaning prices are indicative and not appropriate for trading purposes. Fusion Media and any provider of the data contained in this website will not accept liability for any loss or damage as a result of your trading, or your reliance on the information contained within this website.
It is prohibited to use, store, reproduce, display, modify, transmit or distribute the data contained in this website without the explicit prior written permission of Fusion Media and/or the data provider. All intellectual property rights are reserved by the providers and/or the exchange providing the data contained in this website.
Fusion Media may be compensated by the advertisers that appear on the website, based on your interaction with the advertisements or advertisers.
© 2007-2024 - Fusion Media Limited. All Rights Reserved.