Openai Proposes A Second Neural Net To Catch Chatgpt's Code Mistakes

SERVIDORES

critigpt-points-out-chatgpts-flaw-june-2024 — CriticGPT is a neural net-based AI model that critiques code created by ChatGPT and points out bugs in the code.

OpenAI

The problem of hallucinations -- artificial intelligence (AI) models that assert falsehoods under a veneer of being authoritative -- has led some scholars to conclude that generative AI simply cannot detect nor correct its errors.

In a paper last October, researchers at Google's DeepMind argued that "LLMs are not yet capable of self-correcting their reasoning."

Also:If AI is so amazing, why does ChatGPT meltdown over this simple image edit task?

However, ChatGPT creator OpenAI disagrees with this assertion -- and last week the firm offered a version of GPT-4, called CriticGPT, that it claims can help find and correct mistakes to improve the overall accuracy of the model.

The results are encouraging for human teams who clean up code assisted by AI. However, the results also suggest there's no getting around hallucinations from the bots doing the helping.

Also: Generative AI can't find its own errors. Do we need better prompts?

The setting for CriticGPT is programming code writing: the researchers propose CriticGPT as a second neural net that caches the occasions when ChatGPT makes mistakes in the code it generates.

They focus on code writing because, as they put it, computer code is "crisp" -- it has clear right and wrong answers. Also, OpenAI as an organization hopes to use generative AI as "an alignment research assistant", to automate some of the establishment of guardrails for the emerging technology. Code-writing is already a big user of generative AI, so it's a valuable target to go after.

In the paper posted on the arXiv pre-print server, "LLM Critics Help Catch LLM Bugs," lead author Nat McAleese of OpenAI and colleagues describe what they call, "the first demonstration of a simple scalable oversight method that helps humans more comprehensively spot problems in real-world RLHF data."

RLHF (reinforcement learning from human feedback) refers to a well-known practice of subjecting chatbots to responses from humans to make their output more acceptable. It's one of the ways OpenAI and others have established guardrails to try and prevent unwanted behavior.

In this case, CriticGPT is subjected to the feedback of human contract programmers who review CriticGPT's generated critiques of programming code. The humans rate the generated critics for their relevance, specificity, comprehensiveness, and more. CriticGPT is trained to refine critiques based on human feedback to approach a higher approval score.

Also: Is AI lying to us? These researchers built an LLM lie detector of sorts to find out

However, McAleese and team took an extra step. They stuck in some deliberate bugs in the code CriticGPT reviews by having some human contractors deliberately insert mistakes. The researchers wanted the contractors to explain their bugs and for CriticGPT to absorb those explanations and learn to associate bugs with explanations.

The hope was that CriticGPT would improve as it produces descriptions of bugs that approach what the human contractors have written about already-known bugs.

The result of the training, write McAleese and team, is that ChatGPT finds more bugs than human code reviewers. CriticGPT "greatly improves the rate at which inserted bugs are caught, with both LLM critics (prompted ChatGPT and CriticGPT) catching many more bugs than the human annotators," they write.

They note even the human contractors prefer what the machine generates in code analysis versus what their fellow humans write.

"Critiques written by CriticGPT are substantially preferred by contractors over critiques from prompted ChatGPT and over human-written critiques sourced from our group of contractors according to the overall rating."

The AI model helps human contractors to make their bug critiques richer, a kind of AI-augments-humans result that should please everyone: "Human+CriticGPT teams write substantially more comprehensive critiques than humans alone and that CriticGPT improves comprehensiveness over ChatGPT on both human detected and inserted bugs."

As the authors write in a companion blog post, "CriticGPT's suggestions are not always correct, but we find that they can help trainers to catch many more problems with model-written answers than they would without AI help."

Also: Can AI code? In baby steps only

But there is a catch. Just as ChatGPT and various AI models can "hallucinate" incorrect statements, it turns out that CriticGPT can also claim to identify bugs that aren't there.

"We do find, however, that the rate of nitpicks and hallucinated bugs is much higher for models than for humans, though CriticGPT is able to substantially reduce this rate over ChatGPT," they write.

criticgpt-hallucinated-bug — CriticGPT hallucinating a bug in a human's code.

OpenAI

That's a dilemma: the better the AI model is at catching bugs, the more it seems to hallucinate bugs: "Unfortunately, it is not obvious what the right tradeoff between hallucinations and bug detection is for an overall RLHF system that uses critiques to enhance model performance."

And it's not easy to find the middle ground, they note, because, "An ideal experiment would run entirely separate critique-enhanced RLHF data collection loops for each precision/recall point; but this is prohibitively expensive."

In the breach, McAleese and team hit upon a compromise. Force Sampling Beam Search tries to lift the most valuable of CriticGPT's critiques while minimizing the number of spurious critiques.

Among the potential pitfalls of OpenAI's approach is that the training of Critic GPT is built upon humans inserting deliberate bugs. That approach, write McAleese and team, differs from the distribution of natural LLM errors.

"Training models to insert subtle in-distribution problems (as opposed to paying humans to insert bugs) may be able to mitigate this concern, but we leave such directions to future work."

Also: From AI trainers to ethicists: AI may obsolete some jobs but generate new ones

Hence, the problem will always revolve around how to bootstrap the automation without having some human help.

Another issue -- and one not mentioned by the authors -- is that, as with all things OpenAI, neither the new CriticGPT model nor its training data are publicly available: it's all closed, there's no source code for examination, no data sets that others can download. That closure means there is little to no way for outside ethics or security experts to vet the corrections made by the CriticGPT model.

With no oversight from any party outside OpenAI, the saying goes, who will watch the watchers?

Artificial Intelligence

Transparency is sorely lacking amid growing AI interest
What is a Chief AI Officer, and how do you become one?
How Adobe manages AI ethics concerns while fostering creativity
6 ways OpenAI just supercharged ChatGPT for free users

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVIDORES

NOTÍCIAS QUENTES

S5735-L24T4X-A1 Empowers Installers with Scalable, Reliable, and Efficient Network Access

Best Ethernet Switches for Business (2025): Selection Guide and Top Picks

Huawei S5735-L24T4S-A1: A Compact, Stackable Access Switch Built for the Future

Huawei S5735-L24T4S-A: High-Performance Stacking Meets Zero-Noise Deployment

S5735-L24P4XE-A-V2: Huawei’s Smart Choice for High-Density Campus Deployments

S5735-L24P4X-A1: Huawei’s High-Performance Access Switch Redefining Campus Networking

Huawei S5735-L24P4S-A1 Review: Reliable Gigabit Access with Enterprise-Grade Features

What Is an Orthogonal Architecture?

Huawei s5735-l24p4s-a-v2 Delivers Scalable, Secure, and Smart PoE Access for Modern IT Infrastructures

Huawei S5735-L48T4XE-A-V2 Switch Delivers Enterprise-Grade Performance in a Compact Design

Huawei S5735-L48P4XE-A-V2 Review: Versatile Campus Switch with iStack and Full L3 Support

Differences Between Huawei CE Series and S Series Switches

Huawei CloudEngine S5735 Switches Set the Benchmark for High-Performance, Energy-Efficient Switching

Huawei CloudEngine S5731‑S48P4X Datasheet

Huawei CloudEngine S5731‑S24P4X Datasheet

Huawei S5731-S Empowers Next-Generation Campus Networks with Advanced Capabilities

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

OpenAI proposes a second neural net to catch ChatGPT's code mistakes

Artificial Intelligence

Tags quentes : Inovação

Ordering Guide

Recursos

Quem somos

Cisco Price, Dell Price, Huawei Price, ZTE HPE Fortinet Switch Router Server At Low Price

SERVIDORES

NOTÍCIAS QUENTES

S5735-L24T4X-A1 Empowers Installers with Scalable, Reliable, and Efficient Network Access

Best Ethernet Switches for Business (2025): Selection Guide and Top Picks

Huawei S5735-L24T4S-A1: A Compact, Stackable Access Switch Built for the Future

Huawei S5735-L24T4S-A: High-Performance Stacking Meets Zero-Noise Deployment

S5735-L24P4XE-A-V2: Huawei’s Smart Choice for High-Density Campus Deployments

S5735-L24P4X-A1: Huawei’s High-Performance Access Switch Redefining Campus Networking

Huawei S5735-L24P4S-A1 Review: Reliable Gigabit Access with Enterprise-Grade Features

What Is an Orthogonal Architecture?

Huawei s5735-l24p4s-a-v2 Delivers Scalable, Secure, and Smart PoE Access for Modern IT Infrastructures

Huawei S5735-L48T4XE-A-V2 Switch Delivers Enterprise-Grade Performance in a Compact Design

Huawei S5735-L48P4XE-A-V2 Review: Versatile Campus Switch with iStack and Full L3 Support

Differences Between Huawei CE Series and S Series Switches

Huawei CloudEngine S5735 Switches Set the Benchmark for High-Performance, Energy-Efficient Switching

Huawei CloudEngine S5731‑S48P4X Datasheet

Huawei CloudEngine S5731‑S24P4X Datasheet

Huawei S5731-S Empowers Next-Generation Campus Networks with Advanced Capabilities

Huawei S5731-H24P4XC Switch Review: Power-Packed Performance and Smart PoE

Huawei S5731-H Series Switches Redefine Campus Networking with Intelligent High-Performance Architecture

Top Features of the Huawei S5731-S24T4X: The Ultimate Gigabit Access Switch for Modern Networks

General Power Module Fault Location Procedure (CE8800 & 7800 & 6800 & 5800)

How Do I Split a Stack? How to clear the stacking configuration?

Huawei CloudEngine S5731 Datasheet

Huawei CloudEngine S5731-S24P4X: Powerful Enterprise-Grade Switch Explained

Huawei S5731-S48T4X Review: Powerful Enterprise Switch for High-Speed Networking

Why are network cables limited to 100 meters?

Huawei S5731-S32ST4X: Powerful, Enterprise-Ready Gigabit Switch with Advanced Capabilities

Huawei S5731-H48T4XC Review: High-Performance Switching for Modern IT Infrastructures

Huawei S5731-H48P4XC: Comprehensive Overview

Common display Commands for Huawei Devices

Stacking Card Stacking vs. Service Port Stacking: Application Scenarios for the Two Switch Stacking Methods

OpenAI proposes a second neural net to catch ChatGPT's code mistakes

Artificial Intelligence

Tags quentes : Inovação

Ordering Guide

Recursos

Quem somos

Huawei CloudEngine S5731‑S48P4X Datasheet