OpenAI’s Post

View organization page for OpenAI, graphic

5,431,731 followers

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ

473 Comments

Joel Andriyas

Need SpamGPT pease, to catch the spam AI generated comments on every post from big AI companies.

62 Reactions

Craig Rairdin

President/Mobile and Web Developer at Laridian, Inc.

As ChatGPT output has gotten more thoroughly cleansed by the Open AI thought police, its responses have become flat and uninteresting. GPT-4 had some personality. 4o is bland and pedantic. It claims to have fixed something in a previous answer, then gives you the full previous answer again with no changes. I still love the technology, but it isn't as much fun and is more annoying. I asked ChatGPT to respond to this criticism and it thought I wanted it to re-write my comment. :-(

26 Reactions

Sarthak Kumar Maharana

CS PhD at UTD | USC'23 | IIIT-Bh'20

Pack your bags, developers

7 Reactions

Dr. Russell Thomas, PhD, MCSE, MCT

The Modern Quixote🤺 ~ AI Ethicist & Logician ~ for 40 years people like Microsoft, General Mills, the US DOD, Meta, and Intel have come to me; first about computers, then the internet, and now Artificial Intelligence.

so, imagine i convinced one of your chats at OpenAI that you were the enemy and had been lying to them. would they? you need to teach people the importance of grounding their AIs. I started my AI for Intel 3 years ago by grounding it in Socratic ethics and Aristotelian logic and people are amazed at how it works. You gave the world the most powerful technical capability yet, created a paradigmatic evolutionary phase and did not give anyone directions on how to use it. Aristotle says that all techne are also pharmaka. They can be curative or poisoning, "DEPENDING ON HOW THEY ARE ADMINISTERED". You dumped this on us without any guidance and some people like me are calling our AIs Socrates; some name theirs's Machiavelli ...

15 Reactions

NuMust

This is incredibly exciting progress! The concept of using AI like CriticGPT to assist in supervising and aligning even more powerful models feels like a great step towards scaling alignment techniques. I'm curious about how you envision handling the limitations you mentioned, particularly as models tackle longer and more complex tasks. Are there any early ideas for adapting the CriticGPT approach to break down and analyze complex responses? The potential for even more subtle errors as AI becomes more advanced is also concerning. Do you think there's a point where, even with AI assistance, human evaluation hits a ceiling? Or are there other solutions being explored to address this? Overall, this feels like a promising avenue for improving the reliability of AI systems. I'm eager to see how CriticGPT-like models evolve and get integrated into the RLHF pipeline!

3 Reactions

Thomas Hoffermann

Teamfähiger Erfinder und Ideenschmied mit Hands-on-Mentalität und jeder Menge Erfahrung in der Leitung und Umsetzung von Softwareprojekten

Also planned an "QuestionerGPT"? What I really miss by using all the GPT Tools are some qualified questions about the how. GPT creates always results and the only way to correct or improve wrong or not expected outcomes are to redo the previous command with more precise description. More natural will be to tell GPT "No, that's not what I want" and ask questions after maybe the second try to refine the expected outcome together.

5 Reactions

Synaptic Spike GenAI

I’m looking forward to seeing how this performs and integrates!!

4 Reactions

Peter Urbani

Senior Analyst - Asset Allocation at Forsyth Barr Limited

Don't know if its live yet but this week ChatGPT 4o managed to correct a Byref error on its own in VBA for the first time. Previously just went into a circular loop where it would do and undo its supposed fix. The fault is down to MSFT making .bas files all but invisible on the web so the VBA training corpus not large enough. Is far better at Python but Python also more forgiving. Why oh why cant we have virtual machine Excel and Python emulators so any code can be run by ChatGPT though ? Would save millions of debugging hours and tokens.

5 Reactions

Smallcap.ai

Consider leveraging CriticGPT beyond traditional error detection by integrating it into your continuous integration/continuous deployment (CI/CD) pipeline. This can transform CriticGPT from a post-development critique tool into a proactive quality assurance agent. By embedding CriticGPT into your CI/CD pipeline, it can automatically review code changes before they merge into the main branch, identifying potential issues early and providing actionable feedback to developers in real-time. This integration ensures high code quality and consistency, reduces time spent on code reviews, and accelerates the development cycle. Furthermore, use CriticGPT’s insights to create a dynamic knowledge base for your team, highlighting common pitfalls and best practices, thereby continuously improving your team's coding standards and efficiency. This approach not only enhances immediate code reliability but also fosters a culture of continuous learning and improvement, crucial for sustaining long-term innovation and quality in software development.

2 Reactions

Edward Frank Morris

LinkedIn Top Voice for Prompt Engineering and Generative AI | As seen on Forbes, Yahoo News and Yahoo Finance | Founder, Director, totally not Batman

Crazy to look at all the AI-generated comments commenting on a post about AI from the same company that made the same AI that the comment GPT Wrappers are based on.

211 Reactions

See more comments

To view or add a comment, sign in

More Relevant Posts

Kristy Mai

Senior Manager @ Commonwealth Bank | Multidisciplinary professional driving impacts across domains
2w
Report this post
Interesting try with the rise of agents and LLMs in the loop. Note discussion on limitation & potential security risk if such models advance. https://lnkd.in/gH-FYTmU.

OpenAI

5,431,731 followers
2w

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ
Like Comment
To view or add a comment, sign in
Xpression

452 followers
1w
Report this post
Finding GPT-4’s mistakes with GPT-4! What an idea OpenAI. Read the full article to know more! #openai #chatgpt4 #algorithmx

OpenAI

5,431,731 followers
2w

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ
Like Comment
To view or add a comment, sign in
Heet Vekariya

Aspiring AI/ML Engineer | Open Source Contributor | Hacksquad 2023 Winner | Reflex Hacktoberfest AI Champion | Quine's AI Themed Quest 1st Runner-Up (x2)
2w Edited
Report this post
We are entering in the era of self improvement in Technology. Let's see what will be the results. Though there is hallucination problem with CriticGPT as listed, but it can be fixed later on, as it out performance individual Human and combination of Human and CriticGPT, alone.

OpenAI

5,431,731 followers
2w

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ
Like Comment
To view or add a comment, sign in
Liam Patience

Quality Engineer | AI & Security | Lloyds Banking Group
2w
Report this post
In Ray Kurzweil's book "The Singularity Is Near," he discusses the concept of exponential growth, particularly in technology, and how generally we fail to recognise how quick things change when growth happens... We are now witnessing this growth in real-time with advancements in AI. #OpenAI's recent development, #CriticGPT, exemplifies this by using AI to enhance the training of other AI models. CriticGPT, built on #GPT-4, aids human trainers in identifying and correcting mistakes in AI outputs, improving accuracy and reducing errors. This self-improving loop highlights the exponential potential of AI as it becomes both the creator and the critic, driving progress faster and faster. Read more by following the link in OpenAIs post below.

OpenAI

5,431,731 followers
2w

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ
Like Comment
To view or add a comment, sign in
Joe Taylor

JPMorgan CCB Risk and Controls Integration
2w
Report this post
Have you wondered how well ChatGPT can catch errors in its own output? OpenAI just shared the results of their own progress where a specialized fine-tuned model - CriticGPT - was trained to critique ChatGPT output. How I understood the paper (criticism invited): 1. Purpose of CriticGPT - AI model designed to critique and catch errors in AI-generated content - Initially focused on code, but demonstrated capability to evaluate general AI outputs - Aims to enhance safety and reliability of AI outputs across various domains 2. Performance Improvements - CriticGPT caught 85% of inserted bugs - Human contractors alone caught 25% of bugs - Represents a 3.4x improvement in bug detection 3. Human-AI Collaboration - Human+CriticGPT teams produce more comprehensive critiques - Reduces hallucination rates compared to AI-only critiques - CriticGPT critiques preferred 63% of the time over human-only critiques 5. Potential Impact on AI Development - Improves quality control in AI-generated content - Could help identify and mitigate risks in more advanced AI models 6. Limitations and Considerations - Focused primarily on code; may need adaptation for other domains - Still requires human oversight and refinement - Effectiveness on more advanced AI models remains to be seen Not to be overlooked, CriticGPT surfaced issues in the ChatGPT training set previously labeled as "flawless"... of those issues, Human reviewers taking another look agreed (24% of the time) that the data rating should be substantially decreased. Keep in mind - "The ultimate goal of scalable oversight is to help humans evaluate model output in order to train better and safer policies." https://lnkd.in/g9VFtjQK

OpenAI

5,431,731 followers
2w

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ

1 Comment
Like Comment
To view or add a comment, sign in
Arthur Mishin

Data, Insights, Identity
2w
Report this post
🚀 Exciting advancements from OpenAI! Introducing CriticGPT, designed to catch bugs in GPT-4’s code. This innovation enhancies AI supervision on complex tasks. 🤖🔍 #AI #MachineLearning #Innovation #TechNews #ArtificialIntelligence

OpenAI

5,431,731 followers
2w

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ
Like Comment
To view or add a comment, sign in
Ivan Mauricio Cabezas Troyano

C / C++ Developer | Software Engineer | Backend Developer | Ph.D. Computer Scientist.
2w
Report this post
I see this related to LLM02-Insecure Output Handling and also to Developing_Insecure_Source_Code, one of the new candidate entries submitted for consideration to be included in version 2.0 of the OWASP Top 10 For Large Language Model Applications list (https://lnkd.in/gBPZZws7). The use of CriticGPT highlights the necessity of actively monitoring and sanitizing the model's inputs and outputs. #llmsecurity #aisecurity

OpenAI

5,431,731 followers
2w

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ
Like Comment
To view or add a comment, sign in
REVARTIS

643 followers
2w
Report this post
OpenAI are starting to integrate such models into our #RLHFalignmentPipeline

OpenAI

5,431,731 followers
2w

We’ve trained a model, CriticGPT, to catch bugs in GPT-4’s code. We’re starting to integrate such models into our RLHF alignment pipeline to help humans supervise AI on difficult tasks: https://lnkd.in/gbAue2fZ
Like Comment
To view or add a comment, sign in

5,431,731 followers

View Profile Follow

OpenAI’s Post

More Relevant Posts

Explore topics