All posts tagged: GPT5.3

ChatGPT Gets GPT-5.3 Instant Update With Less ‘Cringe,’ Fewer Hallucinations

ChatGPT Gets GPT-5.3 Instant Update With Less ‘Cringe,’ Fewer Hallucinations

OpenAI today updated its most popular ChatGPT model, debuting GPT-5.3 Instant. GPT-5.3 Instant is supposed to provide more accurate answers and better contextualized results when searching the web. The update also cuts down on unnecessary dead ends, caveats, and overly declarative phrasing, plus it has fewer hallucinations. According to OpenAI, it tweaked the Instant model to address complaints about tone, relevance, and conversational flow, which are issues that don’t show up in benchmarks. GPT-5.2 Instant had a “cringe” tone that could be overbearing or make unsubstantiated assumptions about user intent or emotions. The new model will have a more natural conversational style and will cut back on dramatic phrases like “Stop. Take a breath.” Users found that GPT-5.2 Instant would refuse questions it should have been able to answer, or respond in ways that felt overly cautious around sensitive topics. GPT-5.3 Instant cuts down on refusals and tones down overly defensive or moralizing preambles when answering a question. The model will no longer “over-caveat” after assuming bad intent from the user. GPT-5.3 Instant also provides …

ChatGPT’s new GPT-5.3 Instant model will stop telling you to calm down

ChatGPT’s new GPT-5.3 Instant model will stop telling you to calm down

Take a breath, stop spiraling. You’re not crazy, you’re just stressed. And honestly, that’s okay. If you felt immediately triggered reading these words, you’re probably also sick of ChatGPT constantly talking to you as if you’re in some sort of crisis and need delicate handling. Now, things may be improving. OpenAI says its new model, GPT-5.3 Instant, will reduce the “cringe” and other “preachy disclaimers.” According to the model’s release notes, the GPT-5.3 update will focus on the user experience, including things like tone, relevance, and conversational flow — areas that may not show up in benchmarks, but can make ChatGPT feel frustrating, the company said. Or, as OpenAI put it on X, “We heard your feedback loud and clear, and 5.3 Instant reduces the cringe.” In the company’s example, it showed the same query with responses from the GPT-5.2 Instant model compared with the GPT-5.3 Instant model. In the former, the chatbot’s response starts, “First of all — you’re not broken,” a common phrase that’s been getting under everyone’s skin lately. In the updated …

GPT-5.3 Instant cuts hallucinations by 26.8% as OpenAI shifts focus from speed to accuracy

GPT-5.3 Instant cuts hallucinations by 26.8% as OpenAI shifts focus from speed to accuracy

OpenAI’s GPT-5.3 Instant — the company’s most widely used model — reduces hallucinations by up to 26.8% compared to its predecessor, prioritizing accuracy and conversational reliability over raw performance gains, OpenAI says. GPT-5.3 Instant, which is essentially the default and is the most used model for ChatGPT users, also improves on tone, relevance and conversation with fewer refusals. It is available on both ChatGPT and on the API.  Right now, only the Instant model will be upgraded to 5.3, but the company said it is working on updating the other models under ChatGPT, Thinking, and Pro to 5.3 “soon.”  GPT-5.3 Instant cuts hallucinations by up to 26.8% OpenAI ran two internal evaluations: one across higher-stakes domains including medicine, finance, and law; the other drawing on user feedback. Based on higher-stakes evaluations conducted by the company, GPT-5.3 Instant reduces hallucinations by 26.8% when using the web. It improves reliability by 19.7% when relying on its internal knowledge. User feedback showed a 22.5% decrease in hallucinations when answering queries using web search.  The company said GPT-5.3 Instant …

Claude Opus 4.6 vs GPT-5.3 Codex for AI Coding Workflows

Claude Opus 4.6 vs GPT-5.3 Codex for AI Coding Workflows

Can a single AI model truly balance speed, precision, and adaptability, or are trade-offs inevitable? Greg Isenberg takes a closer look at how Claude Opus 4.6 vs GPT-5.3 Codex tackle this question, offering a detailed comparison of two of the most advanced coding-focused AI systems available today. With Claude Opus 4.6 emphasizing multi-agent orchestration for large-scale projects and GPT-5.3 Codex excelling in lightning-fast prototyping and interactive refinement, this analysis provide more insights into their contrasting approaches to AI-driven development and what they mean for developers navigating modern workflows. This breakdown highlights the unique strengths and limitations of each system, including their performance in building a competitor to Poly Market, a prediction market platform. Whether you’re intrigued by the precision and depth of Claude Opus 4.6 or the speed and adaptability of GPT-5.3 Codex, understanding their differences can illuminate which aligns better with your goals. Beyond technical capabilities, this exploration examines how these models integrate into real-world applications, offering a glimpse into the evolving landscape of AI-assisted coding and its impact on the future of software …

Opus 4.6 vs GPT-5.3 Reset Enterprise Expectations

Opus 4.6 vs GPT-5.3 Reset Enterprise Expectations

Are you feeling overwhelmed by the breakneck pace of AI advancements? You’re not alone. In this breakdown, Prompt Engineering walks through how the latest models, Claude Opus 4.6 and GPT-5.3 Codex, are pushing the boundaries of what’s possible in enterprise and technical workflows. With features like a staggering 1-million-token context window and collaborative agent teams, these models aren’t just upgrades; they’re a glimpse into the future of professional problem-solving. But as exciting as these innovations are, they also raise a critical question: how do we keep up with AI that’s evolving faster than most industries can adapt? This guide unpacks the unique strengths and trade-offs of these innovative systems, offering a closer look at how they’re reshaping industries like software development, research, and finance. Whether you’re curious about Claude Opus 4.6’s ability to process vast datasets or ChatGPT-5.3 Codex’s dominance in coding and debugging, you’ll discover insights that could redefine how you approach complex tasks. With AI now capable of handling workflows that once required entire teams, the implications are as thrilling as they are …