All posts tagged: DeepSeek

Anthropic Furious at DeepSeek for Copying Its AI Without Permission, Which Is Pretty Ironic When You Consider How It Built Claude in the First Place

Anthropic Furious at DeepSeek for Copying Its AI Without Permission, Which Is Pretty Ironic When You Consider How It Built Claude in the First Place

Chance Yeh/Getty Images for HubSpot Earlier this month, Google publicly griped that “commercially motivated” actors were trying to clone its Gemini AI through agents that queried the chatbot up to 100,000 times to “extract” the underlying model. The hypocrisy of Google’s accusations was palpable. For years, the search giant has relied on indiscriminately scraping the internet for content to train its AI models, without compensating copyright holders — and racking up lawsuits as a result. Now Anthropic has entered the fray. Unlike Google, the company behind chatbot Claude was willing to point fingers, accusing Chinese AI firms DeepSeek, Moonshot, and MiniMax of “distilling” its AI model. The company claimed in a new blog post that the accused firms created more than 24,000 fake accounts that queried Claude 16 million times, a “violation of our terms of service and regional access restrictions.” Distillation is essentially when a small “student” model is trained to replicate the performance of a much larger “teacher” model — a convoluted term essentially denoting the act of copying someone’s homework without express …

American AI Industry Trembles as Deepseek Prepares to Release New Model

American AI Industry Trembles as Deepseek Prepares to Release New Model

Illustration by Tag Hartman-Simkins / Futurism. Source: Getty Images When Chinese AI company DeepSeek released its cheap and serviceable V3 model early last year, it sent shockwaves throughout Silicon Valley and beyond, roiling the stock market, shaking political confidence in American AI, and stoking new fears from the ever-churlish China hawks. A year later, DeepSeek is preparing to launch its new V4 model — a development which could have major implications for US tech companies and the firms backing them. According to a CNBC bulletin, DeepSeek’s latest version is “expected to be imminent” given the release-schedule of previous versions. Depending on how impressive V4 is when it hits, the AI-heavy Nasdaq could be in for a major upset, as could the tech companies listed on it. Per CNBC, the Nasdaq composite fell 3 percent when DeepSeek V3 made its debut last year, and shares for the chip giant Nvidia plummeted 17 percent, wiping out $600 billion in a flash. While both recovered from the hits over time, it was a defining moment for DeepSeek, securing its …

Anthropic says DeepSeek, Moonshot, and MiniMax used 24,000 fake accounts to rip off Claude

Anthropic says DeepSeek, Moonshot, and MiniMax used 24,000 fake accounts to rip off Claude

Anthropic dropped a bombshell on the artificial intelligence industry Monday, publicly accusing three prominent Chinese AI laboratories — DeepSeek, Moonshot AI, and MiniMax — of orchestrating coordinated, industrial-scale campaigns to siphon capabilities from its Claude models using tens of thousands of fraudulent accounts. The San Francisco-based company said the three labs collectively generated more than 16 million exchanges with Claude through approximately 24,000 fake accounts, all in violation of Anthropic’s terms of service and regional access restrictions. The campaigns, Anthropic said, are the most concrete and detailed public evidence to date of a practice that has haunted Silicon Valley for months: foreign competitors systematically using a technique called distillation to leapfrog years of research and billions of dollars in investment. “These campaigns are growing in intensity and sophistication,” Anthropic wrote in a technical blog post published Monday. “The window to act is narrow, and the threat extends beyond any single company or region. Addressing it will require rapid, coordinated action among industry players, policymakers, and the global AI community.” The disclosure marks a dramatic escalation …

Anthropic accuses Chinese AI labs of mining Claude as US debates AI chip exports

Anthropic accuses Chinese AI labs of mining Claude as US debates AI chip exports

Anthropic is accusing three Chinese AI companies of setting up more than 24,000 fake accounts with its Claude AI model to improve their own models. The labs — DeepSeek, Moonshot AI, and MiniMax — allegedly generated more than 16 million exchanges with Claude through those accounts using a technique called “distillation.” Anthropic said the labs “targeted Claude’s most differentiated capabilities: agentic reasoning, tool use, and coding.” The accusations come amid debates over how strictly to enforce export controls on advanced AI chips, a policy aimed at curbing China’s AI development.  Distillation is a common training method that AI labs use on their own models to create smaller, cheaper versions, but competitors can use it to essentially copy the homework of other labs. OpenAI sent a memo to House lawmakers earlier this month accusing DeepSeek of using distillation to mimic its products.  DeepSeek first made waves a year ago when it released its open-source R1 reasoning model that nearly matched American frontier labs in performance at a fraction of the cost. DeepSeek is expected to soon …

Nvidia Aided DeepSeek AI Breakthrough With “Extensive Technical Support,” House China Chair Warns

Nvidia Aided DeepSeek AI Breakthrough With “Extensive Technical Support,” House China Chair Warns

Congressman John Moolenaar (R-MI), chairman of the House Select Committee on the CCP, penned a letter to Commerce Secretary Howard Lutnick that Nvidia provided extensive technical support to DeepSeek, enabling the startup to achieve chatbot performance breakthroughs despite US export controls on advanced AI chips to China to mitigate the risks of the technology falling into the hands of Beijing’s military. “While NVIDIA asserts its relationship with DeepSeek is “to promote the [AI] ecosystem flywheel and improve NVIDIA’s products,” documents produced to the Committee reveal NVIDIA provided extensive technical support that enabled DeepSeek—now integrated into People’s Liberation Army (PLA) systems and a demonstrated cybersecurity risk—to achieve frontier AI capabilities,” Moolenaar wrote in the letter sent to Lutnick’s office on Wednesday. Moolenaar continued, “These findings demonstrate why rigorous enforcement of the Department’s H200 export rule, which requires certification that chips will not serve military purposes, is essential—even if such enforcement effectively prevents H200 exports to the PRC altogether.” DeepSeek’s release sent shock waves through US markets last year – about this time – over risks that …

DeepSeek Engram Splits Recall from Reasoning for Faster LLMs

DeepSeek Engram Splits Recall from Reasoning for Faster LLMs

Are transformers really the pinnacle of AI innovation, or are they just an overengineered way to solve simple problems? Prompt Engineering explores how the innovative DeepSeek Engram challenges the dominance of transformer-based architectures by proposing a bold alternative: treating transformers as little more than expensive hashmaps. This provocative claim stems from Engram’s ability to separate straightforward recall tasks from complex reasoning, introducing a smarter, more efficient way to handle language model computation. By rethinking how tasks are processed, Engram not only reduces computational waste but also redefines what scalability and speed can look like in large language models. In this overview, we’ll break down the core innovations behind Engram, including its use of hash-based lookups for simple memory tasks and its context-aware gating mechanism for deeper reasoning. You’ll discover how this architecture minimizes GPU load, improves latency, and challenges the inefficiencies of traditional transformers. But it’s not all smooth sailing, Engram’s reliance on static lookup tables and potential hash collisions raises important questions about its adaptability and precision. Could this be the future of AI, …

The Race to Build the DeepSeek of Europe Is On

The Race to Build the DeepSeek of Europe Is On

Against that backdrop, Europe’s reliance on American-made AI begins to look more and more like a liability. In a worst case scenario, though experts consider the possibility remote, the US could choose to withhold access to AI services and crucial digital infrastructure. More plausibly, the Trump administration could use Europe’s dependence as leverage as the two sides continue to iron out a trade deal. “That dependency is a liability in any negotiation—and we are going to be negotiating increasingly with the US,” says Taddeo. The European Commission, White House, and UK Department for Science, Innovation and Technology did not respond to requests for comment. To hedge against those risks, European nations have attempted to bring the production of AI onshore, through funding programs, targeted deregulation, and partnerships with academic institutions. Some efforts have focused on building competitive large language models for native European languages, like Apertus and GPT-NL. For as long as ChatGPT or Claude continues to outperform Europe-made chatbots, though, America’s lead in AI will only grow. “These domains are very often winner-takes-all. When …

AI News : DeepSeek V4 Aims at Long Code & February Launch

AI News : DeepSeek V4 Aims at Long Code & February Launch

What if the future of artificial intelligence wasn’t just about innovation but about reshaping how we work, code, and communicate every single day? In this walkthrough, Universe of AI shows how the latest breakthroughs, like DeepSeek V4, GLM5, and Google’s Gemini-powered Gmail, are pushing the boundaries of what AI can achieve. From coding assistants that tackle impossibly complex tasks to email systems that feel almost human in their ability to organize and respond, these advancements are more than just upgrades, they’re redefining the relationship between humans and technology. But with this rapid progress comes a pressing question: are these technologies truly solving problems, or are they creating new ones? This deep dive explores how these innovative systems are transforming productivity and accessibility in ways that feel almost futuristic. You’ll discover how DeepSeek V4 is setting a new standard for coding AI, why Z.AI’s GLM5 could provide widespread access to access to advanced AI, and what makes Grok Build a fantastic option for developers. And let’s not forget Google’s Gemini-powered Gmail, which is turning inboxes into …