Large Language Models

Explore articles tagged with Large Language Models

What L’Oréal brings Maybelline virtual try-on to ChatGPT

L’Oréal has announced a collaboration with OpenAI that will bring Maybelline New York’s virtual makeup try-on feature into ChatGPT. The announcement was made at VivaTech 2026.

Jun 22, 2026

3 min read

#AI#Large Language Models#NLP

A startup claims it broke through a bottleneck that’s holding back LLMs

Miami-based AI startup Subquadratic came out of stealth mode last month with a huge claim. It announced that it had solved a mathematical bottleneck that had been holding back large language models for almost a decade.

Jun 20, 2026

3 min read

#AI#Large Language Models#NLP

The Download: AI bottleneck debates, and BCI trials take off

This is today’s edition of The Download, our weekday newsletter that provides a daily dose of what’s going on in the world of technology. A startup claims it broke through a bottleneck that’s holding back LLMs AI startup Subquadratic came out of stealth last month with a huge claim: it had solved a mathematical bottleneck….

Jun 19, 2026

3 min read

#AI#Large Language Models#NLP

Kimi K2.7-Code cuts thinking tokens 30% — but practitioners say the benchmarks don't check out

Moonshot AI released Kimi K2.7-Code this week, an open-source update to its K2 coding model family, claiming leaner reasoning and double-digit performance gains.

Jun 13, 2026

3 min read

#AI#Artificial Intelligence#ChatGPT

Surprise upset: GPT-5.5 beats Claude Fable 5 on brutal new Agents’ Last Exam benchmark

Researchers from the University of California, Berkeley's Center for Responsible, Decentralized Intelligence (RDI), alongside an advisory committee of over 300 domain experts, have launched Agents’ Last Exam (ALE)—a grueling new benchmark built to measure whether artificial intelligence can actually execute economically valuable, long-horizon professional workflows. In a shocking upset, OpenAI’s GPT-5.

Jun 11, 2026

3 min read

#AI#ChatGPT#OpenAI

Researchers trained an open source AI search agent, Harness-1, that outperforms GPT-5.4 on recalling relevant information

A joint research collaboration between researchers at the University of Illinois at Urbana-Champaign (UIUC), UC Berkeley, and the open source AI-native vector database platform Chroma unveiled Harness-1, a 20-billion parameter open-source search agent built atop OpenAI's gpt-oss-20B open source model that fundamentally redesigns how AI executes complex retrieval tasks. Harness-1 achieves a massive leap in performance, scoring 73% average on its ability to recall relevant information correctly f.

Jun 9, 2026

3 min read

#AI#Large Language Models#NLP

Walmart’s AI workflows meet the realities of the balance sheet

Walmart has reportedly begun limiting employees’ use of an internal AI assistant called Code Puppy after demands placed on the LLM backing the tool were higher than expected. Employees of Walmart were encouraged to use Code Puppy without any stricture or stipulations as to the limits of use, but Walmart is now assigning employees a […] The post Walmart’s AI workflows meet the realities of the balance sheet appeared first on AI News.

Jun 4, 2026

3 min read

#AI#Large Language Models#NLP

Latest: GitHub Copilot users see token-based price hikes

Since its announcement in April this year, the proposed changes to billing methods on GitHub Copilot were a source of much speculation: how much more or less would a pay-a-you-use AI cost an organisation or individual compared to a flat-rate, monthly subscription? Just a day into the changeover to token-based billing for the LLM-based service, […] The post GitHub Copilot users see token-based price hikes appeared first on AI News.

Jun 2, 2026

3 min read

#AI#ChatGPT#OpenAI

Claude Mythos exposed a hard truth: Your enterprise patching process is way too slow

In 2024, researchers from the University of Illinois found that GPT-4, when provided with a common vulnerabilities and exposures (CVE) description, could autonomously exploit 87% of a curated 15-vulnerability one-day dataset. Without the description, it could only exploit 7%.

Jun 1, 2026

3 min read

#AI#Large Language Models#NLP

AI agents are entering their rebuild era as enterprises confront the reliability problem

As enterprise AI agents move into production, organizations are confronting a growing reliability problem. Many teams are discovering that LLM performance alone does not determine whether agents succeed in production.

May 30, 2026

3 min read

#AI#Large Language Models#NLP

MIT's MeMo lets teams swap in a better LLM without retraining — and performance jumps 26%

Enabling LLMs to acquire new knowledge after training remains a major hurdle for enterprise AI — current solutions are either too expensive, too slow, or constrained by context window limits. MeMo, a framework from researchers at multiple universities, encodes new knowledge into a dedicated smaller memory model that operates separately from the main LLM.

May 30, 2026

3 min read

#AI#ChatGPT#OpenAI

How DeepSeek’s radical architecture is shattering Silicon Valley's token moat

DeepSeek’s announcement over the weekend that it has made its 75% price cut permanent on its flagship V4 Pro model is a disruptive assault on the capital-heavy business models of Silicon Valley’s frontier labs. The reduction on DeepSeek V4 Pro directly undercuts comparable Western models used as workhorses for enterprise production.

May 29, 2026

3 min read

#AI#ChatGPT#OpenAI

DeepSWE blows up the AI coding leaderboard, crowns GPT-5.5, and finds Claude Opus exploiting a benchmark loophole

For months, the leading AI coding benchmarks have told enterprise buyers a comforting but misleading story: the top models are all roughly the same. OpenAI's GPT-5 family, Anthropic's Claude Opus, and Google's Gemini Pro have clustered within a narrow band on Scale AI's SWE-Bench Pro leaderboard, making it nearly impossible for engineering leaders to determine which agent will actually perform best inside their codebases.

May 27, 2026

3 min read

#AI#Large Language Models#NLP

OpenAI co-founder Andrej Karpathy announces he's joining Anthropic

Andrej Karpathy, the influential 39-year-old Slovak-Canadian AI researcher and one of the original 11 co-founders of OpenAI, and former head of Tesla's AI division, announced on Tuesday, May 19 that he's joining rival lab Anthropic. As Karpathy posted from his account on the social network X: "Personal update: I've joined Anthropic.

May 20, 2026

3 min read

#AI#Large Language Models#NLP

Architectural patterns for graph-enhanced RAG: Moving beyond vector search in production

Retrieval-augmented generation (RAG) has become the de facto standard for grounding large language models (LLMs) in private data. The standard architecture — chunking documents, embedding them into a vector database, and retrieving top-k results via cosine similarity — is effective for unstructured semantic search.

May 18, 2026

3 min read

#AI#ChatGPT#OpenAI

Claude’s next enterprise battle is not models: it’s the agent control plane

New VB Pulse data shows Microsoft and OpenAI leading enterprise agent orchestration, but Anthropic’s first measurable foothold points to a larger fight over who controls the infrastructure where AI agents run. For the last two years, the enterprise AI race has mostly been framed as a model war: OpenAI’s GPT series versus Anthropic’s Claude versus Google’s Gemini, with smaller and open-source alternatives also coming in from the U.

May 16, 2026

3 min read

#AI#ChatGPT#OpenAI

Anthropic finally beat OpenAI in business AI adoption — but 3 big threats could erase its lead

For the first time since the AI race began, more American businesses are paying for Anthropic's Claude than for OpenAI's ChatGPT. Adoption of Anthropic rose 3.

May 14, 2026

3 min read

#AI#Large Language Models#NLP

How Sakana trained a 7B model to orchestrate GPT-5, Claude Sonnet 4 and Gemini 2.5 Pro

Every LangChain pipeline your team hardcodes starts breaking the moment the query distribution shifts — and it always shifts. That bottleneck is what Sakana AI set out to eliminate.

May 8, 2026

3 min read

105

#AI#Large Language Models#NLP

GPT-5.5 Instant shows you what it remembered — just not all of it

OpenAI updated the default model for ChatGPT to its new GPT-5.5 Instant, along with a new memory capability that finally shows which context shaped responses — at least some of them.

May 6, 2026

3 min read

#AI#ChatGPT#OpenAI

OpenAI turns its sold-out GPT-5.5 party into a monthlong Codex giveaway for 8,000 developers

OpenAI on Monday began emailing more than 8,000 developers who applied for its invite-only GPT-5.5 party with a surprise consolation prize: a tenfold increase in Codex rate limits on their personal ChatGPT accounts, effective immediately and lasting through June 5.

May 5, 2026

3 min read