General Discussion

usonian

(26,698 posts) Fri May 29, 2026, 12:10 PM 19 hrs ago

I predict "The end of monster AI as we know it"

You can already run LLM models on home computers. Who here remembers mainframe computers?

And who runs financial models on them instead of using a spreadsheet for 99% of such chores?

OpenClaw on Mac. Secure by default. In one click.
https://holaclaw.ai/

Bring your own API key from any cloud provider - or skip the cloud entirely.

Five starting points — each with its own voice, style, and strengths.

Do you really need billion dollar data centers destroying the environment and raising electric costs for this upcoming cooling season?

And regarding those monster data centers:

How DeepSeek’s radical architecture is shattering Silicon Valley's token moat
https://venturebeat.com/infrastructure/how-deepseeks-radical-architecture-is-shattering-silicon-valleys-token-moat

This comes at a time when the closed Western labs, in particular OpenAI and Anthropic, face an intense return-on-investment scrutiny for their multi-billion dollar general-purpose hardware infrastructure investments.

DeepSeek’s announcement over the weekend that it has made its 75% price cut permanent on its flagship V4 Pro model is a disruptive assault on the capital-heavy business models of Silicon Valley’s frontier labs.

The reduction on DeepSeek V4 Pro directly undercuts comparable Western models used as workhorses for enterprise production. It is 7x cheaper on inputs and 17x cheaper on outputs than Anthropic’s Claude Sonnet or OpenAI’s GPT 5.5-Med, while the lightweight DeepSeek V4 Flash undercuts entry-tier alternatives like Claude Haiku by 10x to 25x.

snip

The price cuts are enabled by a series of hardware-software innovations, especially around cache, that make DeepSeek's models radically more efficient to run. When hosted natively in China, DeepSeek’s cache-read pricing is a whopping 87x cheaper than Western clouds — a deflationary floor so aggressive that handset giant Xiaomi just moved to match the exact pricing tier for its newly deployed MiMo architecture.

DeepSeek V4 Pro’s performance is ranked almost on par with Western frontier models, hitting 80.6% on coding-agent tasks via the SWE-bench Verified leaderboard and an elite reasoning score of 87.5 on the advanced MMLU-Pro technical index. Both V4 Pro and V4 Flash — a hyper-optimized speedy version for developers — are open-weight and issued under a permissive MIT license. This gives enterprises complete flexibility over deployment. This dual-model strategy allows technical teams to route their heaviest, multi-step autonomous agent workloads to the lightning-fast Flash model, while reserving the heavy Pro model for deep reasoning tasks, drastically lowering costs at a time when budget concerns have grown considerably.

Brute Force, The American Way?

It comes at a cost.

When "leaders" are just money-grubbing assholes.

4 replies

= new reply since forum marked as read

Highlight:

I predict "The end of monster AI as we know it" (Original Post) usonian 19 hrs ago OP

Agree with the OP, AI will become increasingly useless over the long run bucolic_frolic 19 hrs ago #1

The Agentic Economy Celerity 19 hrs ago #2

Imagine instead a future where every consumer has an assistant agent usonian 18 hrs ago #3

I have only used AI searches a handful of times, ... Straw Man 18 hrs ago #4

bucolic_frolic

(55,982 posts)

1. Agree with the OP, AI will become increasingly useless over the long run

Reply to usonian (Original post)

Fri May 29, 2026, 12:21 PM

19 hrs ago

Knowledge becomes more widely known, companies create proprietary methods. AI won't help as much, and it won't be worth paying a lot for.

Celerity

(55,039 posts)

2. The Agentic Economy

Reply to bucolic_frolic (Reply #1)

Fri May 29, 2026, 12:25 PM

19 hrs ago

The architecture of agentic communication will determine the extent to which generative AI democratizes access to economic opportunity.

https://cacm.acm.org/opinion/the-agentic-economy/

Generative AI has revolutionized the way we interact with technology, allowing people to express their intent in free-form natural language. It has paved the way for AI agents that not only converse with users but also perform actions on their behalf, flexibly and with minimal guidance. Delegation to AI has already begun to improve the efficiency of individual processes, making both consumers and businesses more productive in the set of tasks they had already been doing. However, we believe that the more disruptive—and yet to be realized—impact of generative AI is its potential to drastically reduce the communication frictions between and among consumers and businesses. This could lead to a reorganization of markets, shifts of market power, and the introduction of entirely new products and services.

Consumers have traditionally faced high communication costs when initiating relationships with businesses, reducing efficiency. For example, a consumer seeking a new tax preparer might hesitate to switch because she would have to explain her financial situation all over again to a new person or online service. These communication hurdles can prevent consumers from taking advantage of better products and services or lower prices. Businesses have tried to lower these costs with tools like online forms and voicemail menus, but these often just shift communication costs to the consumer and can make interactions more rigid.

Imagine instead a future where every consumer has an assistant agent to communicate their preferences and personal information to businesses, and every business has service agents to interact with consumers and other businesses. These agents could be designed to interface with each other seamlessly and flexibly, transforming the landscape of consumer-business interactions. Delegating interactions to such assistant and service agents lowers communication costs and makes markets more efficient by expanding the range of options available to both consumers and businesses.

To unlock the full economic potential of generative AI’s communication capabilities, two developments are necessary. First, consumers and businesses must widely adopt assistant and service agents. This is already under way. Second, these agents must be designed to interact seamlessly with each other to facilitate transactions. On the technical front, there has been significant progress in standardizing such agentic interaction, with frameworks such as Microsoft’s AutoGen, and protocols such as Anthropic’s Model Context Protocol and Google’s Agent2Agent Protocol. However, it remains to be seen how these advances will be adopted and implemented, or constrained, given their complex interplay with and dependence on market forces.

snip

The Financial vs. Technological Economy

My thoughts on the AI bubble

Sinéad Bovell

https://sineadbovell.substack.com/p/the-financial-vs-technological-economy

The Two Economies Inside the AI Boom

On Wednesday, Nvidia ( the company at the heart of the AI infrastructure buildout) reported earnings that beat expectations. Revenue was strong, data center demand remained robust, and the company’s position at the center of the AI buildout was reaffirmed. The stock initially rallied in after-hours trading. Then it reversed and sold off sharply. On the surface, this doesn’t make sense. Strong results should lead to optimism about the future. But that’s not what happened. Instead, good earnings seemed to make investors more nervous, not less. Here’s what I think is actually going on: we’re watching two economies operate inside one market, and they’re moving on completely different timeframes.

What the Market Is Pricing

Stock prices reflect belief about future earnings. Right now, current earnings are strong, but belief in future earnings is weakening. That’s unusual. Normally when a company beats expectations, the market becomes more confident about what comes next. But with Nvidia—and really, across the entire AI infrastructure stack—something else is happening. The market is rightfully questioning whether this pace of investment can continue without interruption. It’s pricing in the risk of a slowdown, a digestion period, maybe even a full correction.

What the Technology Is Doing

But then there is the other economy, the technological one. And it’s operating on a completely different clock. If AI is a general purpose technology—and I believe it is—then we’re not looking at a two-year story. We’re looking at a 10-to-20-year transformation. General purpose technologies reshape economies slowly, then all at once. Electricity took nearly two decades before factories were redesigned from the ground up. The internet took almost 15 years before it became indispensable.

Two Timeframes, One Market

The financial economy is focused on quarters and fiscal years. It’s asking: when does spending slow? When do returns materialize? When does the infrastructure buildout hit its natural limit? These are legitimate questions. The technological economy is focused much further into the future. Companies are asking: what happens when this infrastructure is fully utilized? What happens when adoption moves from 10 percent to 50 percent? What does the world look like when AI is embedded in every workflow, every product, every decision? Both can be true. The market might be right about the next 12 to 24 months. And the technology economy might be right about the next 15 years. But why does this nuance matter for us? In our latest episode of I’ve Got Questions, I break down the bubble indicators I’m watching, where the risks are concentrated, and why—even if we see a major correction—this is exactly the moment to pay more attention to AI, not less.

snip

usonian

(26,698 posts)

3. Imagine instead a future where every consumer has an assistant agent

Reply to Celerity (Reply #2)

Fri May 29, 2026, 12:56 PM

18 hrs ago

Think of the things that people used to do that we are "automating" now.

After a century or more of auto development, we still can't replace a carpool driver. (and live)

But we can precision kill over 140 school children in two shots (because one shot didn't kill enough humans)

In everyday life, for almost a century now, the cellphone has been really transformative in a way that "people" couldn't do. It takes just one frightening evening of missing your Mom at the airport arrivals. IIRC, she took a bus to my home.

I will check out your points in detail later. I've been house-sitting and dog-sitting for about a week and it's about time to pack up for the non-automated long drive home, from picking up doody (not automated) and watering plants (also not automated, at least here).

I have found that complexity often makes life more complex rather than simpler, but heck, I made a living off technology, so I can retire to mostly play music (instead of streaming it in, the real stuff, not algorithmic) and take photos of the gorgeous scenery (the real kind) and I do use film now and then.

"Being There" is the greatest part of the fun.

Later ...

Straw Man

(6,958 posts)

4. I have only used AI searches a handful of times, ...

Reply to usonian (Original post)

Fri May 29, 2026, 01:40 PM

18 hrs ago

... and mostly to find arcane bits of trivia. The responses I get have been predominantly wrong, provably so. Not only that, but successive searches bring up contradictory information. I'm not seeing the value at this point.

Reply to this discussion