General Discussion
Related: Editorials & Other Articles, Issue Forums, Alliance Forums, Region ForumsI predict "The end of monster AI as we know it"
You can already run LLM models on home computers. Who here remembers mainframe computers?
And who runs financial models on them instead of using a spreadsheet for 99% of such chores?
OpenClaw on Mac. Secure by default. In one click.
https://holaclaw.ai/
Bring your own API key from any cloud provider - or skip the cloud entirely.
Five starting points each with its own voice, style, and strengths.

Do you really need billion dollar data centers destroying the environment and raising electric costs for this upcoming cooling season?
And regarding those monster data centers:
How DeepSeeks radical architecture is shattering Silicon Valley's token moat
https://venturebeat.com/infrastructure/how-deepseeks-radical-architecture-is-shattering-silicon-valleys-token-moat
This comes at a time when the closed Western labs, in particular OpenAI and Anthropic, face an intense return-on-investment scrutiny for their multi-billion dollar general-purpose hardware infrastructure investments.
The reduction on DeepSeek V4 Pro directly undercuts comparable Western models used as workhorses for enterprise production. It is 7x cheaper on inputs and 17x cheaper on outputs than Anthropics Claude Sonnet or OpenAIs GPT 5.5-Med, while the lightweight DeepSeek V4 Flash undercuts entry-tier alternatives like Claude Haiku by 10x to 25x.
snip
The price cuts are enabled by a series of hardware-software innovations, especially around cache, that make DeepSeek's models radically more efficient to run. When hosted natively in China, DeepSeeks cache-read pricing is a whopping 87x cheaper than Western clouds a deflationary floor so aggressive that handset giant Xiaomi just moved to match the exact pricing tier for its newly deployed MiMo architecture.
DeepSeek V4 Pros performance is ranked almost on par with Western frontier models, hitting 80.6% on coding-agent tasks via the SWE-bench Verified leaderboard and an elite reasoning score of 87.5 on the advanced MMLU-Pro technical index. Both V4 Pro and V4 Flash a hyper-optimized speedy version for developers are open-weight and issued under a permissive MIT license. This gives enterprises complete flexibility over deployment. This dual-model strategy allows technical teams to route their heaviest, multi-step autonomous agent workloads to the lightning-fast Flash model, while reserving the heavy Pro model for deep reasoning tasks, drastically lowering costs at a time when budget concerns have grown considerably.
Brute Force, The American Way?
It comes at a cost.
When "leaders" are just money-grubbing assholes.
bucolic_frolic
(55,982 posts)Knowledge becomes more widely known, companies create proprietary methods. AI won't help as much, and it won't be worth paying a lot for.
Celerity
(55,039 posts)https://cacm.acm.org/opinion/the-agentic-economy/
Generative AI has revolutionized the way we interact with technology, allowing people to express their intent in free-form natural language. It has paved the way for AI agents that not only converse with users but also perform actions on their behalf, flexibly and with minimal guidance. Delegation to AI has already begun to improve the efficiency of individual processes, making both consumers and businesses more productive in the set of tasks they had already been doing. However, we believe that the more disruptiveand yet to be realizedimpact of generative AI is its potential to drastically reduce the communication frictions between and among consumers and businesses. This could lead to a reorganization of markets, shifts of market power, and the introduction of entirely new products and services.
Consumers have traditionally faced high communication costs when initiating relationships with businesses, reducing efficiency. For example, a consumer seeking a new tax preparer might hesitate to switch because she would have to explain her financial situation all over again to a new person or online service. These communication hurdles can prevent consumers from taking advantage of better products and services or lower prices. Businesses have tried to lower these costs with tools like online forms and voicemail menus, but these often just shift communication costs to the consumer and can make interactions more rigid.
Imagine instead a future where every consumer has an assistant agent to communicate their preferences and personal information to businesses, and every business has service agents to interact with consumers and other businesses. These agents could be designed to interface with each other seamlessly and flexibly, transforming the landscape of consumer-business interactions. Delegating interactions to such assistant and service agents lowers communication costs and makes markets more efficient by expanding the range of options available to both consumers and businesses.
To unlock the full economic potential of generative AIs communication capabilities, two developments are necessary. First, consumers and businesses must widely adopt assistant and service agents. This is already under way. Second, these agents must be designed to interact seamlessly with each other to facilitate transactions. On the technical front, there has been significant progress in standardizing such agentic interaction, with frameworks such as Microsofts AutoGen, and protocols such as Anthropics Model Context Protocol and Googles Agent2Agent Protocol. However, it remains to be seen how these advances will be adopted and implemented, or constrained, given their complex interplay with and dependence on market forces.
snip
The Financial vs. Technological Economy
My thoughts on the AI bubble
Sinéad Bovell
https://sineadbovell.substack.com/p/the-financial-vs-technological-economy
The Two Economies Inside the AI Boom
On Wednesday, Nvidia ( the company at the heart of the AI infrastructure buildout) reported earnings that beat expectations. Revenue was strong, data center demand remained robust, and the companys position at the center of the AI buildout was reaffirmed. The stock initially rallied in after-hours trading. Then it reversed and sold off sharply. On the surface, this doesnt make sense. Strong results should lead to optimism about the future. But thats not what happened. Instead, good earnings seemed to make investors more nervous, not less. Heres what I think is actually going on: were watching two economies operate inside one market, and theyre moving on completely different timeframes.
What the Market Is Pricing
Stock prices reflect belief about future earnings. Right now, current earnings are strong, but belief in future earnings is weakening. Thats unusual. Normally when a company beats expectations, the market becomes more confident about what comes next. But with Nvidiaand really, across the entire AI infrastructure stacksomething else is happening. The market is rightfully questioning whether this pace of investment can continue without interruption. Its pricing in the risk of a slowdown, a digestion period, maybe even a full correction.
What the Technology Is Doing
But then there is the other economy, the technological one. And its operating on a completely different clock. If AI is a general purpose technologyand I believe it isthen were not looking at a two-year story. Were looking at a 10-to-20-year transformation. General purpose technologies reshape economies slowly, then all at once. Electricity took nearly two decades before factories were redesigned from the ground up. The internet took almost 15 years before it became indispensable.
Two Timeframes, One Market
The financial economy is focused on quarters and fiscal years. Its asking: when does spending slow? When do returns materialize? When does the infrastructure buildout hit its natural limit? These are legitimate questions. The technological economy is focused much further into the future. Companies are asking: what happens when this infrastructure is fully utilized? What happens when adoption moves from 10 percent to 50 percent? What does the world look like when AI is embedded in every workflow, every product, every decision? Both can be true. The market might be right about the next 12 to 24 months. And the technology economy might be right about the next 15 years. But why does this nuance matter for us? In our latest episode of Ive Got Questions, I break down the bubble indicators Im watching, where the risks are concentrated, and whyeven if we see a major correctionthis is exactly the moment to pay more attention to AI, not less.
snip
usonian
(26,698 posts)Think of the things that people used to do that we are "automating" now.
After a century or more of auto development, we still can't replace a carpool driver. (and live)
But we can precision kill over 140 school children in two shots (because one shot didn't kill enough humans)
In everyday life, for almost a century now, the cellphone has been really transformative in a way that "people" couldn't do. It takes just one frightening evening of missing your Mom at the airport arrivals. IIRC, she took a bus to my home.
I will check out your points in detail later. I've been house-sitting and dog-sitting for about a week and it's about time to pack up for the non-automated long drive home, from picking up doody (not automated) and watering plants (also not automated, at least here).
I have found that complexity often makes life more complex rather than simpler, but heck, I made a living off technology, so I can retire to mostly play music (instead of streaming it in, the real stuff, not algorithmic) and take photos of the gorgeous scenery (the real kind) and I do use film now and then.
"Being There" is the greatest part of the fun.
Later ...
Straw Man
(6,958 posts)... and mostly to find arcane bits of trivia. The responses I get have been predominantly wrong, provably so. Not only that, but successive searches bring up contradictory information. I'm not seeing the value at this point.