More

apublicfrog · 2026-05-20T05:32:22 1779255142

It's been pretty common in the past for tech companies to announce outages and quick updates about them on twitter for decades. I'm sure their status page etc will be updated soon, but it's historically been the fastest way to get things out to the wider audience whilst bypassing the "official mail out" review by marketing etc.

mcintyre1994 · 2026-05-20T05:48:39 1779256119

I think that was a lot more justifiable when Twitter reliably let logged out users read tweets. X seem to tweak it all the time, or maybe it’s just broken a lot, but sometimes I can’t even load a tweet in a browser that isn’t logged in.

Chaosvex · 2026-05-20T07:52:36 1779263556

They broke it not too long before Musk bought it when they wanted to boost user numbers.

It'll frequently display tweets from literal years ago as being the latest.

It's why proxies/mirrors are often linked rather than Twitter itself.

They don't seem to care to fix it, which implies that it's intentional. Seems completely stupid but what do I know?

numpad0 · 2026-05-20T06:27:22 1779258442

It doesn't show live profile pages to logged out users since a while ago. You get cached summary pages, an age gate error, or sometimes a straight up 404.

Most individual permalinks (.com/username/1234...) don't work without logging in, either, and the official client now uses `/i/` in place of usernames for permalinks(bogus usernames always worked; pkey was the timestamp).

This means an organizationally shared Twitter account for announcements is not a viable concept, at least until Twitter is to be transferred again to whoever would be a better keeper of it.

apublicfrog · 2026-05-10T22:02:22 1778450542

Is that true? It feels wrong. Consumer grade SSDs and spinning disks are unlikely to be the products used in enterprise.

spookie · 2026-05-11T11:35:06 1778499306

Look at SSD prices over the last 6 months. https://pcpartpicker.com/trends/price/internal-hard-drive/

windowsrookie · 2026-05-10T22:14:23 1778451263

AI companies bought up all the NAND manufacturing capacity, limiting the available manufacturing capacity for consumer products. These data centers also use hard drives for some of their data storage.

apublicfrog · 2026-05-10T22:00:38 1778450438

I suspect the type of person who is even aware of this 4GB blob is the type of person who would research its usage. Pretty high venn diagram crossover.

therealpygon · 2026-05-11T14:35:50 1778510150

Yeah. The fictional user doesn’t know anything about AI but knows about this 4gb file…because of news stories about how bad a 4gb file must be. Outside of that, they don’t know or care and wonder if that means that need to add some more “memory” to their computer.

apublicfrog · 2026-05-10T21:58:06 1778450286

> It's a very dangerous gamble. Today incredible value is available for nearly everyone. But it may stop without any warning, for reason outside our control.

What stops you from running the best open weighted LLMs currently available on consumer grade hardware for the rest of time? They're good enough for 95% of use cases, and they don't have a used by date. From what I can see, the "danger" is not having the next tier that comes out, but the impact of that is very low.

giobox · 2026-05-10T22:01:18 1778450478

> they don't have a used by date

For quite a lot of use cases, the current systems arguably do get worse over time if not continually updated. The knowledge cutoff date will start to hurt more and more as the weights age in a hypothetical scenario where you are stuck with them forever.

Coding, one of the most popular usescases today, would not be great if it say only understood java to a version from years ago etc.

https://en.wikipedia.org/wiki/Knowledge_cutoff

mrtesthah · 2026-05-10T23:55:45 1778457345

>Coding, one of the most popular uses cases today, would not be great if it say only understood java to a version from years ago etc.

This LLM trained only and entirely on pre-1930s texts was able to code Python programs when given only a short example:

https://talkie-lm.com/introducing-talkie

throwyawayyyy · 2026-05-10T22:35:27 1778452527

One solution is not to advance anything of course. I'm not even joking, is there going to be a successor to React? I suspect not, with the vast amount of training data for React now, it's going to look silly to move to something else with less support. What is the last new popular programming language, rust? Will there be another one? I suspect not. Same reasoning. The irony of all this AI acceleration talk is it'll work best if we don't accelerate the underlying tech at all.

WarmWash · 2026-05-11T00:10:01 1778458201

There probably won't be new stuff so much as trends in how stuff is done, and updates around optimizing those trends.

jvm___ · 2026-05-11T00:17:31 1778458651

Will programming languages evolve into less human oriented written code and more just calls to a trusted AI.

Or will human readable code be less and less of a thing as AI learns it's own, more terse language to talk to other AI's.

digitaltrees · 2026-05-11T01:20:27 1778462427

Yes. I am seeing a big push to use vanilla js for single file html apps that are easy to build, deploy and distribute because they have no build step. I could see component libraries emerging that make it easier build from chat interfaces with less ceremony

byzantinegene · 2026-05-11T02:48:06 1778467686

i'm not sure the tradeoff in code readability is worth it as of now.

Spooky23 · 2026-05-11T00:15:42 1778458542

Alot of the language work is scratching the itch of engineers and developers. I think you’re correct and react is the new COBOL.

hadlock · 2026-05-11T00:20:09 1778458809

Name/post content combo on point

apsurd · 2026-05-11T00:32:57 1778459577

Humans are notoriously bad at predicting the future. Toward that end, your prediction is laughable. React is the end all be all of UI… lol

melagonster · 2026-05-11T00:43:23 1778460203

Programmers won't be allow to exist in future. Vibe coding is the final resolution people can apply.

nullc · 2026-05-10T23:46:21 1778456781

Small models are more useful for "doing stuff" than "knowing stuff" to begin with. Add in an agentic harness and a small model can happily read more current information on demand (including from e.g. a local wikipedia snapshot).

henry_kang · 2026-05-13T11:08:52 1778670532

This feels increasingly true.

A lot of useful AI work is shifting from “knowing more” to “working with more context”, files, recordings, repos, screenshots, browsing history, etc.

Once that happens, memory and orchestration start mattering much more than raw model size.

rrvsh · 2026-05-10T22:26:43 1778452003

Nobody is unaware of the knowledge cutoff, and sharing the Wikipedia article is not helping anyone. Your point is easily rebutted by taking whatever open weights/source model has an outdated cutoff and training or fine tuning it on more data, which is again always going to be viable given a modicum of compute

tcp_handshaker · 2026-05-10T22:28:49 1778452129

You could learn how to code...a whole generation did it before...

AlienRobot · 2026-05-11T11:25:25 1778498725

I genuinely don't understand how can this possibly be a problem long term.

It feels very obvious that the solution is to have a smaller model that can be trained exclusively on Java information to augment the older model. If the architecture doesn't support it currently, then that's what the architecture will look like in the future.

Otherwise you'd be arguing that, to serve users who want to an up-to-date LLM on topic X, you have to train the model on the entire ABC all over again.

It's simply ludicrous to have a coding LLM that needs to be retrained on the latest published poems and pastry recipes to generate Java.

lowbloodsugar · 2026-05-12T02:57:03 1778554623

Laughs in JDK8 code base.

moffkalast · 2026-05-11T11:14:31 1778498071

Ha yes I used to think this was not a notable issue, but just today I was getting qwen 3.5 to fix my network drivers and it immediately freaked out like: "kernel 6.17, what the fuck? that doesn't exist yet!". It almost had a mental breakdown over that detail and derailed the conversation towards checking what's wrong with the kernel version reporting lol.

turtlebits · 2026-05-10T22:17:06 1778451426

FOMO. A new model comes out weekly and the HN crowd debates over the minutia of changes.

Pockets are too deep, it will only change once everyone is out of money.

3eb7988a1663 · 2026-05-11T03:37:24 1778470644

What is really amusing to me is how N months ago, the latest SOTA was incredible, but now utterly unusable. Feels like there is a model reality-distortion field in play where people can only acknowledge the flaws in retrospect.

lxgr · 2026-05-10T22:46:59 1778453219

They’re really not good enough, unless you consider 64 GB of memory or more consumer grade.

steve_adams_86 · 2026-05-10T22:52:16 1778453536

I’m pretty happy with what a 32GB Mac Studio can do for a lot of tasks. They’re the things I’d throw a model like Haiku at, but still genuinely useful. We don’t have an answer to frontier models in the consumer range yet, but we’re not totally trapped.

Side note though, it’s the speed that bothers me more than the reasoning. Qwen 3.5 is awesome, but my Claude subscription can tear through similar workloads an order of magnitude faster than my local LLM can when using Haiku. That’ll matter a lot to some people.

datadrivenangel · 2026-05-10T23:55:07 1778457307

Yeah this is the real killer. slower and more expensive is tough.

avazhi · 2026-05-11T00:37:06 1778459826

> What stops you from running the best open weighted LLMs currently available on consumer grade hardware for the rest of time?

Uh… the hardware requirements? And stop acting like some dog shit 8B model the average Joe can run on a laptop is even close to being comparable to what Claude or even Codex can currently do.

I have pretty good hardware and I’ve tinkered with the best sub-150B models you can use and they are awful compared to Anthropic/OAI/Grok.

apsurd · 2026-05-11T00:44:24 1778460264

What if the harness and loops get sufficiently better though? CC is using haiku for code-base gripping and such, you don't see a local commodity model being "good enough" for the 80% case when matched with better harnesses and tool calls?

honest question, i'm very interested in this, but too casual as of now to know any better.

avazhi · 2026-05-12T05:17:50 1778563070

I think the main issue is, as the other guy also alluded to, the parameter discrepancy. I know Mixture of Experts models are popular specifically becaue they save a lot of space and memory, but if your initial answer space is two orders of magnitude smaller on a local machine compared to the frontier cloud models, that knowledge gap just gets wider as the conversation continues, and the initial answer isn't even going to be as good to begin with. I don't know how to solve that parameter gap without hardware - there's only so much optimisation you can do, but at the end of the day parameterised knowledge takes up some minimum amount of bits that you can't excise without the actual knowledge and intelligence suffering.

byzantinegene · 2026-05-11T03:20:35 1778469635

vast majority of average users don't use llms for coding, and for those purposes, local llms with low param count are a far cry from SOTA models.

apublicfrog · 2026-05-11T04:53:03 1778475183

> And stop acting like some dog shit 8B model the average Joe can run on a laptop is even close to being comparable to what Claude or even Codex can currently do.

I'm not, you've actually illustrated my point. LLMs in 2022 were very impressive. By 2024 the general public was finding them an acceptable replacement for many research driven tasks and massive shortcuts for other tasks (coding, image work, document preperation, etc).

Those models are absolutely runnable on consumer hardware now, and we were extremely happy with the results. It's no different to how we used to think CRTs were amazing or early smartphones, but going back now they seem awful.

We're long past "danger". If what we have is the best we'll ever have open source, we're already in an excellent position.

avazhi · 2026-05-11T07:42:15 1778485335

> LLMs in 2022 were very impressive.

No they weren't. They were a gimmick - it is only in the past 6 or so months that frontier models have started to do stuff beyond mere gimmicks when it comes to coding, and you could make the argument that Mythos has been the first 'Holy shit' moment that we've had that has stepped us beyond 'Yeah that's really neat but...'

> Those models are absolutely runnable on consumer hardware now,

A sub 50B model is awful and can't even write proper English sentences half the time, to say nothing of how bad its world knowledge is. Try the 32B Gemma 4 local model for a week and then go back to Claude and then get back to me.

> We're long past "danger". If what we have is the best we'll ever have open source, we're already in an excellent position.

Not sure what to tell you other than that you and I have very different standards. What we have locally right now is barely more than a glorified autocomplete, and it feels worse than using ChatGPT 2 years ago because the context window is less and it doesn't have good webhooks on consumer setups. Another thing I'd say is that you clearly have no clue what 'consumer hardware' means, or what consumers that can even get this stuff running locally would have to do to get it to even rival the frontier models in terms of their usability (most consumers are't going to just boot into Ubuntu and run this thing from a command line) flow, to say nothing of the hardware requirements. I'd love to never use Claude or Gemini or ChatGPT again for both privacy and money reasons, but the quality of outputs and depth of thinking and writing ability between even the very best local models you can run right now is many orders of magnitude less than what you get using distributed frontier models, and those 'very best' local models require a top of the line machine that 99.9999% of consumers don't have and would never consider buying. The cloud models all have like a trillion(!) parameters now. It isn't even close.

I sure hope the local side of things massively improves over the next 2-3 years, but based on how this has gone my guess is that in 3 years you'll be lucky, if you have very top of the line hardware, to get benchmark performance that we had 6 months ago with the frontier models. The distributed hardware/memory gap is just too big.

apublicfrog · 2026-05-13T10:21:42 1778667702

> No they weren't. They were a gimmick - it is only in the past 6 or so months that frontier models have started to do stuff beyond mere gimmicks when it comes to coding

This is simply untrue. Using agentic orchestration I was writing production code daily 3 years ago. Hallucinations happened sometimes and context window was smaller (so you had to do some funky workarounds to deal with larger codebases), but it was workable. There have been a lot of marked improvements from a code perspective then - a lot model related yes, but also a lot in the ease of use, interfaces, etc.

> Another thing I'd say is that you clearly have no clue what 'consumer hardware' means, or what consumers that can even get this stuff running locally would have to do to get it to even rival the frontier models in terms of their usability (most consumers are't going to just boot into Ubuntu and run this thing from a command line) flow, to say nothing of the hardware requirements.

You've moved the goalposts. My point was that the "danger" of no new open models being released isn't that high as the existing ones are already impressive. Their ease of use or daily driving isn't relevant to that. If there were a need, someone could wrap a clean interface and support around it, or run it as their own cloud solution.

You seem to be arguing something adjacent to my point, which is fine I guess but I have little to say. Also multiple of your comments have come across quite aggressive and rude. Just food for thought if you want to work on that or not.

root_axis · 2026-05-11T02:42:10 1778467330

> They're good enough for 95% of use cases

They're not at all, not even close. Especially when you consider the use cases for people who are paying for LLM services today.

nightski · 2026-05-10T22:33:22 1778452402

Hardware. Frontier labs are driving up demand so much that it's priced significantly above cost making it far less affordable. Just look at Nvidia's profit margins.

suika · 2026-05-10T22:06:03 1778450763

The use cases in the future will be nothing like the use cases from today.

apublicfrog · 2026-05-11T04:54:22 1778475262

Maybe. The use cases people primarily use LLMs for (documents, coding, design, research) existed decades ago with different tooling. Who knows if the future will have a slew of new problems that require new models or will continue to be similar?

ai_fry_ur_brain · 2026-05-10T23:40:06 1778456406

95% of usecases. What are you smoking.

selcuka · 2026-05-11T01:34:30 1778463270

There are very good open weight models (such as DeepSeek v4 Flash) that can run on consumer level hardware.

Note that we are talking about 95% of everyone's use cases, not your specific use cases (which could require better models all the time).

apublicfrog · 2026-05-08T10:05:17 1778234717

Can you explain for the billions of the rest of us why this is the "most stressful time of the year" for the group you're referencing? I assume that's American students and/or teachers?

isakmarr · 2026-05-08T10:16:13 1778235373

Final exam season, and it's ongoing in Iceland too, so not just American students.

Steve16384 · 2026-05-08T15:14:07 1778253247

Here in the UK it's currently exam season. One of my son's had a GCSE exam just today.

pfortuny · 2026-05-08T18:26:01 1778264761

European students are preparing for their finals.

apublicfrog · 2026-05-08T10:03:43 1778234623

All these articles listing the American schools affected, "nationwide" outage reported, meanwhile hundreds of millions in the rest of the world affected.

Does anyone have a list of affected schools?

isakmarr · 2026-05-08T10:13:48 1778235228

I don't have a list, but I can tell you the University of Iceland is affected.

apublicfrog · 2026-05-04T06:07:44 1777874864

> Important Legal Notice: This is a non-binding pledge of intent. No money is collected at this stage. All references to profit-sharing, dividends, voting rights, and ownership are proposed concepts only — not confirmed arrangements. Nothing on this site constitutes a securities offering, investment contract, or financial instrument of any kind. The final cooperative structure must be reviewed and approved by qualified securities and aviation counsel. Participation does not guarantee ownership, financial return, or membership in any final entity. This is a movement, not an investment product.

From skimming, I see at least 5 places where this is reiterated on the page.

fny · 2026-05-04T14:28:00 1777904880

It's also not a sale of any sort. They're asking for pledges and have an accredited investor question.

apublicfrog · 2026-05-02T23:47:43 1777765663

Completely agree. How is it possible they issued these permits (years ago it seems) without having this infrastructure in place?

apublicfrog · 2026-04-29T01:21:20 1777425680

What are you basing that on? I'm usually pretty good at sniffing out AI writing, and it smells human to me.

riffraff · 2026-04-29T07:38:28 1777448308

I had the impression it was AI writing too because of the second half of the article. The first part looks genuine, the part since "trust laundering" smells fake: the scary single sentence followed by a whole paragraph of single clause sentences hints at AI.

Perhaps we've all just become paranoid, but even if it's not LLMs writing this, it now puts me off. And the AI image at the top of the page does not help with the feeling.

sdthjbvuiiijbb · 2026-04-29T11:39:18 1777462758

The line "This is the part that really matters." and the line "This is the circular citation pattern, and it’s one of the most under discussed attacks on the “retrieval augmented generation” trust model. " both raised flags. AI absolutely loves writing about the One Weird Trick that dentists don't want you to know. They love talking about "what really matters" or saying something is "the most under discussed" thing.

Then we get to the section "Why This Is A Bigger Deal Than It Looks". The title of this section again raises similar flags to before. But the bulleted list of:

1. The retrieval layer (immediately) 2. The model training corpus layer (months to years) 3. The agent layer (where the money is)

Absolutely reeks of AI. This list with this sequence of parentheticals is exactly how LLMs write, both structurally and the specific phrasing. This was the point where I felt confident enough to publicly accuse the post of AI writing.

I could go on with the prose in this section... How about "The attack surface is not hypothetical, it’s the default case."? Or "The cleanup problem for corpus poisoning is genuinely unsolved as of 2026."? (LLMs wildly overuse "genuine(ly)" and "real")

chneu · 2026-04-29T06:41:42 1777444902

Agreed. Nothing about this post really stood out as AI. It didn't raise a single flag for me.

I think calling something AI generated is just a lazy way of dismissing stuff nowadays.

jameshart · 2026-04-29T11:49:36 1777463376

This paragraph under ‘Trust Laundering’ is when it hit my AI writing trigger threshold:

> This is the circular citation pattern, and it’s one of the most under discussed attacks on the “retrieval augmented generation” trust model. It doesn’t require compromising Wikipedia’s infrastructure with l33t hacker skills. It doesn’t require social engineering an editor. You just simply write the source yourself, cite yourself on Wikipedia, and let the trust flow downstream. Easy peasy!

“It doesn’t X. It doesn’t Y. You just Z. Conclusion”

Once I saw that some other elements stood out too.

There’s a set of bullet points under ‘Thae Approach’ where each bullet starts with a bolded phrase: “one domain”, “one press release”, “one Wikipedia edit”, followed by a bolded sentence “The whole thing took maybe about twenty minutes”.

The emphasis here on irrelevant quantifiable optimizations - who cares that it only needs one of each of three things and it took under twenty minutes? - with unnecessary faux-profundity is a strong AI tell.

Add to that that the writer talks in the article about using AI generation to produce the content for the poisoning site, the suggestion that he used it to write up a blog post about this is hardly an implausible suggestion.

sdthjbvuiiijbb · 2026-04-29T11:45:31 1777463131

If this truly set off zero flags for you then you're probably just not very attuned to LLM writing style. I've noticed that most people are not.

I posted a bunch of specifics in a reply to the GP since I was quite annoyed with being accused of "a lazy way of dismissing stuff". It's nothing of the sort. I am a very good reader and I have read a lot of LLM writing and a lot of human writing.

malfist · 2026-04-29T01:51:16 1777427476

Why is agents (where the money is)? Fake profundity is abound in the post

esquivalience · 2026-04-29T05:57:29 1777442249

The author has been using parenthetical comments like that since at least 2017, judging by a review of old posts on that site.

apublicfrog · 2026-04-29T01:09:06 1777424946

The author quite clearly outlines their reasoning for this in the article:

> Carrot Disclosure, dangling a metaphorical carrot in front of the vendor to incentivise change. The main idea is to only publish the (redacted) output of the exploit for a critical vulnerability, to showcase that the software is exploitable. Now the vendor has two choices: either perform a holistic audit of its software, fixing as many issues as possible in the hope of fixing the showcased vulnerability; or losing users who might not be happy running a known-vulnerable software. Users of this disclosure model are of course called Bugs Bunnies.

flumpcakes · 2026-04-29T09:35:09 1777455309

Seems like grandstanding bad faith to me. They didn't even bother to follow the established disclosure policy for this project because the author feels this quality of the code is so crap, so instead does this...

apublicfrog · 2026-05-01T10:48:03 1777632483

Maybe, but I can see why people don't want to deal with red tape to do someone a favour.

Once I tried to help an open source project with a bug and was rejected because I didn't agree to support the Ukraine, that all sexual orientations are equal, or whatever else the long winded contributor rules were.

The issue isn't that I don't support those things, it's more that it's like someone handing me a 3 page form to fill out for picking their keys up for them.

There also may be conventions on disclosure and exploits, but they're not based on the law or rules of society.