More

eqmvii · 2026-06-13T01:10:38 1781313038

I give it until Tuesday at the latest until it's accessible again.

eqmvii · 2026-06-13T00:22:13 1781310133

I've seen it turn right in business contexts. Sometimes you can even lower your standard of "good enough" and find quantity has a quality all its own.

But it requires taste and engineering to do it right, and on the right things. It'll be an interesting few years.

cwnyth · 2026-06-13T03:23:34 1781321014

I think it also requires someone who knows just enough to be able to navigate between those ideas that will set you back and those which will propel you forward. At the end of the day, you still need some human filter.

eqmvii · 2026-06-12T14:17:19 1781273839

That's more or less what Claude Cowork is.

Every serious engineer I've seen try to use it ran away screaming, because of limitations in the sandbox.

I've also seen people set their coding agents up entirely within containers -- that may be the better way going forward, but it's an extra stop and a lot of extra plumbing to maintain.

eqmvii · 2026-06-09T14:16:30 1781014590

> Just getting the code to run on your laptop took a week.

This one surprised me. Claude Code in the CLI has made standing up an app and debugging whatever random dependencies or docker BS a dream compared to the before times, when you'd have to learn the architecture while simultaneously troubleshooting whatever isn't working on your machine

Lord_Zero · 2026-06-09T14:51:41 1781016701

And in the before times, you learned a lot and walked away with knowledge on the deps needed, connections, .env secrets, and cleaned it all up and documented it so the next dev would have an easier time doing it.

fooster · 2026-06-09T15:27:06 1781018826

Yeah, that totally didn't happen the majority of the time.

bigstrat2003 · 2026-06-09T16:23:28 1781022208

Yes it did. That's how I learned a great many things throughout my career. I'm sure some people didn't pay attention or try to understand what they were doing, and didn't learn. That's on them. But most of us learned a lot that way.

bunderbunder · 2026-06-10T01:07:14 1781053634

I think it depends on how “before” we’re talking about.

I can remember a time when learning was valued and leaving the camp cleaner than you found it was considered a basic professional standard.

But I can also remember a time when Scrum became all the rage and next thing you know we’re all stuck on the sprinting treadmill, management is obsessing over “velocity”, and it’s generally an everyone-for-themself free-for-all to clear the absolute minimum criteria to get the ticket moved to the “done” column in a semi-desperate effort to keep up with your ever-growing backlog of tickets to which you’ve been over committed. Don’t worry about incomprehensible code or flaky designs; taking your time to do it right the first time looks bad on the KPI dashboard but rework does the opposite because you get to count the second (third, fourth, etc.) times the same task needs to be revisited towards your velocity metrics, too.

I’m not sure most developers younger than maybe 40 realize just how much worse our line of work has become over the past ~15 years.

giraffe_lady · 2026-06-09T15:34:33 1781019273

Bullshit I just pasted random shit from google in until it worked and then instantly forgot which combination of the 20 things I tried got it there.

prerok · 2026-06-09T16:16:16 1781021776

Indeed, there were plenty of people doing just that. I imagine they get the most out of vibe coding. However, when it became a problem, an engineer was still required to fix it.

It might have been you, a couple of months later, or someone else. I have dealt with slop produced by unknowing programmers most of my career. With this vibe coding I think my job is still safe. The amount, though, is increasing exponentially.

lefra · 2026-06-10T04:14:43 1781064883

The second tome I had to do that for the same project (new computer), I sarted taking very detailed notes when doing this kind of unpleasant, supposedly one-off things.

eqmvii · 2026-06-08T19:13:22 1780946002

I think the hard part is acquiring the GPUs, first at all, then at any reasonable price.

trenchgun · 2026-06-08T19:30:47 1780947047

Yes, and then you need to have the datacenter. Do you get a permit? How long does it take to build it?

wongarsu · 2026-06-08T21:02:22 1780952542

Getting somebody who actually knows how to design and build a data center also seems to be a bit of an issue right now

joquarky · 2026-06-09T02:08:43 1780970923

Simple solution: just put them in space!

mNovak · 2026-06-09T18:04:35 1781028275

Also gas turbines are on backorder until 2029, and new grid connection queues on a similar timescale.

Hence why all the bitcoin miners are cashing in (or trying to) by converting their facilities to datacenters.

eqmvii · 2026-06-05T23:16:29 1780701389

Some business users spent ~30 minutes on an internal process, and we prototyped an "Agent" in Slack to take over. At first it didn't work, then it didn't work some more, eventually it ALMOST worked. Then one day, it worked, and the old business process died never to be revived.

Now it sits in a slack channel, and I watch it doing work, responding to ambiguity, and taking feedback/edits all day. It's unreal. It's literal magic. It saves a HUGE amount of time and gave us a pattern to do more.

This is the real deal. It's not easy to find problems with the right shape, and it's not easy to build agents that fit even when you do... but once it clicks, it clicks.

eqmvii · 2026-04-29T11:25:57 1777461957

my favorite quote in this space has always been:

the prophecies of what the courts will do in fact, and nothing more profound, are what i mean by the law.

eqmvii · 2026-04-11T00:14:18 1775866458

Held my breath the whole time after all the heat shield warnings. Very glad it all worked, or that there was enough margin!

telesilla · 2026-04-11T00:15:38 1775866538

Yes it was worrisome, but how could it not be even with the best tech we'll ever have - I feel relief still on every plane touchdown.

Bravo, Artemis team for an exceptional return to extra-orbital space travel.

Levitating · 2026-04-11T00:15:52 1775866552

The LOS was also more than 6 minutes as predicted (I measured a bit over 7 minutes). What a tension.

llbbdd · 2026-04-11T00:18:18 1775866698

I wasn't clear, was the LOS just comms or a full loss of telemetry from the craft? Either way, terrifying.

loloquwowndueo · 2026-04-11T00:21:35 1775866895

Everything. No radio signals make it in or out of the capsule due to ionization from the heat and plasma of reentry.

philistine · 2026-04-11T00:25:17 1775867117

I’ll note, since it is supremely interesting to me, that Starship is able to communicate with the ground during its whole reentry due to its sheer size and ability to connect with Starlink satellites. I assumed loss of signal due to reentry was a given for any spaceship!

numpad0 · 2026-04-11T01:12:51 1775869971

Shuttle in its last days had antennas that protruded outside the plasma just enough for telemetry. Apollo and Artemis reentry are also direct entry from Lunar-Earth transfer orbit using ablative heat shields, so the plasma would be hotter and thicker than suborbital Starship shots with Shuttle style ceramic tiles.

m4rtink · 2026-04-11T04:55:46 1775883346

I'm pretty sure it did not stick anything through the plasma sheet- that is impossible. You would eithe melt the thing or just shift the plasma sheet a bit. It forms as air is compressed on contact, simple as that.

What IIRC was actually done was that some antennas were placed on the back of the shuttle & its size was big enough that the plasma bubble would not fully envelope it - it would be open up to space. And that antenna on the back would communicate with TDRS satellites through this gap, enabling contact through the whole re-entry.

Starship does basically the same, just with Starlink satellites instead of TDRS.

llbbdd · 2026-04-11T00:55:17 1775868917

Would this capsule had been been able to communicate if it was integrated with starlink or is the size more important? I'd imagine if they could have achieved communication via Starlink they would have done it, but just curious.

rufo · 2026-04-11T01:25:06 1775870706

It's a function of the shape. On a capsule-sized spacecraft, the ionized plasma completely surrounds the craft, so no radio communications can get in or out. For an oblong-shaped spacecraft, like the Space Shuttle or Starship, the descent tends to be angled such that you have a "hole" in the plasma you can get a signal through.

albumen · 2026-04-11T01:16:11 1775870171

No, the plasma forms a teardrop shape around small craft like Orion, completely cutting off radio comms. Larger craft like starship or the shuttle which have a roughly cylindrical shape (vs Orion’s circular cross section) aren’t fully enclosed by the plasma. The shuttle had a transmitter attached to its tail for later flights, which could send back telemetry during re-entry.

m4rtink · 2026-04-11T04:59:50 1775883590

Well, provided you had a 30 MW microwave transmitter on board, you could punch through the plasma just fine, it has been done:

https://en.wikipedia.org/wiki/Sprint_(missile)

"Sprint accelerated at 100 g, reaching a speed of Mach 10 (12,000 km/h; 7,600 mph) in 5 seconds. Such a high velocity at relatively low altitudes created skin temperatures up to 6,200 °F (3,400 °C), requiring an ablative shield to dissipate the heat. The high temperature caused a plasma to form around the missile, requiring extremely powerful radio signals to reach it for guidance. The missile glowed bright white as it flew."

llbbdd · 2026-04-11T01:39:14 1775871554

Awesome, thank you! I wonder if some kind of very long-tethered deployed antenna could enable this for the capsule or if the ratio of long-enough-to-work vs thick-enough-to-not-burn-off-completely just doesn't work. Time to read about the shuttle.

Culonavirus · 2026-04-11T01:46:48 1775872008

It's the shape and size.

Also Orion and other capsules fall like a rock (steep reentry profile ) compared to shuttle/starship, which intentionally slow down the reentry and kinda glide (ballpark 10min with capsules compared to 30min with shuttle/starship).

tl;dr: capsules get fully enveloped in plasma due to their shape, size and reentry profile

TomatoCo · 2026-04-11T01:00:03 1775869203

The space shuttle, too, was able to communicate. I imagine the smaller the craft the smaller the angle you can "speak" out of and, below a certain size, it just doesn't work.

misterprime · 2026-04-11T01:15:26 1775870126

Yes, I remember when they used the signal out the back through the plasma during reentry. It was astoundingly good!

Rebelgecko · 2026-04-11T01:23:54 1775870634

It seems like they had limited telemetry for a short period before they did any audio

rootusrootus · 2026-04-11T01:25:19 1775870719

I was wondering about that, so I looked up the heat shield issues. It seems like their solution was very defensible and there was every reason to believe it would work out just fine. The plan that did not work as they wanted had a new idea, a double re-entry, and when the results were concerning they backed off to using a traditional single re-entry. That seems like a legitimate fix?

magicalhippo · 2026-04-11T02:38:51 1775875131

Scott Manley went into the details in a recent video.

The reason the heat shield failed was due to gas buildup inside the ablative material. This was due to the skip reentry profile they used, where the craft does a single skip (as in skipping stones) during reentry. The high bounce caused the shield to be heated enough that the heat penetrated the material causing gas release but not enough that the material ablated. Thus gas would build up deep inside up until it caused large chunks to break off. They could reproduce this in tests.

The fix was two-fold. First they lowered the bounce height, so a much less pronounced skip, avoiding the lowered heating of the shield. And they tweaked the material formula a bit so it was more porous, allowing subsurface gas to escape rather than build up.

TheOtherHobbes · 2026-04-11T08:59:31 1775897971

No doubt there are people looking at the heat shield right now and saying "Hmmm."

I am very curious about what they're seeing, and how well the get-it-over-with solution worked.

It was a bold move and the results will be fascinating.

masklinn · 2026-04-11T06:10:08 1775887808

In my understanding of the Manley video, the materials change will only occur for Artemis 3, for which it will be irrelevant as that will not be leaving LEO.

masklinn · 2026-04-11T15:32:38 1775921558

Not sure why I'm being downvoted. Here's the segment where Manley explains this: https://youtu.be/shcj7MUK5BU?t=828 and this is also the section where Manley explains Artemis III is not going to the moon so it won't actually be testing this change.

And from an older NASA explanation: https://www.nasa.gov/news-release/nasa-shares-orion-heat-shi...

> Engineers already are assembling and integrating the Orion spacecraft for Artemis III based on lessons learned from Artemis I and implementing enhancements to how heat shields for crewed returns from lunar landing missions are manufactured to achieve uniformity and consistent permeability.

thegrim33 · 2026-04-11T01:29:41 1775870981

Yes, but it was the biggest opening for propagandists to latch on to for demoralizing and spreading fear/uncertainty/doubt about the mission.

neaden · 2026-04-11T00:15:35 1775866535

Same! Glad everyone made it safe.

eqmvii · 2026-03-20T14:00:27 1774015227

A lot of people feel this way.

But IMO the most fruitful thing for an engineering org to do RIGHT NOW is learn the tools well enough to see where they can be best applied.

Claude Code and its ilk can turn "maybe one day" internal projects into live features after a single hour of work. You really, honestly, and truly are missing out if you're not looking for valuable things like that!

lopis · 2026-03-20T14:29:00 1774016940

You're only missing out if that's what you want to do. Not every software developer is interested in creating new software projects from scratch in an hour, or at all. It's totally find to do software development as a job, and then close your laptop and not see it until Monday. Learn the tools that suit when when you need them.

remus · 2026-03-20T14:49:43 1774018183

> You're only missing out if that's what you want to do.

Who writes software and doesn't have a list of "I'll fix this one day" issues as long as their arm?

This is honestly one of the things I enjoy most at the moment. There's whole classes of issues where I know the fix is probably pretty simple but I wouldn't have had time to sort it previously. Now I can just point claude at it and have a PR 5mins later. It's really nice when you can tell users "just deployed a fix for your thing" rather than "I've made a ticket for your request" your issue is on the never-ending backlog pile and might get fixed in 5 years time if you're lucky.

subarctic · 2026-03-20T17:00:17 1774026017

Claude code makes it so easy to do things the "right way" that it also makes it really easy for you to let scope creep get out of hand. I have a personal project that I haven't deployed yet that in some ways is way overengineered for its purpose. It's hard to blame the tool though, it's always telling me I'm making it more complicated than it needs to be but I don't listen

shepherdjerred · 2026-03-21T03:13:23 1774062803

I've felt this recently. I've often been bad about scope creep. CC makes it so easy.

On the other hand, I can see these tools getting good enough that scope creep doesn't even matter.

ATM I usually get stuck around the review/verification stage. As in, my code works, I have tested that it works, but it is failing CI or someone left a PR comment. And for each comment I'll have to make sure it makes sense, make the change, test again, and get CI passing again.

lopis · 2026-03-22T05:07:30 1774156050

In my team we have strict rules for scope creep in pull request. Each one needs to introduce a single thing, not a dozen little refactorings. This helps, but not when you're working alone in a personal project. Maybe you can setup your review agent to help with scope creep?

lopis · 2026-03-22T05:05:20 1774155920

Many people don't. You can write a ticket and the PM can deal with it. Not everyone is intimately involved in their job enough to care about stuff like that. And some projects might not last long enough for you to care. You should project your dev experience on everyone, specially as a software development enthusiast.

stiiv · 2026-03-20T14:07:12 1774015632

> Claude Code and its ilk can turn "maybe one day" internal projects into live features after a single hour of work. You really, honestly, and truly are missing out if you're not looking for valuable things like that!

You're right, it's possible. But you might be both overestimating the ease of onboarding and underestimating the variety of tasks and constraints devs are responsible for.

I've seen Claude knock out trivial stuff with a sufficiently good spec. But I've also seen it utterly choke on a bad spec or a hard task. I think these outcomes are pretty broadly established. So is the expectation that the tech will get better. Waiting isn't unwise.

smugma · 2026-03-20T14:36:23 1774017383

Waiting may not be “unwise” but acting now may be optimal. Even though tooling may be much better in 12 months, if it can improve quality or time now, that’s a net benefit.

Bikers in the Tour de France used to not wear helmets. They were seen as uncouth (“why jump on the bandwagon?”). Helmets today are way better than they were then. But if the utility provided is greater than the cost, of course it makes sense to act sooner.

I’m not explicitly arguing for investing in AI or other newfangled tech, I’m arguing that the premise of waiting may be “sounded” but also “leaves money on the table”, or in some cases, lives.

The author talks about vaccines as a counter example but doesn’t really address the cost/benefit in any detail.

eqmvii · 2026-01-12T16:39:25 1768235965

Could this be an experiment to show how likely LLMs are to lead to AGI, or at least intelligence well beyond our current level?

If you could only give it texts and info and concepts up to Year X, well before Discovery Y, could we then see if it could prompt its way to that discovery?

ben_w · 2026-01-12T16:49:24 1768236564

> Could this be an experiment to show how likely LLMs are to lead to AGI, or at least intelligence well beyond our current level?

You'd have to be specific what you mean by AGI: all three letters mean a different thing to different people, and sometimes use the whole means something not present in the letters.

> If you could only give it texts and info and concepts up to Year X, well before Discovery Y, could we then see if it could prompt its way to that discovery?

To a limited degree.

Some developments can come from combining existing ideas and seeing what they imply.

Other things, like everything to do with relativity and quantum mechanics, would have required experiments. I don't think any of the relevant experiments had been done prior to this cut-off date, but I'm not absolutely sure of that.

You might be able to get such an LLM to develop all the maths and geometry for general relativity, and yet find the AI still tells you that the perihelion shift of Mercury is a sign of the planet Vulcan rather than of a curved spacetime: https://en.wikipedia.org/wiki/Vulcan_(hypothetical_planet)

grimgrin · 2026-01-12T17:16:52 1768238212

An example of why you need to explain what you mean by AGI is:

https://www.robinsloan.com/winter-garden/agi-is-here/

opponent4 · 2026-01-12T17:32:52 1768239172

> You'd have to be specific what you mean by AGI

Well, they obviously can't. AGI is not science, it's religion. It has all the trappings of religion: prophets, sacred texts, origin myth, end-of-days myth and most importantly, a means to escape death. Science? Well, the only measure to "general intelligence" would be to compare to the only one which is the human one but we have absolutely no means by which to describe it. We do not know where to start. This is why you scrape the surface of any AGI definition you only find circular definitions.

And no, the "brain is a computer" is not a scientific description, it's a metaphor.

strbean · 2026-01-12T18:19:55 1768241995

> And no, the "brain is a computer" is not a scientific description, it's a metaphor.

Disagree. A brain is turing complete, no? Isn't that the definition of a computer? Sure, it may be reductive to say "the brain is just a computer".

opponent4 · 2026-01-12T19:02:49 1768244569

Not even close. Turing complete does not apply to the brain plain and simple. That's something to do with algorithms and your brain is not a computer as I have mentioned. It does not store information. It doesn't process information. It just doesn't work that way.

https://aeon.co/essays/your-brain-does-not-process-informati...

strbean · 2026-01-12T21:01:18 1768251678

> Forgive me for this introduction to computing, but I need to be clear: computers really do operate on symbolic representations of the world. They really store and retrieve. They really process. They really have physical memories. They really are guided in everything they do, without exception, by algorithms.

This article seems really hung up on the distinction between digital and analog. It's an important distinction, but glosses over the fact that digital computers are a subset of analog computers. Electrical signals are inherently analog.

This maps somewhat neatly to human cognition. I can take a stream of bits, perform math on it, and output a transformed stream of bits. That is a digital operation. The underlying biological processes involved are a pile of complex probabilistic+analog signaling, true. But in a computer, the underlying processes are also probabilistic and analog. We have designed our electronics to shove those parts down to the lowest possible level so they can be abstracted away, and so the degree to which they influence computation is certainly lower than in the human brain. But I think an effective argument that brains are not computers is going to have to dive in to why that gap matters.

stevenhuang · 2026-01-13T11:39:02 1768304342

It is pretty clear the author of that article has no idea what he's talking about.

You should look into the physical church turning thesis. If it's false (all known tested physics suggests it's true) then well we're probably living in a dualist universe. This means something outside of material reality (souls? hypercomputation via quantum gravity? weird physics? magic?) somehow influences our cognition.

> Turning complete does not apply to the brain

As far as we know, any physically realizable process can be simulated by a turing machine. And FYI brains do not exist outside of physical reality.. as far as we know. If you have issue with this formulation, go ahead and disprove the physical church turning thesis.

nearbuy · 2026-01-12T21:33:32 1768253612

That is an article by a psychologist, with no expertise in neuroscience, claiming without evidence that the "dominant cognitive neuroscience" is wrong. He offers no alternative explanation on how memories are stored and retrieved, but argues that large numbers of neurons across the brain are involved and he implies that neuroscientists think otherwise.

This is odd because the dominant view in neuroscience is that memories are stored by altering synaptic connection strength in a large number of neurons. So it's not clear what his disagreement is, and he just seems to be misrepresenting neuroscientists.

Interestingly, this is also how LLMs store memory during training: by altering the strength of connections between many artificial neurons.

anthonypasq · 2026-01-12T19:24:36 1768245876

ive gotta say this article was not convincing at all.

Closi · 2026-01-12T20:12:32 1768248752

A human is effectively turning complete if you give the person paper and pen and the ruleset, and a brain clearly stores information and processes it to some extent, so this is pretty unconvincing. The article is nonsense and badly written.

> But here is what we are not born with: information, data, rules, software, knowledge, lexicons, representations, algorithms, programs, models, memories, images, processors, subroutines, encoders, decoders, symbols, or buffers – design elements that allow digital computers to behave somewhat intelligently. Not only are we not born with such things, we also don’t develop them – ever.

Really? Humans don't ever develop memories? Humans don't gain information?

Davidzheng · 2026-01-13T01:24:54 1768267494

probably not actually turing complete right? for one it is not infinite so

nomel · 2026-01-13T07:50:15 1768290615

> And no, the "brain is a computer" is not a scientific description, it's a metaphor.

I have trouble comprehending this. What is "computer" to you?

ben_w · 2026-01-12T18:45:02 1768243502

Cargo cults are a religion, the things they worship they do not understand, but the planes and the cargo themselves are real.

There's certainly plenty of cargo-culting right now on AI.

Sacred texts, I don't recognise. Yudkowsky's writings? He suggests wearing clown shoes to avoid getting a cult of personality disconnected from the quality of the arguments, if anyone finds his works sacred, they've fundamentally misunderstood him:

  I have sometimes thought that all professional lectures on rationality should be delivered while wearing a clown suit, to prevent the audience from confusing seriousness with solemnity.

- https://en.wikiquote.org/wiki/Eliezer_Yudkowsky

Prophets forecasting the end-of-days, yes, but this too from climate science, from everyone who was preparing for a pandemic before covid and is still trying to prepare for the next one because the wet markets are still around, from economists trying to forecast growth or collapse and what will change any given prediction of the latter into the former, and from the military forces of the world saying which weapon systems they want to buy. It does not make a religion.

A means to escape death, you can have. But it's on a continuum with life extension and anti-aging medicine, which itself is on a continuum with all other medical interventions. To quote myself:

  Taking a living human's heart out without killing them, and replacing it with one you got out a corpse, that isn't the magic of necromancy, neither is it a prayer or ritual to Sekhmet, it's just transplant surgery.

  …

  Immunity to smallpox isn't a prayer to the Hindu goddess Shitala (of many things but most directly linked with smallpox), and it isn't magic herbs or crystals, it's just vaccines.

- https://benwheatley.github.io/blog/2025/06/22-13.21.36.html

markab21 · 2026-01-12T17:08:56 1768237736

Basically looking for emergent behavior.

water-data-dude · 2026-01-12T17:31:10 1768239070

It'd be difficult to prove that you hadn't leaked information to the model. The big gotcha of LLMs is that you train them on BIG corpuses of data, which means it's hard to say "X isn't in this corpus", or "this corpus only contains Y". You could TRY to assemble a set of training data that only contains text from before a certain date, but it'd be tricky as heck to be SURE about it.

Ways data might leak to the model that come to mind: misfiled/mislabled documents, footnotes, annotations, document metadata.

gwern · 2026-01-12T18:31:40 1768242700

There's also severe selection effects: what documents have been preserved, printed, and scanned because they turned out to be on the right track towards relativity?

mxfh · 2026-01-12T19:51:07 1768247467

This.

Especially for London there is a huge chunk of recorded parliament debates.

More interesting for dialoge seems training on recorded correspondence in form of letters anyway.

And that corpus script just looks odd to say the least, just oversample by X?

water-data-dude · 2026-01-13T18:22:28 1768328548

Oh! I honestly didn't think about that, but that's a very good point!

reassess_blind · 2026-01-13T06:48:28 1768286908

Just Ctrl+F the data. /s

alansaber · 2026-01-12T16:43:08 1768236188

I think not if only for the fact that the quantity of old data isn't enough to train anywhere near a SoTA model, until we change some fundamentals of LLM architecture

franktankbank · 2026-01-12T16:46:30 1768236390

Are you saying it wouldn't be able to converse using english of the time?

ben_w · 2026-01-12T16:56:32 1768236992

Machine learning today requires an obscene quantity of examples to learn anything.

SOTA LLMs show quite a lot of skill, but they only do so after reading a significant fraction of all published writing (and perhaps images and videos, I'm not sure) across all languages, in a world whose population is 5 times higher than the link's cut off date, and the global literacy went from 20% to about 90% since then.

Computers can only make up for this by being really really fast: what would take a human a million or so years to read, a server room can pump through a model's training stage in a matter of months.

When the data isn't there, reading what it does have really quickly isn't enough.

wasabi991011 · 2026-01-12T16:53:17 1768236797

That's not what they are saying. SOTA models include much more than just language, and the scale of training data is related to its "intelligence". Restricting the corpus in time => less training data => less intelligence => less ability to "discover" new concepts not in its training data

withinboredom · 2026-01-13T13:36:35 1768311395

Could always train them on data up to 2015ish and then see if you can rediscover LLMs. There's plenty of data.

franktankbank · 2026-01-12T17:18:33 1768238313

Perhaps less bullshit though was my thought? Was language more restricted then? Scope of ideas?

andyfilms1 · 2026-01-12T16:47:37 1768236457

I mean, humans didn't need to read billions of books back then to think of quantum mechanics.

alansaber · 2026-01-12T16:51:46 1768236706

Which is why I said it's not impossible, but current LLM architecture is just not good enough to achieve this.

famouswaffles · 2026-01-12T16:50:17 1768236617

Right, what they needed was billions of years of brute force and trial and error.

armcat · 2026-01-12T16:56:35 1768236995

I think this would be an awesome experiment. However you would effectively need to train something of a GPT-5.2 equivalent. So you need lot of text, a much larger parameterization (compared to nanoGPT and Phi-1.5), and the 1800s equivalents of supervised finetuning and reinforcement learning with human feedback.

dexwiz · 2026-01-12T17:06:09 1768237569

This would be a true test of can LLMs innovate or just regurgitate. I think part of people's amazement of LLMs is they don't realize how much they don't know. So thinking and recalling look the same to the end user.

nickpsecurity · 2026-01-12T20:54:05 1768251245

That is one of the reasons I want it done. We cant tell if AI's are parroting training data without having the whole, training data. Making it old means specific things won't be in it (or will be). We can do more meaningful experiments.

Trufa · 2026-01-12T16:57:03 1768237023

This is fascinating, but the experiment seems to fail in being a fair comparison of how much knowledge can we have from that time in data vs now.

As a thought experiment I find it thrilling.

Rebuff5007 · 2026-01-12T17:01:56 1768237316

OF COURSE!

The fact that tech leaders espouse the brilliance of LLMs and don't use this specific test method is infuriating to me. It is deeply unfortunate that there is little transparency or standardization of the datasets available for training/fine tuning.

Having this be advertised will make more interesting and informative benchmarks. OEM models that are always "breaking" the benchmarks are doing so with improved datasets as well as improved methods. Without holding the datasets fixed, progress on benchmarks are very suspect IMO.

feisty0630 · 2026-01-12T16:52:28 1768236748

I fail to see how the two concepts equate.

LLMs have neither intelligence nor problem-solving abillity (and I won't be relaxing the definition of either so that some AI bro can pretend a glorified chatbot is sentient)

You would, at best, be demonstrating that the sharing of knowledge across multiple disciplines and nations (which is a relatively new concept - at least at the scale of something like the internet) leads to novel ideas.

al_borland · 2026-01-12T16:57:50 1768237070

I've seen many futurists claim that human innovation is dead and all future discoveries will be the results of AI. If this is true, we should be able to see AI trained on the past figure it's way to various things we have today. If it can't do this, I'd like said futurists to quiet down, as they are discouraging an entire generation of kids who may go on to discover some great things.

skissane · 2026-01-12T17:21:20 1768238480

> I've seen many futurists claim that human innovation is dead and all future discoveries will be the results of AI.

I think there's a big difference between discoveries through AI-human synergy and discoveries through AI working in isolation.

It probably will be true soon (if it isn't already) that most innovation features some degree of AI input, but still with a human to steer the AI in the right direction.

I think an AI being able to discover something genuinely new all by itself, without any human steering, is a lot further off.

If AIs start producing significant quantities of genuine and useful innovation with minimal human input, maybe the singularitarians are about to be proven right.

thinkingemote · 2026-01-12T18:39:58 1768243198

I'm struggling to get a handle on this idea. Is the idea that today's data will be the data of the past, in the future?

So if it can work with whats now past, it will be able to work with the past in the future?

al_borland · 2026-01-12T21:08:21 1768252101

Essentially, yes.

If the prediction is that AI will be able to invent the future. If we give it data from our past without knowledge of the present... what type of future will it invent, what progress will it make, if any at all? And not just having the idea, but how to implement the idea in a way that actually works with the technology of the day, and can build on those things over time.

For example, would AI with 1850 data have figured out the idea of lift to make an airplane and taught us how to make working flying machines and progress them to the jets we have today, or something better? It wouldn't even be starting from 0, so this would be a generous example, as da Vinci way playing with these ideas in the 15th century.

If it can't do it, or what it produces is worse than what humans have done, we shouldn't leave it to AI alone to invent our actual future. Which would mean reevaluating the role these "thought leaders" say it will play, and how we're educating and communicating about AI to the younger generations.