More

martinald · 2026-05-31T01:18:43 1780190323

Why is this a paper? It's just using the n-cpu-moe option on llama.cpp? What am I missing here?

Farmadupe · 2026-05-31T01:59:16 1780192756

It's amazingly vacuous isn't it? I think the most interesting read was the fact that they were surprised llama.cpp crashed when they used a bad set of commandline arguments.

Although in the section immediately above the observation they claimed that they ran 10 whole completions with 100% success rate. So who knows.

I have to admit I slightly miss the flood of AI-psychosis research papers that seemed to be popping up a couple of months ago. Good to know there's still one or two new ones floating around.

LoganDark · 2026-05-31T02:42:53 1780195373

Apparently the author has a patent about it, too.

martinald · 2026-05-29T22:05:47 1780092347

Hi! Big fan of OpenRouter and the data you provide. It'd be awesome if you would consider providing volume of tokens per hour, mostly for my own curiosity as to quite how peaky demand is.

Thanks!

martinald · 2026-05-28T12:52:08 1779972728

How is it surprising to people that zip and XML are in stdlibs for a programming language?

Btw, you should have looked at dotnet for this as well. There is a very good library ( DocumentFormat.OpenXml) that can handle all docx/xlsx/pptx files. And dotnet can ship standalone binaries (though AOT probably won't work).

pier25 · 2026-05-28T14:13:04 1779977584

Many runtimes/languages rely on third party deps for that. Also plenty of devs think the stdlib should be as lean as possible.

Personally, I think there should be a balance. The direct consequence of a barebones stdlib is NPM and having to download hundreds of dependencies for a hello world.

wky · 2026-05-28T15:35:54 1779982554

Golang has the golang.org/x packages, which avoids too much stdlib bloat while still providing the niceties of “pre-vetted” packages that don’t pull in a massive dependency tree.

tanin · 2026-05-29T04:29:02 1780028942

Yeah, I try to avoid 3rd party libraries unless it comes from credible sources/companies or vetted by famous frameworks e.g. Rails uses it.

Not just for NPM but for Intellij plugin, VSCode extension, and etc.

If Node has officially pre-vetted libraries, that would be really awesome.

pier25 · 2026-05-28T16:39:32 1779986372

yeah I wish Node had something like that

slopinthebag · 2026-05-28T18:37:21 1779993441

It does! (well Deno does but you can use them in Node)

https://jsr.io/@std

pier25 · 2026-05-28T20:11:30 1779999090

support is not uniform though

eg: https://jsr.io/@std/dotenv

rjrjrjrj · 2026-05-28T15:01:01 1779980461

Difficult to square the author's surprise with the later comment "I have my fair share of building a Java desktop application and know jpackage and alike very well"

You can't get very far in Java development without working with .jar files (which are zip archives).

CharlieDigital · 2026-05-28T14:20:09 1779978009

https://github.com/dotnet/Open-XML-SDK

First party from Microsoft; feels like it would be the way to go.

xnorswap · 2026-05-28T15:39:33 1779982773

That is the source of DocumentFormat.OpenXml, you're talking about the same package, https://github.com/dotnet/Open-XML-SDK#packages

Microsoft are incapable of:

    1. naming things well
    2. keeping those names stable

martinald · 2026-05-26T19:11:04 1779822664

They can (not entirely sure how 'grey' market this is) either have subsidiaries outside of china (eg: singapore) that provide the inference and/or just rent it off the public gpu clouds.

martinald · 2026-05-26T13:01:51 1779800511

Totally agree. There's days (or even afternoons) where I trigger more actions than I would have done in a month.

martinald · 2026-05-15T16:47:43 1778863663

Two thoughts (I was in the same situation, constantly trying desktop Linux then pinging back to Windows after hitting issues).

1) Fedora is really worth a try, it's extremely polished. The best thing is the packages in the repo are generally much more up to date that debian based distros, which maeans less random PPAs to work around it, which cause issues.

2) The biggest change is having Claude Code/Codex able to diagnose and tweak things extremely quickly. If something goes wrong, I ask claude code (in a specific folder with various docs about workarounds) and it goes and fixes it 99% of the time very quickly.

Coding agents being able to fix Linux actually makes it _more_ stable than Windows for me. In my experience Windows is less buggy _in general_ than desktop Linux.[1] However, once you hit random issues you are basically screwed if basic attempts don't work. With Linux you can have a coding agent go thru all the reams of logs to find the issue and even clone the underlying source code to find issues.

[1] For example, there is some ridiculous problem with wayland and notifications on GNOME at least, see this: https://gitlab.gnome.org/GNOME/gnome-shell/-/work_items/358?... which has to be disabled with an extension unless you want to go insane

gbro3n · 2026-05-15T17:00:17 1778864417

Similar to this, but NixOS. Having AI to help me through made this so much easier. I was sold on the idea of an 'IaC' config based machine, after a general push to move all of my processes towards full GitOps. Windows had been pretty good to me (running 4 years on the same laptop install), but it had started overheating with fans sounding like a vacuum cleaner and it was time to start over. The difference with NixOS is that sure there are issues and preferences to work through, but when I fix them, they get committed in config, and thats an investment in time rather than something that will be right until next time I have to do it. I was able to reproduce, and rebuild on a separate machine with minimal hassle (it's good to go through this process to be sure you've got it right) and that really was amazing to see. SSH keys, SMB share, monitor configurations, themes, apps, utilities - a fully set up dev machine, everything is just there. I've been planning changes on a Copilot integrated taskboard I built (https://www.agentkanban.io) and then handing them off to the agent, reviewing the changes in the VS Code git client and then apply, commit. Being able to see the number of commits makes me realise how much I was doing manually, every time I set up a new machine.

isityettime · 2026-05-16T03:36:09 1778902569

In my first years with NixOS, I really wanted to share my enthusiasm and joy with others. But I often got caught up explaining the mechanisms, the insights that it's built on. It took me a while to come around to realizing the heart of it: NixOS feels really good to use because it frees you from the uncertainty of a machine that suddenly changes underneath your feet. It gives you a system that is really easy to inspect and modify. It gives you the ability to move forward without fear. It's really liberating and relaxing in a certain way.

Historically, a sense of intimidation and the difficulty of onboarding blocked a lot of people from accessing these benefits. It's been cool to see how LLM agents have helped a lot of people get started and tap into the fun parts.

kuglimon · 2026-05-16T09:43:46 1778924626

My experience with explaining NixOS is that once you get to the benefits people get intimidated and start telling how they never have these issues.

I think immutable distros/software are the most intuitive way, even if people get intimidated with the idea. What's the action you did as a junior when windows/linux/x broke? You most likely reinstalled. When encountering issues we tend to try to emulate immutability.

The sad thing about NixOS is that there's 0% chance of seeing it used at work. Even in the devops people you're lucky if people can edit a Dockerfile. Good luck spending the next 3 years explaining it might look like json but it's not, and it would help to learn the syntax.

mplanchard · 2026-05-16T15:25:26 1778945126

We used nix for developer environments, and nix to define docker containers for our production services. The biggest problem is the “nix people” can become blockers if there is an issue and they aren’t around to figure it out, because most engineers are just not going to learn nix.

However, we felt the benefit was worth the cost, overall, especially because we used the same flakes and therefore versions across all dev and deployed envs, and we didn’t have to deal with the hassle and performance issues of running all the dev services in docker containers.

isityettime · 2026-05-16T18:07:39 1778954859

I'm the Nix guy on my small team at work and I'd love to train any of teammates on Nix stuff. One of them already has some Nix experience and I make sure to walk him through my Nix work whenever I finish something new. But it seems like no one is really interested in truly diving in like I did when I was new to Nix.

I don't really know why that is, because it doesn't seem to have to do with intelligence or general experience with adjacent skills. I think it's probably at least 30% down to temperament, but it may also be because my team is mostly fairly senior and each person has a lot of responsibilities of their own.

On the bright side, the tooling is easy enough that nonetheless, my team generally has no problems spinning up new Nix environments without me, or making small changes to the ones I've set up. And LLM agents are now good enough with Nix that I'm confident they can unblock themselves until my return whenever I'm on vacation or whatever.

mplanchard · 2026-05-16T18:44:36 1778957076

Yeah, similar experience here. One pattern that was helpful for us was defining a nix file that just contained a mapping of names to lists of packages: dev, ci, and so on, so that if you wanted to install something that was on the nix search page, you could just go toss it in there and immediately get it locally. It also made it clear which package sets would wind up where, and allowed recursion so you could say that like dev was rust-base ++ rust-dev ++ js ++ common, or whatever.

isityettime · 2026-05-16T13:27:54 1778938074

Currently no NixOS at work for me, but my team is using Nix for local development environments and CI successcully via devenv.nix. I think we actually could use NixOS on my team except that we have a host of endpoint security software that we need to run on all cloud VMs, and I'm a little distant from the people who own the automation that installs it on approved AMIs.

throwaway2037 · 2026-05-18T09:48:07 1779097687

I can second your comment about Fedora. One underrated point: Their drivers are always the best amoung all free distros. I'm not sure if there team is the most aggressive to upstream new drivers, or manufs target Fedora first. And I write this post as a regular Debian user. However, if I have a piece of hardware that has terrible drivers on Debian, almost always it will work fine on Fedora. (Grumble!)

graceful6800 · 2026-05-16T22:47:58 1778971678

Genuine question, how is everyone else dealing with giving agents elevated permissions? Obviously the answer is "don't", but some things are pretty harmless, like journalctl and dmesg, and are pretty useful for debugging the system.

I guess you could make a new user to run the harness under and give it no-password sudo rights for select commands? That doesn't feel like a great solution but it's the only thing I can come up with

nijave · 2026-05-17T15:59:18 1779033558

I give it passwordless sudo but don't give it permissions to automatically run commands. I just review what it's running before accepting. If it looks questionable, I interrupt and ask it to explain what/why it wants to run the thing

martinald · 2026-05-17T15:43:00 1779032580

I use sudo -A with some openssh ui for sudo. I tell the agent to use sudo -A for anything that it needs and then it pops up with a sudo password prompt.

retrochameleon · 2026-05-15T17:29:18 1778866158

Using Linux is a learning experience. You will inevitably face and solve numerous problems over time, but every time you do, you come out of it understanding what's going on under the hood a little more.

Still, it can be dreadful to face even small issues when you only feel like using your computer and not fixing it. Having an LLM agent help with fixing issues is a lifesaver. Ask it what you don't understand, take note of the commands it uses or suggests while troubleshooting and fixing your issue, and you'll supercharge your learning and not get as hung up on the issues.

If someone doesn't care much to learn though, I'd say Linux is still tough to recommend.

ErroneousBosh · 2026-05-16T08:24:11 1778919851

> Using Linux is a learning experience.

Do you think that Windows *isn't* a learning experience?

marysol5 · 2026-05-16T10:10:25 1778926225

As someone who's done the whole Helpdesk -> Sysadmin stuff, working around other "Windows Admins" they don't learn much, all they know is "Deleting this thing seemed to fix" "Reboot". They don't get any of the fundamentals of what caused an issue, or how to diagnose them properly.

bigstrat2003 · 2026-05-17T05:17:41 1778995061

I spent a lot of my career in the Windows admin space. That isn't true. Most Windows admins are sharp dudes, just like most Linux admins are sharp dudes. And a minority of Windows admins are slackers who don't try to learn, but a minority of Linux admins are slackers who don't try to learn.

Telaneo · 2026-05-16T11:14:53 1778930093

Most of my experience with Windows has been googling for solutions rather than understanding the underlying system. So I guess it's a learning experience, but not in learning Windows itself, as much as learning to use search engines.

I do remember common fixes for various things, but not much of it can be extrapolated to other issues in my experience.

f33d5173 · 2026-05-16T14:36:49 1778942209

To your point 1, they claimed to have been using fedora on their most recent attempt.

martinald · 2026-05-13T20:57:38 1778705858

Currently, if you use claude -p (non interactive mode) in for example CI/CD, you can use your included subscription tokens.

They are now changing it to be:

You get $20/$100/$200 of "credit" that can be used for claude -p. Problem is, once you are out of that it is the normal API rates (outrageously expensive).

martinald · 2026-05-12T13:51:04 1778593864

I get this, but isn't this a complete compliance failure?

What's the point of having all those loops to onboard vendors if you can just buy from AWS marketplace (which AFIAK is not a particularly high bar to achieve for SaaS options)?

Like imagine $POOR_QUALITY_VENDOR. If they go through the normal channels they might get shot down. If they get procured on AWS Marketplace, then it feels to me in many organisations 'its fine', though AWS does minimal checking?

martinald · 2026-05-09T15:17:40 1778339860

I wrote this recently which maybe people will enjoy in the same vein :) https://martinalderson.com/posts/august-29-2026-a-scenario/

martinald · 2026-05-09T15:08:44 1778339324

Powerline is in my experience vastly worse than WiFi in nearly all cases. It's slow, suffers from bad jitter/interference (often worse than WiFi) and the chips run so hot (especially the last gen ones, AV2000 iirc - I believe they don't sell them any more because they overheat and fail, or at least 2/2 of the ones I had did this).

Even with many walls I was getting 300-400mbit/sec on WiFi vs 100mbit/sec on powerline.