More

carlsborg · 2026-06-12T17:37:50 1781285870

> AUR is just a collection of user-produced PKGBUILDs.

Is that much different from the entire pypi ecosystem, and npm, and dockerhub (people disable Selinux, --privileged turns off seccomp and apparmour, sandbox escape CVES exist)?

thewebguyd · 2026-06-12T17:42:03 1781286123

Not much different no, and people have equally bad practices around programming package managers as well.

The entire dev ecosystem has terrible security hygiene, largely because of the pressure to move fast and real security controls by their nature limit flexibility and can slow most processes down.

carlsborg · 2026-06-12T17:28:19 1781285299

Pipeline is then: Cheap open source model for flagging potential LLM refusal content -> main LLM check

manquer · 2026-06-12T22:28:23 1781303303

How will flagging help?

The main llm will refuse to scan for issues flagged or not, and the cheap model not do a good enough scan on its own.

For models designed/marketed for cybersecurity defensive uses, any predictable refusal mechanism is a vulnerability. It is like being able to cause a kernel panic or segmentation fault .

Even if the gate is fail-reject, an attacker can overwhelm HITL reviews with many false positives and use DoS vectors here.

05 · 2026-06-13T01:22:01 1781313721

Cheap model replaces trigger words with something innoculous. Of course, this breaks dynamic analysis if malware has unpatched integrity checks

carlsborg · 2026-06-03T20:27:22 1780518442

Intriguing... "After months of misdiagnoses Dr Souhel Najjar, employs a test asking Susannah to draw a clock. Instead of the customary clock face, her condition led her to draw all the numbers 1 through 12 on the right side of the clock. This was the breakthrough moment; it was this clock drawing that enabled Dr Najjar to understand that the right side of Susannah’s brain was inflamed, further test revealed this inflammation was a result of anti-NMDA receptor encephalitis, initiating her path to recovery"

carlsborg · 2026-05-19T08:22:30 1779178950

"Memory plus search is all you need"

carlsborg · 2026-05-03T16:34:20 1777826060

Now the echo of my system prompts are bouncing round the room.

edit: Bouncing around the room is one of their hit songs. Give it a listen.

carlsborg · 2026-04-22T07:05:14 1776841514

"since 2019, on the advice of the National Agency for the Safety of Medicines and Health Products, French health workers have been told not to treat fever or infections with ibuprofen." [1]

But yet in some countries pediatricians will libreally prescribe it to toddlers

[1] https://www.bmj.com/content/368/bmj.m1086

Also from [2] "In this systematic review of NSAID use during acute lower respiratory tract infections in adults, we found that the existing evidence for mortality, pleuro-pulmonary complications and rates of mechanical ventilation or organ failure is of extremely poor quality, very low certainty and should be interpreted with caution."

https://bpspubs.onlinelibrary.wiley.com/doi/10.1111/bcp.1451...

KaiserPro · 2026-04-22T07:14:12 1776842052

One of the problems is that if you give it to kids with chicken pox it can cause complications. There was also some hints early in the pandemic that ibuprofen had a similar effect on covid-19. However as you link to, the data doesn't really support that view anymore.

carlsborg · 2026-04-05T09:34:54 1775381694

Perhaps in due time we will see workload specific forks of Linux maintained by a team of agents

carlsborg · 2026-03-27T18:58:40 1774637920

Anthropic/OpenAI could own this space. They should offer a paid service that offers a mirror with LLM scanned and sandbox-evaluated package with their next gen models. Free for individuals, orgs can subscribe to it.

oblvious-earth · 2026-03-27T19:12:47 1774638767

OpenAI just acquired Astral who have an index service called pyx, so they would have a step up.

My understanding though is most corporations that take security seriously either build everything themselves in a sandbox, or use something like JFrog's Artifactory with various security checks, and don't let users directly connect to public indexes. So I'm not sure what the market is.

doc_ick · 2026-03-27T19:16:08 1774638968

There’s also virustotal, any.run, probably a few others outside of GitHub/gitlab scans

dmitrygr · 2026-03-27T22:43:29 1774651409

Detecting properly-written malicious code is undecidable. No amount of snake oil fixes that

johndough · 2026-03-27T19:29:46 1774639786

Judging by curl shutting down its bug bounty program due to AI slop, a likely outcome would be that this mirror has no packages because they are all blocked by false positives.

andrepd · 2026-03-27T19:23:31 1774639411

Genuinely cannot tell whether this is satire.

firesteelrain · 2026-03-27T19:27:20 1774639640

Own what space ?

carlsborg · 2026-03-23T19:23:00 1774293780

> “ The agent acted like a hyperparameter optimization algorithm with some basic reasoning baked in.”

Good lens.

The crux of the auto research repo is basically one file - program.md which is a system prompt that can be summarized as “do this in a loop: improve train.py, run the training, run evals, record result. Favor simplicity”. The other files are an arbitrary ML model that is being trained.

MITSardine · 2026-03-24T09:55:24 1774346124

This is something I could almost never be bothered to do before, but I can now very lazily set up large parameter sweeps and visualization scripts to really probe things. There's a danger of "analysis paralysis" but I've still found it quite useful. Although I'm not sure it saves me time as much as sanity.

carlsborg · 2026-03-21T20:53:36 1774126416

3I/ATLAS first detected on: July 1 2025

Gamma ray burst that kept going for seven hours, fired three distinct bursts spread across an entire day: July 2 2025

just saying

EA-3167 · 2026-03-21T21:02:01 1774126921

This event originated in a different galaxy.

stouset · 2026-03-23T21:23:34 1774301014

Obviously this was the distant alien civilization remotely beaming power to the probe they sent to our solar system :)

VladVladikoff · 2026-03-21T23:04:23 1774134263

Thanks Avi, very cool!