wdym? Nobody's paying me or rewarding me for using these tokens. I had some spare in my subscription limit (we're not on token pricing), so I decided to try an ambitious task that may reduce our CI times and improve our DX significantly. That's hardly "the entire token-maxxing AI hype train in a nutshell".
I’m curious when folks will tire of lighting money on fire. Companies are already starting to scale back a bit, but the AI companies are still nowhere near profitability.
Maybe I missed something, but this article felt more like an ad for their modern matchbox designs, versus any sort of gallery of older ones - save for a collage near the end.
That's why I said "regular police". My understanding is that generally no one (including the police) is allowed to carry a gun in public unless someone already illegally introduced them into play. That's the world I want to live in.
I initially thought the same, but apparently with the inaccuracies inherent to floating-point arithmetic and various other such accuracy leakage, it’s not true!
This has nothing to do with FP inaccuracies, and your link does confirm that:
“Although the use of multiple GPUs introduces some randomness (Nvidia, 2024), it can be eliminated by setting random seeds, so that AI models are deterministic given the same input. […] In order to support this line of reasoning, we ran Llama3-8b on our local GPUs without any optimizations, yielding deterministic results. This indicates that the models and GPUs themselves are not the only source of non-determinism.”
I believe you've misread - the Nvidia article and your quote support my point. Only by disabling the fp optimizations, are the authors are able to stop the inaccuracies.
First, the “optimizations” are not IEEE 754 compliant. So nondeterminism with floating-point operations is not an inherent property of using floating-point arithmetics, it’s a consequence of disregarding the standard by deliberately opting in to such nondeterminism.
Secondly, as I quoted the paper is explicitly making the point that there is a source of nondeterminism outside of the models and GPUs, hence ensuring that the floating-point arithmetics are deterministic doesn’t help.
Yes it’s LTS but the point is that the LTS system has overlapping support so you can wait on an older LTS for a bit before upgrading to a newer one. And it’s somewhat prudent to do so if you value stability highly, because often a few new issues will be discovered and patched after LTS goes live for a bit.
Feels like a lot of words to avoid thinking about “black” money and favors in kind. For example, nobody would include Trump’s golden bar from Switzerland in such ann estimate - repeated ad nauseam for all lobbying corruption.
You say this, and yet there are no real comments i.e. discussion in either of them? This must be the HN equivalent of Stack Overflow's infamous "closed as duplicate".
This presumes that the labs themselves know how well their models perform. But all they have are overtuned benchmarks and hype vibes.
reply