Local privacy respecting inference can be worth it. I use a local model to log everything I do all week to automate my timesheet. I also have it do a bunch of other data tasks. I won't say that larger SOTA models wouldn't do these tasks better than a local model but PII is a concern and my employer wouldn't approve of me just setting tokens on fire everyday to do what I could do myself.
Not at all! My company has 100s of clients and we track time in 6 minute increments. I feed in my browser history, terminal logs, session scripts, calendar, git commits, etc etc into it and voila it produces a highly accurate timesheet in no time flat.
Automating it has been way better for me than the alternative of breaking my flow whenever I'm switching tasks to chart my time, or logging all my hours for the week in one sitting. Different strokes for different folks I suppose.
> My company has 100s of clients and we track time in 6 minute increments.
That’s an insane level of detail and I can understand why it makes sense to use an LLM to remove that busywork, otherwise you’d be spending half of your time on this bureaucracy.
For the company I would imagine it could be more cost efficient to stop being so onerous instead of burning tokens.
I thought these stopped working altogether with the release of Chrome 142? I know you could override it for awhile there, but I've been lead to believe that option is gone.
This is part of what forced my hand and made Firefox my daily driver, at least for personal use.
It works fine in Chrome 148. Earlier Chrome versions removed chrome://flags versions of the above even with "Temporarily unexpire M147 flags" (and similar), but command line invocation continues to work.
Awesome, thank you so much for that! I'll have to dig in and get this working for my work machine where I still have to use Chrome. I literally quit using Chrome for personal use over this but I guess it was premature.
Yes, it's been a way better deal to go for a subscription than pay as you go for me in the past. I had a month where I burnt through ~3.8b tokens which was somewhere in the ballpark of $8k worth of savings.
Now though I don't dare use spend tokens for basic note taking with Sonnet because I'm hitting the limit over a couple million tokens on the 20x plan, so they've really tightened the purse strings since November.
They must mean by creating a composite image with multiple in focus areas? Otherwise I agree, I can't see anyway that multiple exposures would help, at least from some light reading on Wikipedia - https://en.wikipedia.org/wiki/Multiple_exposure
I think some of this comes down to undeclared A/B testing. I've had the worst week of interactions I have ever had using Claude Code. The whole week whenever I have a session that isn't failing miserably I seem to get tapped for a session survey but on any that are out and out shitting the bed it never asks. It has felt a little surreal. I'd love to see a product wide stats graph for swearing, I would 100% believe that it is hitting an all time high but maybe I'm just a victim of a bad A/B round.
Oh I’ve been getting a lot more of those too lately even though I dismiss it every time. Wonder if I should report not satisfied every time so that I get routed to something better…
Not at all! My shop is having one of its best years ever. We've been switching to value based billing with our clients which has made them happier and us more profitable.
Using AI powered prototypes to sell clients on new features has gone really well for us. Many of our clients have presold themselves on adding AI features of some kind, which is nice because it generally means they will need support for these features going forwards.
I think it means charging based on the value of the work to the client not just the cost of doing the work.
That means if you get fast and efficient at doing something valuable for the clients you get to enjoy better margins because your costs are relatively low but the value is high so the customers still happily pay a good amount for it.
reply