• jasonjmcghee 5 hours ago

    Surprised that "controlling cost" isn't a section in this post. Here's my attempt.

    ---

    If you get a hang of controlling costs, it's much cheaper. If you're exhausting the context window, I would not be surprised if you're seeing high cost.

    Be aware of the "cache".

    Tell it to read specific files (and only those!), if you don't, it'll read unnecessary files, or repeatedly read sections of files or even search through files.

    Avoid letting it search - even halt it. Find / rg can have a thousands of tokens of output depending on the search.

    Never edit files manually during a session (that'll bust cache). THIS INCLUDES LINT.

    The cache also goes away after 5-15 minutes or so (not sure) - so avoid leaving sessions open and coming back later.

    Never use /compact (that'll bust cache, if you need to, you're going back and forth too much or using too many files at once).

    Don't let files get too big (it's good hygiene too) to keep the context window sizes smaller.

    Have a clear goal in mind and keep sessions to as few messages as possible.

    Write / generate markdown files with needed documentation using claude.ai, and save those as files in the repo and tell it to read that file as part of a question. I'm at about ~$0.5-0.75 for most "tasks" I give it. I'm not a super heavy user, but it definitely helps me (it's like having a super focused smart intern that makes dumb mistakes).

    If i need to feed it a ton of docs etc. for some task, it'll be more in the few $, rather than < $1. But I really only do this to try some prototype with a library claude doesn't know about (or is outdated). For hobby stuff, it adds up - totally.

    For a company, massively worth it. Insanely cheap productivity boost (if developers are responsible / don't get lazy / don't misuse it).

    • BeetleB 4 hours ago

      Oh wow. Reading your comment guarantees I'll never use Claude Code.

      I use Aider. It's awesome. You explicitly specify the files. You don't have to do work to limit context.

      • Jerry2 a minute ago

        >I use Aider. It's awesome.

        What do you use for the model? Claude? Gemini? o3?

        • boredtofears 4 hours ago

          Yeah, I tried CC out and quickly noticed it was spending $5+ for simple LLM capable tasks. I rarely break $1-2 a session using aider. Aider feels like more of a precision tool. I like having the ability to manually specify.

          I do find Claude Code to be really good at exploration though - like checking out a repository I'm unfamiliar with and then asking questions about it.

        • pclmulqdq 2 hours ago

          It's interesting that this is a problem for people because I have never spent more than about $0.50 on a task with Claude Code. I have pretty good code hygiene and I tell Claude what to do with clear instructions and guidelines, and Claude does it. I will usually go through a few revisions and then just change anything myself if I find it not quite working. It's exactly like having an eager intern.

          • jjmarr an hour ago

            I don't think about controlling cost because I price my time at US$40/h and virtually all models are cheaper than that (with the exception of o1 or Gemini 2.5 pro).

            If I spend $2 instead of $0.50 on a session but I had to spend 6 minutes thinking about context, I haven't gained any money.

            • jasonjmcghee an hour ago

              If you do it a bit, it just becomes habit / no extra time or cognitive load.

              Correlation or causation aside, the same people I see complain about cost, complain about quality.

              It might indicate more tightly controlled sessions may also produce better results.

              Or maybe it's just people that tend to complain about one thing, complain about another.

              • owebmaster an hour ago

                Important to remind people this is only true if you have a profitable product, otherwise you’re spending money you haven’t earned.

                • jasonjmcghee an hour ago

                  If your expectation is to produce the same amount of output, you could argue when paying for AI tools, you're choosing to spend money to gain free time.

                  4 hours coding project X or 3 hours and a short hike with your partner / friends etc

              • chewz an hour ago

                My attempt is - Do not use Claude Code at all, it is terrible tool. It is bad at almost everything starting with making simple edits to files.

                And most of all Claude Code is overeager to start messing with your code and run unnecessary $$ instead of making sensible plan.

                This isn't problem with Claude Sonnet - it is fundamnetal problem with Claude Code.

                • winrid an hour ago

                  I pretty much one shot a scraper from an old Joomla site with 200+ articles to a new WP site, including all users and assets, and converting all the PDFs to articles. It cost me like $3 in tokens.

                • kiratp an hour ago

                  The productivity boost can be so massive that this amount of fiddling to control costs is counterproductive.

                  Developers tend to seriously underestimate the opportunity cost of their own time.

                  Hint - it’s many multiples of your total compensation broken down to 40 hour work weeks.

                  • bugglebeetle 5 hours ago

                    If I have to spend this much time thinking about any of this, congratulations, you’ve designed a product with a terrible UI.

                    • tetha an hour ago

                      Mh. Like, I'm deeply impressed what these AI assistants can do by now. But, the list in the parent comment there is very similar to my mental check-list of pair-programming / pair-admin'ing with less experienced people.

                      I guess "context length" in AIs is what I intuitively tracked with people already. It can be a struggle to connect the Zabbix alert, the ticket and the situation on the system already, even if you don't track down all the zabbix code and scripts. And then we throw in Ansible configuring the thing, and then the business requriements by more, or less controlled dev-teams. And then you realize dev is controlled by impossible sales-terms.

                      These are scope -- or I guess context -- expansions that cause people to struggle.

                      • jasonjmcghee 5 hours ago

                        Some tools take more effort to hold properly than others. I'm not saying there's not a lot of room for improvement - or that the ux couldn't hold the users hand more to force things like this in some "assisted mode" but at the end of the day, it's a thin, useful wrapper around an llm, and llms require effort to use effectively.

                        I definitely get value out of it- more than any other tool like it that I've tried.

                        • oxidant an hour ago

                          Think about what you would do in an unfamiliar project with no context and the ticket

                          "please fix the authorization bug in /api/users/:id".

                          You'd start by grepping the code base and trying to understand it.

                          Compare that to, "fix the permission in src/controllers/users.ts in the function `getById`. We need to check the user in the JWT is the same user that is being requested"

                          • sqs 5 hours ago

                            It's fundamentally hard. If you have an easy solution, you can go make a easy few billion dollars.

                          • datavirtue 2 hours ago

                            GitHub copilot follows your context perfectly. I don't have to tell it anything about files. I tried this initially and it just screwed up the results.

                          • zoogeny 5 hours ago

                            So I have been using Cursor a lot more in a vibe code way lately and I have been coming across what a lot of people report: sometimes the model will rewrite perfectly working code that I didn't ask it to touch and break it.

                            In most cases, it is because I am asking the model to do too much at once. Which is fine, I am learning the right level of abstraction/instruction where the model is effective consistently.

                            But when I read these best practices, I can't help but think of the cost. The multiple CLAUDE.md files, the files of context, the urls to documentation, the planning steps, the tests. And then the iteration on the code until it passes the test, then fixing up linter errors, then running an adversarial model as a code review, then generating the PR.

                            It makes me want to find a way to work at Anthropic so I can learn to do all of that without spending $100 per PR. Each of the steps in that last paragraph is an expensive API call for us ISV and each requires experimentation to get the right level of abstraction/instruction.

                            I want to advocate to Anthropic for a scholarship program for devs (I'd volunteer, lol) where they give credits to Claude in exchange for public usage. This would be structured similar to creator programs for image/audio/video gen-ai companies (e.g. runway, kling, midjourney) where they bring on heavy users that also post to social media (e.g. X, TikTok, Twitch) and they get heavily discounted (or even free) usage in exchange for promoting the product.

                            • Wowfunhappy 36 minutes ago

                              > So I have been using Cursor a lot more in a vibe code way lately and I have been coming across what a lot of people report: sometimes the model will rewrite perfectly working code that I didn't ask it to touch and break it.

                              I don't find this particularly problematic because I can quickly see the unnecessary changes in git and revert them.

                              Like, I guess it would be nice if I didn't have to do that, but compared to the value I'm getting it's not a big deal.

                              • zoogeny 21 minutes ago

                                I agree with this in the general sense but of course I would like to minimize the thrash.

                                I have become obsessive about doing git commits in the way I used to obsess over Ctrl-S before the days of source control. As soon as I get to a point I am happy, I get the LLM to do a check-point check in so I can minimize the cost of doing a full directory revert.

                                But from a time and cost perspective, I could be doing much better. I've internalized the idea that when the LLM goes off the rails it was my fault. I should have prompted it better. So I am now consider: how do I get better faster? And the answer is I do it as much as I can to learn.

                                I don't just want to whine about the process. I want to use that frustration to help me improve, while avoiding going bankrupt.

                              • istjohn 3 hours ago

                                Why do you think it's supposed to be cheap? Developers are expensive. Claude doesn't have to be cheap to make software development quicker and cheaper. It just has to be cheaper than you.

                                There are ways to use LLMs cheaply, but it will always be expensive to get the most out of them. In fact, the top end will only get more and more costly as the lengths of tasks AIs can successfully complete grows.

                                • qudat 9 minutes ago

                                  > It just has to be cheaper than you.

                                  Not when you need an SWE in order for it to work successfully.

                                  • zoogeny 3 hours ago

                                    I am not implying in any sense a value judgement on cost. I'm stating my emotions at the realization of the cost and how that affects my ability to use the available tools in my own education.

                                    It would be no different than me saying "it sucks university is so expensive, I wish I could afford to go to an expensive college but I don't have a scholarship" and someone then answers: why should it be cheap.

                                    So, allow me the space to express my feelings and propose alternatives, of which scholarships are one example and creative programs are another. Another one I didn't mention would be the same route as universities force now: I could take out a loan. And I could consider it an investment loan with the idea it will pay back either in employment prospects or through the development of an application that earns me money. Other alternatives would be finding employment at a company willing to invest that $100/day through me, the limit of that alternative being working at an actual foundational model company for presumably unlimited usage.

                                    And of course, I could focus my personal education on squeezing the most value for the least cost. But I believe the balance point between slightly useful and completely transformative usages levels is probably at a higher cost level than I can reasonably afford as an independent.

                                  • k__ 2 hours ago

                                    That's why I like Aider.

                                    You can protect your files in a non-AI way: by simply not giving write access to Aider.

                                    Also, apparently Aider is a bit more economic with tokens than other tools.

                                    • zoogeny an hour ago

                                      I haven't used Aider yet, but I see it show up on HN frequently recently (the last couple of days specifically).

                                      I am hesitant because I am paying for Cursor now and I get a lot of model usage included within that monthly cost. I'm cheap, perhaps to a fault even when I could afford it, and I hate the idea of spending twice when spending once is usually enough. So while Aider is potentially cheaper than Claude Code, it is still more than what I am already paying.

                                      I would appreciate any comments on people who have made the switch from Cursor to Aider. Are you paying more/less? If you are paying more, do you feel the added value is worth the additional cost? If you are paying less, do you feel you are getting less, the same or even more?

                                  • remoquete 6 hours ago

                                    What's the Gemini equivalent of Claude Code and OpenAI's Codex? I've found projects like reugn/gemini-cli, but Gemini Code Assist seems limited to VS Code?

                                  • joshstrange 9 hours ago

                                    The most interesting part of this article for me was:

                                    > Have multiple checkouts of your repo

                                    I don’t know why this never occurred to me probably because it feels wrong to have multiple checkouts, but it makes sense so that you can keep each AI instance running at full speed. While LLM‘s are fast, this is one of the annoying parts of just waiting for an instance of Aider or Claude Code to finish something.

                                    Also, I had never heard of git worktrees, that’s pretty interesting as well and seems like a good way to accomplish effectively having multiple checkouts.

                                    • m0rde 8 hours ago

                                      I've never used Claude Code or other CLI-based agents. I use Cursor a lot to pair program, letting the AI do the majority of the work but actively guiding.

                                      How do you keep tabs on multiple agents doing multiple things in a codebase? Is the end deliverable there a bunch of MRs to review later? Or is it a more YOLO approach of trusting the agents to write the code and deploy with no human in the loop?

                                      • oxidant an hour ago

                                        Multiple terminal sessions. Well written prompts and CLAUDE.md files.

                                        I like to start by describing the problem and having it do research into what it should do, writing to a markdown file, then get it to implement the changes. You can keep tabs on a few different tasks at a time and you don't need to approve Yolo mode for writes, to keep the cost down and the model going wild.

                                        • rfoo 6 hours ago

                                          In the same way how you manage a group of brilliant interns.

                                          • mh- 26 minutes ago

                                            Really? My LLMs seem entirely uninterested in free snacks and unlimited vacation.

                                      • sbszllr 5 hours ago

                                        The issue with many of these tips is that they require you use to claude code (or codex cli, doesn't matter) to spend way more time in it, feed it more info, generate more outputs --> pay more money to the LLM provider.

                                        I find LLM-based tools helpful, and use them quite regularly but not 20 bucks+, let alone 100+ per month that claude code would require to be used effectively.

                                        • dist-epoch 5 hours ago

                                          > let alone 100+ per month that claude code would require

                                          I find this argument very bizarre. $100 is pay for 1-2 hours of developer time. Doesn't it save at least that much time in a whole month?

                                          • nrvn 3 hours ago

                                            what happened to the "$5 is just a cup o' coffee" argument? Are we heading towards the everything-for-$100 land?

                                            On a serious note, there is no clear evidence that any of the LLM-based code assistants will contribute to saving developer time. Depends on the phase of the project you are in and on a multitude of factors.

                                            • rsyring 7 minutes ago

                                              I'm a skeptical adopter of new tech. But I cut my teeth on LLMs a couple years ago when I was dropped into a project using an older framework I wasn't familiar with. Even back then, LLMs helped me a ton to get familiar with the project and use best practices when I wasn't sure what those were.

                                              And that was just copy & past into ChatGPT.

                                              I don't know about assistants or project integration. But, in my experience, LLMS are a great tool to have and worth learning how to use well, for you. And I think that's the key part. Some people like heavily integrated IDEs, some people prefer a more minimal approach with VS Code or Vim.

                                              I think LLMs are going to be similar. Some people are going to want full integration and some are just going to want minimal interface, context, and edits. It's going to be up to the dev to figure out what works best for him or her.

                                            • owebmaster 5 hours ago

                                              No, it doesn't. If you are still looking for product market fit, it is just cost.

                                              After 2 years of GPT4 release, we can safely say that LLMs don't make finding PMF that much easier nor improve general quality/UX of products, as we still see a general enshittification trend.

                                              If this spending was really game-changing, ChatGPT frontend/apps wouldn't be so bad after so long.

                                              • mikeg8 4 hours ago

                                                Finding product market fit is a human directional issue, and LLMs absolutely can help speed up iteration time here. I’ve built two RoR MVPs for small hobbby projects spending ~$75 in Claude code to make something in a day that would have previously taken me a month plus. Again, absolutely bizarre that people can’t see the value here, even as these tools are still working through their kinks.

                                                • owebmaster 3 hours ago

                                                  And how much did these two MVPs make in sales?

                                                  If they just helped you to ship something valueless, you paid $75 for entertainment, like betting.

                                                  • dist-epoch 2 hours ago

                                                    You can now do 30 MVPs in a month instead of just one.

                                                    • owebmaster an hour ago

                                                      And then 3000 people will magically come to try them.

                                                • mrbombastic 4 hours ago

                                                  Enshittification is the result of shitty incentives in the market not because coding is hard

                                            • 0x696C6961 5 hours ago

                                              I mostly work in neovim, but I'll open cursor to write boilerplate code. I'd love to use something cli based like Claude Code or Codex, but neither of them implement semantic indexing (vector embeddings) the way Cursor does. It should be possible to implement an MCP server which does this, but I haven't found a good one.

                                              • isaksamsten 4 hours ago

                                                I use a small plugin I’ve written my self to interact with Claude, Gemini 2.5 pro or GPT. I’ve not really seen the need for semantic searching yet. Instead I’ve given the LLM access to LSP symbol search, grep and the ability to add files to the conversation. It’s been working well for my use cases but I’ve never tried Cursor so I can’t comment on how it compares. I’m sure it’s not as smooth though. I’ve tried some of the more common Neovim plugins and for me it works better, but the preference here is very personal. If you want to try it out it’s here: https://github.com/isaksamsten/sia.nvim

                                                • sqs 4 hours ago

                                                  Tool-calling agents with search tools do very well at information retrieval tasks in codebases. They are slower and more expensive than good RAG (if you amortize the RAG index over many operations), but they're incredibly versatile and excel in many cases where RAG would fall down. Why do you think you need semantic indexing?

                                                  • 0x696C6961 4 hours ago

                                                    > Why do you think you need semantic indexing?

                                                    Unfortunately I can only give an anecdotal answer here, but I get better results from Cursor than the alternatives. The semantic index is the main difference, so I assume that's what's giving it the edge.

                                                    • sqs 3 hours ago

                                                      Is it a very large codebase? Anything else distinctive about it? Are you often asking high-level/conceptual questions? Those are the questions that would help me understand why you might be seeing better results with RAG.

                                                      • 0x696C6961 2 hours ago

                                                        I'll ask something like "where does X happen?" But "X" isn't mentioned anywhere in the code because the code is a complete nightmare.

                                                • LADev an hour ago

                                                  This is so helpful!

                                                  • bugglebeetle 5 hours ago

                                                    Claude Code works fairly well, but Anthropic has lost the plot on the state of market competition. OpenAI tried to buy Cursor and now Windsurf because they know they need to win market share, Gemini 2.5 pro is better at coding than their Sonnet models, has huge context and runs on their TPU stack, but somehow Anthropic is expecting people to pay $200 in API costs per functional PR costs to vibe code. Ok.

                                                    • owebmaster 44 minutes ago

                                                      > but somehow Anthropic is expecting people to pay $200 in API costs per functional PR costs to vibe code. Ok.

                                                      Reading the thread, somehow people are paying. It is mindblowing how in place of getting cheaper, development just got more expensive for businesses.

                                                    • andrewstuart an hour ago

                                                      I’m too scared of the cost to use this.

                                                      • Wowfunhappy 4 hours ago

                                                        > Use /clear to keep context focused

                                                        The only problem is that this loss is permanent! As far as I can tell, there's no way to go back to the old conversation after a `/clear`.

                                                        I had one session last week where Claude Code seemed to have become amazingly capable and was implementing entire new features and fixing bugs in one-shot, and then I ran `/clear` (by accident no less) and it suddenly became very dumb.

                                                        • zomglings 4 hours ago

                                                          You can ask it to store its current context to a file, review the file, ask it to emphasize or de-emphasize things based on your review, and then use `/clear`.

                                                          Then, you can edit the file at your leisure if you want to.

                                                          And when you want to load that context back in, ask it to read the file.

                                                          Works better than `/compact`, and is a lot cheaper.

                                                          • Wowfunhappy 4 hours ago

                                                            Neat, thanks, I had no idea!

                                                            Edit: It so happens I had a Claude Code session open in my Terminal, so I asked it:

                                                                Save your current context to a file.
                                                            
                                                            Claude produced a 91 line md file... surely that's not the whole of its context? This was a reasonably lengthy conversation in which the AI implemented a new feature.
                                                          • datavirtue 2 hours ago

                                                            Compared to my experience with the free GitHub Copilot in VS Code it sounds like you guys are in a horse and buggy.

                                                            • shmoogy 42 minutes ago

                                                              I'm excited for the improvements they've had recently but I have better luck with Cline in regular vs code, as well as cursor.

                                                              I've tried Claude code this week and I really didn't like it - Claude did an okay job but was insistent on deleting some shit and hard coding a check instead of an actual conditional. It got the feature done in about $3, but I didn't really like the user experience and it didn't feel any better than using 3.7 in cursor.

                                                          • jasonjmcghee 4 hours ago

                                                            They've worked to improve this with "memories" (hash symbol to "permanently" record something - you can edit later if you want).

                                                            And there's CLAUDE.md. it's like cursorrules. You can also have it modify it's own CLAUDE.md.

                                                          • m00dy 6 hours ago

                                                            well, the best practice is to use gemini 2.5 pro instead :)

                                                            • replwoacause 5 hours ago

                                                              Yep I learned this the hard way after racking up big bills just using Sonnet 3.7 in my IDE. Gemini is just as good (and not nearly as willing to agree with every dumb thing I say) and it’s way cheaper.

                                                            • zomglings 6 hours ago

                                                              If anyone from Anthropic is reading this, your billing for Claude Code is hostile to your users.

                                                              Why doesn’t Claude Code usage count against the same plan that usage of Claude.ai and Claude Desktop are billed against?

                                                              I upgraded to the $200/month plan because I really like Claude Code but then was so annoyed to find that this upgrade didn’t even apply to my usage of Claude Code.

                                                              So now I’m not using Claude Code so much.

                                                              • jdance 5 hours ago

                                                                This would put anthropic in the business of minimizing the context to increase profits, same as Cursor and others who cheap out on context and try to RAG etc. Which would quickly make it worse, so I hope they stay on api pricing

                                                                Some base usage included in the plan might be a good balance

                                                                • zomglings 4 hours ago

                                                                  You know, I wouldn't mind if they just applied the API pricing after Claude Code ran through the plan limits.

                                                                  It would definitely get me to use it more.

                                                                  • Wowfunhappy 4 hours ago

                                                                    But the Claude Pro plan is almost certainly priced under the assumption that some users will use it below the usage limit.

                                                                    If everyone used the plan to the limit, the plan would cost the same as the API with usage equal to the limit.

                                                                    • karbon0x 4 hours ago

                                                                      Claude Code and Claude.ai are separate products.

                                                                  • twalkz 5 hours ago

                                                                    I’ve been using codemcp (https://github.com/ezyang/codemcp) to get “most” of the functionality of Claude code (I believe it uses prompts extracted from Claude Code), but using my existing pro plan.

                                                                    It’s less autonomous, since it’s based on the Claude chat interface, and you need to write “continue” every so often, but it’s nice to save the $$

                                                                    • fcoury 3 hours ago

                                                                      Just tried it and it's indeed very good, thanks for mentioning it! :-)

                                                                      • zomglings 4 hours ago

                                                                        Thanks, makes sense that an MCP server that edits files is a workaround to the problem.

                                                                      • replwoacause 5 hours ago

                                                                        Their API billing in general is hostile to users. I switched completely to Gemini for this reason and haven’t looked back.

                                                                        • fcoury 6 hours ago

                                                                          I totally agree with this, I would rather have some kind of prediction than using the Claude Code roulette. I would definitely upgrade my plan if I got Claude Code usage included.

                                                                          • datavirtue 2 hours ago

                                                                            I don't what you guys are on about but I have been using the free GitHub Copilot in VS Code chats to absolutely crank out new UI features in Vue. All that stuff that makes you groan at the thought of it: more divs, bindings, form validation, a whole new widget...churned out in 30 seconds. Try it live. Works? Keep.

                                                                            I'm surprised at the complexity and correctness at which it infers from very simple, almost inadequate, prompts.

                                                                          • dist-epoch 5 hours ago

                                                                            Claude.ai/Desktop is priced based on average user usage. If you have 1 power user sending 1000 requests per day, and 99 sending 5, many even none, you can afford having a single $10/month plan for everyone to keep things simple.

                                                                            But every Claude Code user is a 1000 requests per day user, so the economics don't work anymore.

                                                                            • zomglings 5 hours ago

                                                                              I would accept a higher-priced plan (which covered both my use of Claude.ai/Claude Desktop AND my use of Claude Code).

                                                                              Anthropic make it seem like Claude Code is a product categorized like Claude Desktop (usage of which gets billed against your Claude.ai plan). This is how it signs off all its commits:

                                                                                   Generated with [Claude Code](https://claude.ai/code)
                                                                              
                                                                              At the very least, this is misleading. It misled me.

                                                                              Once I had purchased the $200/month plan, I did some reading and quickly realized that I had been too quick to jump to conclusions. It still left me feeling like they had pulled a fast on one me.

                                                                              • dist-epoch 4 hours ago

                                                                                Maybe you can cancel your subscription or charge back?

                                                                                I think it's just oversight on their part. They have nothing to gain by making people believe they would get Claude Code access through their regular plans, only bad word of mouth.

                                                                                • zomglings 4 hours ago

                                                                                  To be fair to them, they make it pretty easy to manage the subscription, downgrade it, etc.

                                                                                  This is definitely not malicious on their part. Just bears pointing out.

                                                                              • fcoury 5 hours ago

                                                                                Well, take that into consideration then. Just make it an option. Instead of getting 1000 requests per day with code, you get 100 on the $10/month plan, and then let users decide whether they want to migrate to a higher tier or continue using the API model.

                                                                                I am not saying Claude should stop making money, I'm just advocating for giving users the value of getting some Code coverage when you migrate from the basic plan to the pro or max.

                                                                                Does that make sense?

                                                                              • visarga 2 hours ago

                                                                                Yeah, tried it for a couple of minutes, $0.31, quickly stopped and moved away.

                                                                                • cypherpunks01 6 hours ago

                                                                                  Claude Pro and other website/desktop subscription plans are subject to usage limits that would make it very difficult to use for Claude Code.

                                                                                  Claude Code uses the API interface and API pricing, and writes and edits code directly on your machine, this is a level past simply interacting with a separate chat bot. It seems a little disingenuous to say it's "hostile" to users, when the reality is yeah, you do pay a bit more for more reliable usage tier, for a task that requires it. It also shows you exactly how much it's spent at any point.

                                                                                  • zomglings 4 hours ago

                                                                                    The writing of edits and code directly on my machine is something that happens on the client side. I don't see why that usage would be subject to anything but one-time billing or how it puts any strain on Anthropic's infrastructure.

                                                                                    • fcoury 5 hours ago

                                                                                      > ... usage limits that would make it very difficult to use for Claude Code.

                                                                                      Genuinely interested: how's so?

                                                                                      • cypherpunks01 5 hours ago

                                                                                        Well, I think it'd be pretty irritating to see the message "3 messages remaining until 6PM" while you are in the middle of a complex coding task.

                                                                                        • unshavedyak 4 hours ago

                                                                                          Conversely I have to manually do this and monitor the billing instead.

                                                                                          • fcoury 5 hours ago

                                                                                            No, that's the whole point: predictability. It's definitely a trade off, but if we could save the work as is we could have the option to continue the iteration elsewhere, or even better, from that point on offer the option to fallback to the current API model.

                                                                                            A nice addition would be having something like /cost but to check where you are in regards to limits.

                                                                                      • ghuntley 5 hours ago

                                                                                        $200/month isn't that much. Folks, I'm hanging around with are spending $100 USD to $500 USD daily as the new norm as a cost of doing business and remaining competitive. That might seem expensive, but it's cheap... https://ghuntley.com/redlining

                                                                                        • oytis 5 hours ago

                                                                                          When should we expect to see the amazing products these super-competitive businesses are developing?

                                                                                          • zomglings 4 hours ago

                                                                                            $100/day seems reasonable as an upper-percentile spend per programmer. $500/day sounds insane.

                                                                                            A 2.5 hour session with Claude Code costs me somewhere between $15 and $20. Taking $20/2.5 hours as the estimate, $100 would buy me 12.5 hours of programming.

                                                                                            • bambax 4 hours ago

                                                                                              Asking very specific questions to Sonnet 3.7 costs a couple of tenths of a cent every time, and even if you're doing that all day it will never amount to more than maybe a dollar at the end of the day.

                                                                                              On average, one line of, say, JavaScript represents around 7 tokens, which means there are around 140k lines of JS per million tokens.

                                                                                              On Openrouter, Sonnet 3.7 costs are currently:

                                                                                              - $3 / one million input tokens => $100 = 33.3 million input tokens = 420k lines of JS code

                                                                                              - $15 / one million output tokens => $100 = 3.6 million output tokens = 4.6 million lines of JS code

                                                                                              For one developer? In one day? It seems that one can only reach such amounts if the whole codebase is sent again as context with each and every interaction (maybe even with every keystroke for type completion?) -- and that seems incredibly wasteful?

                                                                                              • bambax an hour ago

                                                                                                I can't edit the above comment, but there's obviously an error in the math! ;-) Doesn't change the point I was trying to make, but putting this here for the record.

                                                                                                33.3 million input tokens / 7 tokens per loc = 4.8 million locs

                                                                                                3.6 million output tokens / 7 tokens per loc = 515k locs

                                                                                                • cma 3 hours ago

                                                                                                  That's how it works, everything is recomputed again every additional prompt. But it can cache the state of things and restore for a lower fee, and reingesting what was formerly output is cheaper than making new output (serial bottleneck) so sometimes there is a discount there.

                                                                                                • ghuntley 4 hours ago

                                                                                                  It sounds insane until you drive full agentic loops/evals. I'm currently making a self-compiling compiler; no doubt you'll hear/see about it soon. The other night, I fell asleep and woke up with interface dynamic dispatch using vtables with runtime type information and generic interface support implemented...

                                                                                                  • UltraSane 3 hours ago

                                                                                                    Do you actually understand the code Claude wrote?

                                                                                                    • cpursley 2 hours ago

                                                                                                      Do you understand all of the code in the libraries that your applications depend on? Or your coworker for that matter?

                                                                                                      All of the gate keeping around llm code tools are amusing. But whatever, I’m shipping 10x and making money doing it.

                                                                                                    • zomglings 3 hours ago

                                                                                                      Fair enough. That's pretty cool, I haven't gone that far in my own work with AI yet, but now I am inspired to try.

                                                                                                      The point is to get a pipeline working, cost can be optimized down after.

                                                                                                  • m00dy 5 hours ago

                                                                                                    Seriously? That’s wild. What kind of CS field could even handle that kind of daily spend for a bunch of people?

                                                                                                    • ghuntley 5 hours ago

                                                                                                      Consider L5 at Google: outgoings of $377,797 USD per year just on salary/stock, before fixed overheads such as insurance, leave, issues like ramp-up time and cost of their manager. In the hands of a Staff+ engineer, these tools enable replication of Staff+ engineers and don't sleep. My 2c: the funding for the new norm will come from either compressing the manager layer or engineering layer or both.

                                                                                                      • malfist 5 hours ago

                                                                                                        LLMs absolutely don't replicate staff+ engineers.

                                                                                                        If your staff engineers are mostly doing things AI can do, then you don't need staff. Probably don't even need senior

                                                                                                        • cpursley 2 hours ago

                                                                                                          Oh but they absolutely do. Have you not used any of this llm tooling? It’s insanely good once you learn how to employ it. I no longer need a front end team, for example. It's that good at TypeScript and React. And the design is even better.

                                                                                                          • ghuntley 5 hours ago

                                                                                                            That's my point.

                                                                                                            - L3 SWE II - $193,712 USD (before overheads)

                                                                                                            - L4 SWE III - $297,124 USD (before overheads)

                                                                                                            - L5 Senior SWE - $377,797 USD (before overheads)

                                                                                                            These tools and foundational models get better every day, and right now, they enable Staff+ engineers and businesses to have less need for juniors. I suspect there will be [short-to-medium-term] compression. See extended thoughts at https://ghuntley.com/screwed

                                                                                                            • StefanBatory 3 hours ago

                                                                                                              I wonder what will happen first - will companies move to LLMs, or to programmers from abroad (because ultimately, it will be cheaper than using LLMs - you've said ~$500 per day, in Poland ~$1500 will be a good monthly wage - and that still will make us expensive! How about moving to India, then? Nigeria? LATAM countries?)

                                                                                                              • throwawayb299 an hour ago

                                                                                                                > in Poland ~$1500 will be a good monthly wage

                                                                                                                The minimum wage in Poland is around USD 1240/month. The median wage in Poland is approximately USD 1648/month. Tech salaries are considerably higher than the median.

                                                                                                                Idk, maybe for an intern software developer it's a good salary...

                                                                                                                • StefanBatory an hour ago

                                                                                                                  Minimal is ~$930 after taxes, though; I rarely see people talk here about salary pre-tax, tbh.

                                                                                                                  ~$1200 is what I'd get paid here after a few years of experience; I have never saw an internship offer in my city that paid more than minimal wage (most commonly, it's unpaid).

                                                                                                                • ghuntley 2 hours ago

                                                                                                                  The industry has tried that, and the problems are well known (timezones, unpredictable outcomes in terms of quality and delivery dates)...

                                                                                                                  Delivery via LLMs is predictable, fast, and any concerns about outcome [quality] can be programmed away to reject bad outcomes. This form of programming the LLMs has a one-time cost...

                                                                                                          • mmikeff 5 hours ago

                                                                                                            The kind of field where AI builds more in a day than a team or even contract dev does.

                                                                                                            • ghuntley 5 hours ago

                                                                                                              correct; utilised correctly these tools ship teams of output in a single day.

                                                                                                              • rudedogg 4 hours ago

                                                                                                                Do you have a link to some of this output? A repo on Github of something you’ve done for fun?

                                                                                                                I get a lot of value out of LLMs but when I see people make claims like this I know they aren’t “in the trenches” of software development, or care so little about quality that I can’t relate to their experience.

                                                                                                                Usually they’re investors in some bullshit agentic coding tool though.

                                                                                                                • ghuntley 3 hours ago

                                                                                                                  I will shortly; am building a serious self-compiling compiler rn out of an brand-new esoteric language. Meaning the LLM is able to program itself without training data about the programming language...

                                                                                                                  • throwawayb299 an hour ago

                                                                                                                    > a serious self-compiling compiler

                                                                                                                    Well, virtually every production-grade compiler is self-compiling. Since you bring it up explicitly, I'm wondering what implications of begin self-compiling you have in mind?

                                                                                                                    > Meaning the LLM is able to program itself without training data about the programming language...

                                                                                                                    Could you clarify this sentence a bit? Does it mean the LLM will code in this new language without training in it before hand? Or is it going to enable the LLM to programm itself to gain some new capabilities?

                                                                                                                    Frankly, with the advent of coding agents, building a new compiler sounds about as relevant as introducing a new flavor of assembly language and then a new assembly may at least be justified by a new CPU architecture...

                                                                                                                    • lostmsu 3 hours ago

                                                                                                                      I would hold on on making grand claims until you have something grand to show for it.

                                                                                                                      • ghuntley 3 hours ago

                                                                                                                        Honestly, I don't know what to make of it. Stage 2 is almost complete, and I'm (right now) conducting per-language benchmarks to compare it to the Titans.

                                                                                                                        Using the proper techniques, Sonet 3.7 can generate code in the custom lexical/stdlib. So, in my eyes, the path to Stage 3 is unlocked, but it will chew lots and lots of tokens.

                                                                                                            • sbszllr 5 hours ago

                                                                                                              All can be true depending on the business/person:

                                                                                                              1. My company cannot justify this cost at all.

                                                                                                              2. My company can justify this cost but I don't find it useful.

                                                                                                              3. My company can justify this cost, and I find it useful.

                                                                                                              4. I find it useful, and I can justify the cost for personal use.

                                                                                                              5. I find it useful, and I cannot justify the cost for personal use.

                                                                                                              That aside -- 200/day/dev for a "nice to have service that sometimes makes my work slightly faster" is much in the majority of the world.