• shaztopher@lemmy.zip
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 hours ago

    If you click the most expensive model and then click max/fast mode, the same task can easily cost 10 or 20x of the cheaper models

    • aksdb@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 hours ago

      I watched two colleagues this week and both had Opus 4.8 1M max thinking. No matter which task. It’s also slow as fuck. I work almost all day with GPT-5.4 low thinking and get good results… but faster and cheaper.

      I guess good model selection and promoting will be what sets devs apart in the near future. Once that bubble bursts a bit more and prices increase further that will be an interesting reckoning. Also for companies who basically taunted their employees into tokenmaxxing.