It would be on the order of aN intensive video game, maybe. Depends on the size of the model, etc.
Training is definitely expensive but you are right in that it’s a one-time cost.
Overall, the challenge is that it’s very inefficient. To use a machine learning algorithm to do something that could be implemented deductively is not ideal (On the other hand, if it saves human effort…)
To a degree, trained models can also be retrained on newer data (eg freezing layers, LoRa, GaLore, Hypernetworks etc). Also newer data can be injected into a prompt to make sure that the responses are aligned with newer versions of software, for example.
The electricity consumption is a concern, but it’s probably not going to be the end of the world.
They can also be really good for quickly writing code if you line up a whole bunch of tests and line up all the types and then copy and paste that a few times, maybe with a macro in Vim.
The LLM will fill in the middle correctly, like 90% of the time. Compare it in git, make sure the tests pass, and then that’s an extra 20 minutes I get to spend with my wife and kids.
Yeah, that’s always a risk, but as you said, humans make mistakes too. And if you change your approach to software development by writing more tests and using strict interfaces or type annotations, etc., it is pretty reliable and definitely saves time.
It would be on the order of aN intensive video game, maybe. Depends on the size of the model, etc.
Training is definitely expensive but you are right in that it’s a one-time cost.
Overall, the challenge is that it’s very inefficient. To use a machine learning algorithm to do something that could be implemented deductively is not ideal (On the other hand, if it saves human effort…)
To a degree, trained models can also be retrained on newer data (eg freezing layers, LoRa, GaLore, Hypernetworks etc). Also newer data can be injected into a prompt to make sure that the responses are aligned with newer versions of software, for example.
The electricity consumption is a concern, but it’s probably not going to be the end of the world.
the one place you would use an LLM is going to be something where nothing else can be used, cataloging information for example.
This pretty much lines up with my understanding of AI.
They can also be really good for quickly writing code if you line up a whole bunch of tests and line up all the types and then copy and paste that a few times, maybe with a macro in Vim.
The LLM will fill in the middle correctly, like 90% of the time. Compare it in git, make sure the tests pass, and then that’s an extra 20 minutes I get to spend with my wife and kids.
that’s definitely one of the potentials, if you prefer a schizoprehnic person writing code which you then have to make work. I suppose.
I’m autistic about things and would write it all myself to my very high standards of “how autistic is it? If the answer is yes, than good enough.”
if other people look at the shit im doing and say “you could do this better” it’s not done well enough.
Which I can totally understand, but I would like to spend more time with my family and less time writing code.
This also allows me to iterate faster and identify useful ideas That justify deeper effort.
And then the one fucking hallucination you missed ends up crashing the whole thing on the weekend.
On the other hand, you could have done that by accident yourself.
Yeah, that’s always a risk, but as you said, humans make mistakes too. And if you change your approach to software development by writing more tests and using strict interfaces or type annotations, etc., it is pretty reliable and definitely saves time.