LLMs are just a tool, just like airplanes or hammers. An airplane is very expensive, but better at going really far distances than humans can on foot. A hammer is cheaper than a human, but by itself is useless unless operated properly. Despite the tone of the outputs, LLMs should not be authoritative and human judgement shouldn’t be replaced with them.
Just on the security side of coding, highly skilled security engineers at Mozilla were able to use Claude Mythos to identify and address many issues to make Firefox more secure. Some if these issues were introduced over 10 years ago, and a human could have identified and fixed them but human speed of reading and finding will always be a bottleneck. Having highly skilled humans offload the slow task to go through the codebase and raise issues, allowed them to find and understand the nuanced problem, and work on a fix. The key here is giving the people with the skills the ability be enhanced with LLMs, not replace them with one.
Name one single use case for LLMs that shows they are better or cheaper than humans.
LLMs are just a tool, just like airplanes or hammers. An airplane is very expensive, but better at going really far distances than humans can on foot. A hammer is cheaper than a human, but by itself is useless unless operated properly. Despite the tone of the outputs, LLMs should not be authoritative and human judgement shouldn’t be replaced with them.
Just on the security side of coding, highly skilled security engineers at Mozilla were able to use Claude Mythos to identify and address many issues to make Firefox more secure. Some if these issues were introduced over 10 years ago, and a human could have identified and fixed them but human speed of reading and finding will always be a bottleneck. Having highly skilled humans offload the slow task to go through the codebase and raise issues, allowed them to find and understand the nuanced problem, and work on a fix. The key here is giving the people with the skills the ability be enhanced with LLMs, not replace them with one.
In short, they’re great at finding and flagging things for a human to review.
The problem is when someone overestimates how well these models perform and they try to automate everything and put too much trust into these models.
I once had an AI chatbot clean up a 3D model in under a minute with just a simple command